Transform Your Videos into Immersive Interactive Worlds with Odyssey’s AI Model
London-based AI lab Odyssey has unveiled an innovative research preview that could redefine the way we engage with video content. Initially aimed at crafting immersive world models for film and game production, Odyssey’s team has serendipitously stumbled upon what might become a groundbreaking new medium in entertainment.
Imagine stepping into a world where every interaction you make can shape your visual experience. Odyssey’s interactive video, powered by AI, delivers real-time responses based on user inputs. Whether you’re navigating with your keyboard, phone, or controller—and soon, by voice—you’ll feel as if you’re stepping into a digital dimension reminiscent of a futuristic “Holodeck.”
The Technology Behind the Magic
So, what sets Odyssey’s interactive video apart from conventional video games or CGI? It all hinges on a revolutionary concept they call a "world model."
- Frame-by-Frame Prediction: Unlike traditional video models that deliver complete clips, world models function frame-by-frame. They analyze the current situation and user interactions to anticipate subsequent video frames.
- Complex Learning: This method resembles how advanced language models predict the next word in a sentence but is far more sophisticated, as it applies to high-resolution video frames rather than mere words.
“A world model is, at its core, an action-conditioned dynamics model,” Odyssey explains. Each interaction is meticulously calculated, allowing the AI to generate video frames that feel organic and unpredictable. Gone are rigid scripts; here, the AI dynamically assesses and evolves based on user activity.
Overcoming Challenges with AI-Generated Video
Creating this dynamic video experience isn’t without its hurdles. Maintaining stability over time is one of the greatest challenges. The process of generating each frame might seem straightforward, but small discrepancies can quickly snowball—a phenomenon known as “drift.”
To combat this, Odyssey employs what they refer to as a “narrow distribution model.” This involves pre-training their AI on extensive video footage, followed by fine-tuning it on more specific environments. The trade-off here might result in less variety, but it enhances overall stability, ensuring a seamless experience.
Currently, costs associated with running this advanced AI technology range from £0.80 to £1.60 per user-hour, relying on a network of H100 GPUs throughout The US and Europe. While this price point may seem steep for streaming, it’s a fraction of the costs typically associated with traditional game or film production. Odyssey anticipates these expenses will further decline as their models become increasingly efficient.
Is Interactive Video the Future of Storytelling?
Throughout history, advances in technology have birthed new storytelling mediums—from cave paintings to literature, and from radio to video games. Odyssey believes that AI-generated interactive video will usher in the next chapter of this storytelling evolution.
Consider the possibilities: training scenarios that allow you to practice skills in a simulated environment, or travel experiences that let you explore foreign cities from the comfort of your couch. While the current research preview is merely a proof of concept, it offers a tantalizing glimpse into a world where AI-generated environments transform entertainment, education, and advertising.
Are you ready to step into this brave new world? You can experience the cutting edge of interactive video by trying the research preview here. Dive in and see how the future of storytelling unfolds before your eyes.

