All the world’s a robot-staging ground for tech entrepreneurs building ‘physical AI’

AP
Published
0
0
All the world’s a robot-staging ground for tech entrepreneurs building ‘physical AI’
Read the full story at APOriginal
Computer scientist Fei-Fei Li speaks during the Clinton Global Initiative, Sept. 24, 2024, in New York. (AP Photo/Andres Kudacki, File)

2026-06-24T04:01:06Z

PROVIDENCE, R.I. (AP) — Computer scientist Louis Castricato was in his eighth year studying large language models — the artificial intelligence technology behind chatbots like ChatGPT and Claude — when he started to feel like he was hitting a dead end.

“We basically have passed the point of doing real fundamental LLM research,” Castricato said. “Now it’s just applications.”

The researcher quit his studies at Brown University and started a new company, called Overworld. Its ambition is in its name: AI that can understand and navigate a world, not just words.

There’s still plenty of money to be made from AI chatbots — investors are counting on it as they commit trillions of dollars to leading developers like Anthropic and OpenAI. But a growing number of AI entrepreneurs are dedicating themselves to what they see as the next frontier: “world models” that teach AI systems, and sometimes robots, how to react in a physical environment.

They include some of the field’s most prominent scientists, such as “Godmother of AI” Fei-Fei Li, who describes the concept of a world model as “one of the most important and most overloaded terms in AI today.”

Scientists are applying AI in new dimensions with ‘world models’

At the heart of world model research is the idea that AI can’t be truly intelligent if it can only read a book. It also needs to read the room.

“Where language models learn the statistical structure of text, world models learn the statistical structure of space and time: how light falls on a surface, how a garden looks from an angle no camera has captured, how objects respond to force and follow the laws of physics,” wrote Li, founder of the San Francisco startup World Labs, in an essay published this month.

Another proponent is AI pioneer Yann LeCun, who quit his job as Meta’s chief AI scientist last year to start Paris-based Advanced Machine Intelligence Labs.

“World model is quickly becoming a buzzword,” LeCun said on a recent “Unsupervised Learning” podcast. He said he views it as something that enables an AI agent “to predict the consequences of its own actions.”

There are multiple ways of defining world models, often based on the technologies someone hopes to build with it — be it robots or a more interactive video game.

Robots can’t learn much from AI models trained on books

Training on all of humanity’s books, news articles and visual media, as AI language models have done, has led to AI assistants that are changing the nature of office-based work and some creative fields. But some proponents see limitations in generative AI models that work by repeatedly predicting the next word or pixel to produce new dialogue, images or lines of code.

Chatbots can’t pick up a coffee mug, notes Martin Hebert, dean of computer science at Carnegie Mellon University.

“There’s all the geometry of the world, the dynamic of how I move my hand, the physical interaction of the contact with the cup,” Hebert said. “This is much more complex than just predicting the next word in a sentence.”

Sign up for Morning Wire: Our flagship newsletter breaks down the biggest headlines of the day.

For scientists like Hebert, who has spent more than four decades researching robotics, the most useful application for world models is as a faster and cheaper path to “physical AI” — another tech industry buzzword.

“Some people may have different definitions, but physical and embodied AI are kind of the evolution of what we used to call robotics,” Hebert said in an interview. Some of the AI advances that have made chatbots so useful can also be applied to building AI with a broad enough awareness of its environment to work like a robot’s brain, he said.

“In your body and spinal cord you have a very general model of how to balance, how to walk around, and you can adapt to your knee hurting in the morning, so you now walk a little differently,” he said. “You don’t need to think about that. You have a general model somewhere in your nervous system and brain that allows your body to adapt very quickly.”

Simulated worlds are drawing interest from investors

Smarter robots aren’t the only end game for world models. Castricato started Overworld last year and the tiny Rhode Island-based startup is now building video game worlds where a scene, say, of a spooky forest, can adapt as a virtual character moves through it and interacts with the objects in it.

“There’s no other world model where you can just walk through doors or where you can interact with a detailed environment like this,” he said in an interview. “We optimize for interaction above anything else.”

While the near-term applications aren’t as readily apparent as AI coding tools, world model makers are attracting interest from venture capitalists like Steve Jang, co-founder and managing partner at Kindred Ventures.

The firm is investing in Overworld and other world model-focused companies, including Causal Labs, which is building AI models for weather prediction, and Extropic, which is building specialized computer chips suited to world models.

“I think that the future is many different types of models with many different philosophies and architectures,” Jang said. “I don’t think that it’ll be one large, dense model to rule them all.”

In her recent essay, Li sought to create a “taxonomy of world models” to help sort out the confusion about the competing visions.

“A video model that produces gorgeous but physically impossible flames, a language model improvising a playable game, and a physics engine that faithfully simulates combustion all go by the same name,” she wrote.

She divided world models into three categories. The most commercially viable today are “renderers” that prioritize the visual fidelity of the virtual worlds they create but can’t be trusted to teach robots much.

Then, there are “simulators” that create virtual training grounds that faithfully represent the physical structure of a world; and “planners” that try to predict what an AI agent or robot should do in an unstructured world.

“A robot that can plan is a robot that can work, and the entire industry is racing to be the one that gets there first,” she wrote.

MATT O’BRIEN O’Brien covers the business of technology and artificial intelligence for The Associated Press. mailto

Related Markets

All Markets
View full chart →
View Full Chart
View full chart →
View Full Chart
View full chart →
View Full Chart

Market data may be delayed. Not financial advice.

Reader Reactions
Reading the article

💡 AI analysis provides alternative perspectives on current events

Support Alto & Gab

Alto is funded entirely by readers like you. Your donation helps us continue delivering curated news from a right-wing Christian Nationalist perspective, powered by Gab AI.

Gab Shop

Support free speech with official merchandise

View All Products

Install Alto on Your Phone

Add Alto to your home screen for quick access to breaking news — no app store required.

iPhone & iPad

Using Safari Browser

1

Open alto.gab.com in Safari

alto.gab.com
2

Tap the Share button

at the bottom of Safari
3

Tap "More"

More
4

Scroll and tap "Add to Home Screen"

Add to Home Screen

Tap "Add" to confirm

Alto will appear on your home screen like any other app!

Android

Using Chrome Browser

1

Open alto.gab.com in Chrome

alto.gab.com
2

Tap the menu button

three dots in top right
3

Tap "Add to Home screen"

Add to Home screen

Tap "Add" to confirm

Alto will appear on your home screen like any other app!
gab

Speak Freely

Join millions on the original and only true free speech social network.

What Makes Gab Different

We're not just another social network. We're a platform built on principles that matter.

Freedom of Speech & Reach

All First Amendment protected speech is welcome. No algorithmic throttling or shadow banning.

Family-Friendly Platform

We maintain a clean environment. Explicit adult content is strictly prohibited.

Western Nations Only

Third-world IPs are blocked. No scammers, no spam farms. Built for Western civilization.

Funded By Users

Our users are our investors and customers. You're not the product being sold.

Battle Tested

A decade of standing strong. Banned from app stores, banks—and still here.

American Owned & Operated

We reject foreign censorship demands. Built by Americans, for free people.