Vision–Language–Action Models in Humanoid Robotics Practical Limits, Strategic Bets, and Market Implications

less than 1 minute read

Published: August 12, 2025

Originally published on Substack.

In the recent speech, Unitree’s founder pointed out that the bigger problem is model architecture, not data quantity, and he also commented that the VLA is too limited. However, there is another startup Spirit AI is betting its roadmap on a VLA- first approach.

The strategic divergence is clear.

Yong Qian