Vision-Language-Action for Robotics
A living reference of foundational architectures, rigorous validation strategies, and deploying robot foundation models.
Scene Encoding
Reasoning
Control Policies
Core concepts and problem formulation
Model designs and network topologies
Dataset construction and curation
Optimization and learning methods
Metrics and benchmarking protocols
Production systems and scaling
Real-world use cases
Open problems and frontiers