Introducing PhAIL: We Measured How Good Robot AI Actually Is. It's Not.
We built PhAIL – the Physical AI Leaderboard – to measure how foundation models actually perform on real commercial robotics tasks. The best model runs at 5% of human throughput.