Back to feed
arXiv cs.AI·

Structured Labeling Enables Faster Vision-Language Models for End-to-End Autonomous Driving

Signal
72
Hype
28
In three linesFastDrive, a compact 0.9B-parameter VLM, outperforms 7B+ models (LLaVA-1.5) on autonomous driving tasks. Trained on NuScenes-S, a benchmark with structured representations, it achieves +20% accuracy on decision-making with 10x inference speedup.
Read source
Your take?
VisionReasoningBenchmarksCode generationRobotics

Summary generated by Claude — human-verified