FD-VLA: Force-Distilled Vision-Language-Action Model for Contact-Rich Manipulation
Paper • Feb 13, 2026 • arXiv • Ruiteng Zhao, Wenshuo Wang, Marcelo H. Ang Jr., Haiyue Zhu
Current VLA models primarily rely on visual feedback, which is insufficient for contact-rich tasks like precision assembly or handling delicate objects. We introduce FD-VLA, a force-distilled frame...