LDA Language Cards Action

Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation

Abstract: In robotic, task goals can be conveyed through various modalities, such as language, goal images, and goal videos. However, natural language can be ambiguous, while images or videos may ...

IEEE

TinyVLA: Toward Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

Vision-Language-Action (VLA) models have shown remarkable potential in visuomotor control and instruction comprehension through end-to-end learning processes. However, current VLA models face ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation

TinyVLA: Toward Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

Trending now