OTTER:A Vision-Language-Action Model with Text-Aware Visual Feature Extraction

本文最后更新于 2025年3月7日下午

OTTER:A Vision-Language-Action Model with Text-Aware Visual Feature Extraction

加强VLA对语言的理解能力

A simple but efficient method

AI-Paper-Reading > Robot

#Robot #VLA

OTTER:A Vision-Language-Action Model with Text-Aware Visual Feature Extraction

http://example.com/2025/03/06/2025-3/paper4/

作者

Artimis

发布于

2025年3月6日

许可协议

如何解压缩大文件上一篇

Improving Vision-Language-Action Model with Online Reinforcement Learning 下一篇