Quick answer

Paper2025-08-01•Source ↗•10 attns0 checkouts

Claim

LLaVA-Video: Video Instruction Tuning With Synthetic Data

Authors

Discuss with Grok

Yuanhan Zhang·

Jinming Wu·

Wei Li·

Bo Li·

Zejun Ma·

Ziwei Liu·

Chunyuan Li

ABSTRACT

The development of video large multimodal models (LMMs) has been hindered by the difficulty of curating large amounts of high-quality raw data from the web. To address this, we propose an alternative approach by creating a high-quality synthetic dataset specifically for video instruction-following, namely LLaVA-Video-178K. This dataset includes key tasks such as detailed captioning, open-ended question-answering (QA), and multiple-choice QA. By training on this dataset, in combination with existing visual instruction tuning data, we introduce LLaVA-Video, a new video LMM. Our experiments demonstrate that LLaVA-Video achieves strong performance across various video benchmarks, highlighting the effectiveness of our dataset. We plan to release the dataset, its generation pipeline, and the model checkpoints.

#llm/paper/month/202508 #computer-version/year/2025 #llm/paper/year/2025 #machine-learning/month/202508 #machine-learning #computer-version #multimodal-model 📋 Awesome List: multimodal #llm/paper #deep-learning/from/bytedance-research #deep-learning/year/2025 #llm/year/2025 #llm/month/202508 #multimodal #world-model #deep-learning #machine-learning/year/2025 #llm #computer-version/month/202508 ByteDance Research #deep-learning/month/202508

Review Snapshot

Explore ratings

0.0

★★★★★

0 ratings

5 star

4 star

3 star

2 star

1 star

Recommendation

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for LLaVA-Video: Video Instruction Tuning With Synthetic Data.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful