← Home

Quick answer

In this work, we introduce Janus-Pro, an advanced version of the previous work Janus. Specifically, Janus-Pro incorporates (1) an optimized training strategy, (2) expanded training data, and (3) scaling to larger model size.

Claim

Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling

Authors
Xiaokang Chen·
Zhiyu Wu·
Xingchao Liu·
Zizheng Pan·
Wen Liu·
Zhenda Xie·
Xingkai Yu·
Chong Ruan

ABSTRACT

In this work, we introduce Janus-Pro, an advanced version of the previous work Janus. Specifically, Janus-Pro incorporates (1) an optimized training strategy, (2) expanded training data, and (3) scaling to larger model size. With these improvements, Janus-Pro achieves significant advancements in both multimodal understanding and text-to-image instruction-following capabilities, while also enhancing the stability of text-to-image generation. We hope this work will inspire further exploration in the field. Code and models are publicly available.

Review Snapshot

Explore ratings

0.0
★★★★★
0 ratings
5 star
0%
4 star
0%
3 star
0%
2 star
0%
1 star
0%

Recommendation

0%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.
Post an inquiry
Sort by: Most helpful
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling | Attendemia