Quick answer

Paper2023-08-28•Source ↗•10 attns0 checkouts

Claim

MagicEdit: High-Fidelity and Temporally Coherent Video Editing

Authors

Discuss with Grok

Jun Hao Liew·

Hanshu Yan·

Jianfeng Zhang·

Zhongcong Xu·

Jiashi Feng

ABSTRACT

In this report, we present MagicEdit, a surprisingly simple yet effective solution to the text-guided video editing task. We found that high-fidelity and temporally coherent video-to-video translation can be achieved by explicitly disentangling the learning of content, structure and motion signals during training. This is in contradict to most existing methods which attempt to jointly model both the appearance and temporal representation within a single framework, which we argue, would lead to degradation in per-frame quality. Despite its simplicity, we show that MagicEdit supports various downstream video editing tasks, including video stylization, local editing, video-MagicMix and video outpainting.

#computer-version #multimodal-model #world-model #deep-learning #llm ByteDance Research

Review Snapshot

Explore ratings

0.0

★★★★★

0 ratings

5 star

4 star

3 star

2 star

1 star

Recommendation

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for MagicEdit: High-Fidelity and Temporally Coherent Video Editing.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful