Topic: deep-learning/from/bytedance-research

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for deep-learning/from/bytedance-research, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

Weekly Monthly All time

← Back to home

FAN: Fourier Analysis Networks
Paper • Oct 26, 2025 • arxiv.org • Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, Jinliang Deng, Jing Su, Jun Zhang, Jingjing Xu
Despite the remarkable successes of general-purpose neural networks, such as MLPs and Transformers, we find that they exhibit notable shortcomings in modeling and reasoning about periodic phenomena...
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Paper • Apr 2, 2025 • arxiv.org • Yuqing Wang, Tianwei Xiong, Daquan Zhou, Zhijie Lin, Yang Zhao, Bingyi Kang, Jiashi Feng, Xihui Liu
It is desirable but challenging to generate content-rich long videos in the scale of minutes. Autoregressive large language models (LLMs) have achieved great success in generating coherent and long...
Hyper-Connections
Paper • Mar 18, 2025 • arxiv.org • Defa Zhu, Hongzhi Huang, Zihao Huang, Yutao Zeng, Yunyao Mao, Banggu Wu, Qiyang Min, Xun Zhou
We present hyper-connections, a simple yet effective method that can serve as an alternative to residual connections. This approach specifically addresses common drawbacks observed in residual conn...
ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development
Paper • Apr 2, 2025 • arxiv.org • Borui Wan, Mingji Han, Yiyao Sheng, Yanghua Peng, Haibin Lin, Mofan Zhang, Zhichao Lai, Menghan Yu, Junda Zhang, Zuquan Song, Xin Liu, Chuan Wu
Checkpointing to preserve training states is crucial during the development of Large Foundation Models (LFMs), for training resumption upon various failures or changes in GPU resources and parallel...
Let the Code LLM Edit Itself When You Edit the Code
Paper • Mar 4, 2025 • arxiv.org • Zhenyu He, Jun Zhang, Shengjie Luo, Jingjing Xu, Zhi Zhang, Di He
In this work, we investigate a typical scenario in code generation where a developer edits existing code in real time and requests a code assistant, e.g., a large language model, to re-predict the ...
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Paper • Mar 14, 2025 • arxiv.org • Jiannan Huang, Jun Hao Liew, Hanshu Yan, Yuyang Yin, Yao Zhao, Humphrey Shi, Yunchao Wei
Recent text-to-image customization works have proven successful in generating images of given concepts by fine-tuning diffusion models on a few examples. However, tuning-based methods inherently te...
Vista-LLaMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
Paper • Mar 3, 2025 • arxiv.org • Fan Ma, Xiaojie Jin, Heng Wang, Yuchen Xian, Jiashi Feng, Yi Yang
Recent advances in large video-language models have displayed promising outcomes in video comprehension. Current approaches straightforwardly convert video into language tokens and employ large lan...

← PreviousPage 4Next →

FAQ

What does this deep-learning/from/bytedance-research page rank?

It ranks public content for deep-learning/from/bytedance-research using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to deep-learning/from/bytedance-research topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to deep-learning/from/bytedance-research topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in deep-learning/from/bytedance-research?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to deep-learning/from/bytedance-research topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to deep-learning/from/bytedance-research topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Topic: deep-learning/from/bytedance-research

Short answer

FAN: Fourier Analysis Networks

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Hyper-Connections

ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development

Let the Code LLM Edit Itself When You Edit the Code

ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance

Vista-LLaMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens

Top Entities In This Topic

Related Topics

FAQ

What does this deep-learning/from/bytedance-research page rank?

How should I use weekly vs monthly vs all-time?

How can I discover organizations active in deep-learning/from/bytedance-research?

Can I follow this topic for updates?