AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement LearningPaper·Chenwei Lou, Zewei Sun, Xinnian Liang, Meng Qu, Wei Shen,…·5/25/2025Source ↗
Model Merging in Pre-training of Large Language ModelsPaper·Yunshui Li, Yiyuan Ma, Shen Yan, Chaoyi Zhang, Jing Liu, …·5/22/2025Source ↗
Scaling Law for Quantization-Aware TrainingPaper·Mengzhao Chen, Chaoyi Zhang, Jing Liu, Yutao Zeng, Zeyue …·5/20/2025Source ↗
DAPO: An Open-Source LLM Reinforcement Learning System at ScalePaper·Qiying Yu, Zheng Zhang, Ruofei Zhu, Yufeng Yuan, Xiaochen…·5/20/2025Source ↗
Reformulation for Pretraining Data AugmentationPaper·Xintong Hao, Ruijie Zhu, Ge Zhang, Ke Shen, Chenggang Li·5/19/2025Source ↗
AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data SelectionPaper·Kai Hua, Steven Wu, Ge Zhang, Ke Shen·5/12/2025Source ↗
FullStack Bench: Evaluating LLMs as Full Stack CodersPaper·Bytedance-Seed-Foundation-Code-Team, :, Yao Cheng, Jianfe…·5/12/2025Source ↗
Seed1.5-VL Technical ReportPaper·Dong Guo, Faming Wu, Feida Zhu, Fuxing Leng, Guang Shi, H…·5/11/2025Source ↗
Reward-Augmented Data Enhances Direct Preference Alignment of LLMsPaper·Shenao Zhang, Zhihan Liu, Boyi Liu, Yufeng Zhang, Yingxia…·5/11/2025Source ↗
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement LearningPaper·ByteDance Seed, :, Jiaze Chen, Tiantian Fan, Xin Liu, Lin…·4/29/2025Source ↗
ReTool: Reinforcement Learning for Strategic Tool Use in LLMsPaper·Jiazhan Feng, Shijue Huang, Xingwei Qu, Ge Zhang, Yujia Q…·4/17/2025Source ↗
Multi-SWE-bench: A Multilingual Benchmark for Issue ResolvingPaper·Daoguang Zan, Zhirong Huang, Wei Liu, Hanwu Chen, Linhao …·4/3/2025Source ↗
Loong: Generating Minute-level Long Videos with Autoregressive Language ModelsPaper·Yuqing Wang, Tianwei Xiong, Daquan Zhou, Zhijie Lin, Yang…·4/2/2025Source ↗
ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model DevelopmentPaper·Borui Wan, Mingji Han, Yiyao Sheng, Yanghua Peng, Haibin …·4/2/2025Source ↗
SuperGPQA: Scaling LLM Evaluation across 285 Graduate DisciplinesPaper·P Team, Xinrun Du, Yifan Yao, Kaijing Ma, Bingli Wang, Ti…·3/28/2025Source ↗
Polynomial Composition Activations: Unleashing the Dynamics of Large Language ModelsPaper·Zhijian Zhuo, Ya Wang, Yutao Zeng, Xiaoqing Li, Xun Zhou,…·3/20/2025Source ↗
Multi-Reward as Condition for Instruction-based Image EditingPaper·Xin Gu, Ming Li, Libo Zhang, Fan Chen, Longyin Wen, Tieji…·3/20/2025Source ↗
FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View SynthesisPaper·Luxi Chen, Zihan Zhou, Min Zhao, Yikai Wang, Ge Zhang, We…·3/19/2025Source ↗
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class GuidancePaper·Jiannan Huang, Jun Hao Liew, Hanshu Yan, Yuyang Yin, Yao …·3/14/2025Source ↗