DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail PredictionPaper·Yiheng Liu, Liao Qu, Huichao Zhang, Xu Wang, Yi Jiang, Yi…·11/11/2025Source ↗
SAIL-Embedding Technical Report: Omni-modal Embedding Foundation ModelPaper·Lin Lin, Jiefeng Long, Zhihe Wan, Yuchi Wang, Dingkang Ya…·11/2/2025Source ↗
Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?Paper·Kai Yan, Yufei Xu, Zhengyin Du, Xuesong Yao, Zheyu Wang, …·11/1/2025Source ↗
Scaling Diffusion Transformers Efficiently via $μ$PPaper·Chenyu Zheng, Xinyu Zhang, Rongzhen Wang, Wei Huang, Zhi …·10/31/2025Source ↗
FAN: Fourier Analysis NetworksPaper·Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang,…·10/26/2025Source ↗
Video-As-Prompt: Unified Semantic Control for Video GenerationPaper·Yuxuan Bian, Xin Chen, Zenan Li, Tiancheng Zhi, Shen Sang…·10/23/2025Source ↗
MoGA: Mixture-of-Groups Attention for End-to-End Long Video GenerationPaper·Weinan Jia, Yuning Lu, Mengqi Huang, Hualiang Wang, Binyu…·10/21/2025Source ↗
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in ProductionPaper·Chao Jin, Ziheng Jiang, Zhihao Bai, Zheng Zhong, Juncai L…·10/17/2025Source ↗
BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem ProvingPaper·Ran Xin, Chenguang Xi, Jie Yang, Feng Chen, Hang Wu, Xia …·10/9/2025Source ↗
Diffusion Adversarial Post-Training for One-Step Video GenerationPaper·Shanchuan Lin, Xin Xia, Yuxi Ren, Ceyuan Yang, Xuefeng Xi…·10/1/2025Source ↗
ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMsPaper·Jiaru Zou, Ling Yang, Jingwen Gu, Jiahao Qiu, Ke Shen, Ji…·9/25/2025Source ↗
Co-Evolving LLM Coder and Unit Tester via Reinforcement LearningPaper·Yinjie Wang, Ling Yang, Ye Tian, Ke Shen, Mengdi Wang·9/25/2025Source ↗
MMaDA: Multimodal Large Diffusion Language ModelsPaper·Ling Yang, Ye Tian, Bowen Li, Xinchen Zhang, Ke Shen, Yun…·9/25/2025Source ↗
Lynx: Towards High-Fidelity Personalized Video GenerationPaper·Shen Sang, Tiancheng Zhi, Tianpei Gu, Jing Liu, Linjie Luo·9/19/2025Source ↗
Robix: A Unified Model for Robot Interaction, Reasoning and PlanningPaper·Huang Fang, Mengxi Zhang, Heng Dong, Wei Li, Zixuan Wang,…·9/11/2025Source ↗
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement LearningPaper·Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Ho…·9/10/2025Source ↗
Reverse-Engineered Reasoning for Open-Ended GenerationPaper·Haozhe Wang, Haoran Que, Qixin Xu, Minghao Liu, Wangchuns…·9/7/2025Source ↗
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement LearningPaper·Haoming Wang, Haoyang Zou, Huatong Song, Jiazhan Feng, Ju…·9/5/2025Source ↗
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?Paper·Qinyan Zhang, Xinping Lei, Ruijie Miao, Yu Fu, Haojie Fan…·9/4/2025Source ↗
DanceGRPO: Unleashing GRPO on Visual GenerationPaper·Zeyue Xue, Jie Wu, Yu Gao, Fangyuan Kong, Lingting Zhu, M…·8/28/2025Source ↗