Topic: cs.AI

Short answer

This page shows the most relevant public items for cs.AI, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

WeeklyMonthlyAll time

← Back to home

  1. MiRA: A Zero-Shot Mixture-of-Reasoning Agents Framework

    PaperFeb 20, 2026arXivSethuraman et al., AAMAS 2026 Main Track

    We propose Mixture-of-Reasoning Agents (MiRA), a zero-shot multimodal framework that decomposes reasoning across three specialized agents: Visual Analyzing, Text Comprehending, and Judge. By consol...

  2. Agentic Test-Time Scaling for WebAgents

    PaperFeb 12, 2026arXivNicholas Lee, Lutfi Eren Erdogan, Chris Joseph John, Surya Krishnapillai, Kurt Keutzer, Amir Gholami

    Current WebAgents struggle with long-horizon tasks and complex navigation. We propose an agentic scaling framework that increases compute at test-time through iterative trajectory pruning and rewar...

  3. Intelligent AI Delegation

    PaperFeb 12, 2026arXivNenad Tomašev, Kevin R. McKee, Jack Rae, Iason Gabriel, Vukosi Marivate, Milind Tambe, Demis Hassabis, Charles Blundell

    As advanced AI agents evolve beyond query-response models, their utility is increasingly defined by how effectively they can decompose complex objectives and delegate sub-tasks. We propose an adapt...

  4. Minimax M2.5: Scaling RL for Industrial-Grade Agentic AI

    PaperFeb 16, 2026arXivMiniMax Research Team

    Training agents for industrial-scale deployment requires extreme stability and data throughput. We present Minimax M2.5, a model trained using a novel asynchronous RL architecture designed to proce...

  5. Fast KV Compaction via Attention Matching

    PaperFeb 18, 2026arXivAdam Zweiger, Xinghong Fu, Han Guo, MIT Team

    Large Language Models struggle with memory overhead during long-context inference due to the linear growth of the Key-Value (KV) cache. We propose Attention Matching (AM), a framework for high-qual...

  6. KLong: Training LLM Agents for Extremely Long-horizon Tasks

    PaperFeb 19, 2026arXivYue Liu, Zhiyuan Hu, Flood Sung

    Current LLM agents frequently fail in tasks requiring hundreds of steps due to error accumulation and context overflow. We introduce KLong, an agentic framework that utilizes 'Trajectory-Splitting ...

  7. Simplicity and Complexity in Combinatorial Optimization

    PaperFeb 15, 2026arXivDeepMind Research Team

    We explore the boundary between simple heuristics and complex neural-cognitive models in combinatorial optimization. This paper demonstrates how hybrid architectures can leverage memory to shape re...

  8. GLM-5: From Vibe Coding to Agentic Engineering

    PaperFeb 17, 2026arXivZhipu AI Team, Tsinghua University Researchers

    We present GLM-5, a foundation model designed to bridge the gap between human-guided 'vibe coding' and autonomous 'agentic engineering.' GLM-5 introduces DeepSeek-inspired Sparse Attention (DSA) to...

  9. GSR: Learning Structured Reasoning for Embodied Manipulation

    PaperFeb 10, 2026arXivKewei Hu, Michael Zhang, Hanwen Kang

    We introduce Grounded Scene-graph Reasoning (GSR), a structured reasoning paradigm that explicitly models world-state evolution as transitions over semantically grounded scene graphs. By reasoning ...

Related Topics

cs.LG (11)cs.RO (5)Robotics (5)Machine Learning (5)cs.CL (5)