Topic: cs.AI

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for cs.AI, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

WeeklyMonthlyAll time
Current monthLast month2 months ago

← Back to home

  1. Grandmaster level in StarCraft II using multi-agent reinforcement learning

    PaperOct 30, 2019NatureOriol Vinyals, Igor Babuschkin, Wojciech M. Czarnecki, Michaël Mathieu, Andrew Dudzik, David Silver

    The game of StarCraft II has emerged as a grand challenge for artificial intelligence research owing to its complex, multi-agent, and partially observable environment. Here we introduce AlphaStar, ...

  2. Sandboxing Agency: Isolation Protocols for Third-Party Tool Use

    PaperFeb 21, 2026arXivLiu et al., Wang et al.

    Current agents often utilize third-party tools (APIs, web browsers, databases) with full authority, creating a 'Tools-as-Attack-Vector' problem. We introduce 'Agency Sandboxing,' a software enginee...

  3. Intelligent AI Delegation

    PaperFeb 12, 2026arXivNenad Tomašev, Kevin R. McKee, Jack Rae, Iason Gabriel, Vukosi Marivate, Milind Tambe, Demis Hassabis, Charles Blundell

    As advanced AI agents evolve beyond query-response models, their utility is increasingly defined by how effectively they can decompose complex objectives and delegate sub-tasks. We propose an adapt...

  4. Minimax M2.5: Scaling RL for Industrial-Grade Agentic AI

    PaperFeb 16, 2026arXivMiniMax Research Team

    Training agents for industrial-scale deployment requires extreme stability and data throughput. We present Minimax M2.5, a model trained using a novel asynchronous RL architecture designed to proce...

  5. Fast KV Compaction via Attention Matching

    PaperFeb 18, 2026arXivAdam Zweiger, Xinghong Fu, Han Guo, MIT Team

    Large Language Models struggle with memory overhead during long-context inference due to the linear growth of the Key-Value (KV) cache. We propose Attention Matching (AM), a framework for high-qual...

  6. KLong: Training LLM Agents for Extremely Long-horizon Tasks

    PaperFeb 19, 2026arXivYue Liu, Zhiyuan Hu, Flood Sung

    Current LLM agents frequently fail in tasks requiring hundreds of steps due to error accumulation and context overflow. We introduce KLong, an agentic framework that utilizes 'Trajectory-Splitting ...

  7. Simplicity and Complexity in Combinatorial Optimization

    PaperFeb 15, 2026arXivDeepMind Research Team

    We explore the boundary between simple heuristics and complex neural-cognitive models in combinatorial optimization. This paper demonstrates how hybrid architectures can leverage memory to shape re...

  8. OpenAI o1 System Card

    PaperSep 12, 2024OpenAIOpenAI

    We introduce OpenAI o1, a new series of large language models trained with reinforcement learning to perform complex reasoning. o1 models are designed to spend more time thinking before they respon...

  9. GPT-4 Technical Report

    PaperMar 15, 2023arXivOpenAI

    We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT...

  10. Training language models to follow instructions with human feedback

    PaperMar 4, 2022arXivLong Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

    Making language models bigger does not inherently make them better at following a user's intent. For example, large language models can generate outputs that are untruthful, toxic, or simply not he...

  11. A Generalist Agent

    PaperMay 12, 2022arXivScott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Nando de Freitas

    Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato,...

← PreviousPage 3Next →

Top Entities In This Topic

Related Topics

FAQ

What does this cs.AI page rank?

It ranks public content for cs.AI using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in cs.AI?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.