Topic: cs.AI

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for cs.AI, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

Weekly Monthly All time

Current month Last month 2 months ago

← Back to home

Graph of Thoughts (GoT) in Agentic Workflows for Non-Linear Problem Solving
Paper • Apr 18, 2025 • arXiv • Maciej Besta, Nils Blach, Torsten Hoefler
While Chain-of-Thought and Tree-of-Thoughts prompting greatly enhance LLM reasoning, they strictly enforce linear or hierarchical cognitive paths. We introduce Graph of Thoughts (GoT), a novel cogn...
AgentBench-2025: Evaluating Autonomous Swarms in Adversarial and Dynamic Environments
Paper • Sep 10, 2025 • arXiv • Yao Mu, Tianyu Zheng, Percy Liang, Dan Hendrycks
As multi-agent swarms are deployed in open-ended web environments, standard static benchmarks fail to capture their vulnerability to dynamic threats and deceptive actors. We introduce AgentBench-20...
Minecraft as a Turing Test: Evaluating Open-Ended Agentic AI
Paper • Jul 15, 2025 • arXiv • Kevin Zhu, Lara Croft, Julian Bao
Evaluating the long-horizon planning and adaptability of Agentic AI in the real world is fraught with safety and cost limitations. We establish Minecraft as the premier sandbox for open-ended agent...
Scaling Laws for Agentic AI: When Does Swarm Intelligence Peak?
Paper • Aug 8, 2025 • arXiv • Jian Lu, Percy Liang, Chelsea Finn
While scaling laws for single-model LLMs are well established, the relationship between the number of collaborating agents and overall system performance remains poorly understood. This paper inves...
Neurosymbolic Agentic AI for Automated Theorem Proving
Paper • Mar 21, 2025 • arXiv • Albert Q. Jiang, Wenda Li, Szymon Tworkowski, Kuhu Syal
Automated theorem proving requires a blend of creative intuition and rigorous logical deduction, a combination that eludes pure deep learning models. We propose a Neurosymbolic Agentic AI framework...
Agentic AI: A Comprehensive Survey of Architectures, Applications, and Future Directions
Paper • Oct 29, 2025 • arXiv • Mohamad Abou Ali, Fadi Dornaika
Agentic AI represents a transformative shift in artificial intelligence, but its rapid advancement has led to a fragmented understanding, often conflating modern neural systems with outdated symbol...
Agentic Alignment: Inverse Reinforcement Learning from Swarm Behavior
Paper • Dec 22, 2025 • arXiv • Percy Liang, Thomas K. V., Eleanor Rigby
Aligning multi-agent systems via traditional human feedback is intractable due to the sheer volume and speed of agent-to-agent interactions. We introduce a novel alignment framework utilizing Inver...
MiRA: A Zero-Shot Mixture-of-Reasoning Agents Framework
Paper • Feb 20, 2026 • arXiv • Sethuraman et al., AAMAS 2026 Main Track
We propose Mixture-of-Reasoning Agents (MiRA), a zero-shot multimodal framework that decomposes reasoning across three specialized agents: Visual Analyzing, Text Comprehending, and Judge. By consol...
Gemini 3.1 Pro: A Smarter Baseline for Complex Reasoning on ARC-AGI-2
Paper • Feb 19, 2026 • arXiv • Google DeepMind Team
We introduce Gemini 3.1 Pro, an enhanced version of the Gemini 3 series optimized for rigorous logic and long-horizon problem solving. Built on a hybrid architecture that fuses linear attention wit...
Concrete Problems in AI Safety
Paper • Jun 21, 2016 • arXiv • Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mané
Rapid progress in machine learning and artificial intelligence (AI) has brought increasing attention to the potential impacts of AI technologies on society. In this paper, we discuss one such poten...
Language models can explain neurons in language models
Paper • May 9, 2023 • OpenAI • Steven Bills, Nick Cammarata, Dan Mossing, Henk Tillman, Leo Gao, Gabriel Goh, Ilya Sutskever, Jan Leike, Jeff Wu, William Saunders
Understanding the internal mechanisms of massive language models is a critical bottleneck for AI safety and alignment. Given the billions of parameters in modern models, manual human inspection of ...
WebGPT: Browser-assisted question-answering with human feedback
Paper • Dec 16, 2021 • arXiv • Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman
We introduce a method for fine-tuning language models to interact with a text-based web browser to answer open-ended questions. This model, WebGPT, searches the web, navigates through links, and sy...
Dota 2 with Large Scale Deep Reinforcement Learning
Paper • Dec 13, 2019 • arXiv • Christopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Ilya Sutskever, et al.
We present OpenAI Five, a system of five neural networks that learned to play the highly complex, imperfect-information esports game Dota 2 entirely through self-play. Dota 2 involves long time hor...
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Paper • Oct 13, 2023 • arXiv • Open X-Embodiment Collaboration (Google DeepMind & Academic Partners)
Large, diverse datasets have catalyzed breakthroughs in natural language and computer vision, yet robotics has struggled to build generalist models due to the fragmented nature of hardware platform...
Mastering the game of Stratego with model-free multiagent reinforcement learning
Paper • Dec 1, 2022 • Science • Julien Perolat, Bart De Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T. Connor, Neil Burch, Thomas Anthony, Stephen McAleer, Romuald Elie, Sarah H. Cen, Zhe Wang, Audrunas Gruslys, Aleksander Malyshev, Mina Khan, Sherjil Ozair, Finbarr Timbers, Toby Pohlen, Tom Eccles, Mark Rowland, Marc Lanctot, Jean-Baptiste Lespiau, Bilal Piot, Shayegan Omidshafiei, Edward Lockhart, Laurent Sifre, Nathalie Beauguerlange, Remi Munos, David Silver, Satinder Singh, Demis Hassabis, Karl Tuyls
Imperfect information games, where players have hidden information, represent a significant challenge for artificial intelligence. Stratego is a complex, imperfect-information board game with an en...
Mastering the game of Go without human knowledge
Paper • Oct 18, 2017 • Nature • David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George van den Driessche, Thore Graepel, Demis Hassabis
A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. We introduce AlphaGo Zero, an AI that achieves superhuman pe...
Mastering Diverse Domains through World Models
Paper • Jan 10, 2023 • arXiv • Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap
General intelligence requires solving tasks across diverse domains without human intervention. We present DreamerV3, a general and scalable reinforcement learning algorithm that masters a wide rang...
RT-1: Robotics Transformer for Real-World Control at Scale
Paper • Dec 13, 2022 • arXiv • Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Google DeepMind
By transferring knowledge from large, diverse, task-agnostic datasets, modern machine learning models can solve specific downstream tasks either zero-shot or with small task-specific datasets. We i...
Scaling Instructable Agents Across Many Simulated Worlds
Paper • Apr 15, 2024 • arXiv • SIMA Team, Google DeepMind
We introduce the Scalable Instructable Multiworld Agent (SIMA), an AI agent capable of following natural-language instructions to carry out tasks in a wide variety of 3D virtual environments and vi...
Solving olympiad geometry without human demonstrations
Paper • Jan 17, 2024 • Nature • Trieu Trinh, Yuhuai Wu, Quoc V. Le, He He, Thang Luong
Proving mathematical theorems requires deep logical reasoning and intuition, representing a grand challenge for AI. We introduce AlphaGeometry, a neuro-symbolic system that solves complex geometry ...

← PreviousPage 1Next →

FAQ

What does this cs.AI page rank?

It ranks public content for cs.AI using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in cs.AI?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Topic: cs.AI

Short answer

Graph of Thoughts (GoT) in Agentic Workflows for Non-Linear Problem Solving

AgentBench-2025: Evaluating Autonomous Swarms in Adversarial and Dynamic Environments

Minecraft as a Turing Test: Evaluating Open-Ended Agentic AI

Scaling Laws for Agentic AI: When Does Swarm Intelligence Peak?

Neurosymbolic Agentic AI for Automated Theorem Proving

Agentic AI: A Comprehensive Survey of Architectures, Applications, and Future Directions

Agentic Alignment: Inverse Reinforcement Learning from Swarm Behavior

MiRA: A Zero-Shot Mixture-of-Reasoning Agents Framework

Gemini 3.1 Pro: A Smarter Baseline for Complex Reasoning on ARC-AGI-2

Concrete Problems in AI Safety

Language models can explain neurons in language models

WebGPT: Browser-assisted question-answering with human feedback

Dota 2 with Large Scale Deep Reinforcement Learning

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Mastering the game of Stratego with model-free multiagent reinforcement learning

Mastering the game of Go without human knowledge

Mastering Diverse Domains through World Models

RT-1: Robotics Transformer for Real-World Control at Scale

Scaling Instructable Agents Across Many Simulated Worlds

Solving olympiad geometry without human demonstrations

Top Entities In This Topic

Related Topics

FAQ

What does this cs.AI page rank?

How should I use weekly vs monthly vs all-time?

How can I discover organizations active in cs.AI?

Can I follow this topic for updates?