Topic: cs.AI

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for cs.AI, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

WeeklyMonthlyAll time
Current monthLast month2 months ago

← Back to home

  1. Minecraft as a Turing Test: Evaluating Open-Ended Agentic AI

    PaperJul 15, 2025arXivKevin Zhu, Lara Croft, Julian Bao

    Evaluating the long-horizon planning and adaptability of Agentic AI in the real world is fraught with safety and cost limitations. We establish Minecraft as the premier sandbox for open-ended agent...

  2. Scaling Laws for Agentic AI: When Does Swarm Intelligence Peak?

    PaperAug 8, 2025arXivJian Lu, Percy Liang, Chelsea Finn

    While scaling laws for single-model LLMs are well established, the relationship between the number of collaborating agents and overall system performance remains poorly understood. This paper inves...

  3. Neurosymbolic Agentic AI for Automated Theorem Proving

    PaperMar 21, 2025arXivAlbert Q. Jiang, Wenda Li, Szymon Tworkowski, Kuhu Syal

    Automated theorem proving requires a blend of creative intuition and rigorous logical deduction, a combination that eludes pure deep learning models. We propose a Neurosymbolic Agentic AI framework...

  4. Agentic Alignment: Inverse Reinforcement Learning from Swarm Behavior

    PaperDec 22, 2025arXivPercy Liang, Thomas K. V., Eleanor Rigby

    Aligning multi-agent systems via traditional human feedback is intractable due to the sheer volume and speed of agent-to-agent interactions. We introduce a novel alignment framework utilizing Inver...

  5. MiRA: A Zero-Shot Mixture-of-Reasoning Agents Framework

    PaperFeb 20, 2026arXivSethuraman et al., AAMAS 2026 Main Track

    We propose Mixture-of-Reasoning Agents (MiRA), a zero-shot multimodal framework that decomposes reasoning across three specialized agents: Visual Analyzing, Text Comprehending, and Judge. By consol...

  6. Concrete Problems in AI Safety

    PaperJun 21, 2016arXivDario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mané

    Rapid progress in machine learning and artificial intelligence (AI) has brought increasing attention to the potential impacts of AI technologies on society. In this paper, we discuss one such poten...

  7. Language models can explain neurons in language models

    PaperMay 9, 2023OpenAISteven Bills, Nick Cammarata, Dan Mossing, Henk Tillman, Leo Gao, Gabriel Goh, Ilya Sutskever, Jan Leike, Jeff Wu, William Saunders

    Understanding the internal mechanisms of massive language models is a critical bottleneck for AI safety and alignment. Given the billions of parameters in modern models, manual human inspection of ...

  8. WebGPT: Browser-assisted question-answering with human feedback

    PaperDec 16, 2021arXivReiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman

    We introduce a method for fine-tuning language models to interact with a text-based web browser to answer open-ended questions. This model, WebGPT, searches the web, navigates through links, and sy...

  9. Dota 2 with Large Scale Deep Reinforcement Learning

    PaperDec 13, 2019arXivChristopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Ilya Sutskever, et al.

    We present OpenAI Five, a system of five neural networks that learned to play the highly complex, imperfect-information esports game Dota 2 entirely through self-play. Dota 2 involves long time hor...

  10. Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    PaperOct 13, 2023arXivOpen X-Embodiment Collaboration (Google DeepMind & Academic Partners)

    Large, diverse datasets have catalyzed breakthroughs in natural language and computer vision, yet robotics has struggled to build generalist models due to the fragmented nature of hardware platform...

  11. Mastering the game of Stratego with model-free multiagent reinforcement learning

    PaperDec 1, 2022ScienceJulien Perolat, Bart De Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T. Connor, Neil Burch, Thomas Anthony, Stephen McAleer, Romuald Elie, Sarah H. Cen, Zhe Wang, Audrunas Gruslys, Aleksander Malyshev, Mina Khan, Sherjil Ozair, Finbarr Timbers, Toby Pohlen, Tom Eccles, Mark Rowland, Marc Lanctot, Jean-Baptiste Lespiau, Bilal Piot, Shayegan Omidshafiei, Edward Lockhart, Laurent Sifre, Nathalie Beauguerlange, Remi Munos, David Silver, Satinder Singh, Demis Hassabis, Karl Tuyls

    Imperfect information games, where players have hidden information, represent a significant challenge for artificial intelligence. Stratego is a complex, imperfect-information board game with an en...

  12. Mastering the game of Go without human knowledge

    PaperOct 18, 2017NatureDavid Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George van den Driessche, Thore Graepel, Demis Hassabis

    A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. We introduce AlphaGo Zero, an AI that achieves superhuman pe...

  13. Mastering Diverse Domains through World Models

    PaperJan 10, 2023arXivDanijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap

    General intelligence requires solving tasks across diverse domains without human intervention. We present DreamerV3, a general and scalable reinforcement learning algorithm that masters a wide rang...

  14. RT-1: Robotics Transformer for Real-World Control at Scale

    PaperDec 13, 2022arXivAnthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Google DeepMind

    By transferring knowledge from large, diverse, task-agnostic datasets, modern machine learning models can solve specific downstream tasks either zero-shot or with small task-specific datasets. We i...

  15. Scaling Instructable Agents Across Many Simulated Worlds

    PaperApr 15, 2024arXivSIMA Team, Google DeepMind

    We introduce the Scalable Instructable Multiworld Agent (SIMA), an AI agent capable of following natural-language instructions to carry out tasks in a wide variety of 3D virtual environments and vi...

  16. Solving olympiad geometry without human demonstrations

    PaperJan 17, 2024NatureTrieu Trinh, Yuhuai Wu, Quoc V. Le, He He, Thang Luong

    Proving mathematical theorems requires deep logical reasoning and intuition, representing a grand challenge for AI. We introduce AlphaGeometry, a neuro-symbolic system that solves complex geometry ...

← PreviousPage 1Next →

Top Entities In This Topic

Related Topics

FAQ

What does this cs.AI page rank?

It ranks public content for cs.AI using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in cs.AI?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.