Topic: cs.AI

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for cs.AI, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

WeeklyMonthlyAll time

← Back to home

  1. OpenAI o1 System Card

    PaperSep 12, 2024OpenAIOpenAI

    We introduce OpenAI o1, a new series of large language models trained with reinforcement learning to perform complex reasoning. o1 models are designed to spend more time thinking before they respon...

  2. GPT-4 Technical Report

    PaperMar 15, 2023arXivOpenAI

    We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT...

  3. Training language models to follow instructions with human feedback

    PaperMar 4, 2022arXivLong Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

    Making language models bigger does not inherently make them better at following a user's intent. For example, large language models can generate outputs that are untruthful, toxic, or simply not he...

  4. A Generalist Agent

    PaperMay 12, 2022arXivScott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Nando de Freitas

    Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato,...

  5. GLM-5: From Vibe Coding to Agentic Engineering

    PaperFeb 17, 2026arXivZhipu AI Team, Tsinghua University Researchers

    We present GLM-5, a foundation model designed to bridge the gap between human-guided 'vibe coding' and autonomous 'agentic engineering.' GLM-5 introduces DeepSeek-inspired Sparse Attention (DSA) to...

  6. GSR: Learning Structured Reasoning for Embodied Manipulation

    PaperFeb 10, 2026arXivKewei Hu, Michael Zhang, Hanwen Kang

    We introduce Grounded Scene-graph Reasoning (GSR), a structured reasoning paradigm that explicitly models world-state evolution as transitions over semantically grounded scene graphs. By reasoning ...

  7. Mirage2Matter: A Physically Grounded Gaussian World Model from Video

    PaperFeb 8, 2026arXivZhengqing Gao, Ziwen Li, Xin Wang, Tongliang Liu

    To bridge the simulation-to-real gap, we introduce Mirage2Matter, a physically grounded Gaussian world model that generates high-fidelity embodied training data from multi-view videos. We reconstru...

  8. ZEST: Zero-shot Embodied Skill Transfer for Athletic Robot Control

    PaperFeb 8, 2026arXivEva Mungai, Zach Nobles, Scott Kuindersma, Yeuhi Abe

    We introduce ZEST (Zero-shot Embodied Skill Transfer), a motion-imitation framework that trains policies via RL from diverse sources—mocap, noisy monocular video, and animation—and deploys them to ...

  9. Tiny Recursive Reasoning with Mamba-2 Attention Hybrid

    PaperFeb 12, 2026arXivWenlong Wang, Fergal Reid

    Recent work demonstrates that tiny networks (7M parameters) can achieve strong performance on abstract reasoning through latent recursion. We investigate whether Mamba-2's state space recurrence, i...

  10. Inference-Only Prompt Projection for Safe Text-to-Image Generation

    PaperFeb 9, 2026arXivMinhyuk Lee, Hyekyung Yoon, Myungjoo Kang

    Text-to-Image (T2I) diffusion models enable high-quality synthesis, but deployment demands safeguards. We formalize the tension between safety and alignment through a total variation (TV) lens, yie...

  11. MEM1: A Constant-Memory RL Framework for Long-Horizon Language Agents

    PaperFeb 12, 2026arXivYurong Chen, Yu He, Michael I. Jordan, Fan Yao

    Modern language agents must operate over long-horizon, multi-turn interactions, but most rely on full-context prompting which leads to unbounded memory growth. We introduce MEM1, an end-to-end rein...

← PreviousPage 4Next →

Top Entities In This Topic

Related Topics

FAQ

What does this cs.AI page rank?

It ranks public content for cs.AI using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in cs.AI?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to cs.AI topic page on Attendemia and is written so it still makes sense without reading other sections on the page.