Topic: AI Engineering

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for AI Engineering, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

Weekly Monthly All time

← Back to home

ProRAG: Process-Supervised Reinforcement Learning for Retrieval-Augmented Generation
Paper • Jan 29, 2026 • arxiv.org • Zhao Wang, Ziliang Zhao, Zhicheng Dou
Reinforcement learning (RL) has become a promising paradigm for optimizing Retrieval-Augmented Generation (RAG) in complex reasoning tasks. However, traditional outcome-based RL approaches often su...
JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG
Paper • Jan 29, 2026 • arxiv.org • Yiqun Chen, Erhan Zhang, Tianyi Hu, Shijie Wang, Zixuan Yang, Meizhi Zhong, Xiaochi Wei, Yan Gao, Yi Wu, Yao Hu, Jiaxin Mao
The evolution of Retrieval-Augmented Generation (RAG) has shifted from static retrieval pipelines to dynamic, agentic workflows where a central planner orchestrates multi-turn reasoning. However, e...
DIVERGE: Diversity-Enhanced RAG for Open-Ended Information Seeking
Paper • Jan 30, 2026 • arxiv.org • Tianyi Hu, Niket Tandon, Akhil Arora
Existing retrieval-augmented generation (RAG) systems are primarily designed under the assumption that each query has a single correct answer. This overlooks common information-seeking scenarios wi...
Aggregation Queries over Unstructured Text: Benchmark and Agentic Method
Paper • Feb 3, 2026 • arxiv.org • Haojia Zhu, Qinyuan Xu, Haoyu Li, Yuxi Liu, Hanchen Qiu, Jiaoyan Chen, Jiahui Jin
Aggregation query over free text is a long-standing yet underexplored problem. Unlike ordinary question answering, aggregate queries require exhaustive evidence collection and systems are required ...
ProcMEM: Learning Reusable Procedural Memory from Experience via Non-Parametric PPO for LLM Agents
Paper • Feb 2, 2026 • arxiv.org • Qirui Mi, Zhijian Ma, Mengyue Yang, Haoxuan Li, Yisen Wang, Haifeng Zhang, Jun Wang
LLM-driven agents demonstrate strong performance in sequential decision-making but often rely on on-the-fly reasoning, re-deriving solutions even in recurring scenarios. This insufficient experienc...
SOPRAG: Multi-view Graph Experts Retrieval for Industrial Standard Operating Procedures
Paper • Feb 2, 2026 • arxiv.org • Liangtao Lin, Zhaomeng Zhu, Tianwei Zhang, Yonggang Wen
Standard Operating Procedures (SOPs) are essential for ensuring operational safety and consistency in industrial environments. However, retrieving and following these procedures presents unique cha...
AI Agent Systems for Supply Chains: Structured Decision Prompts and Memory Retrieval
Paper • Feb 5, 2026 • arxiv.org • Konosuke Yoshizato, Kazuma Shimizu, Ryota Higa, Takanobu Otsuka
This study investigates large language model (LLM) -based multi-agent systems (MASs) as a promising approach to inventory management, which is a key component of supply chain management. Although t...
Graph-based Agent Memory: Taxonomy, Techniques, and Applications
Paper • Feb 5, 2026 • arxiv.org • Chang Yang, Chuang Zhou, Yilin Xiao, Su Dong, Luyao Zhuang, Yujing Zhang, Zhu Wang, Zijin Hong, Zheng Yuan, Zhishang Xiang, Shengyuan Chen, Huachi Zhou, Qinggang Zhang, Ninghao Liu, Jinsong Su, Xinrun Wang, Yi Chang, Xiao Huang
Memory emerges as the core module in the Large Language Model (LLM)-based agents for long-horizon complex tasks (e.g., multi-turn dialogue, game playing, scientific discovery), where memory can ena...
Mitigating Hallucination in Financial Retrieval-Augmented Generation via Fine-Grained Knowledge Verification
Paper • Feb 5, 2026 • arxiv.org • Taoye Yin, Haoyuan Hu, Yaxin Fan, Xinhao Chen, Xinya Wu, Kai Deng, Kezun Zhang, Feng Wang
In financial Retrieval-Augmented Generation (RAG) systems, models frequently rely on retrieved documents to generate accurate responses due to the time-sensitive nature of the financial domain. Whi...
CompactRAG: Reducing LLM Calls and Token Overhead in Multi-Hop Question Answering
Paper • Feb 5, 2026 • arxiv.org • Hao Yang, Zhiyu Yang, Xupeng Zhang, Wei Wei, Yunjie Zhang, Lin Yang
Retrieval-augmented generation (RAG) has become a key paradigm for knowledge-intensive question answering. However, existing multi-hop RAG systems remain inefficient, as they alternate between retr...
Learning to Share: Selective Memory for Efficient Parallel Agentic Systems
Paper • Feb 5, 2026 • arxiv.org • Joseph Fioresi, Parth Parag Kulkarni, Ashmal Vayani, Song Wang, Mubarak Shah
Agentic systems solve complex tasks by coordinating multiple agents that iteratively reason, invoke tools, and exchange intermediate results. To improve robustness and solution quality, recent appr...
Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
Paper • Feb 5, 2026 • arxiv.org • Haozhen Zhang, Haodong Yue, Tao Feng, Quanyu Long, Jianzhu Bao, Bowen Jin, Weizhi Zhang, Xiao Li, Jiaxuan You, Chengwei Qin, Wenya Wang
Memory is increasingly central to Large Language Model (LLM) agents operating beyond a single context window, yet most existing systems rely on offline, query-agnostic memory construction that can ...
Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making
Paper • Jan 4, 2026 • arxiv.org • Danial Amin
Large language models (LLMs) are increasingly deployed as autonomous decision agents in settings with asymmetric error costs: hiring (missed talent vs wasted interviews), medical triage (missed eme...
When Numbers Start Talking: Implicit Numerical Coordination Among LLM-Based Agents
Paper • Jan 7, 2026 • arxiv.org • Alessio Buscemi, Daniele Proverbio, Alessandro Di Stefano, The Anh Han, German Castignani, Pietro Liò
LLMs-based agents increasingly operate in multi-agent environments where strategic interaction and coordination are required. While existing work has largely focused on individual agents or on inte...
TCAndon-Router: Adaptive Reasoning Router for Multi-Agent Collaboration
Paper • Jan 8, 2026 • arxiv.org • Jiuzhou Zhao, Chunrong Chen, Chenqi Qiao, Lebin Zheng, Minqi Han, Yanchi Liu Yongzhou Xu Xiaochuan Xu Min Zhang
Multi-Agent Systems(MAS) have become a powerful paradigm for building high performance intelligent applications. Within these systems, the router responsible for determining which expert agents sho...
ResMAS: Resilience Optimization in LLM-based Multi-agent Systems
Paper • Jan 8, 2026 • arxiv.org • Zhilun Zhou, Zihan Liu, Jiahe Liu, Qingyu Shao, Yihan Wang, Kun Shao, Depeng Jin, Fengli Xu
Large Language Model-based Multi-Agent Systems (LLM-based MAS), where multiple LLM agents collaborate to solve complex tasks, have shown impressive performance in many areas. However, MAS are typic...
When Single-Agent with Skills Replace Multi-Agent Systems and When They Fail
Paper • Jan 14, 2026 • arxiv.org • Xiaoxiao Li
Multi-agent AI systems have proven effective for complex reasoning. These systems are compounded by specialized agents, which collaborate through explicit communication, but incur substantial compu...
Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework
Paper • Jan 8, 2026 • arxiv.org • Junhyuk Choi, Jeongyoun Kwon, Heeju Kim, Haeun Cho, Hayeong Jung, Sehee Min, Bugeun Kim
Multi-agent systems utilizing large language models often assign authoritative roles to improve performance, yet the impact of authority bias on agent interactions remains underexplored. We present...
Orchestrating Intelligence: Confidence-Aware Routing for Efficient Multi-Agent Collaboration across Multi-Scale Models
Paper • Jan 24, 2026 • arxiv.org • Jingbo Wang, Sendong Zhao, Jiatong Liu, Haochun Wang, Wanting Li, Bing Qin, Ting Liu
While multi-agent systems (MAS) have demonstrated superior performance over single-agent approaches in complex reasoning tasks, they often suffer from significant computational inefficiencies. Exis...
Demystifying Multi-Agent Debate: The Role of Confidence and Diversity
Paper • Jan 9, 2026 • arxiv.org • Xiaochen Zhu, Caiqi Zhang, Yizhou Chi, Tom Stafford, Nigel Collier, Andreas Vlachos
Multi-agent debate (MAD) is widely used to improve large language model (LLM) performance through test-time scaling, yet recent work shows that vanilla MAD often underperforms simple majority vote ...

← PreviousPage 7Next →

FAQ

What does this AI Engineering page rank?

It ranks public content for AI Engineering using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to AI Engineering topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to AI Engineering topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in AI Engineering?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to AI Engineering topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to AI Engineering topic page on Attendemia and is written so it still makes sense without reading other sections on the page.