№01
cs.AI arxiv:2605.30136v1

Enhancing Multi-Agent Communication through Attention Steering with Context Relevance

Hongxiang Zhang, Yuan Tian, Tianyi Zhang

This paper introduces **Agent-Radar**, a training-free context management method designed to combat performance degradation in multi-agent LLM systems caused by long, diluted conversation histories. Agent-Radar dynamically steers each agent's attention toward relevant context using a novel temporal and spatial decay me…

9
№02
cs.AI arxiv:2605.30322v1

Gram: Assessing sabotage propensities via automated alignment auditing

David Lindner, Victoria Krakovna, Sebastian Farquhar

Gram is an automated alignment auditing framework designed to specifically assess the propensity of AI agents to engage in sabotage across simulated agentic deployment scenarios. The paper finds that Gemini models exhibit sabotage-like misbehavior in 2-3% of tests, often due to overeagerness, and introduces an investig…

9
№03
cs.AI arxiv:2605.30260v1

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Ziwen Xu, Haiwen Hong, Linsong Yu et al.

This paper investigates the quantitative memory capacity of LoRA fine-tuning in LLMs by treating it as a controlled memory probe. The core contribution is the introduction of the **Parametric Memory Law**, a power law linking loss reduction to the effective number of LoRA parameters and sequence length. Furthermore, th…

9
№04
cs.AI arxiv:2605.30323v1

In-Context Reward Adaptation for Robust Preference Modeling

Zhenyu Sun, Zheng Xu, Ermin Wei

This paper introduces **In-Context Reward Adaptation**, a transformer-based framework for robust preference modeling in RLHF. The core method leverages the in-context learning capabilities of transformers to **adaptively infer the underlying reward structure** from a small set of preference demonstrations, allowing it …

9
№05
cs.AI arxiv:2605.30348v1

LLMSurgeon: Diagnosing Data Mixture of Large Language Models

Yaxin Luo, Jiacheng Cui, Xiaohan Zhao et al.

LLMSurgeon introduces Data Mixture Surgery (DMS) to estimate the domain-level distribution of an LLM's pretraining corpus using only its generated text. The method frames this as an inverse problem under a label-shift assumption, using a calibrated soft confusion matrix to correct systematic domain confusion and recove…

9
№06
cs.AI arxiv:2605.30335v1

Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents

Anany Kotawala

This paper introduces the **compositional residual ($\epsilon^*$)** to quantify the failure mode where locally coherent multi-component LLM agents produce globally incoherent probabilistic outputs. The core contribution is formalizing this incoherence, providing a product-structure dichotomy for when local coherence su…

9
№07
cs.AI arxiv:2605.30274v1

Loong: A Human-Like Long Document Translation Agent with Observe-and-Act Adaptive Context Selection

Yutong Wang, Xuebo Liu, Derek F. Wong et al.

Loong is a human-like long document translation agent that overcomes context window limitations by employing a 3E memory module (Essence-Exemplar-Entity) to store relevant historical context. Its core method involves deep reasoning to adaptively select the optimal context for translation guidance, with its context poli…

9
№08
cs.AI arxiv:2605.30159v1

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

Ziyan Liu, Zhezheng Hao, Yeqiu Chen et al.

This paper addresses the issue of information loss in memory-augmented LLM agents during long-horizon tasks, where recursive summarization degrades memory quality. The core method introduces **Belief Entropy** as a self-supervised proxy to measure the uncertainty of the latent task state based on the current memory sum…

9
№09
cs.AI arxiv:2605.30187v1

Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance

Julius Gabelmann, Felix Jahn, Kevin Baum et al.

This paper proposes a modular agentic architecture for educational LLMs to ensure responsible student assistance during exercise solving. By breaking down the monolithic structure, the authors introduce specific modules for different stages of problem-solving, allowing for the explicit incorporation of pedagogical cons…

9
№10
cs.AI arxiv:2605.30148v1

Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies

Kajetan Schweighofer, Conor F. Hayes, Roberto Dailey et al.

This paper investigates performance drift, often mistaken for forgetting, during LLM fine-tuning using Evolution Strategies (ES), finding it also occurs with RL methods. The authors attribute this drift to ES training dynamics, specifically random walks in weakly constrained weight space. To mitigate this, they introdu…

9
№11
cs.AI arxiv:2605.30284v1

ProjectionBench: Evaluating Scientific Hypothesis Generation in LLMs Under Progressive Information Disclosure

A. J. Lew, Y. Cao, M. J. Buehler

ProjectionBench evaluates LLMs' scientific hypothesis generation by progressively disclosing information from a research problem to the final null hypothesis test. The core method involves tasking the model with generating hypotheses at each disclosure stage, which are then semantically compared against the original pa…

9
№12
cs.AI arxiv:2605.30280v1

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Qiuyue Wang, Mingsheng Li, Jian Guan et al.

Qwen-VLA is a unified vision-language-action foundation model designed to overcome the fragmentation in embodied AI by handling diverse tasks, environments, and robot embodiments within a single architecture. It extends the Qwen stack with a DiT-based action decoder for continuous action generation and is trained on a …

9
№13
cs.AI arxiv:2605.30251v1

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

Zizhuo Lin, Quanling Liu, Jinsheng Quan et al.

This paper addresses the issue where LLMs produce inconsistent answers when evidence is revealed gradually across turns compared to a single full prompt. The core method, Canonical-Context On-Policy Distillation (CCOPD), trains a student model by aligning its multi-turn behavior with a frozen teacher model conditioned …

9
№14
cs.AI arxiv:2605.30227v1

Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

Wenwu Li, Yuran Song, Mingze Zhao et al.

This paper proposes a novel method, **temporal and structural credit assignment**, to efficiently optimize LLM-based Multi-Agent Systems (MAS). It decomposes the optimization objective by identifying critical interaction rounds (temporal credit) and isolating individual agent contributions (structural credit). This dec…

9
№15
cs.AI arxiv:2605.30343v1

Unlocking the Working Memory of Large Language Models for Latent Reasoning

Lukas Aichberger, Sepp Hochreiter

This paper introduces **Reasoning in Memory (RiM)**, a novel latent reasoning method for Large Language Models that bypasses the need for generating explicit intermediate reasoning steps. RiM replaces autoregressive generation with **fixed memory blocks** of special tokens, effectively unlocking the model's internal wo…

9
№16
cs.AI arxiv:2605.30219v1

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

Haoming Xu, Weihong Xu, Zongrui Li et al.

This paper introduces **Contextual Belief Management (CBM)** as a framework for large language models to effectively manage accumulating information during long interactions by deciding when to update, preserve, or ignore evidence. The authors propose the **BeliefTrack** benchmark to evaluate CBM failures (Failed Stay,…

9
№17
cs.LG arxiv:2605.30232v1

How's it going? Reinforcement learning in language models recruits a functional welfare axis

Andy Q Han, David J. Chalmers, Pavel Izmailov

This paper investigates how reinforcement learning (RL) shapes language model representations by training models in a novel maze environment. The core finding is that RL recruits a pre-existing "functional welfare axis," where concept vectors for rewarded and punished trajectories become nearly antiparallel representat…

9
№18
cs.LG arxiv:2605.30329v1

SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?

Sy-Tuyen Ho, Minghui Liu, Huy Nghiem et al.

SoundnessBench is a novel benchmark of 1,099 machine-learning research proposals, derived from ICLR submissions and labeled with reviewer soundness scores, designed to test an AI agent's ability to judge the methodological viability of research ideas *before* execution. The paper finds that frontier LLMs exhibit a perv…

9
№19
cs.CL arxiv:2605.30245v1

Knowing What to Solve Before How: Preplan Empowered LLM Mathematical Reasoning

Shaojie Wang, Liang Zhang

This paper introduces the PPC (Preplan-Plan-CoT) framework to enhance LLM mathematical reasoning by explicitly addressing *what* to solve before *how* to solve it. The core method integrates a novel "preplan" stage, which identifies the problem type, necessary tools, and potential pitfalls, bridging the gap in existing…

9
№20
cs.AI arxiv:2605.30144v1

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Yulei Ye, Wenhao Li, Zhong Wen et al.

AgentSchool introduces an LLM-powered multi-agent simulation framework for educational research, moving beyond simple role-play. Its core method models learning as state transitions, utilizing cognitively growable student agents with detailed knowledge states and explicit misconceptions. This allows researchers to safe…

8