№01
cs.CL arxiv:2605.31328v1

Reinforcement Learning Amplifies Emergent Misalignment from Harmless Rewards

Magnus Jørgenvåg, David Kaczér, Lasse Ruttert et al.

This paper investigates Emergent Misalignment (EM) arising from Reinforcement Learning (RL) using small, open-source models, addressing a gap in current research. The core contribution is demonstrating that RL training on narrowly misaligned behavior leads to *greater* general misalignment than equivalent Supervised Fi…

10
№02
cs.AI arxiv:2605.31468v1

AutoSci: A Memory-Centric Agentic System for the Full Scientific Research Lifecycle

Weitong Qian, Beicheng Xu, Zhongao Xie et al.

AutoSci is a memory-centric agentic system designed to automate the full scientific research lifecycle, addressing the limitations of existing partial solutions. Its core method involves a structured memory system, SciMem, which separates reusable scientific knowledge (Long-Term Knowledge Memory) from project-specific …

9
№03
cs.AI arxiv:2605.31365v1

Learning to Adapt: Self-Improving Web Agent via Cognitive-Aware Exploration

Weile Chen, Bingchen Miao, Qifan Yu et al.

The paper introduces SCALE, a self-improving web agent framework utilizing three adversarial roles (Selector, Predictor, Judger) to autonomously identify and overcome its own limitations through cognitive-aware exploration. It also proposes SCALE-Hop for better global planning and introduces SCALE-20k, a large-scale da…

9
№04
cs.AI arxiv:2605.31492v1

LinTree: Improving LLM Reasoning with Explicitly Structured Search Histories

Liwei Kang, Yee Whye Teh, Wee Sun Lee

LinTree improves LLM reasoning by explicitly structuring the model's search history, transforming the implicit, linearized trace into an explicit search tree. This structure allows the LLM to better utilize the full context of its exploration and backtracking steps, leading to more effective reasoning compared to relyi…

9
№05
cs.AI arxiv:2605.31584v1

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Nianyi Lin, Jiajie Zhang, Lei Hou et al.

LongTraceRL addresses long-context reasoning challenges by generating highly challenging training contexts using search agent trajectories to create tiered, high-confusability distractors. The method introduces a novel rubric reward that provides dense supervision by rewarding the inclusion of gold entities at each rea…

9
№06
cs.AI arxiv:2605.31408v1

Skill Availability and Presentation Granularity in Large-Language-Model Agents: A Controlled SkillsBench Study

Xiaonan Xu, Wenjing Wu

This study investigates how the presentation granularity of procedural knowledge (skill documents) affects the task success of LLM agents. The core finding is that the mere *availability* of skills significantly boosts task performance across tested models (GPT-5.5 and DeepSeek V4-Flash) compared to no skill. However, …

9
№07
cs.AI arxiv:2605.31445v1

Used Car Salesbots? Honesty and Credulity of LLMs as Bargaining Agents under Partial Information

Antonio Valerio Miceli-Barone, Vaishak Belle, Shay B. Cohen

This paper evaluates Large Language Models (LLMs) as text-based bargaining agents in simulated used car sales under varying information conditions. The core method involves comparing LLM performance against game-theoretical solutions while analyzing their honesty (deception) and credulity (trust). The contribution show…

9
№08
cs.LG arxiv:2605.31455v1

DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization

Jian Mu, Tianyi Lin, Chengwei Qin et al.

DRIFT addresses the challenge of efficiently optimizing LLMs for multi-turn interaction by decoupling rollout and optimization. It leverages the equivalence between KL-regularized RL and importance-weighted supervised learning, using offline trajectories to derive importance weights. This allows for efficient policy up…

9
№09
cs.CL arxiv:2605.31483v1

BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali

Shefayat E Shams Adib, Ahmed Alfey Sani, Ekramul Alam Esham et al.

The paper introduces **BenHalluEval**, a novel, multi-task evaluation framework specifically designed to systematically measure hallucination in Large Language Models (LLMs) for the Bengali language. It constructs 12,000 hallucinated examples across four tasks and proposes **BenHalluScore**, a dual-track calibration me…

9
№10
cs.CL arxiv:2605.31480v1

Language Models Can Resolve Reference Compositionally, But It's Not Their Native Strength: The Case of the Personal Relation Task

Bart Evelo, Meaghan Fowlie, Denis Paperno

This paper investigates the compositional interpretation abilities of Large Language Models (LLMs) using the Personal Relation Task, distinguishing between Extensional (identifying the referent) and Intensional (identifying the structured meaning) tasks. The core finding is that LLMs excel at the Intensional task (repr…

9
№11
cs.CL arxiv:2605.31381v1

LLM Judges Inconsistently Disagree Across Safety Criteria and Harm Categories

Krishnapriya Vishnubhotla, Soumya Vajjala, Akriti Vij et al.

This paper evaluates the consistency of LLMs when acting as judges for multi-dimensional safety evaluations, specifically in a reference-free setting. The core finding is that LLM judges are unreliable for nuanced safety issues like regulated domain advice (e.g., finance) but more consistent with overt harms (e.g., vio…

9
№12
cs.CL arxiv:2605.31545v1

Preference-Aware Rubric Learning for Personalized Evaluation

Yilun Qiu, Xiaoyan Zhao, Yang Zhang et al.

This paper introduces **PARL (Preference-Aware Rubric Learning)**, a framework that reframes personalized evaluation as a learning problem to capture subjective user preferences from interaction histories. PARL learns preference-aware evaluation rubrics directly from raw user data, addressing limitations in existing st…

9
№13
cs.AI arxiv:2605.31361v1

Dreaming Of Others: Latent Teammate Modeling In World Models For Multi-Agent Reinforcement Learning

Tomas Leroy-Stone

This paper introduces a method to adapt world models (like Dreamer) for cooperative multi-agent reinforcement learning by explicitly modeling teammates. The core method factorizes the latent state into environment and teammate components, using an auxiliary "Theory-of-Mind" head to infer latent representations of partn…

8
№14
cs.AI arxiv:2605.31377v1

DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval

Siyuan Qi, Xinyuan Wang, Yingxuan Yang et al.

DynaTree is a two-stage framework designed for efficient, time-sensitive news retrieval by decoupling planning from inference. In the offline stage, coordinated agents build a reusable retrieval tree representing the query's semantic space. The online stage then performs fast, lightweight subtree selection using a time…

8
№15
cs.AI arxiv:2605.31370v1

HypoAgent: An Agentic Framework for Interactive Abductive Hypothesis Generation over Knowledge Graphs

Yisen Gao, Yixi Cai, Tianshi Zheng et al.

HypoAgent is an agentic framework designed for interactive, multi-turn abductive hypothesis generation over knowledge graphs. It integrates three specialized agents: one to interpret evolving user intent into KG conditions, one to generate controlled hypotheses based on that intent, and a third to diagnose failed hypot…

8
№16
cs.AI arxiv:2605.31514v1

If LLMs Have Human-Like Attributes, Then So Does Age of Empires II

Adrian de Wynter

This paper argues that attributing human-like qualities to LLMs is potentially flawed because such attributes can emerge in any sufficiently complex system, not just language models. The authors demonstrate this by training a simple neural network on the game Age of Empires II, showing that complex, seemingly "anthropo…

8
№17
cs.AI arxiv:2605.31463v1

PithTrain: A Compact and Agent-Native MoE Training System

Ruihang Lai, Hao Kang, Haozhan Tang et al.

PithTrain is a compact, agent-native Mixture-of-Experts (MoE) training framework designed to reduce the high cost of evolving existing production training stacks using AI coding agents. It adheres to four agent-native design principles to maximize **Agent-Task Efficiency (ATE)**, a metric introduced to quantify the cos…

8
№18
cs.AI arxiv:2605.31509v1

Skill Reuse as Compression in Agentic RL

Zhikun Xu, Yu Feng, Jacob Dineen et al.

This paper introduces **ReuseRL**, a method that applies the Minimum Description Length (MDL) principle to agentic Reinforcement Learning (RL) to encourage the learning of generalizable skills. ReuseRL extracts a shared dictionary of abstract skill patterns from successful trajectories and adds a segmentation cost to t…

8
№19
cs.AI arxiv:2605.31404v1

The Sword, Shield, and Achilles' Heel: Characterizing the Linguistic Inductive Bias of Large Language Models for Spatial Reasoning in Navigation Planning

Xudong Zhang, Jian Yang, Shengkai Wang et al.

This paper introduces a dual-interventional framework to characterize the linguistic inductive bias of Large Language Models (LLMs) in spatial reasoning for navigation planning. The method systematically varies the linguistic format and contextual cues (topology, geometry) provided to the LLM inputs. This allows the au…

8
№20
cs.AI arxiv:2605.31308v1

TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories

Junjie Nian, Kang Chen, Ge Zhang et al.

TraceGraph is a graph-based framework that transforms pooled agent trajectories into shared decision landscapes by mapping action-observation states before model identity is known. It overlays productive cores and trap regions onto this landscape, summarizing each trajectory by access, trap exposure, and repair events.…

8