№01
cs.AI arxiv:2605.23459v1

AI Assurance: A Comprehensive Testing Strategy for Enterprise AI Systems

Chitra Badagi, Divye Singh, Animesh Sen et al.

This paper proposes a comprehensive AI assurance strategy for enterprise AI systems, shifting focus from classical verification to continuous risk reduction. The core method involves treating evaluation as a core engineering discipline, structured around a new AI Failure Taxonomy and a five-layer AI Assurance Pyramid. …

9
№02
cs.AI arxiv:2605.23780v1

Beyond Binary Edits Robust Multimodal Knowledge Editing with Adversarial Subspace Alignment

Haoyuan Wang, Xiaohao Liu, Jiajie Su et al.

This paper introduces Latent Adversarial Robustification (LAR) to improve the generality of intrinsic multimodal knowledge editing in MLLMs. LAR generates adversarial, semantically coherent variants in the latent space to expose fragile editing regions, ensuring that knowledge updates generalize across semantically equ…

9
№03
cs.AI arxiv:2605.23605v1

DiLaDiff: Distilled Latent-Augmented Diffusion for Language Modeling

Jean-Marie Lemercier, Tomas Geffner, Karsten Kreis et al.

DiLaDiff addresses the token correlation issue in diffusion language models by introducing a continuous, semantically rich latent space learned via an autoencoder. This latent space guides a diffusion model, and a subsequent consistency model distills this process into a fast, few-step latent generator. The core contri…

9
№04
cs.AI arxiv:2605.23899v1

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Zisu Huang, Jingwen Xu, Yifan Yang et al.

This paper systematically studies the full lifecycle of model-generated agent skills, spanning experience generation, extraction, and consumption. The core contribution is a utility-grounded evaluation framework applied across five diverse domains to determine when and why these skills succeed or fail. The study finds …

9
№05
cs.AI arxiv:2605.23825v1

It's the humans, not the data: Geopolitical bias in LLMs originates in post-training, amplified by the language of the prompt

Stuart Bladon, Brinnae Bent

This paper demonstrates that geopolitical bias in LLMs primarily originates during the **post-training (fine-tuning/alignment) phase**, contrary to common assumptions about pre-training data. The authors found that models consistently develop biases favoring the region of their developer after post-training, and the ma…

9
№06
cs.AI arxiv:2605.23901v1

LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Xu Ouyang, Deyi Liu, Yuhang Cai et al.

This paper introduces the **Shannon Scaling Law**, modeling LLM training as information transmission over a noisy channel, mapping parameters to bandwidth and data to signal power. This framework explains non-monotonic scaling phenomena like catastrophic forgetting by identifying a fundamental **Shannon capacity**. The…

9
№07
cs.AI arxiv:2605.23723v1

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

Zhewen Tan, Yilun Yao, Huiyan Jin et al.

MemAudit is a post-hoc auditing framework designed to identify malicious memories injected into LLM agents' persistent storage. It combines a counterfactual memory influence score to measure each memory's causal contribution to harmful outputs with a memory consistency graph to detect structural anomalies indicative of…

9
№08
cs.AI arxiv:2605.23904v1

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Yifan Yang, Ziyang Gong, Weiquan Huang et al.

SkillOpt introduces a novel method to systematically optimize agent skills by treating the skill itself as an external, trainable state, analogous to weight optimization in deep learning. It employs a dedicated optimizer model to generate bounded, text-based edits (add/delete/replace) to the skill document, accepting o…

9
№09
cs.LG arxiv:2605.23574v1

Push Your Agent: Measuring and Enforcing Quantitative Goal Persistence in Long-Horizon LLM Agents

Yuandao Cai, Yuzhang Zhu, Liyou Gao et al.

This paper introduces **Quantitative Goal Persistence (QGP)**, a metric to measure whether long-horizon LLM agents continue working until an external verifier confirms a specific count of distinct, valid items is achieved. The authors propose **PushBench**, a benchmark focused on artifact collection, to directly measur…

9
№10
cs.LG arxiv:2605.23857v1

Strong Teacher Not Needed? On Distillation in LLM Pretraining

Taiming Lu, Zhuang Liu

This paper investigates the conventional assumption that stronger teachers are necessary for effective knowledge distillation during Large Language Model (LLM) pretraining. The authors demonstrate that even small, undertrained "teachers" can successfully improve larger "students" when the language modeling and distilla…

9
№11
cs.CL arxiv:2605.23454v1

ARES: Automated Rubric Synthesis for Scalable LLM Reinforcement Learning

Xiaoyuan Li, Keqin Bao, Moxin Li et al.

ARES is a framework that automates the creation of question-answer pairs and corresponding question-specific weighted rubrics from raw pretraining documents. This enables scalable reinforcement learning for LLMs by providing instance-level reward supervision for open-ended responses, overcoming the limitations of manua…

9
№12
cs.CL arxiv:2605.23657v1

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

Jiahao Ying, Boxian Ai, Wei Tang et al.

OpenSkillEval is an automatic evaluation framework designed to audit the rapidly expanding ecosystem of skills used by LLM agents. It addresses the lack of clarity regarding skill quality and model interaction by automatically constructing realistic task instances across five application domains. The framework's core c…

9
№13
cs.AI arxiv:2605.23772v1

Agentic Proving for Program Verification

Alessandro Sosso, Akhil Arora, Bas Spitters

This paper investigates the capability of agentic AI systems, specifically Claude Code, for program verification using the CLEVER benchmark in Lean 4. The core method involves evaluating the agent's performance across specification generation, implementation certification against ground truth, and end-to-end verificati…

8
№14
cs.AI arxiv:2605.23590v1

Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents

Jiazheng Kang, Bowen Zhang, Zixin Song et al.

Co-ReAct introduces a framework where external rubrics act as step-level collaborators to guide ReAct agents during inference, moving beyond their typical role as post-hoc evaluators. By injecting the rubric into the agent's context at each decision point, Co-ReAct provides explicit, actionable targets for evidence see…

8
№15
cs.AI arxiv:2605.23655v1

CVSearch: Empowering Multimodal LLMs with Cognitive Visual Search for High-Resolution Image Perception

Liupeng Li, Haoqian Kang, Zhenyu Lu et al.

CVSearch is a training-free framework that addresses the high-resolution image perception bottleneck in MLLMs by adaptively scheduling search strategies. It employs an "Assess-then-Search" workflow, prioritizing efficient expert-assisted search and only resorting to a novel semantic-aware scanning mechanism upon failur…

8
№16
cs.AI arxiv:2605.23897v1

ETCHR: Editing To Clarify and Harness Reasoning

Beichen Zhang, Yuhong Liu, Jinsong Li et al.

ETCHR addresses the limitations of purely textual reasoning in multimodal LLMs by introducing a novel approach that couples a dedicated image editing model with an understanding model. The core method involves conditioning the image editor on the reasoning question to overcome the editor's inability to map abstract que…

8
№17
cs.AI arxiv:2605.23551v1

Goal-Conditioned Agents that Learn Everything All at Once

Michael Matthews, Matthew Jackson, Michael Beukman et al.

The paper introduces Learning Everything All at Once (LEO), a method for goal-conditioned reinforcement learning that efficiently performs off-policy updates using every observed transition for *all* possible goals simultaneously. LEO achieves this by jointly outputting values and actions for every goal in a single for…

8
№18
cs.AI arxiv:2605.23572v1

HARNESS-LM: A Three-Phase Training Recipe for Harnessing SLMs in Sponsored Search Retrieval

Vipul Gupta, Shikhar Mohan, Lakshya Kumar et al.

HARNESS-LM (HLM) is a three-phase training recipe designed to efficiently transfer the high retrieval quality of large SLM-based models into compact, production-ready student encoders. The method first trains a large teacher model, then distills its knowledge into a small student encoder using an L2 alignment objective…

8
№19
cs.AI arxiv:2605.23867v1

Human Decision-Making with Persuasive and Narrative LLM Explanations

Laura R. Marusich, Mary Grace Kozuch Dhooghe, Jonathan Z. Bakdash et al.

This paper investigates how the persuasiveness of Large Language Model (LLM) narrative explanations affects human decision-making accuracy in classification tasks. The core finding is that the persuasiveness level of these explanations did not significantly improve decision accuracy compared to a simple AI prediction a…

8
№20
cs.AI arxiv:2605.23861v1

Leveraging Foundation Models for Causal Generative Modeling

Aneesh Komanduri, Xintao Wu

This paper introduces **FM-CGM**, a modular framework that leverages pretrained foundation models for visual causal reasoning without requiring explicit causal constraint training. It formalizes the causal pipeline using a concept extractor, manipulator, and counterfactual generator, employing a large reasoning model f…

8