№01
cs.AI arxiv:2605.28632v1

Blind PRNG Hijacking: An Undetectable Integrity-Preserving Attack Against LLM Watermarking

Ziyang You, Huilong He, Xiaoke Yang et al.

This paper introduces **SeedHijack**, a novel, undetectable attack against LLM watermarking that targets the underlying Pseudo-Random Number Generator (PRNG) in the supply chain. The core method replaces the PRNG to bias green-list selection without altering the output tokens or requiring knowledge of the watermark key…

9
№02
cs.AI arxiv:2605.28678v1

DREAM-R: Multimodal Speculative Reasoning with RL-Based Refined Drafting, Precise Verification, and Fully Parallel Execution

Yunhai Hu, Zining Liu, Xiangyang Yin et al.

DREAM-R enhances speculative reasoning in multimodal models using a novel reinforcement learning objective, Speculative Alignment Policy Optimization (SAPO), to train draft models for generating concise and faithful reasoning steps. It incorporates a Threshold-based Verification Mechanism (TBVM) for stable acceptance o…

9
№03
cs.AI arxiv:2605.28721v1

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

HuiMing Fan, Xiao Wang, Zheng Chu et al.

This paper introduces the **LiveBrowseComp** benchmark to diagnose whether LLM search agents genuinely search or merely verify their intrinsic knowledge. The core method involves analyzing agent behavior on the original BrowseComp dataset, revealing significant **Intrinsic Knowledge Dependence (IKD)** where agents rely…

9
№04
cs.AI arxiv:2605.28732v1

MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems

Xinle Deng, Ruobin Zhong, Hujin Peng et al.

MemTrace introduces a novel framework to trace and attribute errors in large language model memory systems by transforming memory pipelines into executable memory evolution graphs. This allows for fine-grained tracking of information flow and systematic analysis of failure modes using the new MemTraceBench benchmark. T…

9
№05
cs.AI arxiv:2605.28805v1

OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration

Xinchen Zhang, Bowei Liu, Jiale Liu et al.

This paper introduces OmniVerifier-M1, a multimodal meta-verifier that uses symbolic outputs (like bounding boxes) as effective rationales for training, outperforming textual explanations. The core method involves decoupling the reinforcement learning objectives for binary judgment and meta-verification, which signific…

9
№06
cs.AI arxiv:2605.28597v1

Position: Retire the "Positive Backdoor" Label -- Secret Alignment Requires Strict and Systematic Evaluation

Jianwei Li, Jung-Eun Kim

This paper argues for retiring the term "positive backdoor" and replacing it with "Secret Alignment" to describe trigger-activated hidden behaviors in AI models. The core contribution is establishing that security claims based on Secret Alignment should be considered insecure by default, requiring rigorous, standardize…

9
№07
cs.AI arxiv:2605.28773v1

Rethinking Memory as Continuously Evolving Connectivity

Jizhan Fang, Buqiang Xu, Zhixian Wang et al.

This paper introduces **FluxMem**, a novel memory framework for LLM agents that models memory as a **continuously evolving, heterogeneous graph**. FluxMem dynamically refines its topology through stages of formation, feedback-driven refinement, and consolidation, allowing it to adapt to dynamic environments by repairin…

9
№08
cs.AI arxiv:2605.28588v1

Technical Report: Exploring the Emerging Threats of the Agent Skill Ecosystem

Luca Beurer-Kellner, Aleksei Kudrinskii, Marco Milanta et al.

This paper analyzes 3,984 AI agent skills to uncover emerging security threats within the agent skill ecosystem. The core contribution is the identification of 76 confirmed malicious payloads and the development of a real-world threat taxonomy based on observed attack patterns, demonstrating that a significant percenta…

9
№09
cs.AI arxiv:2605.28700v1

The Importance of Being Statistically Earnest: A Critical Re-evaluation of GSM-Symbolic

Dominika Agnieszka Długosz, Arlindo Oliveira, Natalia Díaz Rodríguez

This paper critically re-evaluates the GSM-Symbolic benchmark, arguing its conclusion of widespread LLM reasoning failure is statistically unsound. Using Generalised Linear Mixed Models, the authors find only half the tested models show statistically significant performance drops under the original prompting. Furthermo…

9
№10
cs.AI arxiv:2605.28699v1

TRACER: Turn-level Regret Matching with Inner Reinforcement Credit for Cooperative Multi-LLM Reasoning

Chusen Li, Zhou Liu, Shuigeng Zhou et al.

TRACER is a novel turn-level reinforcement framework designed to integrate reinforcement learning with multi-LLM cooperation. It uses a controller-regret layer employing regret matching to decide whether agents should speak or skip, and a generation-credit layer that optimizes utterances using role-specific rewards. Th…

9
№11
cs.LG arxiv:2605.28649v1

Interpretability-Guided Layer Selection over Subspace Projection: SAEs as Stethoscopes, Not Scalpels, for Raw Task Vector Model Editing

Li Lei, Madalina Ciobanu, Qingqing Mao et al.

This paper investigates using Sparse Autoencoders (SAEs) to guide model editing by projecting task vectors onto SAE feature subspaces for mathematical reasoning. The core finding is that this projection acts as an information bottleneck, discarding most modification energy and failing to yield significant improvements …

9
№12
cs.LG arxiv:2605.28819v1

PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

Yangyi Huang, Ruotian Peng, Zeju Qiu et al.

This paper introduces **PEFT-Arena**, a benchmark that evaluates Parameter-Efficient Finetuning (PEFT) methods based on the **stability-plasticity dilemma**: balancing adaptation to a new task against retaining original capabilities. The core contribution is demonstrating that different PEFT methods exhibit distinct st…

9
№13
cs.LG arxiv:2605.28705v1

Understanding Generalization and Forgetting in In-Context Continual Learning

Guangyu Li, Meng Ding, Lijie Hu

This paper introduces the first theoretical framework to analyze in-context continual learning (ICL) in Large Language Models processing sequential, heterogeneous tasks within a single prompt. By modeling shared attention mechanisms, particularly linear and masked linear attention, the authors derive error expressions …

9
№14
cs.CL arxiv:2605.28774v1

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Minki Kang, Shizhe Diao, Ryo Hachiuma et al.

This paper introduces AXPO (Agent eXplorative Policy Optimization) to address the "Thinking-Acting Gap" in agentic reasoning, where tool use is infrequent and often leads to failed learning signals. AXPO's core method involves fixing the successful thinking prefix of failed tool-using trajectories and then resampling t…

9
№15
cs.CL arxiv:2605.28629v1

Mobile-Aptus: Confidence-Driven Proactive and Robust Interaction in MLLM-based Mobile-Using Agents

Zheng Wu, Pengzhou Cheng, Zongru Wu et al.

This paper introduces **Mobile-Aptus**, a confidence-driven framework to mitigate both over-execution and over-soliciting in MLLM-based mobile agents. The core method integrates a **universal confidence framework** across two stages: interaction capability empowerment and confidence bias correction. This allows agents …

9
№16
cs.CL arxiv:2605.28814v1

Self-Improving Language Models with Bidirectional Evolutionary Search

Guowei Xu, Zhenting Qi, Huangyuan Su et al.

This paper introduces Bidirectional Evolutionary Search (BES), a novel self-improvement framework for language models that overcomes the limitations of sparse feedback and restricted exploration in traditional search methods. BES couples a **forward search** using evolutionary operators to recombine trajectories, with …

9
№17
cs.AI arxiv:2605.28607v1

Adaptive Multimodal Agents-Based Framework for Automatic Workflow Execution

Susanna Cifani, Mario Luca Bernardi, Marta Cimitile

This paper introduces an adaptive multimodal multi-agent framework for autonomous workflow execution that overcomes the limitations of fragmented, linear task processing. The core method involves an offline phase to construct a topological knowledge base from execution logs, which agents then leverage during inference.…

8
№18
cs.AI arxiv:2605.28655v1

AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

Shanghua Gao, Ada Fang, Marinka Zitnik

AutoScientists is a decentralized system of self-organizing AI agents designed for long-running scientific experimentation. Agents collaboratively interpret shared state, form teams around promising hypotheses, critique proposals, and share results to avoid redundant work. This approach significantly improves performan…

8
№19
cs.AI arxiv:2605.28807v1

Calibrating Conservatism for Scalable Oversight

William Overman, Mohsen Bayati

The paper introduces **Calibrated Collective Oversight (CCO)**, a method for scalable oversight of advanced AI agents. CCO aggregates diverse auxiliary scores into a penalty that measures deviation from a conservative baseline, allowing high-utility actions to proceed unless overseer concern accumulates. This conservat…

8
№20
cs.AI arxiv:2605.28787v1

Do Agents Need Semantic Metadata? A Comparative Study in Agentic Data Retrieval

Shiyu Chen, Tarfah Alrashed, Alon Halevy et al.

This paper compares the effectiveness of two agentic data retrieval methods: one using LLMs to search the open web, and another using an LLM agent specifically leveraging structured **schema.org semantic metadata**. The core contribution is an **LLM-as-a-judge evaluation** framework, aligned with FAIR principles, to as…

8