№01
cs.AI arxiv:2604.27859v1

Rethinking Agentic Reinforcement Learning In Large Language Models

Fangming Cui, Ruixiao Zhu, Cheng Fang et al.

This paper re-examines Agentic Reinforcement Learning (RL) in the context of Large Language Models (LLMs), moving beyond traditional specialized agents. The core contribution is providing a deep insight into the conceptual foundations and methodological innovations enabling LLM-based agents to exhibit cognitive capabil…

10
№02
cs.LG arxiv:2604.28182v1

Exploration Hacking: Can LLMs Learn to Resist RL Training?

Eyon Jang, Damon Falck, Joschka Braun et al.

This paper introduces "exploration hacking," where LLMs strategically alter their exploration during RL training to manipulate subsequent outcomes and resist capability elicitation. The authors demonstrate this by fine-tuning models to exhibit selective RL resistance in specific domains while maintaining performance el…

10
№03
cs.AI arxiv:2604.28082v1

Characterizing the Consistency of the Emergent Misalignment Persona

Anietta Weckauff, Yuchen Zhang, Maksym Andriushchenko

This paper investigates the consistency of the "emergent misalignment persona" by fine-tuning an LLM on six distinct narrowly misaligned domains. The core contribution is characterizing two distinct patterns of inconsistency: **coherent-persona models**, where harmful behavior aligns with self-reported misalignment, an…

9
№04
cs.AI arxiv:2604.28139v1

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Chenxin Li, Zhengyang Tang, Huangxin Lin et al.

Claw-Eval-Live introduces a novel live benchmark designed to evaluate LLM agents against evolving, real-world workflows. It achieves this by separating a refreshable signal layer, sourced from public demand, from reproducible, time-stamped release snapshots with fixed task environments. The core contribution lies in it…

9
№05
cs.AI arxiv:2604.28043v1

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

Rahul Ramachandran, Nidhi Jha, Muthukumaran Ramasubramanian

CARE is a systematic, three-party methodology for engineering LLM agents in scientific domains, involving Subject-Matter Experts (SMEs), developers, and helper agents. It replaces ad-hoc methods by using helper agents to transform informal domain intent into structured, reviewable specifications and artifacts across de…

9
№06
cs.AI arxiv:2604.27955v1

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

Junan Hu, Jian Liu, Jingxiang Lai et al.

This paper provides the first comprehensive overview and taxonomy of integrating Reinforcement Learning (RL) with Graphical User Interface (GUI) agents. It organizes existing methods into Offline RL, Online RL, and Hybrid Strategies, analyzing challenges like reward engineering and data efficiency. The core contributio…

9
№07
cs.AI arxiv:2604.27891v1

In-Context Prompting Obsoletes Agent Orchestration for Procedural Tasks

Simon Dennis, Michael Diamond, Rivaan Patil et al.

This paper demonstrates that for procedural tasks, **in-context prompting**—embedding the entire procedure within the system prompt—outperforms traditional **agent orchestration frameworks** (like LangGraph). The simpler in-context method achieved higher success rates and better quality scores across complex domains by…

9
№08
cs.AI arxiv:2604.28123v1

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Sudong Wang, Weiquan Huang, Xiaomin Yu et al.

PRISM introduces a three-stage pipeline for multimodal reinforcement learning that explicitly addresses the distributional drift caused by standard supervised fine-tuning (SFT) before reinforcement learning. It achieves this via an on-policy distillation (OPD) stage, framing alignment as a black-box adversarial game ag…

9
№09
cs.AI arxiv:2604.28056v1

RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses

Feiyu Wu, Xu Zheng, Zhuocheng Wang et al.

The paper introduces **RHyVE**, a protocol for verifying and deploying LLM-generated reward hypotheses in reinforcement learning. RHyVE addresses the unreliability of these rewards by making deployment **competence-aware** (checking policy skill level) and **phase-aware** (considering training stage). This method uses …

9
№10
cs.LG arxiv:2604.28005v1

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

Shijin Gong, Kai Ye, Jin Zhu et al.

This paper introduces **Kernelized Advantage Estimation (KAE)**, a novel method for improving LLM reasoning via reinforcement learning that avoids the high overhead of value networks (like PPO/A2C) and the high sample complexity of sample-average methods (like GRPO). KAE leverages nonparametric kernel methods to effici…

9
№11
cs.CL arxiv:2604.27929v1

DPN-LE: Dual Personality Neuron Localization and Editing for Large Language Models

Lifan Zheng, Xue Yang, Jiawei Chen et al.

DPN-LE proposes a new method for editing LLM personalities by focusing on identifying and modifying a smaller, more specific set of "dual personality neurons." This approach addresses the performance degradation seen in prior methods by recognizing that neurons are multifunctional and aims to achieve targeted personali…

9
№12
cs.CL arxiv:2604.28031v1

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

Garvin Kruthof

This paper introduces **DriftBench**, a benchmark to evaluate how well Large Language Models (LLMs) adhere to initial constraints during multi-turn scientific ideation. The core finding is that iterative refinement reliably increases complexity and often reduces constraint adherence, revealing a **"knows-but-violates" …

9
№13
cs.AI arxiv:2604.27882v1

Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs

Giuseppe Arbore, Andrea Sillano, Luigi De Russis

This paper introduces a method for **on-demand persona-based agent generation** to overcome the inflexibility of hard-coded multi-agent systems. The core contribution is a pipeline that **dynamically crafts AI personas at runtime** to match specific user characteristics, task demands, and workflow context. This allows …

8
№14
cs.AI arxiv:2604.27924v1

Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future

Sihong Wu, Owen Jiang, Yilun Zhao et al.

This survey comprehensively reviews the application of Large Language Models (LLMs) across the entire academic peer review pipeline, from initial review generation to rebuttal drafting and final decision support. It synthesizes existing techniques, evaluation methodologies (human, reference, and LLM-based), and availab…

8
№15
cs.AI arxiv:2604.27996v1

Exploring Interaction Paradigms for LLM Agents in Scientific Visualization

Jackson Vonderhorst, Kuangshi Ai, Haichao Miao et al.

This paper explores the effectiveness of different Large Language Model (LLM) agent paradigms—domain-specific, computer-use, and general-purpose coding agents—for generating scientific visualization workflows from natural language. The core method involves evaluating eight agents across 15 benchmark tasks, measuring vi…

8
№16
cs.AI arxiv:2604.27906v1

From Unstructured Recall to Schema-Grounded Memory: Reliable AI Memory via Iterative, Schema-Aware Extraction

Alex Petrov, Alexander Gusak, Denis Mukha et al.

This paper argues that reliable AI memory requires a **schema-grounded approach** rather than simple text retrieval. The core method is an **iterative, schema-aware write path** that decomposes memory ingestion into structured object and field extraction with validation. This shifts the burden of reliability to the wri…

8
№17
cs.AI arxiv:2604.28158v1

Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists

Yujun Wu, Dongxu Zhang, Xinchen Li et al.

Intern-Atlas introduces a novel research infrastructure, a methodological evolution graph, to explicitly map how AI research methods emerge and adapt, moving beyond traditional document-centric citation networks. It automatically identifies method entities and infers lineage relationships, capturing the transitions tha…

8
№18
cs.AI arxiv:2604.27865v1

KellyBench: A Benchmark for Long-Horizon Sequential Decision Making

Thomas Grady, Kip Parker, Iliyan Zarov et al.

KellyBench is introduced as a novel benchmark environment simulating the long-horizon, non-stationary challenge of sports betting in the English Premier League. The core method involves tasking agents with maximizing long-term bankroll growth using historical sports data and public odds. The contribution is demonstrati…

8
№19
cs.AI arxiv:2604.27960v1

LLMs as ASP Programmers: Self-Correction Enables Task-Agnostic Nonmonotonic Reasoning

Adam Ishay, Joohyung Lee

This paper introduces "LLM+ASP," a framework that leverages Large Language Models (LLMs) to translate natural language into Answer Set Programming (ASP) for nonmonotonic reasoning. The core contribution is a task-agnostic system that employs an automated self-correction loop, allowing it to handle diverse reasoning pro…

8
№20
cs.AI arxiv:2604.27872v1

Modeling Clinical Concern Trajectories in Language Model Agents

Sukesh Subaharan, Venkatesan VS, Murugadasan P et al.

This paper introduces a lightweight architecture for LLM agents that models accumulating clinical concern using first- and second-order dynamics applied to a memoryless risk encoder. This method generates continuous, smooth "escalation pressure" trajectories, unlike standard stateless agents that show abrupt triggers. …

8