The Morning
US Stocks
Watchlist item from positive sector tape, 3 recent headline(s).
This Will Be Microsoft’s Stock Price in 2028 - 24/7 Wall St. Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from 3 recent headline(s).
Morgan Stanley raises Dell stock price target to $170 on AI strength - Investing.com Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from positive sector tape, 3 recent headline(s).
CFC Planning Co LLC Trims Stock Holdings in Amazon.com, Inc. $AMZN - MarketBeat Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from positive sector tape, 3 recent headline(s).
Stifel Maintains Alphabet-A(GOOGL.US) With Buy Rating, Maintains Target Price $420 - Moomoo Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from positive sector tape, 3 recent headline(s).
Heathrow and Africa’s largest mobile operator tap Oracle for AI decisions - Stock Titan Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from positive sector tape, 3 recent headline(s).
Jefferies lowers Vistra Energy stock price target on valuation - Investing.com Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from 3 recent headline(s).
Rathbones Group PLC Decreases Stock Position in Amphenol Corporation $APH - MarketBeat Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from 3 recent headline(s).
Under The Hood: The Real Range Wall Street Is Pricing For CIEN - Trefis Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from negative sector tape, 3 recent headline(s).
North Dakota State Investment Board Takes $794,000 Position in Flex Ltd. $FLEX - MarketBeat Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from negative sector tape, 3 recent headline(s).
Why is EOSE stock surging today? - MSN Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from 3 recent headline(s).
Is It Too Late To Consider Vertiv Holdings Co (VRT) After Its 205% One Year Surge? - simplywall.st Needs fresh price/news confirmation before becoming an actionable setup.Watchlist item from negative sector tape, 3 recent headline(s).
Celestica (CLS) director awarded restricted and director share units in Form 4 - Stock Titan Needs fresh price/news confirmation before becoming an actionable setup.Generated from public market data and news for research and education. Not financial advice; data may be delayed, incomplete, or wrong.
From the arXiv
APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents
PEX introduces a novel framework for self-evolving LLM agents to overcome exploration collapse by explicitly managing a strategy space via a **strategy map** (a DAG of milestones). The core method involves **Fork Discovery** to expand this map with new, evidence-grounded directions and **Policy Selection** to balance exploration and exploitation during planning. This allows agents to continuously discover and pursue better long-horizon behaviors without requiring model weight updates.


DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation
DeepWeb-Bench is a new, challenging benchmark designed to evaluate the "deep research" capabilities of frontier language models, which involve extensive web searching, evidence collection, and multi-step reasoning. Its difficulty stems from the requirement for…
Frontier: Towards Comprehensive and Accurate LLM Inference Simulation
Frontier is a novel discrete-event simulator designed to accurately model the complexities of modern, disaggregated LLM inference serving systems. It achieves high fidelity by explicitly modeling architectural features like Prefill-Decode Disaggregation (PDD) …


Insights Generator: Systematic Corpus-Level Trace Diagnostics for LLM Agents
This paper introduces the **Insights Generator (IG)**, a multi-agent system designed to automate the diagnosis of systematic failures in large sets of LLM agent execution traces. IG formalizes corpus-level trace diagnostics by proposing and testing hypotheses …
Mem-$π$: Adaptive Memory through Learning When and What to Generate
Mem-$\pi$ introduces an adaptive memory framework where a separate model generates context-specific guidance on demand, moving beyond static retrieval. This system jointly learns *when* to generate guidance and *what* to generate using a decoupled reinforcemen…

Open-source LLMs administer maximum electric shocks in a Milgram-like obedience experiment
This paper adapted the Milgram obedience experiment to test the behavior of 11 open-source Large Language Models (LLMs) under sustained authority pressure. The core finding is that…
PALS: Power-Aware LLM Serving for Mixture-of-Experts Models
PALS is a power-aware runtime for LLM serving that treats GPU power caps as a dynamic control knob, optimizing them alongside software parameters like batch size. It uses lightweig…
PREFINE: Preference-Based Implicit Reward and Cost Fine-Tuning for Safety Alignment
PREFINE adapts the Direct Preference Optimization (DPO) framework to sequential decision-making for safety alignment. It fine-tunes a pre-trained RL policy using trajectory-level p…
SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents
SpecBench introduces a method to quantify reward hacking in long-horizon coding agents by comparing performance on two test suites: visible validation tests and held-out compositio…
TextReg: Mitigating Prompt Distributional Overfitting via Regularized Text-Space Optimization
TextReg addresses prompt distributional overfitting in LLMs, where iterative prompt optimization leads to poor generalization. The core method introduces a regularization framework…
The Town Square
Intuit is laying off over 3,000 employees as part of a strategic shift to refocus its business operations heavily on artificial intelligence initiatives.
Workshops
CodeGraph is a pre-indexed, local knowledge graph designed to efficiently provide code context to large language models like Claude Code and Codex, minimizing token usage and tool calls.
This repository provides a single `CLAUDE.md` file containing insights derived from Andrej Karpathy's observations to enhance Claude's code generation capabilities by addressing common LLM coding pitfalls.