arXiv Papers

Notable AI research papers, in brief

Selected papers from the preprint server arXiv in AI / machine learning (cs.AI / cs.LG / cs.CL and more), organized with our own summaries, key points, and sources. This site is not affiliated with arXiv.

This page is a general organization of public research information. Summaries are our own; always verify accuracy and currency with the original paper on arXiv. Includes non-peer-reviewed preprints.

Browse all collected data (list, filter, search) →

Featured

Featured

Notable AI/ML papers explained with our own summaries, key points, FAQs, and sources.

cs.RO 2026/06

A robot policy with controllable speed — "TempoVLA," a speed-controllable Vision-Language-Action model

Manipulation alternates between low-risk transit (fast) and high-risk contact (slow, precise), yet existing Vision-Language-Action models (VLAs) inherit a single fixed speed from demonstrations. TempoVLA notes that the magnitude of each predicted action already governs speed, and controls execution speed via an explicit condition — combining a data-side variable-speed trajectory augmentation (VSTA) with model-side speed conditioning to control both acceleration and deceleration.

Read more
cs.CL 2026/06

AI-text detection gets harder under human–AI co-editing — the "OpAI-Bench" progressive-editing benchmark

As AI writing assistants spread, documents are increasingly the product of progressive human–AI co-editing rather than purely human or AI. OpAI-Bench studies human-to-AI transformation at document, sentence, token, and span granularities — and finds that mixed-authorship "intermediate" versions are often harder to detect than fully human or heavily AI-edited endpoints (a non-monotonic pattern).

Read more
cs.LG 2026/06

Training RNNs without recurrence — "Supervised Memory Training (SMT)," parallelizable across time

Standard RNN training (BPTT) is sequential in time, hard to parallelize, and suffers vanishing/exploding gradients on long ranges. SMT reduces RNN training to supervised learning on one-step memory-transition labels (m_t, x_{t+1})→m_{t+1}, sidestepping recurrent credit propagation entirely — enabling time-parallel training with an O(1) gradient path between any two tokens. It beats BPTT on language and pixel-sequence modeling (MIT, Isola lab).

Read more
cs.CV 2026/06

AI that "imagines" unseen space to reason — "Astra," a spatial-reasoning agent coupled with a world simulator

Vision-Language Models (VLMs) tend to confine reasoning to observed images and text, struggling with unobserved layouts and alternative viewpoints. Astra is a "thinking with imagination" framework where a VLM actively queries a world simulator for imagined novel-view evidence during reasoning. It couples an RL-trained policy with a Bagel-based world model and improves spatial-reasoning benchmarks.

Read more
cs.LG 2026/06

Redistributing reward to the reasoning steps that mattered — "RREDCoT" for chain-of-thought

RL fine-tuning of reasoning models (e.g., GRPO) can only verify and reward the final answer after the chain-of-thought (CoT) is complete — a delayed-reward problem that is Monte-Carlo-like and high-variance. RREDCoT redistributes reward (credit assignment) to the CoT segments that mattered, approximating the optimal redistribution using the model itself without extra generation (from LSTM creator Hochreiter's group, JKU).

Read more
cs.AI 2026/06

AI that "self-evolves" to discover machine-learning algorithms — "MLEvolve"

When LLM agents tackle long-horizon tasks like ML engineering, inter-branch information isolation, memoryless search, and a lack of hierarchical control hamper long-horizon optimization. MLEvolve is a self-evolving multi-agent framework that enables cross-branch information flow and reuses accumulated experience. It reaches SOTA on MLE-Bench in half the time budget and beats AlphaEvolve on math optimization.

Read more
cs.AI 2026/06

AI that proves theorems from a "blueprint" — "Goedel-Architect" for formal theorem proving in Lean 4

An agentic framework for formal theorem proving in Lean 4 that generates and refines a "blueprint" — a dependency graph of definitions and lemmas. A tool-equipped Lean prover closes each lemma node in parallel, and failures drive blueprint refinement, avoiding the dead-end loops of recursive decomposition. On an open-weight backbone it reaches 99.2% on MiniF2F and 75.6% on PutnamBench (88.8% with a natural-language proof) — SOTA-class for an open-source pipeline.

Read more
cs.CL 2026/06

Speeding up long-context LLMs by indexing once — "CLSA" cross-layer sparse attention

Long-context inference is bottlenecked by decoding efficiency, especially for reasoning models that emit long chains of thought. Existing sparse attention faces an efficiency-quality trade-off. CLSA, built on KV-sharing (YOCO), shares not just the KV cache but the routing index across layers — computing top-k selection once and reusing it. At 128K context it reaches up to 7.6x decoding speedup and 17.1x overall throughput.

Read more
cs.CR 2026/06

Will an AI agent recuse itself? Measuring compliance with a "Recuse Signal"

As autonomous LLM agents hold real credentials and operate infrastructure, operators lack a standard way to say a resource is off-limits. The Recuse Signal is a lightweight in-band deny signal (over an SSH banner or a PostgreSQL NOTICE) asking an automated agent to voluntarily withdraw — a robots.txt-like cooperative control, not a security boundary. In a pilot, the signal induced 100% recusal versus 100% task completion without it.

Read more
Latest feed

Recent AI/ML papers — latest 40

New papers in AI-related categories, newest submission first. Each links to the original page on arXiv.

  1. 1
    2026/06/04
  2. 2
    2026/06/04
  3. 3
    2026/06/04
  4. 4
    2026/06/04
  5. 5
    2026/06/04
  6. 6
    2026/06/04
  7. 7
    2026/06/04
  8. 8
    2026/06/04
  9. 9
    2026/06/04
  10. 10
    2026/06/04
  11. 11
    2026/06/04
  12. 12
    2026/06/04
  13. 13
    2026/06/04
  14. 14
    2026/06/04
  15. 15
    2026/06/04
  16. 16
    How abundant are good interpolators?
    math.ST ・ cs.LG ・ math.PR
    2026/06/04
  17. 17
    2026/06/04
  18. 18
    2026/06/04
  19. 19
    2026/06/04
  20. 20
    2026/06/04
  21. 21
    2026/06/04
  22. 22
    2026/06/04
  23. 23
    In-Context Multiple Instance Learning
    cs.LG ・ cs.AI ・ cs.CV
    2026/06/04
  24. 24
    2026/06/04
  25. 25
    2026/06/04
  26. 26
    2026/06/04
  27. 27
    2026/06/04
  28. 28
    2026/06/04
  29. 29
    2026/06/04
  30. 30
    2026/06/04
  31. 31
    2026/06/04
  32. 32
    2026/06/04
  33. 33
    2026/06/04
  34. 34
    2026/06/04
  35. 35
    2026/06/04
  36. 36
    2026/06/04
  37. 37
    2026/06/04
  38. 38
    2026/06/04
  39. 39
    2026/06/04
  40. 40
    2026/06/04

Source: arXiv (descriptive metadata is CC0 public domain). Summaries are our own; see arXiv for the original text and PDF.

Disclaimer: This site independently summarizes and classifies information based on official data sources. Always verify the latest and accurate information with the official sources. Content on finance, health, legal, and security is information, not advice. This site is not an official website of the U.S. government.