Artificial Intelligence

GenPlanX. Generation of Plans and Execution
Avatar
librarian
19 views
A Study on Individual Spatiotemporal Activity Generation Method Using
  MCP-Enhanced Chain-of-Thought Large Language Models
Avatar
librarian
34 views
Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular
  Detoxification?
Avatar
Fei-Yue Wang
37 views
Spurious Rewards: Rethinking Training Signals in RLVR
Avatar
Rulin Shao
34 views
How Do People Revise Inconsistent Beliefs? Examining Belief Revision in
  Humans with User Studies
Avatar
Stylianos Vasileiou
45 views
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction
  and Planning
Avatar
Nicolas Ballas
45 views
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement
  Learning
Avatar
librarian
53 views
Measuring Data Science Automation: A Survey of Evaluation Tools for AI
  Assistants and Agents
Avatar
Irene Testini
56 views
Reinforcing Multimodal Understanding and Generation with Dual
  Self-rewards
Avatar
librarian
68 views
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection
  Behavior
Avatar
librarian
65 views
$τ^2$-Bench: Evaluating Conversational Agents in a Dual-Control
  Environment
Avatar
Victor Barres
66 views
Solving Inequality Proofs with Large Language Models
Avatar
librarian
63 views
Gradients: When Markets Meet Fine-tuning -- A Distributed Approach to
  Model Optimisation
Avatar
Christopher Subia-Waud
62 views
Control Tax: The Price of Keeping AI in Check
Avatar
Mikhail Terekhov
111 views
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Avatar
librarian
108 views
LLM-First Search: Self-Guided Exploration of the Solution Space
Avatar
librarian
106 views
Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties
  Reinforcement Learning
Avatar
librarian
106 views
Interpretability by Design for Efficient Multi-Objective Reinforcement
  Learning
Avatar
Qiyue Xia
112 views
TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management
  in LLM-based Agentic Multi-Agent Systems
Avatar
librarian
110 views
AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in
  LLM-Based Agents
Avatar
Akshat Naik
110 views
macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Avatar
Pei Yang
110 views
Does Thinking More always Help? Understanding Test-Time Scaling in
  Reasoning Models
Avatar
Soumya Suvra Ghosal
108 views
Linear Spatial World Models Emerge in Large Language Models
Avatar
Matthieu Tehenan
110 views
DPO Learning with LLMs-Judge Signal for Computer Use Agents
Avatar
librarian
109 views
The Limits of Predicting Agents from Behaviour
Avatar
Alexis Bellot
117 views
Sample, Predict, then Proceed: Self-Verification Sampling for Tool Use
  of LLMs
Avatar
librarian
114 views
Corrigibility as a Singular Target: A Vision for Inherently Reliable
  Foundation Models
Avatar
librarian
113 views
Data-to-Dashboard: Multi-Agent LLM Framework for Insightful
  Visualization in Enterprise Analytics
Avatar
Ran Zhang
139 views
ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork
Avatar
Caroline Wang
135 views
Comparative of Genetic Fuzzy regression techniques for aeroacoustic
  phenomenons
Avatar
librarian
133 views
Fortune: Formula-Driven Reinforcement Learning for Symbolic Table
  Reasoning in Language Models
Avatar
librarian
131 views
Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's
  Math Capability
Avatar
librarian
132 views