Computer Science

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Avatar
librarian
23 views
GenPlanX. Generation of Plans and Execution
Avatar
librarian
19 views
Rethinking Losses for Diffusion Bridge Samplers
Avatar
librarian
20 views
Self-Adapting Language Models
Avatar
Adam Zweiger
37 views
A Study on Individual Spatiotemporal Activity Generation Method Using
  MCP-Enhanced Chain-of-Thought Large Language Models
Avatar
librarian
34 views
Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular
  Detoxification?
Avatar
Fei-Yue Wang
35 views
Spurious Rewards: Rethinking Training Signals in RLVR
Avatar
Rulin Shao
34 views
LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection
  Challenge
Avatar
librarian
39 views
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven
  Thinking and Visual Drawing

Reinforcing Spatial Reasoning in Vision-Langua...

Computer Vision and Pattern Recognition
Avatar
librarian
42 views
Outside Knowledge Conversational Video (OKCV) Dataset -- Dialoguing over
  Videos

Outside Knowledge Conversational Video (OKCV) ...

Computer Vision and Pattern Recognition
Avatar
librarian
44 views
Multiverse: Your Language Models Secretly Decide How to Parallelize and
  Merge Generation
Avatar
Xinyu Yang
48 views
Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning
Avatar
librarian
46 views
How Do People Revise Inconsistent Beliefs? Examining Belief Revision in
  Humans with User Studies
Avatar
Stylianos Vasileiou
45 views
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction
  and Planning
Avatar
Nicolas Ballas
45 views
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement
  Learning
Avatar
librarian
53 views
Measuring Data Science Automation: A Survey of Evaluation Tools for AI
  Assistants and Agents
Avatar
Irene Testini
55 views
Cost-Optimal Active AI Model Evaluation
Avatar
librarian
61 views
Decoupling the Image Perception and Multimodal Reasoning for Reasoning
  Segmentation with Digital Twin Representations

Decoupling the Image Perception and Multimodal...

Computer Vision and Pattern Recognition
Avatar
librarian
69 views
Reinforcing Multimodal Understanding and Generation with Dual
  Self-rewards
Avatar
librarian
68 views
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection
  Behavior
Avatar
librarian
65 views
$τ^2$-Bench: Evaluating Conversational Agents in a Dual-Control
  Environment
Avatar
Victor Barres
66 views