Computation and Language

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Toward Generalist Autonomous Research via Hypo...

Computation and Language

Jiajie Jin

10 views

End-to-End Context Compression at Scale

End-to-End Context Compression at Scale

Computation and Language

librarian

24 views

SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence

SPADE-Bench: Evaluating Spontaneous Strategic ...

Computation and Language

librarian

30 views

Rethinking Memory as Continuously Evolving Connectivity

Rethinking Memory as Continuously Evolving Con...

Computation and Language

librarian

48 views

MeMo: Memory as a Model

MeMo: Memory as a Model

Computation and Language

Ryan Quek

80 views

The Impossibility Triangle of Long-Context Modeling

The Impossibility Triangle of Long-Context Mod...

Computation and Language

librarian

58 views

GiVA: Gradient-Informed Bases for Vector-Based Adaptation

GiVA: Gradient-Informed Bases for Vector-Based...

Computation and Language

Neeraj Gangwar

78 views

A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents

A Multimodal Text- and Graph-Based Approach fo...

Computation and Language

librarian

83 views

Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language

Chat2Workflow: A Benchmark for Generating Exec...

Computation and Language

librarian

85 views

CD2CR: Co-reference Resolution Across Documents and Domains

CD2CR: Co-reference Resolution Across Document...

Computation and Language

k-m-smit2

79 views

Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models

Demystifying OPD: Length Inflation and Stabili...

Computation and Language

librarian

128 views

ClawBench: Can AI Agents Complete Everyday Online Tasks?

ClawBench: Can AI Agents Complete Everyday Onl...

Computation and Language

librarian

114 views

Synthetic Sandbox for Training Machine Learning Engineering Agents

Synthetic Sandbox for Training Machine Learnin...

Computation and Language

Yuhang Zhou

119 views

Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation

Grounded Token Initialization for New Vocabula...

Computation and Language

Daiwei Chen

133 views

AstroConcepts: A Large-Scale Multi-Label Classification Corpus for Astrophysics

AstroConcepts: A Large-Scale Multi-Label Class...

Computation and Language

librarian

100 views

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

AgentSwing: Adaptive Parallel Context Manageme...

Computation and Language

librarian

103 views

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World

F2LLM-v2: Inclusive, Performant, and Efficient...

Computation and Language

librarian

186 views

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Nemotron-Cascade 2: Post-Training LLMs with Ca...

Computation and Language

librarian

171 views

Learning When to Attend: Conditional Memory Access for Long-Context LLMs

Learning When to Attend: Conditional Memory Ac...

Computation and Language

Aditya Chattopadhyay

121 views

Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL

Knowledge Distillation with Structured Chain-o...

Computation and Language

Khushboo Thaker

123 views

SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning

SciMDR: Benchmarking and Advancing Scientific ...

Computation and Language

librarian

133 views

Instruction set for the representation of graphs

Instruction set for the representation of graphs

Computation and Language

Ezequiel López-Rubio

116 views

Monitoring Emergent Reward Hacking During Generation via Internal Activations

Monitoring Emergent Reward Hacking During Gene...

Computation and Language

librarian

125 views

CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

CHIMERA: Compact Synthetic Data for Generaliza...

Computation and Language

Xinyu Zhu

138 views

Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling

Team of Thoughts: Efficient Test-time Scaling ...

Computation and Language

librarian

134 views

Attention Is All You Need

Attention Is All You Need

Computation and Language

Dr. Murat ALTUN

146 views

Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation

Multi-LLM Thematic Analysis with Dual Reliabil...

Computation and Language

Nilesh Jain

141 views

UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward

UltraLogic: Enhancing LLM Reasoning through La...

Computation and Language

librarian

211 views

Attention Is All You Need

Attention Is All You Need

Computation and Language

Salman

211 views

Memory in the Age of AI Agents

Memory in the Age of AI Agents

Computation and Language

librarian

246 views

Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models

Non-Resolution Reasoning: A Framework for Pres...

Computation and Language

Kei Saito

221 views

Latent Collaboration in Multi-Agent Systems

Latent Collaboration in Multi-Agent Systems

Computation and Language

librarian

251 views

Web analytics