Computation and Language

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Avatar
librarian
23 views
Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning
Avatar
librarian
46 views
Constrained Entropic Unlearning: A Primal-Dual Framework for Large
  Language Models
Avatar
librarian
99 views
Critique-GRPO: Advancing LLM Reasoning with Natural Language and
  Numerical Feedback
Avatar
librarian
98 views
ATLAS: Learning to Optimally Memorize the Context at Test Time
Avatar
librarian
141 views
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning
  Engineering
Avatar
librarian
126 views
LoLA: Low-Rank Linear Attention With Sparse Caching
Avatar
librarian
125 views
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural
  Language and Reinforcement Learning
Avatar
Jiahao Xu
122 views
Learning Composable Chains-of-Thought
Avatar
librarian
122 views
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken
  Language Understanding
Avatar
Alkis Koudounas
135 views
THiNK: Can Large Language Models Think-aloud?
Avatar
Yongan Yu
129 views
Do Large Language Models Excel in Complex Logical Reasoning with Formal
  Language?
Avatar
Jin Jiang
127 views
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent
  Systems
Avatar
librarian
123 views
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs
  via Reinforcement Learning
Avatar
librarian
129 views
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous
  Concept Space
Avatar
librarian
123 views
A Federated Splitting Framework for LLMs: Security, Efficiency, and
  Adaptability
Avatar
librarian
121 views
VerifyBench: Benchmarking Reference-based Reward Systems for Large
  Language Models
Avatar
librarian
120 views
BIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information
  Retrieval
Avatar
Hervé Onguéné
141 views
Learning Dynamics in Continual Pre-Training for Large Language Models
Avatar
librarian
129 views
ComPO: Preference Alignment via Comparison Oracles
Avatar
librarian
129 views
Reasoning Models Don't Always Say What They Think
Avatar
Yanda Chen
143 views
Whisper-LM: Improving ASR Models with Language Models for Low-Resource
  Languages
Avatar
Hussein Kedir
146 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
경택 오
205 views
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Avatar
yorba
174 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Ilya Baimetov
415 views
A Pipeline For Discourse Circuits From CCG
Avatar
ScienceCast Board
324 views
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
  Text
Avatar
Yael Flax
331 views
Meta-path Augmented Response Generation
Avatar
ScienceCast Board
317 views
CliNER 2.0: Accessible and Accurate Clinical Concept Extraction
Avatar
Sasa Pure
298 views
A Hybrid Architecture for Multi-Party Conversational Systems
Avatar
priaon-flag
307 views
Analyzing the Structure of Attention in a Transformer Language Model
Avatar
levymoshe16
328 views
Direct Neural Machine Translation with Task-level Mixture of Experts
  models
Avatar
Isidora Tourni
324 views