Optimizing Length Compression in Large Reasoning Models

0upvotes

By: Zhengxiang Cheng, Dongping Chen, Mingyang Fu, Tianyi Zhou

Large Reasoning Models (LRMs) have achieved remarkable success, yet they often suffer from producing unnecessary and verbose reasoning chains. We identify a core aspect of this issue as "invalid thinking" -- models tend to repeatedly double-check their work after having derived the correct answer. To address this specific inefficiency, we move beyond the general principles of Efficacy and Efficiency to propose two new, fine-grained principles... more

Artificial IntelligenceJune 18, 2025 2:51am

Comments (0)
Views (13)

VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models

0upvotes

By: Edward Li, Zichen Wang, Jiahe Huang, Jeong Joon Park

We present a unified framework for solving partial differential equations (PDEs) using video-inpainting diffusion transformer models. Unlike existing methods that devise specialized strategies for either forward or inverse problems under full or partial observation, our approach unifies these tasks under a single, flexible generative framework. Specifically, we recast PDE-solving as a generalized inpainting problem, e.g., treating forward pre... more

Machine LearningJune 17, 2025 1:11pm

4 SciCasts by .

Comments (0)
Views (16)

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

0upvotes

By: Junru Zhang, Lang Feng, Xu Guo, Yuhan Wu, Yabo Dong, Duanqing Xu

Time-series reasoning remains a significant challenge in multimodal large language models (MLLMs) due to the dynamic temporal patterns, ambiguous semantics, and lack of temporal priors. In this work, we introduce TimeMaster, a reinforcement learning (RL)-based method that enables time-series MLLMs to perform structured, interpretable reasoning directly over visualized time-series inputs and task prompts. TimeMaster adopts a three-part structu... more

Machine LearningJune 17, 2025 2:39am

Comments (0)
Views (6)

Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs

0upvotes

By: Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Reduan Achtibat, Patrick Kahardipraja, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

Large Language Models (LLMs) are central to many contemporary AI applications, yet their extensive parameter counts pose significant challenges for deployment in memory- and compute-constrained environments. Recent works in eXplainable AI (XAI), particularly on attribution methods, suggest that interpretability can also enable model compression by identifying and removing components irrelevant to inference. In this paper, we leverage Layer-wi... more

Machine LearningJune 17, 2025 2:38am

4 SciCasts by .