Artificial Intelligence

Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks
Avatar
librarian
1 view
AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility
Avatar
librarian
9 views
EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery
Avatar
librarian
22 views
Agents-K1: Towards Agent-native Knowledge Orchestration
Avatar
librarian
8 views
Nonslop: A Gamified Experiment in Human-AI Collaborative Writing
Avatar
Maria Edwards
15 views
Towards Responsibly Non-Compliant Machines
Avatar
librarian
12 views
The Impossibility of Eliciting Latent Knowledge
Avatar
librarian
15 views
A Five-Plane Reference Architecture for Runtime Governance of Production AI Agents
Avatar
Krti Tallam
14 views
PROJECTMEM: A Local-First, Event-Sourced Memory and Judgment Layer for AI Coding Agents
Avatar
librarian
14 views
StatefulDiscovery: Evidence-Calibrated Claim Formation in Open-Ended Scientific Discovery
Avatar
12531182
12 views
Embodied-BenchClaw: An Autonomous Multi-Agent System for Embodied Spatial Intelligence Benchmark Construction
Avatar
librarian
10 views
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity
Avatar
librarian
15 views
CIAware-Bench: Benchmarking Control Intervention Awareness Across Frontier LLMs
Avatar
librarian
15 views
Null-Space Constrained Low-Rank Adaptation for Response-Specified Large Language Model Unlearning
Avatar
librarian
15 views
Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields
Avatar
librarian
18 views
ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models
Avatar
librarian
18 views
AutoPDE: Reliable Agentic PDE Solving via Explicitly Represented Solver Strategies
Avatar
librarian
13 views
Frontier Coding Agents Use Metaprogramming to Adapt to Unfamiliar Programming Languages
Avatar
librarian
13 views
Moonshine: An Autonomous Mathematical Research Agent Centered on Conjecture Generation
Avatar
librarian
8 views
WorldKernel: A World Model is the Coupling Kernel of Admissible Possible Worlds
Avatar
librarian
9 views
Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models
Avatar
librarian
9 views
(Auto)formalization is supposed to be easy: Trellis process semantics for spelling out rigorous proofs
Avatar
librarian
21 views
SIGA: Self-Evolving Coding-Agent Adapters for Scientific Simulation
Avatar
librarian
24 views
Proxy Reward Internalization and Mechanistic Exploitation: A Learned Precursor to Reward Hacking and Its Generalization
Avatar
Mohammad Beigi
22 views
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research
Avatar
librarian
25 views
Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting
Avatar
librarian
27 views
From 0-to-1 to 1-to-N: Reproducible Engineering Evidence for MetaAI Recursive Self-Design
Avatar
librarian
22 views
Optical Reasoning: Rethinking Images as an Expressive Reasoning Medium Beyond Text
Avatar
Yutong Bian
26 views
TokenMizer: Graph-Structured Session Memory for Long-Horizon LLM Context Management
Avatar
Shweta Mishra
92 views
Vortex: Efficient and Programmable Sparse Attention Serving for AI Agents
Avatar
Zhuoming Chen
40 views
Benchmark Everything Everywhere All at Once
Avatar
librarian
31 views
Goedel-Architect: Streamlining Formal Theorem Proving with Blueprint Generation and Refinement
Avatar
librarian
31 views