Hugging Face Daily Papers

75 items · Foundation Models & Frontier AI Labs · site ↗

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time? Hugging Face Daily Papers now
Seoul National University Hugging Face Daily Papers now
TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration Hugging Face Daily Papers now
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints Hugging Face Daily Papers now
University of Illinois at Urbana-Champaign Hugging Face Daily Papers now
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding Hugging Face Daily Papers now
RobotValues: Evaluating Household Robots When Human Values Conflict Hugging Face Daily Papers now
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation Hugging Face Daily Papers now
University of Zurich, Department of Computational Linguistics Hugging Face Daily Papers now
LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing Hugging Face Daily Papers now
Personal AI Agent for Camera Roll VQA Hugging Face Daily Papers now
Rethinking Continual Experience Internalization for Self-Evolving LLM Agents Hugging Face Daily Papers now
Cosmos 3: Omnimodal World Models for Physical AI Hugging Face Daily Papers yest
Audio Interaction Model Hugging Face Daily Papers yest
Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories Hugging Face Daily Papers yest
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning Hugging Face Daily Papers yest
Qwen-Image-Flash: Beyond Objective Design Hugging Face Daily Papers yest
M^3Eval: Multi-Modal Memory Evaluation through Cognitively-Grounded Video Tasks Hugging Face Daily Papers yest
OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs Hugging Face Daily Papers yest
Intern Large Models Hugging Face Daily Papers yest
Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation Hugging Face Daily Papers yest
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning Hugging Face Daily Papers yest
Streaming Communication in Multi-Agent Reasoning Hugging Face Daily Papers yest
Benchmarks are Not Enough: RAMP for Runtime Assessing of Agentic Models in Production Systems Hugging Face Daily Papers yest
OCC-RAG: Optimal Cognitive Core for Faithful Question Answering Hugging Face Daily Papers yest
Trust Region On-Policy Distillation Hugging Face Daily Papers yest
From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain Hugging Face Daily Papers yest
Massachusetts Institute of Technology Hugging Face Daily Papers yest
Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking Hugging Face Daily Papers yest
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks Hugging Face Daily Papers yest
HUAWEI Computing Systems Lab Hugging Face Daily Papers yest
A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RL Hugging Face Daily Papers yest
MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection Hugging Face Daily Papers yest
World Models Meet Language Models: On the Complementarity of Concrete and Abstract Reasoning Hugging Face Daily Papers yest
AutoMedBench: Towards Medical AutoResearch with Agentic AI Models Hugging Face Daily Papers yest
University of California, Santa Cruz Hugging Face Daily Papers yest
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Hugging Face Daily Papers Jun 2
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Hugging Face Daily Papers Jun 2
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Hugging Face Daily Papers Jun 2
Technion Israel institute of technology Hugging Face Daily Papers Jun 2
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts Hugging Face Daily Papers Jun 2
Carnegie Mellon University Hugging Face Daily Papers Jun 2
Draft-OPD: On-Policy Distillation for Speculative Draft Models Hugging Face Daily Papers Jun 2
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Hugging Face Daily Papers Jun 2
Shanghai Jiao Tong University Hugging Face Daily Papers Jun 2
Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs Hugging Face Daily Papers Jun 2
King's College London Hugging Face Daily Papers Jun 2
VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization Hugging Face Daily Papers Jun 2
GrepSeek: Training Search Agents for Direct Corpus Interaction Hugging Face Daily Papers Jun 1
University of Massachusetts Amherst Hugging Face Daily Papers Jun 1
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation Hugging Face Daily Papers Jun 1
Trust-Region Behavior Blending for On-Policy Distillation Hugging Face Daily Papers Jun 1
Representation Forcing for Bottleneck-Free Unified Multimodal Models Hugging Face Daily Papers Jun 1
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue Hugging Face Daily Papers Jun 1
Mellum2 Technical Report Hugging Face Daily Papers Jun 1
LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards Hugging Face Daily Papers Jun 1
Knowledge Engineer Group @ Tsinghua University Hugging Face Daily Papers Jun 1
GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration Hugging Face Daily Papers Jun 1
Function2Scene: 3D Indoor Scene Layout from Functional Specifications Hugging Face Daily Papers Jun 1
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer Hugging Face Daily Papers Jun 1
LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis Hugging Face Daily Papers Jun 1
Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring Hugging Face Daily Papers Jun 1
University of Wisconsin-Madison Hugging Face Daily Papers Jun 1
AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security Hugging Face Daily Papers May 31
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Hugging Face Daily Papers May 31
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Hugging Face Daily Papers May 31
CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation Hugging Face Daily Papers May 31
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Hugging Face Daily Papers May 31
YoCausal: How Far is Video Generation from World Model? A Causality Perspective Hugging Face Daily Papers May 31
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Hugging Face Daily Papers May 31
GenClaw: Code-Driven Agentic Image Generation Hugging Face Daily Papers May 31
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning Hugging Face Daily Papers May 31
EarlyTom: Early Token Compression Completes Fast Video Understanding Hugging Face Daily Papers May 31
Native Audio-Visual Alignment for Generation Hugging Face Daily Papers May 31
UniSteer: Text-Guided Flow Matching in Activation Space for Versatile LLM Steering Hugging Face Daily Papers May 31

Keyboard

j / k
move between items
Space
expand / collapse
o
open original
s
save / unsave
m
mark read
/
focus search
?
this help