arXiv cs.AI (Artificial Intelligence)

75 items · Knowledge Graphs, Ontologies & Intent/Semantic Modeling · site ↗

How Far Did They Go? The Persuasive Tactics of Covert LLM Agents in a Discontinued Field Experiment arXiv cs.AI (Artificial Intelligence) 8h
What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems arXiv cs.AI (Artificial Intelligence) 8h
I Know What You Meme, Even If it Emerged Today: Understanding Evolving Memes through Open-World Knowledge Acquisition arXiv cs.AI (Artificial Intelligence) 8h
GITCO: Gated Inference-Time Context Optimization in TSFMs arXiv cs.AI (Artificial Intelligence) 8h
Uncertainty Aware Functional Behavior Prediction and Material Fatigue Assessment for Circular Factory arXiv cs.AI (Artificial Intelligence) 8h
SentinelBench: A Benchmark for Long-Running Monitoring Agents arXiv cs.AI (Artificial Intelligence) 8h
An interpretable and trustworthy AI framework for large-scale longitudinal structure-pain association studies using data from the Osteoarthritis Initiative (OAI) arXiv cs.AI (Artificial Intelligence) 8h
Synthetic Contrastive Reasoning for Multi-Table Q&A arXiv cs.AI (Artificial Intelligence) 8h
Stability vs. Manipulability: Evaluating Robustness Under Post-Decision Interaction in LLM Judges arXiv cs.AI (Artificial Intelligence) 8h
Residual Modeling for High-Fidelity Learned Compression of Scientific Data arXiv cs.AI (Artificial Intelligence) 8h
LeanMarathon: Toward Reliable AI Co-Mathematicians through Long-Horizon Lean Autoformalization arXiv cs.AI (Artificial Intelligence) 8h
Harnessing Generalist Agents for Contextualized Time Series arXiv cs.AI (Artificial Intelligence) 8h
Agents' Last Exam arXiv cs.AI (Artificial Intelligence) 8h
Mutation Without Variation: Convergence Dynamics in LLM-Driven Program Evolution arXiv cs.AI (Artificial Intelligence) 8h
A Motivational Architecture for Conversational AGI arXiv cs.AI (Artificial Intelligence) 8h
Toward Pre-Deployment Assurance for Enterprise AI Agents: Ontology-Grounded Simulation and Trust Certification arXiv cs.AI (Artificial Intelligence) yest
Stumbling Into AI Emotional Dependence: How Routine AI Interactions Reshape Human Connection arXiv cs.AI (Artificial Intelligence) yest
Thinking Through Signs: PEEL as a Semiotic Scaffolding for Epistemically Accountable AI-Enabled Research arXiv cs.AI (Artificial Intelligence) yest
SMAC-Talk: A Natural Language Extension of the StarCraft Multi-Agent Challenge for Large Language Models arXiv cs.AI (Artificial Intelligence) yest
Consensus is Strategically Insufficient: Reasoning-Trace Disagreement as a Knowledge-Representation Signal arXiv cs.AI (Artificial Intelligence) yest
VAMPS: Visual-Assisted Mathematical Problem Solving Benchmark arXiv cs.AI (Artificial Intelligence) yest
StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis arXiv cs.AI (Artificial Intelligence) yest
Can Generalist Agents Automate Data Curation? arXiv cs.AI (Artificial Intelligence) yest
Characterizing initial human-AI proof formalization workflows arXiv cs.AI (Artificial Intelligence) yest
The Saturation Trap and the Subjectivity of Intervention Timing: Why Affect-Based Triggers and LLM Judges Fail to Time Interventions on Autonomous Agents arXiv cs.AI (Artificial Intelligence) yest
Exploring Cross-Scenario Generality of Agentic Memory Systems: Diagnostics and a Strong Baseline arXiv cs.AI (Artificial Intelligence) yest
The Digital Apprentice: A Framework for Human-Directed Agentic AI Development arXiv cs.AI (Artificial Intelligence) yest
Online Skill Learning for Web Agents via State-Grounded Dynamic Retrieval arXiv cs.AI (Artificial Intelligence) yest
Not All Errors Are Equal: Consequence-Aware Reasoning Compute Allocation arXiv cs.AI (Artificial Intelligence) yest
Trivium: Temporal Regret as a First-Class Objective for Causal-Memory Controllers arXiv cs.AI (Artificial Intelligence) yest
Visual Graph Scaffolds for Structural Reasoning in Large Language Models arXiv cs.AI (Artificial Intelligence) Jun 3
AURA: Action-Gated Memory for Robot Policies at Constant VRAM arXiv cs.AI (Artificial Intelligence) Jun 3
Evaluating Transformer and LSTM Frameworks for Prediction in Ungauged Basins arXiv cs.AI (Artificial Intelligence) Jun 3
BehaviorBench: Modeling Real-World User Decisions from Behavioral Traces arXiv cs.AI (Artificial Intelligence) Jun 3
ChatHealthAI: Aligning Electronic Health Record Representations with Large Language Models for Grounded Clinical Reasoning arXiv cs.AI (Artificial Intelligence) Jun 3
Traj-Evolve: A Self-Evolving Multi-Agent System for Patient Trajectory Modeling in Lung Cancer Early Detection arXiv cs.AI (Artificial Intelligence) Jun 3
An Exploration of Collision-based Enemy Morphology Generation arXiv cs.AI (Artificial Intelligence) Jun 3
Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models arXiv cs.AI (Artificial Intelligence) Jun 3
Toward a Modular Architecture for Embedded AI Agent Systems at the Edge arXiv cs.AI (Artificial Intelligence) Jun 3
Don't Gamble, GAMBLe: An Analytical Framework for AI-Driven Research Systems arXiv cs.AI (Artificial Intelligence) Jun 3
When Helping Hurts and How to Fix It: Multi-Agent Debate for Data Cleaning arXiv cs.AI (Artificial Intelligence) Jun 3
Handoff Debt: The Rediscovery Cost When Coding Agents Take Over Interrupted Tasks arXiv cs.AI (Artificial Intelligence) Jun 3
Large AI Models in Dental Healthcare: From General-Purpose Systems to Domain-Specific Foundation Models arXiv cs.AI (Artificial Intelligence) Jun 3
What Benchmarks Don't Measure: The Case for Evaluating Abstention Competence in Autonomous Agents arXiv cs.AI (Artificial Intelligence) Jun 3
WISE-HAR: A Generalizable Ensemble Deep Learning Framework for WiFi-Based Human Activity Recognition arXiv cs.AI (Artificial Intelligence) Jun 3
Position Paper: Post-Solve Robustness in Decision Engines: Feasible Regions and Smoothness Under Perturbations arXiv cs.AI (Artificial Intelligence) Jun 2
Emergent Collaborative Deliberation in Multi-Model AI Systems: A BFT-Derived Protocol for Epistemic Synthesis arXiv cs.AI (Artificial Intelligence) Jun 2
Deliberative Curation: A Protocol for Multi-Agent Knowledge Bases arXiv cs.AI (Artificial Intelligence) Jun 2
Agents on a Tree: Pathwise Coordination for Multi-Objective Molecular Optimization arXiv cs.AI (Artificial Intelligence) Jun 2
Optimal Transport-based Permutation-Invariant Bayesian Optimization of Offshore Wind Farm Layouts arXiv cs.AI (Artificial Intelligence) Jun 2
MindGames Arena Generalization Track: In2AI Solution with Delayed Per-Step Reward Attribution arXiv cs.AI (Artificial Intelligence) Jun 2
Universal Quantum Transformer arXiv cs.AI (Artificial Intelligence) Jun 2
Grokers: Bottom-Up Inductive Comprehension and Write-Time Intelligence over Typed Knowledge Graphs arXiv cs.AI (Artificial Intelligence) Jun 2
Product-Aware Deep Autoencoders for Robust Process Monitoring in Multi-Product Cyber-Physical Systems arXiv cs.AI (Artificial Intelligence) Jun 2
On the evolution of the concept of probability as a mirror of the evolution of reason arXiv cs.AI (Artificial Intelligence) Jun 2
Evaluating Interactive Reasoning in Large Language Models: A Hierarchical Benchmark with Executable Games arXiv cs.AI (Artificial Intelligence) Jun 2
A Multi-AI-agent Framework Enabling End-to-end Finite Element Analysis for Solid Mechanics Problems arXiv cs.AI (Artificial Intelligence) Jun 2
CAST: Non-Privileged Clipped Asymmetric Self-Teaching with Advantage Flipping for GRPO arXiv cs.AI (Artificial Intelligence) Jun 2
TIGER: Traceable Inference with Graph-Based Evidence Routing for Mitigating Hallucinations in Multimodal Generation arXiv cs.AI (Artificial Intelligence) Jun 2
MindZero: Learning Online Mental Reasoning With Zero Annotations arXiv cs.AI (Artificial Intelligence) Jun 2
PhyDrawGen: Physically Grounded Diagram Generation from Natural Language arXiv cs.AI (Artificial Intelligence) Jun 1
Physically Viable World Models: A Case for Query-Conditioned Embodied AI arXiv cs.AI (Artificial Intelligence) Jun 1
Transforming and Encoding FTS for SAT Solving: What Helps, What Hurts (Extended Version) arXiv cs.AI (Artificial Intelligence) Jun 1
Procedural Generation of First Person Shooter Maps using Map-Elites arXiv cs.AI (Artificial Intelligence) Jun 1
Uncertainty-Aware and Temporally Regulated Expert Advice in Reinforcement Learning for Autonomous Driving arXiv cs.AI (Artificial Intelligence) Jun 1
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents arXiv cs.AI (Artificial Intelligence) Jun 1
EHRBench: An Automated and Reliable EHR-based Benchmark for Clinical Decision Making with LLMs arXiv cs.AI (Artificial Intelligence) Jun 1
Structure-Induced Information for Rerooting Levin Tree Search arXiv cs.AI (Artificial Intelligence) Jun 1
Healthcare Mechanisms from Policy-as-Code Search under Strategic Provider Response arXiv cs.AI (Artificial Intelligence) Jun 1
MAVEN: Improving Generalization in Agentic Tool Calling arXiv cs.AI (Artificial Intelligence) Jun 1
Generating Graph-like Rules for Knowledge Graph Reasoning via Diffusion Models arXiv cs.AI (Artificial Intelligence) Jun 1
Learning Agent-Compatible Context Management for Long-Horizon Tasks arXiv cs.AI (Artificial Intelligence) Jun 1
PReMISE: Policy Rubrics as Measurement Specifications for LLM Judges arXiv cs.AI (Artificial Intelligence) Jun 1
Planner-Centric Reinforcement Learning for Deep Research with Structure-Aware Reward arXiv cs.AI (Artificial Intelligence) Jun 1
SLAT: Segment-Level Adaptive Trimming for Efficient CoT Reasoning arXiv cs.AI (Artificial Intelligence) Jun 1

Keyboard

j / k
move between items
Space
expand / collapse
o
open original
s
save / unsave
m
mark read
/
focus search
?
this help