arXiv cs.AI (Artificial Intelligence)

75 items · Knowledge Graphs, Ontologies & Intent/Semantic Modeling · site ↗

How Far Did They Go? The Persuasive Tactics of Covert LLM Agents in a Discontinued Field Experiment

arXiv cs.AI (Artificial Intelligence) 8h

What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems

arXiv cs.AI (Artificial Intelligence) 8h

I Know What You Meme, Even If it Emerged Today: Understanding Evolving Memes through Open-World Knowledge Acquisition

arXiv cs.AI (Artificial Intelligence) 8h

GITCO: Gated Inference-Time Context Optimization in TSFMs

arXiv cs.AI (Artificial Intelligence) 8h

Uncertainty Aware Functional Behavior Prediction and Material Fatigue Assessment for Circular Factory

arXiv cs.AI (Artificial Intelligence) 8h

SentinelBench: A Benchmark for Long-Running Monitoring Agents

arXiv cs.AI (Artificial Intelligence) 8h

An interpretable and trustworthy AI framework for large-scale longitudinal structure-pain association studies using data from the Osteoarthritis Initiative (OAI)

arXiv cs.AI (Artificial Intelligence) 8h

Synthetic Contrastive Reasoning for Multi-Table Q&A

arXiv cs.AI (Artificial Intelligence) 8h

Stability vs. Manipulability: Evaluating Robustness Under Post-Decision Interaction in LLM Judges

arXiv cs.AI (Artificial Intelligence) 8h

Residual Modeling for High-Fidelity Learned Compression of Scientific Data

arXiv cs.AI (Artificial Intelligence) 8h

LeanMarathon: Toward Reliable AI Co-Mathematicians through Long-Horizon Lean Autoformalization

arXiv cs.AI (Artificial Intelligence) 8h

Harnessing Generalist Agents for Contextualized Time Series

arXiv cs.AI (Artificial Intelligence) 8h

Agents' Last Exam

arXiv cs.AI (Artificial Intelligence) 8h

Mutation Without Variation: Convergence Dynamics in LLM-Driven Program Evolution

arXiv cs.AI (Artificial Intelligence) 8h

A Motivational Architecture for Conversational AGI

arXiv cs.AI (Artificial Intelligence) 8h

Toward Pre-Deployment Assurance for Enterprise AI Agents: Ontology-Grounded Simulation and Trust Certification

arXiv cs.AI (Artificial Intelligence) yest

Stumbling Into AI Emotional Dependence: How Routine AI Interactions Reshape Human Connection

arXiv cs.AI (Artificial Intelligence) yest

Thinking Through Signs: PEEL as a Semiotic Scaffolding for Epistemically Accountable AI-Enabled Research

arXiv cs.AI (Artificial Intelligence) yest

SMAC-Talk: A Natural Language Extension of the StarCraft Multi-Agent Challenge for Large Language Models

arXiv cs.AI (Artificial Intelligence) yest

Consensus is Strategically Insufficient: Reasoning-Trace Disagreement as a Knowledge-Representation Signal

arXiv cs.AI (Artificial Intelligence) yest

VAMPS: Visual-Assisted Mathematical Problem Solving Benchmark

arXiv cs.AI (Artificial Intelligence) yest

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

arXiv cs.AI (Artificial Intelligence) yest

Can Generalist Agents Automate Data Curation?

arXiv cs.AI (Artificial Intelligence) yest

Characterizing initial human-AI proof formalization workflows

arXiv cs.AI (Artificial Intelligence) yest

The Saturation Trap and the Subjectivity of Intervention Timing: Why Affect-Based Triggers and LLM Judges Fail to Time Interventions on Autonomous Agents

arXiv cs.AI (Artificial Intelligence) yest

Exploring Cross-Scenario Generality of Agentic Memory Systems: Diagnostics and a Strong Baseline

arXiv cs.AI (Artificial Intelligence) yest

The Digital Apprentice: A Framework for Human-Directed Agentic AI Development

arXiv cs.AI (Artificial Intelligence) yest

Online Skill Learning for Web Agents via State-Grounded Dynamic Retrieval

arXiv cs.AI (Artificial Intelligence) yest

Not All Errors Are Equal: Consequence-Aware Reasoning Compute Allocation

arXiv cs.AI (Artificial Intelligence) yest

Trivium: Temporal Regret as a First-Class Objective for Causal-Memory Controllers

arXiv cs.AI (Artificial Intelligence) yest

Visual Graph Scaffolds for Structural Reasoning in Large Language Models

arXiv cs.AI (Artificial Intelligence) Jun 3

AURA: Action-Gated Memory for Robot Policies at Constant VRAM

arXiv cs.AI (Artificial Intelligence) Jun 3

Evaluating Transformer and LSTM Frameworks for Prediction in Ungauged Basins

arXiv cs.AI (Artificial Intelligence) Jun 3

BehaviorBench: Modeling Real-World User Decisions from Behavioral Traces

arXiv cs.AI (Artificial Intelligence) Jun 3

ChatHealthAI: Aligning Electronic Health Record Representations with Large Language Models for Grounded Clinical Reasoning

arXiv cs.AI (Artificial Intelligence) Jun 3

Traj-Evolve: A Self-Evolving Multi-Agent System for Patient Trajectory Modeling in Lung Cancer Early Detection

arXiv cs.AI (Artificial Intelligence) Jun 3

An Exploration of Collision-based Enemy Morphology Generation

arXiv cs.AI (Artificial Intelligence) Jun 3

Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models

arXiv cs.AI (Artificial Intelligence) Jun 3

Toward a Modular Architecture for Embedded AI Agent Systems at the Edge

arXiv cs.AI (Artificial Intelligence) Jun 3

Don't Gamble, GAMBLe: An Analytical Framework for AI-Driven Research Systems

arXiv cs.AI (Artificial Intelligence) Jun 3

When Helping Hurts and How to Fix It: Multi-Agent Debate for Data Cleaning

arXiv cs.AI (Artificial Intelligence) Jun 3

Handoff Debt: The Rediscovery Cost When Coding Agents Take Over Interrupted Tasks

arXiv cs.AI (Artificial Intelligence) Jun 3

Large AI Models in Dental Healthcare: From General-Purpose Systems to Domain-Specific Foundation Models

arXiv cs.AI (Artificial Intelligence) Jun 3

What Benchmarks Don't Measure: The Case for Evaluating Abstention Competence in Autonomous Agents

arXiv cs.AI (Artificial Intelligence) Jun 3

WISE-HAR: A Generalizable Ensemble Deep Learning Framework for WiFi-Based Human Activity Recognition

arXiv cs.AI (Artificial Intelligence) Jun 3

Position Paper: Post-Solve Robustness in Decision Engines: Feasible Regions and Smoothness Under Perturbations

arXiv cs.AI (Artificial Intelligence) Jun 2

Emergent Collaborative Deliberation in Multi-Model AI Systems: A BFT-Derived Protocol for Epistemic Synthesis

arXiv cs.AI (Artificial Intelligence) Jun 2

Deliberative Curation: A Protocol for Multi-Agent Knowledge Bases

arXiv cs.AI (Artificial Intelligence) Jun 2

Agents on a Tree: Pathwise Coordination for Multi-Objective Molecular Optimization

arXiv cs.AI (Artificial Intelligence) Jun 2

Optimal Transport-based Permutation-Invariant Bayesian Optimization of Offshore Wind Farm Layouts

arXiv cs.AI (Artificial Intelligence) Jun 2

MindGames Arena Generalization Track: In2AI Solution with Delayed Per-Step Reward Attribution

arXiv cs.AI (Artificial Intelligence) Jun 2

Universal Quantum Transformer

arXiv cs.AI (Artificial Intelligence) Jun 2

Grokers: Bottom-Up Inductive Comprehension and Write-Time Intelligence over Typed Knowledge Graphs

arXiv cs.AI (Artificial Intelligence) Jun 2

Product-Aware Deep Autoencoders for Robust Process Monitoring in Multi-Product Cyber-Physical Systems

arXiv cs.AI (Artificial Intelligence) Jun 2

On the evolution of the concept of probability as a mirror of the evolution of reason

arXiv cs.AI (Artificial Intelligence) Jun 2

Evaluating Interactive Reasoning in Large Language Models: A Hierarchical Benchmark with Executable Games

arXiv cs.AI (Artificial Intelligence) Jun 2

A Multi-AI-agent Framework Enabling End-to-end Finite Element Analysis for Solid Mechanics Problems

arXiv cs.AI (Artificial Intelligence) Jun 2

CAST: Non-Privileged Clipped Asymmetric Self-Teaching with Advantage Flipping for GRPO

arXiv cs.AI (Artificial Intelligence) Jun 2

TIGER: Traceable Inference with Graph-Based Evidence Routing for Mitigating Hallucinations in Multimodal Generation

arXiv cs.AI (Artificial Intelligence) Jun 2

MindZero: Learning Online Mental Reasoning With Zero Annotations

arXiv cs.AI (Artificial Intelligence) Jun 2

PhyDrawGen: Physically Grounded Diagram Generation from Natural Language