CRAFT-E: A Neuro-Symbolic Framework for Embodied Affordance Grounding

#1 CRAFT-E: A Neuro-Symbolic Framework for Embodied Affordance Grounding [PDF] [Copy] [Kimi] [REL]

Authors: Zhou Chen, Joe Lin, Carson Bulgin, Sathyanarayanan N. Aakur

Assistive robots operating in unstructured environments must understand not only what objects are, but what they can be used for. This requires grounding language-based action queries to objects that both afford the requested function and can be physically retrieved. Existing approaches often rely on black-box models or fixed affordance labels, limiting transparency, controllability, and reliability for human-facing applications. We introduce CRAFT-E, a modular neuro-symbolic framework that composes a structured verb-property-object knowledge graph with visual-language alignment and energy-based grasp reasoning. The system generates interpretable grounding paths that expose the factors influencing object selection and incorporates grasp feasibility as an integral part of affordance inference. We further construct a benchmark dataset with unified annotations for verb-object compatibility, segmentation, and grasp candidates, and deploy the full pipeline on a physical robot. CRAFT-E achieves competitive performance in static scenes, ImageNet-based functional retrieval, and real-world trials involving 20 verbs and 39 objects. The framework remains robust under perceptual noise and provides transparent, component-level diagnostics. By coupling symbolic reasoning with embodied perception, CRAFT-E offers an interpretable and customizable alternative to end-to-end models for affordance-grounded object selection, supporting trustworthy decision-making in assistive robotic systems.

Subjects: Robotics , Artificial Intelligence

Publish: 2025-12-03 19:54:27 UTC

2512.04231

#1 CRAFT-E: A Neuro-Symbolic Framework for Embodied Affordance Grounding [PDF] [Copy] [Kimi] [REL]