Benchmarking graph construction by large language models for coherence-driven inference

2502.13953

Total: 1

#1 Benchmarking graph construction by large language models for coherence-driven inference [PDF²] [Copy] [Kimi¹²] [REL]

Authors: Steve Huntsman, Jewell Thomas

We devise an algorithm to generate propositions that objectively instantiate graphs supporting coherence-driven inference. We also benchmark the ability of large language models (LLMs) to reconstruct coherence graphs from (a simple transformation of) propositions expressed in natural language, with promising results from a single prompt to reasoning-optimized LLMs. For example, o1/3/4-mini achieve perfect reconstruction half of the time on sparse graphs. Coherence-driven inference on consistency evaluations by LLMs may advance machine cognition capabilities.

Subject: Artificial Intelligence

Publish: 2025-02-19 18:53:16 UTC

2502.13953

#1 Benchmarking graph construction by large language models for coherence-driven inference [PDF2] [Copy] [Kimi12] [REL]

#1 Benchmarking graph construction by large language models for coherence-driven inference [PDF²] [Copy] [Kimi¹²] [REL]