AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence

#1 AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence [PDF¹] [Copy] [Kimi³] [REL]

Despite major advances in machine learning, current artificial intelligence systems continue to fall short of human-like general intelligence. Existing evaluation frameworks, which are centered on language or perception tasks, fail to capture generality at its core and offer no guidance. The Artificial General Intelligence Testbed (AGITB) is a novel, freely available benchmarking suite consisting of thirteen core requirements, twelve of which are implemented as fully automatable tests designed to assess low-level cognitive precursors through binary signal prediction. AGITB requires models to forecast temporal sequences without pretraining, symbolic manipulation, or semantic grounding. The framework isolates core computational invariants-such as determinism, sensitivity, and generalization-that align with principles of biological information processing. Engineered to resist brute-force and memorization-based approaches, AGITB presumes no prior knowledge and demands learning from first principles. While humans pass all tests, no current AI system has met the full AGITB criteria, underscoring its potential as a rigorous, interpretable, and actionable benchmark for guiding and evaluating progress toward artificial general intelligence. A reference implementation of AGITB is available on GitHub.

Subject: Artificial Intelligence

Publish: 2025-04-06 10:01:15 UTC

2504.04430

#1 AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence [PDF1] [Copy] [Kimi3] [REL]

#1 AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence [PDF¹] [Copy] [Kimi³] [REL]