Tiny Networks

The document discusses the limitations of large language models (LLMs) like GPT-5 in structured reasoning tasks due to their autoregressive nature, which prevents backtracking. It introduces a new model called TRM, which simplifies the hierarchical reasoning approach by using a single tiny network that operates recursively, leading to impressive performance on reasoning benchmarks. Empirical results show that TRMs significantly outperform larger models on tasks such as Sudoku and ARC-AGI puzzles.

Uploaded by

Thiyagarajan Palaniyappan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views5 pages

Tiny Networks

Uploaded by

Thiyagarajan Palaniyappan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Less is More: Recursive Reasoning

with Tiny Networks

The Problem
Why Giant LLMs Fail at Logic?

LLMs ike GPT-5 and Claude Sonnet 4.5 are masters of

pattern matching and fluent text generation. However,
they hit a wall with structured reasoning tasks like
Sudoku, complex mazes, or ARC-AGI puzzles.

The core issue is their autoregressive nature—they

predict the next token without a built-in mechanism to
backtrack or correct a flawed logical step.

A single early mistake in a Sudoku grid cascades,

invalidating the entire solution. Scaling up data and
parameters doesn't fix this fundamental architectural
limitation.

Source: Medium
Hierarchical Reasoning Models
Before TRMs, Hierarchical Reasoning Models (HRMs)
offered a promising path. HRMs used two small networks
working in a hierarchy: a fast "thinking" network and a
slower "conceptual" network that guided it.

By running this loop recursively, HRMs showed that

smaller models could indeed rival larger ones on logical
tasks. However, their two-network design was complex,
biologically metaphorical, and computationally tricky to
tune.

Source: Medium
The Breakthrough: TRMs
TRMs strip down the HRM concept to its elegant core.
They replace the two-network hierarchy with a single,
tiny network that operates in a recursive loop. This
network maintains two simple pieces of information:

y (the solution): The current best answer (e.g., a

Sudoku grid).
z (the reasoning state): A latent memory of its
current "thought process."

With each iteration, the network refines both y and z,

inching closer to the correct solution.

Source: Less is More: Recursive Reasoning with Tiny Networks

The Stunning Results
Tiny Model, Giant-Killing Performance

The empirical data is undeniable. On classic reasoning

benchmarks, a TRM with just ~7 million parameters
dramatically outperforms billion-parameter LLMs:

Sudoku-Extreme: TRM
achieved 87.4% accuracy,
while state-of-the-art
models like DeepSeek R1,
Claude 3, and o3-mini
scored a shocking 0%.
ARC-AGI Puzzles: On this
benchmark for fluid
intelligence, the TRM
scored 44.6%, nearly
tripling the performance of
DeepSeek R1 (15.8%) and
other large models.
Maze-Hard: For complex
pathfinding, TRM reached
85.3% accuracy,
significantly outperforming
its predecessor HRM
(74.5%).
Source: Less is More: Recursive Reasoning with Tiny Networks

Eng Lab
No ratings yet
Eng Lab
7 pages
Samsung's Tiny AI Model Beats Giant Reasoning LLMs
No ratings yet
Samsung's Tiny AI Model Beats Giant Reasoning LLMs
12 pages
The Illusion of Thinking
No ratings yet
The Illusion of Thinking
30 pages
The Illusion of Thinking
No ratings yet
The Illusion of Thinking
3 pages
The Illusion of Thinking
No ratings yet
The Illusion of Thinking
30 pages
Part 4
No ratings yet
Part 4
3 pages
LLM Paper
No ratings yet
LLM Paper
15 pages
Hierarchical Reasoning Model
No ratings yet
Hierarchical Reasoning Model
24 pages
Hierarchical Reasoning Model
No ratings yet
Hierarchical Reasoning Model
24 pages
Hierarchical Reasoning Model
No ratings yet
Hierarchical Reasoning Model
25 pages
Understanding Reasoning LLMS: Methods and Strategies For Building and Refining Reasoning Models
No ratings yet
Understanding Reasoning LLMS: Methods and Strategies For Building and Refining Reasoning Models
27 pages
Legal : Enhancing Legal Reasoning in LLMs Via Reinforcement Learning With Chain-of-Thought Guided Information Gain
No ratings yet
Legal : Enhancing Legal Reasoning in LLMs Via Reinforcement Learning With Chain-of-Thought Guided Information Gain
33 pages
Graph of Thoughts: Solving Elaborate Problems With Large Language Models
No ratings yet
Graph of Thoughts: Solving Elaborate Problems With Large Language Models
13 pages
Graph of Thoughts Framework for LLMs
No ratings yet
Graph of Thoughts Framework for LLMs
61 pages
Meta-Reasoner: Dynamic Guidance For Optimized Inference-Time Reasoning in Large Language Models
No ratings yet
Meta-Reasoner: Dynamic Guidance For Optimized Inference-Time Reasoning in Large Language Models
17 pages
Meta-CoT for Advanced AI Reasoning
No ratings yet
Meta-CoT for Advanced AI Reasoning
100 pages
Algorithm of Thoughts for LLMs
No ratings yet
Algorithm of Thoughts for LLMs
46 pages
Stop Overthinking: A Survey On Efficient Reasoning For Large Language Models
No ratings yet
Stop Overthinking: A Survey On Efficient Reasoning For Large Language Models
25 pages
Aristotle Mastering Logical Reasoning With A Logic
No ratings yet
Aristotle Mastering Logical Reasoning With A Logic
22 pages
Efficient Reasoning with Chain of Draft
No ratings yet
Efficient Reasoning with Chain of Draft
7 pages
Analyzing LLM Reasoning with ArrangementPuzzle
No ratings yet
Analyzing LLM Reasoning with ArrangementPuzzle
7 pages
SOLAR: Scalable Optimization of Large-Scale Architecture For Reasoning
No ratings yet
SOLAR: Scalable Optimization of Large-Scale Architecture For Reasoning
25 pages
Reason From Future: Reverse Thought Chain Enhances LLM Reasoning
No ratings yet
Reason From Future: Reverse Thought Chain Enhances LLM Reasoning
14 pages
Efficient LLM Reasoning Strategy
No ratings yet
Efficient LLM Reasoning Strategy
6 pages
Advancing Reasoning in Large Language Models. Promising Methods and Approaches
No ratings yet
Advancing Reasoning in Large Language Models. Promising Methods and Approaches
9 pages
RLMs: A Comprehensive Blueprint
No ratings yet
RLMs: A Comprehensive Blueprint
44 pages
Theoretical Insights on Chain of Thought
No ratings yet
Theoretical Insights on Chain of Thought
38 pages
Self-Improving LLM Architectures With Open Source
No ratings yet
Self-Improving LLM Architectures With Open Source
14 pages
Abstract
No ratings yet
Abstract
21 pages
L4 Auto Got
No ratings yet
L4 Auto Got
6 pages
Reinforcement Learning For Reasoning in Small LLMS: What Works and What Doesn'T
No ratings yet
Reinforcement Learning For Reasoning in Small LLMS: What Works and What Doesn'T
17 pages
ReasoningModels Blueprint
No ratings yet
ReasoningModels Blueprint
44 pages
Harnessing The Reasoning Economy A Survey of Efficient Reasoning For Large Language Models
No ratings yet
Harnessing The Reasoning Economy A Survey of Efficient Reasoning For Large Language Models
24 pages
Achieving 97% On GSM8K Deeply Understanding The Problems Makes LLMs Better Solvers For Math Word Problems 2404.14963v4
No ratings yet
Achieving 97% On GSM8K Deeply Understanding The Problems Makes LLMs Better Solvers For Math Word Problems 2404.14963v4
11 pages
Large Language Model For Science
No ratings yet
Large Language Model For Science
73 pages
Part 2
No ratings yet
Part 2
3 pages
Harnessing The True Potential of LLMs Iterative Self Improvement For Competitive Performance
No ratings yet
Harnessing The True Potential of LLMs Iterative Self Improvement For Competitive Performance
12 pages
Diagram of Thought Framework for LLMs
No ratings yet
Diagram of Thought Framework for LLMs
7 pages
ThinkPatterns 21k
No ratings yet
ThinkPatterns 21k
27 pages
OpenR: Open Source LLM Reasoning Framework
No ratings yet
OpenR: Open Source LLM Reasoning Framework
18 pages
Paper Final
No ratings yet
Paper Final
11 pages
10 Cs224r-Rl For Reasoning Lecture
No ratings yet
10 Cs224r-Rl For Reasoning Lecture
41 pages
Mixture of Recursions Learning $ynamic Recursive $epths For !daptive Token Level Computation
No ratings yet
Mixture of Recursions Learning $ynamic Recursive $epths For !daptive Token Level Computation
36 pages
Do Llms Really Need 10+ Thoughts For "Find The Time 1000 Days Later"? Towards Structural Understanding of LLM Overthinking
No ratings yet
Do Llms Really Need 10+ Thoughts For "Find The Time 1000 Days Later"? Towards Structural Understanding of LLM Overthinking
30 pages
Rethinking The Illusion of Thinking
No ratings yet
Rethinking The Illusion of Thinking
8 pages
The Illusion of The Illusion of Thinking: A Comment On Shojaee Et Al. (2025)
No ratings yet
The Illusion of The Illusion of Thinking: A Comment On Shojaee Et Al. (2025)
4 pages
Adapthink: Adaptive Thinking Preferences For Reasoning Language Model
No ratings yet
Adapthink: Adaptive Thinking Preferences For Reasoning Language Model
18 pages
Enhancing LLMs with Rationality Layer
No ratings yet
Enhancing LLMs with Rationality Layer
12 pages
Towards Revealing The Mystery Behind Chain of Thought: A Theoretical Perspective
No ratings yet
Towards Revealing The Mystery Behind Chain of Thought: A Theoretical Perspective
42 pages
Solving Sudoku With Machine Learning
No ratings yet
Solving Sudoku With Machine Learning
2 pages
A Survey On Latent Reasoning: Ucsc, Fdu, Nju, Pku, Ruc, Uom, Uw-Madison, Polyu, M-A-P
No ratings yet
A Survey On Latent Reasoning: Ucsc, Fdu, Nju, Pku, Ruc, Uom, Uw-Madison, Polyu, M-A-P
38 pages
Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models Through Logic
No ratings yet
Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models Through Logic
23 pages
Exploring and Exploiting The Inherent Efficiency Within Large Reasoning Models For Self-Guided Efficiency Enhancement
No ratings yet
Exploring and Exploiting The Inherent Efficiency Within Large Reasoning Models For Self-Guided Efficiency Enhancement
18 pages
From Reasoning To Super-Intelligence: A Search-Theoretic Perspective
No ratings yet
From Reasoning To Super-Intelligence: A Search-Theoretic Perspective
26 pages
A Survey On Latent Reasoning: Ucsc, Fdu, Nju, Pku, Ruc, Uom, Uw-Madison, Polyu, M-A-P
No ratings yet
A Survey On Latent Reasoning: Ucsc, Fdu, Nju, Pku, Ruc, Uom, Uw-Madison, Polyu, M-A-P
38 pages
Logic LM
No ratings yet
Logic LM
19 pages
Thought-Like-Pro - Enhancing Reasoning of Large Language Models Through Self-Driven Prolog-Based Chain-of-Thought
No ratings yet
Thought-Like-Pro - Enhancing Reasoning of Large Language Models Through Self-Driven Prolog-Based Chain-of-Thought
15 pages
Atom of Thoughts - A Paradigm Shift in LLM Reasoning and Efficiency - by Arman Kamran - Medium
No ratings yet
Atom of Thoughts - A Paradigm Shift in LLM Reasoning and Efficiency - by Arman Kamran - Medium
15 pages
Full Course of Machine Learning
100% (18)
Full Course of Machine Learning
660 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (15)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
Hackers Guide To Machine Learning With Python PDF
100% (16)
Hackers Guide To Machine Learning With Python PDF
272 pages
Understanding Machine Learning
100% (73)
Understanding Machine Learning
416 pages
Beyond AI
100% (11)
Beyond AI
532 pages
Apress Understanding Large Language Models B0CJ2C8TXQ
100% (12)
Apress Understanding Large Language Models B0CJ2C8TXQ
166 pages
Artificial Intelligence With Python (Machine Learning Foundations, Methodologies, and Applications) (Teik Toe Teoh, Zheng Rong)
95% (19)
Artificial Intelligence With Python (Machine Learning Foundations, Methodologies, and Applications) (Teik Toe Teoh, Zheng Rong)
334 pages
The Hundred-Page Language Models Book - Andriy Burkov
93% (14)
The Hundred-Page Language Models Book - Andriy Burkov
209 pages
Machine Learning With Python
100% (15)
Machine Learning With Python
692 pages
Deep Learning - Fundamentals, Theory and Applications 2019 PDF
100% (11)
Deep Learning - Fundamentals, Theory and Applications 2019 PDF
168 pages
Tom Taulli - Generative AI - A Non-Technical Introduction-Apress (2023)
100% (10)
Tom Taulli - Generative AI - A Non-Technical Introduction-Apress (2023)
211 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
100% (6)
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
326 pages
AI Artificial Intelligence, 60 Leaders 17 Questions
100% (14)
AI Artificial Intelligence, 60 Leaders 17 Questions
236 pages
RAG Architecture
100% (11)
RAG Architecture
52 pages
Deep Learning For NLP and Speech Recogni
100% (10)
Deep Learning For NLP and Speech Recogni
640 pages
Frana P. Encyclopedia of Artificial Intelligence. The Past, Present, and Future of AI 2021
82% (11)
Frana P. Encyclopedia of Artificial Intelligence. The Past, Present, and Future of AI 2021
405 pages
Transformers For Machine Learning A Deep Dive (Uday Kamath, Kenneth L. Graham, Wael Emara)
100% (12)
Transformers For Machine Learning A Deep Dive (Uday Kamath, Kenneth L. Graham, Wael Emara)
284 pages
Principles of Building AI Agents 2nd Edition
100% (10)
Principles of Building AI Agents 2nd Edition
149 pages
Algorithms Illuminated Part3
100% (5)
Algorithms Illuminated Part3
230 pages
Machine Learning Projects in Python
100% (17)
Machine Learning Projects in Python
135 pages
Building AI Agents With LLMS, RAG, and Knowledge Graphs
100% (11)
Building AI Agents With LLMS, RAG, and Knowledge Graphs
560 pages
Deep Learning With Python
100% (10)
Deep Learning With Python
396 pages
Applied Generative AI For Beginners Practical Knowledge 1703207445
95% (19)
Applied Generative AI For Beginners Practical Knowledge 1703207445
221 pages
Prof Luciano Floridi - The Ethics of Artificial Intelligence - Principles, Challenges, and Opportunities-Oxford University Press (2023)
100% (7)
Prof Luciano Floridi - The Ethics of Artificial Intelligence - Principles, Challenges, and Opportunities-Oxford University Press (2023)
272 pages
Machine Learning Paradigms
100% (10)
Machine Learning Paradigms
336 pages
Mathematics For Machine Learning
100% (6)
Mathematics For Machine Learning
417 pages
LLM Applications in Production Guide
100% (12)
LLM Applications in Production Guide
254 pages
Python Machine Learning For Beginners Ebook Final
100% (11)
Python Machine Learning For Beginners Ebook Final
305 pages
Paab G. Foundation Models For Natural Language Processing... 2023
100% (4)
Paab G. Foundation Models For Natural Language Processing... 2023
448 pages
Agentic Design Patterns Deepmind
0% (1)
Agentic Design Patterns Deepmind
30 pages
TOP AI AGENT Frameworks
No ratings yet
TOP AI AGENT Frameworks
15 pages
Transformers vs. Mixture of Experts (MoE)
No ratings yet
Transformers vs. Mixture of Experts (MoE)
7 pages
Agentic AI #2 - How To Build An AI Agent From Scratch - A Developer's Guide - by Aman Raghuvanshi - Medium
No ratings yet
Agentic AI #2 - How To Build An AI Agent From Scratch - A Developer's Guide - by Aman Raghuvanshi - Medium
41 pages
Confusion Matrix For GenAI
No ratings yet
Confusion Matrix For GenAI
8 pages
Small Language Models Are The Future of AI Agents
No ratings yet
Small Language Models Are The Future of AI Agents
25 pages
Master AI Agents - MCP, RAG, Graphs & More
No ratings yet
Master AI Agents - MCP, RAG, Graphs & More
13 pages
How To Build AI Agents From Scratch (Even If You'Ve Never Coded One Before) - by Aakash Gupta - Oct, 2025 - Medium
No ratings yet
How To Build AI Agents From Scratch (Even If You'Ve Never Coded One Before) - by Aakash Gupta - Oct, 2025 - Medium
19 pages
Medical Multi Agent
No ratings yet
Medical Multi Agent
7 pages
Governing AI Agents. The Rapid Evolution of Artificial - by Sourav Verma - Medium
No ratings yet
Governing AI Agents. The Rapid Evolution of Artificial - by Sourav Verma - Medium
10 pages
Medical Multi Agent
No ratings yet
Medical Multi Agent
7 pages
Planning Your Career - Values and Superpowers
No ratings yet
Planning Your Career - Values and Superpowers
5 pages
Day 27 of Agentic AI
No ratings yet
Day 27 of Agentic AI
10 pages
Day 26 of Agentic AI
No ratings yet
Day 26 of Agentic AI
8 pages
Applied ML Semantic Search Exercise
No ratings yet
Applied ML Semantic Search Exercise
4 pages
SLMs For Agentic AI - Why Small Language Models Outperform LLMs
No ratings yet
SLMs For Agentic AI - Why Small Language Models Outperform LLMs
4 pages
Evaluating Deep Learning Models With Custom Loss Functions and Calibration Metrics
No ratings yet
Evaluating Deep Learning Models With Custom Loss Functions and Calibration Metrics
9 pages
Understanding Large Language Models
No ratings yet
Understanding Large Language Models
1 page
12 Important Model Evaluation Metrics For Machine Learning Everyone Should Know (Updated 2025)
No ratings yet
12 Important Model Evaluation Metrics For Machine Learning Everyone Should Know (Updated 2025)
16 pages
Python Interview Questions: DataFrame & Zip()
No ratings yet
Python Interview Questions: DataFrame & Zip()
6 pages
Neoden 4 Manual 1 0 (21-41)
No ratings yet
Neoden 4 Manual 1 0 (21-41)
21 pages
Senior .NET Developer in Los Angeles
No ratings yet
Senior .NET Developer in Los Angeles
3 pages
Lec 23
No ratings yet
Lec 23
13 pages
IT Professional Resume: C/C++ & Testing
No ratings yet
IT Professional Resume: C/C++ & Testing
5 pages
Dynamics 365 For Marketing
No ratings yet
Dynamics 365 For Marketing
3 pages
Unit-2 Bi
No ratings yet
Unit-2 Bi
50 pages
Error Frame in CAN Protocol Explained
100% (1)
Error Frame in CAN Protocol Explained
41 pages
Class-VIII-cube and Cube Roots
No ratings yet
Class-VIII-cube and Cube Roots
36 pages
Sabari Resume
No ratings yet
Sabari Resume
2 pages
Online Shopping Cart System Analysis
No ratings yet
Online Shopping Cart System Analysis
29 pages
Mahehwaran D Internship Report
No ratings yet
Mahehwaran D Internship Report
25 pages
EZkeys Player Installation Guide
No ratings yet
EZkeys Player Installation Guide
2 pages
CEOs Guide to Digital Transformation
No ratings yet
CEOs Guide to Digital Transformation
27 pages
DFT, Scan and ATPG
No ratings yet
DFT, Scan and ATPG
10 pages
Stereography 1
No ratings yet
Stereography 1
46 pages
Advanced Drowsiness Detection System
No ratings yet
Advanced Drowsiness Detection System
9 pages
MotionBuilder Guide for RoboBuilder Users
No ratings yet
MotionBuilder Guide for RoboBuilder Users
57 pages
CSS Training Plan 2023 2024
No ratings yet
CSS Training Plan 2023 2024
12 pages
SED1173 Backstage
No ratings yet
SED1173 Backstage
18 pages
CSC 415 Past Question Answers
No ratings yet
CSC 415 Past Question Answers
19 pages
User's Manual - Climbing System - v3.0
No ratings yet
User's Manual - Climbing System - v3.0
9 pages
Big Data Analytics A Review On Theoretical Contributions-2017
No ratings yet
Big Data Analytics A Review On Theoretical Contributions-2017
27 pages
Processor Data Path and Control Overview
No ratings yet
Processor Data Path and Control Overview
27 pages
CloudTelemetry - Messagelog - 2026 01 02 21 37 37
No ratings yet
CloudTelemetry - Messagelog - 2026 01 02 21 37 37
3 pages
Web Scraping Using Python
No ratings yet
Web Scraping Using Python
18 pages
DS18B20 Temperature Sensor Sonde With 3m Cable ENG
No ratings yet
DS18B20 Temperature Sensor Sonde With 3m Cable ENG
29 pages
Spatio-Temporal Activity Detection Analysis
No ratings yet
Spatio-Temporal Activity Detection Analysis
5 pages
Business Configuration Sets (BC Sets) and Their Use - Basis Corner - SCN Wiki
No ratings yet
Business Configuration Sets (BC Sets) and Their Use - Basis Corner - SCN Wiki
3 pages
Polynomials Test
No ratings yet
Polynomials Test
1 page
BD
No ratings yet
BD
4 pages

Tiny Networks

Uploaded by

Tiny Networks

Uploaded by

Less is More: Recursive Reasoning

with Tiny Networks

LLMs ike GPT-5 and Claude Sonnet 4.5 are masters of

The core issue is their autoregressive nature—they

A single early mistake in a Sudoku grid cascades,

By running this loop recursively, HRMs showed that

y (the solution): The current best answer (e.g., a

With each iteration, the network refines both y and z,

Source: Less is More: Recursive Reasoning with Tiny Networks

The empirical data is undeniable. On classic reasoning

You might also like