← Home

Quick answer

AI Summary: Achieves high reasoning performance on mobile-grade hardware using a hybrid architecture.

Claim

Tiny Recursive Reasoning with Mamba-2 Attention Hybrid

Authors
Wenlong Wang·
Fergal Reid

ABSTRACT

Recent work demonstrates that tiny networks (7M parameters) can achieve strong performance on abstract reasoning through latent recursion. We investigate whether Mamba-2's state space recurrence, itself a form of iterative refinement, preserves reasoning capability when replacing Transformer blocks in a recursive scaffold. Maintaining parameter parity (6.8M), we find that the Mamba-2 hybrid improves pass@2 on ARC-AGI-1 by +2.0% and consistently outperforms at higher K values. Our results validate that SSM-based operators are viable candidates in recursive design, establishing a first step toward understanding the best mixing strategies for 'more thinking time' over 'bigger models'.

Review Snapshot

Explore ratings

0.0
★★★★★
0 ratings
5 star
0%
4 star
0%
3 star
0%
2 star
0%
1 star
0%

Recommendation

0%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Tiny Recursive Reasoning with Mamba-2 Attention Hybrid.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.
Post an inquiry
Sort by: Most helpful