상세 보기
ReMi-ReMath: A Reverse-Mutual Reasoning Framework for Enhancing Mathematical Thinking in Small Language Models
Citations
SCOPUS
0초록
We introduce Reverse-Mutual Reasoning for Mathematical Thinking (ReMi-ReMath), a mutual reasoning framework that enhances mathematical reasoning in small language models without requiring fine-tuning. Building on the mutual reasoning structure of rStar, ReMi-ReMath expands it by incorporating backward reasoning-based discriminative reconstruction and a self-evaluation system that measures logical and semantic consistency, rather than simple answer matching. Across benchmarks, ReMi-ReMath improves performance by 3-5%p over rStar using 3.8B-scale models, demonstrating that combining mutual reasoning and reverse reconstruction establishes a new paradigm for quantitatively verifying reasoning validity beyond answer-centric evaluations.
키워드
mathematical reasoning; mutual reasoning; reverse reasoning; self-verification; small language model
- 제목
- ReMi-ReMath: A Reverse-Mutual Reasoning Framework for Enhancing Mathematical Thinking in Small Language Models
- 저자
- Kim, Dae Kyoo; Moon, Sang Kyu; Park, Eunil
- 발행일
- 2026
- 유형
- Conference Paper
- 저널명
- WWW 2026 - Proceedings of the ACM Web Conference 2026
- 페이지
- 8461 ~ 8464