Models & Research

A Three-Phase Factual Recall Circuit in Gemma-2B and Gemma-12B-IT

· June 24, 2026
A Three-Phase Factual Recall Circuit in Gemma-2B and Gemma-12B-IT

What changed

A new analysis of the Gemma-2B and Gemma-12B-IT transformer models reveals a detailed, three-phase circuit underlying factual recall. This circuit shows how facts are stored, routed, and extracted as these models process information. Activation patching exposes that the residual stream, rather than the attention mechanism, carries the bulk of the workload for recalling and outputting facts from the model’s layers.

Why builders should care

This finding shifts focus from attention heads alone to the broader residual stream during fact retrieval. For developers working on transformer architectures, it means that optimizing or debugging models should consider the residual pathways more closely. This insight also challenges some prevailing assumptions about where knowledge “lives” inside transformer layers, urging builders to rethink strategies for interpretability and targeted model editing.

The practical takeaway

Builders should factor in a multi-phase process for factual recall: storage within specific components, routing via the residual stream, and final readout. Tools and diagnostics focusing solely on attention might miss how and why a model produces particular facts. Upgrading model editing techniques or accuracy audits will require monitoring the residual stream modifications, which carry the factual content forward across layers.

What to watch next

Watch for new tooling and research that incorporate this three-phase circuit perspective, especially methods that enhance model transparency or reduce hallucination by intervening in the residual stream. Advances in model editing using activation patching could become standard practice for large transformer models. Also, expect growing scrutiny over which model components handle factual accuracy, potentially shifting investment and development priorities in AI explainability.

AI Quick Briefs Editorial Desk

Stay ahead of AI Get the most important AI news delivered to your inbox — free.