Everyone assumes alignment gets harder as models scale. We measured it — across 102 models from 16 families — and found a phase transition: below a critical scale, alignment and capabilities fight. Above it, they cooperate. The transition is engineerable. That changes the game.
Coupling is how strongly two capabilities move together. We measure it between reasoning and truthfulness as models scale. Below a critical size, they’re anticorrelated — improving one hurts the other. Above it, they reinforce. This transition is sharp, reproducible across every family we tested, and invisible to loss curves. The same mathematics governs phase transitions in superconductors and sleep-stage dynamics in neuroscience.
The transition point varies by family, architecture, and training data — it’s a design parameter, not a physical constant. Three levers shift it: data curation, model width, and architecture.
Add a truth-direction vector at one layer (quarter-depth). The model’s output changes — zero retraining. These are real activation-level results from TransformerLens, not prompt engineering. Verified on GPT-2, Pythia-160M, and Pythia-410M.
This runs via cape-steer — an open-source CLI that works on any open-weight model. Auto-detects architecture, steers at quarter-depth. Full demo → GitHub →
Enter benchmarks, get your model’s alignment phase. Or steer any open-weight model from the command line. The physics is open.
Enter your model’s benchmark scores. Get its alignment phase, coupling trajectory, h-field diagnostic, and concrete interventions. Phase classification, ODE trajectory fitting, frontier analysis, and activation-level steering demo — all in one tool.
Activation-level alignment correction for any open-weight model. Auto-detects architecture, finds the coupling bottleneck at quarter-depth (layer nl/4), and steers the model’s hidden state toward truth. Zero retraining. Works on CPU.
Memory as an energy landscape. Retrieval is Boltzmann-weighted — temperature controls whether you explore (high T, creative) or exploit (low T, precise). Memories deepen with use. Offline consolidation merges, prunes, and strengthens — the same dynamics as biological sleep.
Not just for agents. The energy landscape applies to any system with persistent memory: conversational AI, knowledge bases, research tools, clinical note systems, education platforms. The physics is domain-agnostic.
Two papers on arXiv (under review). More in preparation across multiple domains.
We look for mathematical structure in complex systems — drawing from physics, dynamical systems, network theory, information theory, and whatever else the problem needs. When we find structure, we build tools on it.
Current focus: AI scaling laws. We discovered that the coupling between model capabilities undergoes a phase transition at a critical scale, and that transition is predictable, measurable, and actionable.
Founded by Adil Amin. Based in Milwaukee, WI.
Interested in collaboration, consulting, preprints, or early access?