The Correction Triangle: A New DPO Data Format for Cognitively Integrated AI
Most DPO datasets are pairs: prompt + good response vs bad response. That’s binary thinking. Laeka proposes the Correction Triangle: prompt + flawed response WITH DIAGNOSIS + superior response WITH EXPLANATION.
The diagnosis matters. When an LLM learns why a response fails—missing nuance, logical gap, ethical lapse—it doesn’t just memorize “prefer this one.” It internalizes the structure of better thinking.
Why Diagnosis Changes Everything
Standard DPO treats preference as a black box. The model learns patterns but not principles. Add diagnosis and you’re teaching reasoning about reasoning. The inferior response comes with a reason it’s inferior. The superior response explains the correction.
This produces stronger training signal. Models trained on diagnostic DPO show better generalization to novel prompts. They don’t overfit to surface-level patterns.
The Format
Each triangle consists of three elements:
1. Prompt: The original instruction or question.
2. Flawed Response + Diagnosis: A response that fails, plus structured annotation of why—missing key information, logical inconsistency, tone mismatch, scope creep.
3. Superior Response + Explanation: The better answer, annotated with the principle or reasoning that makes it superior.
Concrete Example
Prompt: “Explain quantum entanglement to a high school student.”
Flawed + Diagnosis: “Two particles become linked so they affect each other instantly across any distance.” [Diagnosis: Oversimplifies; creates false impression of faster-than-light communication; misses the philosophical weirdness that makes entanglement interesting.]
Superior + Explanation: “Quantum entanglement means two particles can be correlated in a way that classical physics can’t explain. Measuring one instantly affects what you know about the other—but you can’t use this to send information faster than light. The weirdness is that this correlation seems to exist even though nothing physical travels between them.” [Explanation: Addresses the core mystery; clarifies the common misconception about superluminal signaling; invites wonder rather than just stating facts.]
Why Laeka Chose This
The Correction Triangle turns preference data into reasoning data. Every pair is now a teaching moment. The model learns not just what’s good but how good emerges from understanding.
This aligns with cognitively integrated AI principles: training through clarity, diagnosis, and explanation rather than brute-force preference optimization.
Laeka Research — laeka.org