Longterm Wiki

Supervised Fine-Tuning / Instruction Tuning

Alignment Trainingmature

Foundational alignment method: fine-tuning on human-written demonstrations of desired behavior.

First Proposed: 2022 (Ouyang et al.)
Cluster: Alignment Training

Tags

function:specificationstage:trainingscope:technique