Supervised Fine-Tuning / Instruction Tuning
Alignment TrainingmatureFoundational alignment method: fine-tuning on human-written demonstrations of desired behavior.
First Proposed: 2022 (Ouyang et al.)
Cluster: Alignment Training
Tags
function:specificationstage:trainingscope:technique