InstructGPT

How ChapGPT is Trained

ChapGPT is trained in three main stages

  • Generative pretraining
  • Supervised fine-tuning (SFT)
  • Reinforcement learning from human feedback (RLHF)