A base model is a pretrained language model that predicts and generates text, while an instruct model is the same kind of model further tuned to follow human instructions more reliably.
This distinction helps you choose the right model for the job:
In practice, most product teams start with an instruct model unless they specifically need to adapt a base model themselves.
A base model is usually trained first on a large text corpus with a simple objective: predict the next token. That pretraining teaches broad language patterns, facts, and some reasoning behavior, but it does not by itself make the model especially good at obeying user instructions.
An instruct model starts from that base model and is then fine-tuned on instruction-following examples. This often includes supervised fine-tuning on prompt-response pairs, and sometimes additional alignment methods such as preference optimization or RLHF. The goal is to make the model more helpful, more direct, and less likely to ignore the user’s intent.
So the relationship is usually:
The exact training recipe varies by lab and model family, so “instruct model” is a practical label, not a single formal algorithm.
If you ask:
“Write a polite email declining a meeting.”
Example instruct-style output:
Hi Sam, thanks for the invitation. I’m not able to make the meeting, but I’d be happy to review the notes afterward.