PaPoo
cover

What is a base model vs instruct model?

A base model is a pretrained language model that predicts and generates text, while an instruct model is the same kind of model further tuned to follow human instructions more reliably.

Why it matters

This distinction helps you choose the right model for the job:

In practice, most product teams start with an instruct model unless they specifically need to adapt a base model themselves.

How it works

A base model is usually trained first on a large text corpus with a simple objective: predict the next token. That pretraining teaches broad language patterns, facts, and some reasoning behavior, but it does not by itself make the model especially good at obeying user instructions.

An instruct model starts from that base model and is then fine-tuned on instruction-following examples. This often includes supervised fine-tuning on prompt-response pairs, and sometimes additional alignment methods such as preference optimization or RLHF. The goal is to make the model more helpful, more direct, and less likely to ignore the user’s intent.

So the relationship is usually:

  1. Pretrain a base model
  2. Fine-tune it for instruction following
  3. Deploy the tuned model as an instruct model

The exact training recipe varies by lab and model family, so “instruct model” is a practical label, not a single formal algorithm.

Tiny concrete example

If you ask:

“Write a polite email declining a meeting.”

Example instruct-style output:

Hi Sam, thanks for the invitation. I’m not able to make the meeting, but I’d be happy to review the notes afterward.

Common pitfalls / when NOT to use it

Related terms

Related terms

同じ著者の記事