Model RLHFed to follow instructions follows instructions, even when we might not...

px43 · on Feb 19, 2024

I think people might have forgotten that LLMs before InstructGPT came around could be weirdly opinionated jerks. There was this whole effort to train them so that we could actually give them instructions. It's probably a hell of a lot more useful to have an LLM that will just go with whatever weird stuff the human says rather than try to fight them on it.

https://openai.com/research/instruction-following