The new version of GPT-3 behaved much better (and should be less)

“This work takes an important step in the right direction,” says Douwe Kiela, a researcher at Hugging Face, an AI company that works on open-source language models. He suggests that the feedback-based training process can be iterated over many rounds, further improving the model. Leike says OpenAI can do this based on customer feedback. InstructGPT […]