A type of machine learning that mimics the way humans learn, by taking feedback from the environment. GPT, one of the most sophisticated language models ever created, has also embraced the power of reinforcement learning. To tune the model, human annotators played a vital role by providing examples of the desired behavior and ranking the model outputs. By leveraging actionable feedback from humans, GPT-3 has evolved into a potent AI tool that can complete complex tasks in a matter of seconds.