Quick answer
AI Summary: The official technical report for GPT-4, detailing its multimodal capabilities, state-of-the-art benchmark performances, and the predictable scaling infrastructure used to train it.
AI Summary: The official technical report for GPT-4, detailing its multimodal capabilities, state-of-the-art benchmark performances, and the predictable scaling infrastructure used to train it.
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales.
Share your opinion to help other learners triage faster.
Write a reviewInvite someone by email to share an invited review for GPT-4 Technical Report.