Quick answer

AI Summary: The official technical report for GPT-4, detailing its multimodal capabilities, state-of-the-art benchmark performances, and the predictable scaling infrastructure used to train it.

Paper2023-03-15•Source ↗•100 attns263 checkouts

Claim

GPT-4 Technical Report

Authors

Discuss with Grok

OpenAI

ABSTRACT

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales.

#gpt-4 #llms company:openai-research #cs-ai #cs-cl

Review Snapshot

Explore ratings

4.2

★★★★★

5 ratings

5 star

40%

4 star

40%

3 star

20%

2 star

1 star

Recommendation

80%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for GPT-4 Technical Report.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful