Quick answer

AI Summary: Details the architecture and performance of Gemma, a family of highly capable, open-weight 2B and 7B language models derived from the flagship Gemini technology.

Paper2024-02-21•Source ↗•20 attns293 checkouts

Claim

Gemma: Open Models Based on Gemini Research and Technology

Authors

Discuss with Grok

Gemma Team·

Google DeepMind

ABSTRACT

We introduce Gemma, a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. Gemma models are offered in two sizes: a 7 billion parameter model suited for efficient deployment on consumer GPUs and TPUs, and a 2 billion parameter model designed for CPU and on-device applications. Both models demonstrate strong performance across text generation, reasoning, and coding benchmarks, outperforming similarly sized open models like Llama 2. We release the pre-trained weights, instruction-tuned checkpoints, and a comprehensive responsible generative AI toolkit to foster innovation and safe deployment within the open-source community.

#open-source #llms lab:deep-mind-ai #gemma #cs-cl

Review Snapshot

Explore ratings

4.4

★★★★★

5 ratings

5 star

40%

4 star

60%

3 star

2 star

1 star

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Gemma: Open Models Based on Gemini Research and Technology.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful