Topic: Generative AI

Short answer

This page shows the most relevant public items for Generative AI, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

WeeklyMonthlyAll time

← Back to home

  1. Diffusion Models Beat GANs on Image Synthesis

    PaperMay 11, 2021arXivPrafulla Dhariwal, Alex Nichol

    We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better archi...

  2. Improving Image Generation with Better Captions

    PaperOct 19, 2023OpenAIJames Betker, Gabriel Goh, Li Jing, Tim Brooks, Jianfeng Wang, Linjie Li, Long Ouyang, Juntang Zhuang, Joyce Lee, Yufei Guo, Wesam Manassra, Prafulla Dhariwal, Casey Chu, Yunxing Jiao, Aditya Ramesh

    Current text-to-image models often struggle to faithfully follow detailed or complex prompts, frequently ignoring specific attributes or object relationships. We propose that this issue stems from ...

  3. Consistency Models

    PaperMar 2, 2023arXivYang Song, Prafulla Dhariwal, Mark Chen, Ilya Sutskever

    Diffusion models have achieved significant success in image, audio, and video generation, but they depend on an iterative generation process that causes slow sampling and precludes real-time applic...

  4. Sora: Video generation models as world simulators

    PaperFeb 15, 2024OpenAI Technical ReportTim Brooks, Bill Peebles, Connor Holmes, Will DePue, Yufei Guo, Li Jing, David Schnurr, Joe Taylor, Troy Luhman, Eric Luhman, Clarence Ng, Ricky Wang, Aditya Ramesh

    We explore the large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of highly variable durations, resolutio...

  5. Improved Denoising Diffusion Probabilistic Models

    PaperFeb 18, 2021arXivAlex Nichol, Prafulla Dhariwal

    Denoising diffusion probabilistic models (DDPMs) have recently demonstrated high-quality image generation, but they suffer from notoriously slow sampling times and sub-optimal log-likelihoods. We p...

  6. Zero-Shot Text-to-Image Generation

    PaperFeb 24, 2021arXivAditya Ramesh, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, Ilya Sutskever

    Text-to-image generation has traditionally focused on finding better modeling assumptions for training on a fixed dataset. We describe a simple approach for this task based on a transformer that au...

  7. Hierarchical Text-Conditional Image Generation with CLIP Latents

    PaperApr 13, 2022arXivAditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, Mark Chen

    Contrastive models like CLIP have been shown to learn robust representations of images that capture both semantics and style. To leverage these representations for image generation, we propose a tw...

  8. Generative Agents for the Continuous Evolution of Target-Binding Proteins

    PaperFeb 25, 2026bioRxivSamuel H. A. von der Dunk, Liliana M. Dávalos, Ard A. Louis

    Directed evolution in the laboratory is constrained by physical limits on library size and mutation rates. We present an entirely *in silico* framework utilizing Generative Agents that simulate the...

  9. Jukebox: A Generative Model for Music

    PaperApr 30, 2020arXivPrafulla Dhariwal, Heewoo Jun, Christine Payne, Jong Wook Kim, Alec Radford, Ilya Sutskever

    We introduce Jukebox, a generative model that produces high-fidelity, highly diverse music with singing in the raw audio domain. We model music as a sequence of discrete tokens by using a multi-sca...

Related Topics

company:openai-research (9)cs.CV (7)Diffusion Models (5)cs.LG (2)Audio Processing (1)