Topic: cs.SD

Short answer

This page shows the most relevant public items for cs.SD, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

WeeklyMonthlyAll time
Current weekPast week2 weeks ago

← Back to home

  1. Robust Speech Recognition via Large-Scale Weak Supervision

    PaperDec 6, 2022arXivAlec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever

    We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet. When scaled to 680,000 hours of multilingual and multitask su...

  2. Jukebox: A Generative Model for Music

    PaperApr 30, 2020arXivPrafulla Dhariwal, Heewoo Jun, Christine Payne, Jong Wook Kim, Alec Radford, Ilya Sutskever

    We introduce Jukebox, a generative model that produces high-fidelity, highly diverse music with singing in the raw audio domain. We model music as a sequence of discrete tokens by using a multi-sca...

Related Topics

company:openai-research (2)Audio Processing (1)Generative AI (1)Whisper (1)Speech Recognition (1)