Topic: Machine Learning

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for Machine Learning, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

WeeklyMonthlyAll time

← Back to home

  1. Attention Is All You Need

    PaperJun 12, 2017arXivAshish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

    The dominant sequence transduction models are based on complex recurrent or convolutional neural networks that include an encoder and a decoder. The best performing models also connect the encoder ...

  2. Diffusion Alignment as Variational Expectation-Maximization

    PaperFeb 13, 2026arXivZijing Ou, Jacob Si, Junyi Zhu, Yingzhen Li

    Diffusion alignment aims to optimize diffusion models for downstream objectives. While existing methods based on RL achieve success, they often suffer from reward over-optimization and mode collaps...

  3. Cog-RAG: Giving RAG a Brain That Thinks Before It Retrieves

    BlogFeb 17, 2026TowardsaiFlorian June

    Traditional Retrieval-Augmented Generation (RAG) is becoming a commodity; the next frontier is 'Cog-RAG.' This post details a new architecture where an agentic 'brain' evaluates a query, identifies...

  4. Learning to Reason Faithfully through Step-Level Faithfulness Maximization

    PaperFeb 28, 2026arXivRunquan Gui, Yafu Li, Xiaoye Qu, Ziyan Liu, Yeqiu Cheng, Yu Cheng

    Large Language Models frequently produce correct final answers based on flawed or unfaithful intermediate reasoning steps. This paper proposes Step-Level Faithfulness Maximization, a training parad...

  5. Practical Machine Learning for Computer Vision

    BookAug 24, 2021AmazonValliappa Lakshmanan, Martin Görner, Ryan Gillard

    Employing machine learning models to extract information from images can be daunting for software developers. This book provides intuitive explanations of visual architectures alongside practical c...

  6. DeepSeek R1 and the Era of Reasoning Swarms

    BlogJan 15, 2026MediumElena Rossi

    The release of advanced reasoning models has completely shifted how developers build autonomous systems in early 2026. This post details how open weights models are replacing expensive proprietary ...

  7. Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

    PaperDec 14, 2023arXivCollin Burns, Pavel Izmailov, Jan Hendrik Kirchner, Bowen Baker, Leo Gao, Leopold Aschenbrenner, Yining Chen, Adrien Ecoffet, Manas Joglekar, Jan Leike, Ilya Sutskever, Jeff Wu

    As AI models become increasingly capable, we will eventually face the challenge of superalignment: how can humans supervise AI systems that are much smarter than them? To study this empirically tod...

  8. Training Compute-Optimal Large Language Models

    PaperMar 29, 2022arXivJordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Laurent Sifre

    We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget. We find that current large language models are significantly under...

  9. A hyperparameter benchmark of VAE-based methods for scRNA-seq

    PaperFeb 10, 2026bioRxivEduardo da Veiga Beltrame

    This paper presents a systematic benchmark of architectural hyperparameters for variational autoencoder (VAE) methods in single-cell RNA-seq batch integration. The study compares scVI, MrVI, and LD...

  10. High-accuracy sampling for diffusion models and log-concave distributions

    PaperFeb 1, 2026arXivFan Chen, Sinho Chewi, Constantinos Daskalakis, Alexander Rakhlin

    We present algorithms for diffusion model sampling which obtain δ-error in polylog(1/δ) steps, given access to eO(δ)-accurate score estimates in L2. This is an exponential improvement over all prev...

  11. VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

    PaperMar 4, 2026arXivJiawei Chen, Tianzhuo Yang, Guoxi Zhang, Jiaming Ji, Yaodong Yang, Juntao Dai

    Aligning large language models to individual user values without compromising the core safety parameters of the foundation model is notoriously difficult. This paper introduces VISA, a shielded ada...

  12. Neural Turing Machines

    PaperDec 10, 2014arxiv.orgAlex Graves, Greg Wayne, Ivo Danihelka

    We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Mach...

  13. Distributed Representations of Words and Phrases and their Compositionality

    PaperOct 16, 2013arxiv.orgTomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, Jeffrey Dean

    The recently introduced continuous Skip-gram model is an efficient method for learning high-quality distributed vector representations that capture a large number of precise syntactic and semantic ...

  14. Efficient Estimation of Word Representations in Vector Space

    PaperSep 7, 2013arxiv.orgTomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean

    We propose two novel model architectures for computing continuous vector representations of words from very large data sets. The quality of these representations is measured in a word similarity ta...

← PreviousPage 1Next →

Top Entities In This Topic

Related Topics

FAQ

What does this Machine Learning page rank?

It ranks public content for Machine Learning using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in Machine Learning?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.