Topic: Machine Learning

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for Machine Learning, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

Weekly Monthly All time

← Back to home

Attention Is All You Need
Paper • Jun 12, 2017 • arXiv • Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
The dominant sequence transduction models are based on complex recurrent or convolutional neural networks that include an encoder and a decoder. The best performing models also connect the encoder ...
Diffusion Alignment as Variational Expectation-Maximization
Paper • Feb 13, 2026 • arXiv • Zijing Ou, Jacob Si, Junyi Zhu, Yingzhen Li
Diffusion alignment aims to optimize diffusion models for downstream objectives. While existing methods based on RL achieve success, they often suffer from reward over-optimization and mode collaps...
Cog-RAG: Giving RAG a Brain That Thinks Before It Retrieves
Blog • Feb 17, 2026 • Towardsai • Florian June
Traditional Retrieval-Augmented Generation (RAG) is becoming a commodity; the next frontier is 'Cog-RAG.' This post details a new architecture where an agentic 'brain' evaluates a query, identifies...
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling
Paper • Mar 4, 2026 • arXiv • Yong Liu, Xingjian Su, Shiyu Wang, Haoran Zhang
While text and image foundation models have scaled massively, time-series forecasting has lagged behind due to architectural constraints. Timer-S1 introduces a billion-parameter foundation model sp...
Learning to Reason Faithfully through Step-Level Faithfulness Maximization
Paper • Feb 28, 2026 • arXiv • Runquan Gui, Yafu Li, Xiaoye Qu, Ziyan Liu, Yeqiu Cheng, Yu Cheng
Large Language Models frequently produce correct final answers based on flawed or unfaithful intermediate reasoning steps. This paper proposes Step-Level Faithfulness Maximization, a training parad...
Practical Machine Learning for Computer Vision
Book • Aug 24, 2021 • Amazon • Valliappa Lakshmanan, Martin Görner, Ryan Gillard
Employing machine learning models to extract information from images can be daunting for software developers. This book provides intuitive explanations of visual architectures alongside practical c...
DeepSeek R1 and the Era of Reasoning Swarms
Blog • Jan 15, 2026 • Medium • Elena Rossi
The release of advanced reasoning models has completely shifted how developers build autonomous systems in early 2026. This post details how open weights models are replacing expensive proprietary ...
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper • Dec 14, 2023 • arXiv • Collin Burns, Pavel Izmailov, Jan Hendrik Kirchner, Bowen Baker, Leo Gao, Leopold Aschenbrenner, Yining Chen, Adrien Ecoffet, Manas Joglekar, Jan Leike, Ilya Sutskever, Jeff Wu
As AI models become increasingly capable, we will eventually face the challenge of superalignment: how can humans supervise AI systems that are much smarter than them? To study this empirically tod...
Training Compute-Optimal Large Language Models
Paper • Mar 29, 2022 • arXiv • Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Laurent Sifre
We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget. We find that current large language models are significantly under...
PeptiVerse: A Unified Multi-Agent Platform for Therapeutic Peptide Property Prediction
Paper • Feb 20, 2026 • bioRxiv • L. Zheng, J. Drucker, A. Chen, W. Wang
Designing therapeutic peptides requires balancing binding affinity with complex ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties, a task that overwhelms traditional ...
A hyperparameter benchmark of VAE-based methods for scRNA-seq
Paper • Feb 10, 2026 • bioRxiv • Eduardo da Veiga Beltrame
This paper presents a systematic benchmark of architectural hyperparameters for variational autoencoder (VAE) methods in single-cell RNA-seq batch integration. The study compares scVI, MrVI, and LD...
Automatic pain face analysis in mice: Applied to a varied dataset
Paper • Feb 16, 2026 • bioRxiv • Anonymous Team
The paper introduces an automated deep learning tool for the real-time assessment of pain in mice using the Mouse Grimace Scale (MGS). The model was trained on a large and diverse dataset with non-...
CellAwareGNN: Single-Cell Knowledge Graph Foundation Model for Drug Indication
Paper • Feb 23, 2026 • bioRxiv • Zhang, X., Jeong, E., Yan, C., Feng, Y., Lyu, L., Guo, X., Chen, Y.
CellAwareGNN is a new graph neural network (GNN) foundation model that integrates single-cell enhanced knowledge graphs for drug indication prediction. By modeling the complex relationships between...
High-accuracy sampling for diffusion models and log-concave distributions
Paper • Feb 1, 2026 • arXiv • Fan Chen, Sinho Chewi, Constantinos Daskalakis, Alexander Rakhlin
We present algorithms for diffusion model sampling which obtain δ-error in polylog(1/δ) steps, given access to eO(δ)-accurate score estimates in L2. This is an exponential improvement over all prev...
VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment
Paper • Mar 4, 2026 • arXiv • Jiawei Chen, Tianzhuo Yang, Guoxi Zhang, Jiaming Ji, Yaodong Yang, Juntao Dai
Aligning large language models to individual user values without compromising the core safety parameters of the foundation model is notoriously difficult. This paper introduces VISA, a shielded ada...
Identifying Intervenable and Interpretable Features via Orthogonality Regularization
Paper • Feb 17, 2026 • arXiv • Moritz Miller, Florent Draye, Bernhard Schölkopf
This paper addresses the fundamental challenge of 'feature disentanglement' in modern deep learning. We propose an Orthogonality Regularization technique to identify features that are both interpre...
Everyday Machine Learning in 2026: The Benefits You Actually Feel
Blog • Feb 14, 2026 • Medium • Shane Collins
In 2026, ML has moved from the cloud to our pockets, handling the 'normal messiness' of real life. This post explores the small, invisible ways AI has cleaned up our lives, from transcripts that ha...
Neural Turing Machines
Paper • Dec 10, 2014 • arxiv.org • Alex Graves, Greg Wayne, Ivo Danihelka
We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Mach...
Distributed Representations of Words and Phrases and their Compositionality
Paper • Oct 16, 2013 • arxiv.org • Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, Jeffrey Dean
The recently introduced continuous Skip-gram model is an efficient method for learning high-quality distributed vector representations that capture a large number of precise syntactic and semantic ...
Efficient Estimation of Word Representations in Vector Space
Paper • Sep 7, 2013 • arxiv.org • Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean
We propose two novel model architectures for computing continuous vector representations of words from very large data sets. The quality of these representations is measured in a word similarity ta...

← PreviousPage 1Next →

FAQ

What does this Machine Learning page rank?

It ranks public content for Machine Learning using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in Machine Learning?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Topic: Machine Learning

Short answer

Attention Is All You Need

Diffusion Alignment as Variational Expectation-Maximization

Cog-RAG: Giving RAG a Brain That Thinks Before It Retrieves

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

Learning to Reason Faithfully through Step-Level Faithfulness Maximization

Practical Machine Learning for Computer Vision

DeepSeek R1 and the Era of Reasoning Swarms

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Training Compute-Optimal Large Language Models

PeptiVerse: A Unified Multi-Agent Platform for Therapeutic Peptide Property Prediction

A hyperparameter benchmark of VAE-based methods for scRNA-seq

Automatic pain face analysis in mice: Applied to a varied dataset

CellAwareGNN: Single-Cell Knowledge Graph Foundation Model for Drug Indication

High-accuracy sampling for diffusion models and log-concave distributions

VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

Identifying Intervenable and Interpretable Features via Orthogonality Regularization

Everyday Machine Learning in 2026: The Benefits You Actually Feel

Neural Turing Machines

Distributed Representations of Words and Phrases and their Compositionality

Efficient Estimation of Word Representations in Vector Space

Top Entities In This Topic

Related Topics

FAQ

What does this Machine Learning page rank?

How should I use weekly vs monthly vs all-time?

How can I discover organizations active in Machine Learning?

Can I follow this topic for updates?