Topic: Machine Learning

Track this topic after sign-in.

Short answer

This page shows the most relevant public items for Machine Learning, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

Weekly Monthly All time

← Back to home

Learning Compact Vision Tokens for Efficient Large Multimodal Models
Paper • Jun 8, 2025 • arxiv.org • Hao Tang, Chengchao Shen
Large multimodal models (LMMs) suffer significant computational challenges due to the high cost of Large Language Models (LLMs) and the quadratic complexity of processing long vision token sequence...
Diversity-Guided MLP Reduction for Efficient Large Vision Transformers
Paper • Sep 22, 2025 • arxiv.org • Chengchao Shen, Hourun Zhu, Gongfan Fang, Jianxin Wang, Xinchao Wang
Transformer models achieve excellent scaling property, where the performance is improved with the increment of model capacity. However, large-scale model parameters lead to an unaffordable cost of ...
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) | Attendemia
Other • Nov 23, 2015 • attendemia.com • Unknown Author
We introduce the "exponential linear unit" (ELU) which speeds up learning in deep neural networks and leads to higher classification accuracies. Like rectifi...
Intriguing properties of neural networks
Paper • Feb 19, 2014 • arxiv.org • Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, Rob Fergus
Deep neural networks are highly expressive models that have recently achieved state of the art performance on speech and visual recognition tasks. While their expressiveness is the reason they succ...
Recurrent Neural Network Regularization
Paper • Feb 19, 2015 • arxiv.org • Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals
We present a simple regularization technique for Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units. Dropout, the most successful technique for regularizing neural networks, ...
Addressing the Rare Word Problem in Neural Machine Translation
Paper • May 30, 2015 • arxiv.org • Minh-Thang Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, Wojciech Zaremba
Neural Machine Translation (NMT) is a new approach to machine translation that has shown promising results that are comparable to traditional approaches. A significant weakness in conventional NMT ...
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Paper • Dec 11, 2014 • arxiv.org • Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, Yoshua Bengio
In this paper we compare different types of recurrent units in recurrent neural networks (RNNs). Especially, we focus on more sophisticated units that implement a gating mechanism, such as a long s...
Recurrent Models of Visual Attention
Paper • Jun 24, 2014 • arxiv.org • Volodymyr Mnih, Nicolas Heess, Alex Graves, Koray Kavukcuoglu
Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent n...
A Neural Conversational Model
Paper • Jul 22, 2015 • arxiv.org • Oriol Vinyals, Quoc Le
Conversational modeling is an important task in natural language understanding and machine intelligence. Although previous approaches exist, they are often restricted to specific domains (e.g., boo...
Visualizing and Understanding Recurrent Networks
Paper • Nov 17, 2015 • arxiv.org • Andrej Karpathy, Justin Johnson, Li Fei-Fei
Recurrent Neural Networks (RNNs), and specifically a variant with Long Short-Term Memory (LSTM), are enjoying renewed interest as a result of successful applications in a wide range of machine lear...
Understanding Neural Networks Through Deep Visualization
Paper • Jun 22, 2015 • arxiv.org • Jason Yosinski, Jeff Clune, Anh Nguyen, Thomas Fuchs, Hod Lipson
Recent years have produced great advances in training large, deep neural networks (DNNs), including notable successes in training convolutional neural networks (convnets) to recognize natural image...
Learning Deconvolution Network for Semantic Segmentation
Paper • May 17, 2015 • arxiv.org • Hyeonwoo Noh, Seunghoon Hong, Bohyung Han
We propose a novel semantic segmentation algorithm by learning a deconvolution network. We learn the network on top of the convolutional layers adopted from VGG 16-layer net. The deconvolution netw...
Character-Aware Neural Language Models
Paper • Dec 1, 2015 • arxiv.org • Yoon Kim, Yacine Jernite, David Sontag, Alexander M. Rush
We describe a simple neural language model that relies only on character-level inputs. Predictions are still made at the word-level. Our model employs a convolutional neural network (CNN) and a hig...
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
Paper • May 30, 2015 • arxiv.org • Kai Sheng Tai, Richard Socher, Christopher D. Manning
Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have...
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Paper • Feb 15, 2016 • arxiv.org • Song Han, Huizi Mao, William J. Dally
Neural networks are both computationally intensive and memory intensive, making them difficult to deploy on embedded systems with limited hardware resources. To address this limitation, we introduc...
Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
Paper • Mar 5, 2016 • arxiv.org • Ankit Kumar, Ozan Irsoy, Peter Ondruska, Mohit Iyyer, James Bradbury, Ishaan Gulrajani, Victor Zhong, Romain Paulus, Richard Socher
Most tasks in natural language processing can be cast into question answering (QA) problems over language input. We introduce the dynamic memory network (DMN), a neural network architecture which p...
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
Paper • Dec 31, 2015 • arxiv.org • Jason Weston, Antoine Bordes, Sumit Chopra, Alexander M. Rush, Bart van Merriënboer, Armand Joulin, Tomas Mikolov
One long-term goal of machine learning research is to produce methods that are applicable to reasoning and natural language, in particular building an intelligent dialogue agent. To measure progres...
Deep Networks with Stochastic Depth
Paper • Jul 28, 2016 • arxiv.org • Gao Huang, Yu Sun, Zhuang Liu, Daniel Sedra, Kilian Weinberger
Very deep convolutional networks with hundreds of layers have led to significant reductions in error on competitive benchmarks. Although the unmatched expressiveness of the many layers can be highl...
What makes for effective detection proposals?
Paper • Aug 1, 2015 • arxiv.org • Jan Hosang, Rodrigo Benenson, Piotr Dollár, Bernt Schiele
Current top performing object detectors employ detection proposals to guide the search for objects, thereby avoiding exhaustive sliding window search across images. Despite the popularity and wides...
Reading Text in the Wild with Convolutional Neural Networks
Paper • Dec 4, 2014 • arxiv.org • Max Jaderberg, Karen Simonyan, Andrea Vedaldi, Andrew Zisserman
In this work we present an end-to-end system for text spotting -- localising and recognising text in natural scene images -- and text based image retrieval. This system is based on a region proposa...

← PreviousPage 13Next →

FAQ

What does this Machine Learning page rank?

It ranks public content for Machine Learning using recent discussion, review, and engagement signals so you can triage faster. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How should I use weekly vs monthly vs all-time?

Use weekly for fast-moving updates, monthly for stable trend confirmation, and all-time for evergreen references. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

How can I discover organizations active in Machine Learning?

Use the linked entities section to jump to labs, companies, and experts connected to this topic and explore their timelines. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Can I follow this topic for updates?

Yes. Use the follow button on this page to subscribe and track new high-signal activity. This guidance is specific to Machine Learning topic page on Attendemia and is written so it still makes sense without reading other sections on the page.

Topic: Machine Learning

Short answer

Learning Compact Vision Tokens for Efficient Large Multimodal Models

Diversity-Guided MLP Reduction for Efficient Large Vision Transformers

Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) | Attendemia

Intriguing properties of neural networks

Recurrent Neural Network Regularization

Addressing the Rare Word Problem in Neural Machine Translation

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Recurrent Models of Visual Attention

A Neural Conversational Model

Visualizing and Understanding Recurrent Networks

Understanding Neural Networks Through Deep Visualization

Learning Deconvolution Network for Semantic Segmentation

Character-Aware Neural Language Models

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks

Deep Networks with Stochastic Depth

What makes for effective detection proposals?

Reading Text in the Wild with Convolutional Neural Networks

Top Entities In This Topic

Related Topics

FAQ

What does this Machine Learning page rank?

How should I use weekly vs monthly vs all-time?

How can I discover organizations active in Machine Learning?

Can I follow this topic for updates?