AI research atlas / v2

Learn AI papers in the right order.

Start with landmark ideas, move through foundations, then branch into LLMs, GenAI, agents, systems, and safety with a reading path that keeps the field from feeling random.

Start roadmap My reading

10 learning tracksFull-paper readerChatGPT handoff

Recommended firstLandmark papers

Build the mental timeline before going deep.

Then specializeLLMs, GenAI, safety

Move from foundations to modern systems.

Read modePDF + resources

Path-firstNo more random paper hopping

Research-nativearXiv links, PDFs, resources

Study loopTrack reading and discuss in ChatGPT

Learning path

Where to start, and what to read next

Start with landmarks

Orientation / 1-2 weeks

Start Here

Read the papers everyone keeps referencing so the rest of the map has anchors.

Know the landmark namesBuild historical contextPick a direction

Open papers

Foundations / 2-4 weeks

Classical ML

Learn the statistical and probabilistic ideas that still sit under modern models.

Bayesian thinkingModel evaluationUncertainty

Open papers

Foundations / 1-2 weeks

Optimization

Understand the training mechanics behind gradient-based learning.

Gradient descentGeneralizationTraining stability

Open papers

Builder / 3-5 weeks

Deep Learning Core

Move through representation learning, CNNs, residual networks, and scaling patterns.

CNN intuitionRepresentation learningBenchmark culture

Open papers

Builder / 3-6 weeks

Sequence Models and LLMs

Study attention, transformers, language modeling, instruction tuning, and evaluation.

AttentionPretrainingInstruction following

Open papers

Specialist / 3-6 weeks

Generative AI

Compare GANs, diffusion, autoregressive generation, and modern GenAI workflows.

DiffusionGANsGeneration tradeoffs

Open papers

Specialist / 2-4 weeks

Multimodal and Retrieval

Connect language with images, retrieval, embeddings, and real-world knowledge access.

Vision-languageEmbeddingsRetrieval

Open papers

Specialist / 3-5 weeks

RL and Agents

Learn decision making, feedback, policy learning, and agent-style systems.

PoliciesRewardsExploration

Open papers

Practitioner / 2-4 weeks

Systems and Scaling

Understand the infrastructure and engineering papers behind large-scale training.

Distributed trainingServingEfficiency

Open papers

Practitioner / 2-4 weeks

Safety and Interpretability

Study robustness, alignment, transparency, and how to reason about model behavior.

AlignmentRobustnessInterpretability

Open papers

Research library

Probabilistic ML

Showing papers for this learning path. Open any paper card to read the full paper and related resources.

40 papers shown

unread2017

ImageNet classification with deep convolutional neural networks

We trained a large, deep convolutional neural network to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes. On the test data, we achieved top-1 and top-5 error rates of 37.5% and 17.0%, respectively, which is considerably better than the previous state-of-the-art. The neural network, which has 60 million parameters and 650,000 neurons, consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully connected layers with a final 1000-way softmax. To make training faster, we used non-saturating neurons and a very efficient GPU implementation of the convolution operation. To reduce overfitting in the fully connected layers we employed a recently developed regularization method called "dropout" that proved to be very effective. We also entered a variant of this model in the ILSVRC-2012 competition and achieved a winning top-5 test error rate of 15.3%, compared to 26.2% achieved by the second-best entry.

Learn AI papers in the right order.

Where to start, and what to read next

Start Here

Classical ML

Optimization

Deep Learning Core

Sequence Models and LLMs

Generative AI

Multimodal and Retrieval

RL and Agents

Systems and Scaling

Safety and Interpretability

Architecture

Learning Paradigms

Applications

Trust and Deployment

Probabilistic ML

ImageNet classification with deep convolutional neural networks

Auto-Encoding Variational Bayes

Gaussian Processes for Machine Learning

Least angle regression

Artificial neural networks for solving ordinary and partial differential equations

MerLin: A Discovery Engine for Photonic and Hybrid Quantum Machine Learning

Learning with Embedded Linear Equality Constraints via Variational Bayesian Inference

Fourier Learning Machines: Nonharmonic Fourier-Based Neural Networks for Scientific Machine Learning

ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data

Generalizing Machine Learning Evaluation through the Integration of Shannon Entropy and Rough Set Theory

Privacy-preserving machine learning for healthcare: open challenges and future perspectives

Physics-Inspired Interpretability Of Machine Learning Models

Multi-annotator Deep Learning: A Probabilistic Framework for Classification

Changing Data Sources in the Age of Machine Learning for Official Statistics

Active learning for data streams: a survey

Learning Curves for Decision Making in Supervised Machine Learning: A Survey

Uncertain Bayesian Networks: Learning from Incomplete Data

Explanatory machine learning for sequential human teaching

Public Policymaking for International Agricultural Trade using Association Rules and Ensemble Machine Learning

Teaching Uncertainty Quantification in Machine Learning through Use Cases

DOME: Recommendations for supervised machine learning validation in biology

On-Device Machine Learning: An Algorithms and Learning Theory Perspective

Bayesian Differential Privacy for Machine Learning

A Unifying Bayesian View of Continual Learning

Deep Bayesian Multi-Target Learning for Recommender Systems

ReinBo: Machine Learning pipeline search and configuration with Bayesian Optimization embedded Reinforcement Learning

The Scientific Method in the Science of Machine Learning

Unsupervised Representation Learning with Minimax Distance Measures

A Benchmark Study of Machine Learning Models for Online Fake News Detection

TapNet: Neural Network Augmented with Task-Adaptive Projection for Few-Shot Learning

Automatic Machine Learning by Pipeline Synthesis using Model-Based Reinforcement Learning and a Grammar

MEMe: An Accurate Maximum Entropy Method for Efficient Approximations in Large-Scale Machine Learning

Reproducibility in Machine Learning for Health

Towards Quantification of Bias in Machine Learning for Healthcare: A Case Study of Renal Failure Prediction

ML-Schema: Exposing the Semantics of Machine Learning with Schemas and Ontologies

Learning Representations from Dendrograms

Progressive Sampling-Based Bayesian Optimization for Efficient and Automatic Machine Learning Model Selection

A Framework for Implementing Machine Learning on Omics Data

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead

On the Importance of Strong Baselines in Bayesian Deep Learning