AI research atlas / v2

Learn AI papers in the right order.

Start with landmark ideas, move through foundations, then branch into LLMs, GenAI, agents, systems, and safety with a reading path that keeps the field from feeling random.

Start roadmap My reading

10 learning tracksFull-paper readerChatGPT handoff

Recommended firstLandmark papers

Build the mental timeline before going deep.

Then specializeLLMs, GenAI, safety

Move from foundations to modern systems.

Read modePDF + resources

Path-firstNo more random paper hopping

Research-nativearXiv links, PDFs, resources

Study loopTrack reading and discuss in ChatGPT

Learning path

Where to start, and what to read next

Start with landmarks

Orientation / 1-2 weeks

Start Here

Read the papers everyone keeps referencing so the rest of the map has anchors.

Know the landmark namesBuild historical contextPick a direction

Open papers

Foundations / 2-4 weeks

Classical ML

Learn the statistical and probabilistic ideas that still sit under modern models.

Bayesian thinkingModel evaluationUncertainty

Open papers

Foundations / 1-2 weeks

Optimization

Understand the training mechanics behind gradient-based learning.

Gradient descentGeneralizationTraining stability

Open papers

Builder / 3-5 weeks

Deep Learning Core

Move through representation learning, CNNs, residual networks, and scaling patterns.

CNN intuitionRepresentation learningBenchmark culture

Open papers

Builder / 3-6 weeks

Sequence Models and LLMs

Study attention, transformers, language modeling, instruction tuning, and evaluation.

AttentionPretrainingInstruction following

Open papers

Specialist / 3-6 weeks

Generative AI

Compare GANs, diffusion, autoregressive generation, and modern GenAI workflows.

DiffusionGANsGeneration tradeoffs

Open papers

Specialist / 2-4 weeks

Multimodal and Retrieval

Connect language with images, retrieval, embeddings, and real-world knowledge access.

Vision-languageEmbeddingsRetrieval

Open papers

Specialist / 3-5 weeks

RL and Agents

Learn decision making, feedback, policy learning, and agent-style systems.

PoliciesRewardsExploration

Open papers

Practitioner / 2-4 weeks

Systems and Scaling

Understand the infrastructure and engineering papers behind large-scale training.

Distributed trainingServingEfficiency

Open papers

Practitioner / 2-4 weeks

Safety and Interpretability

Study robustness, alignment, transparency, and how to reason about model behavior.

AlignmentRobustnessInterpretability

Open papers

Research library

Foundation Models

Showing papers for this learning path. Open any paper card to read the full paper and related resources.

40 papers shown

unread2017

The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

Learn AI papers in the right order.

Where to start, and what to read next

Start Here

Classical ML

Optimization

Deep Learning Core

Sequence Models and LLMs

Generative AI

Multimodal and Retrieval

RL and Agents

Systems and Scaling

Safety and Interpretability

Architecture

Learning Paradigms

Applications

Trust and Deployment

Foundation Models

Attention is All you Need

Very Deep Convolutional Networks for Large-Scale Image Recognition

MizAR 60 for Mizar 50

AI-Assisted Pipeline for Dynamic Generation of Trustworthy Health Supplement Content at Scale

Random sample consensus

The qualitative content analysis process

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Segment Anything

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Attention Is All You Need

Learning Transferable Visual Models From Natural Language Supervision

Machine Learning: Algorithms, Real-World Applications and Research Directions

LLaMA: Open and Efficient Foundation Language Models

Language Models are Few-Shot Learners

Is Space-Time Attention All You Need for Video Understanding?

Llama 2: Open Foundation and Fine-Tuned Chat Models

A Survey on Evaluation of Large Language Models

Sparks of Artificial General Intelligence: Early experiments with GPT-4

Artificial Intelligence, Machine Learning and Deep Learning in Advanced Robotics, A Review

A Survey on Distributed Machine Learning

Attention Is All You Need In Speech Separation

RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Artificial Intelligence, Machine Learning, Deep Learning, and Cognitive Computing: What Do These Terms Mean and How Will They Impact Health Care?

Scaling Distributed Machine Learning with In-Network Aggregation

Building Trust in Artificial Intelligence, Machine Learning, and Robotics

Artificial intelligence, machine learning and health systems

Channel Attention Is All You Need for Video Frame Interpolation

Ethical and Bias Considerations in Artificial Intelligence (AI)/Machine Learning.

Artificial intelligence, machine learning and deep learning

Artificial Intelligence, Machine Learning, Automation, Robotics, Future of Work and Future of Humanity: A Review and Research Agenda

Attention is all you need: utilizing attention in AI-enabled drug discovery

A Review of Further Directions for Artificial Intelligence, Machine Learning, and Deep Learning in Smart Logistics

Has the Future Started? The Current Growth of Artificial Intelligence, Machine Learning, and Deep Learning

Deliberative Alignment: Reasoning Enables Safer Language Models

Artificial Intelligence, Machine Learning, and Deep Learning in Structural Engineering: A Scientometrics Review of Trends and Best Practices

The need for a system view to regulate artificial intelligence/machine learning-based software as medical device

Promising Artificial Intelligence‐Machine Learning‐Deep Learning Algorithms in Ophthalmology

The state-of-the-art on Intellectual Property Analytics (IPA): A literature review on artificial intelligence, machine learning and deep learning methods for analysing intellectual property (IP) data