AI research atlas / v2

Learn AI papers in the right order.

Start with landmark ideas, move through foundations, then branch into LLMs, GenAI, agents, systems, and safety with a reading path that keeps the field from feeling random.

Start roadmap My reading

10 learning tracksFull-paper readerChatGPT handoff

Recommended firstLandmark papers

Build the mental timeline before going deep.

Then specializeLLMs, GenAI, safety

Move from foundations to modern systems.

Read modePDF + resources

Path-firstNo more random paper hopping

Research-nativearXiv links, PDFs, resources

Study loopTrack reading and discuss in ChatGPT

Learning path

Where to start, and what to read next

Start with landmarks

Orientation / 1-2 weeks

Start Here

Read the papers everyone keeps referencing so the rest of the map has anchors.

Know the landmark namesBuild historical contextPick a direction

Open papers

Foundations / 2-4 weeks

Classical ML

Learn the statistical and probabilistic ideas that still sit under modern models.

Bayesian thinkingModel evaluationUncertainty

Open papers

Foundations / 1-2 weeks

Optimization

Understand the training mechanics behind gradient-based learning.

Gradient descentGeneralizationTraining stability

Open papers

Builder / 3-5 weeks

Deep Learning Core

Move through representation learning, CNNs, residual networks, and scaling patterns.

CNN intuitionRepresentation learningBenchmark culture

Open papers

Builder / 3-6 weeks

Sequence Models and LLMs

Study attention, transformers, language modeling, instruction tuning, and evaluation.

AttentionPretrainingInstruction following

Open papers

Specialist / 3-6 weeks

Generative AI

Compare GANs, diffusion, autoregressive generation, and modern GenAI workflows.

DiffusionGANsGeneration tradeoffs

Open papers

Specialist / 2-4 weeks

Multimodal and Retrieval

Connect language with images, retrieval, embeddings, and real-world knowledge access.

Vision-languageEmbeddingsRetrieval

Open papers

Specialist / 3-5 weeks

RL and Agents

Learn decision making, feedback, policy learning, and agent-style systems.

PoliciesRewardsExploration

Open papers

Practitioner / 2-4 weeks

Systems and Scaling

Understand the infrastructure and engineering papers behind large-scale training.

Distributed trainingServingEfficiency

Open papers

Practitioner / 2-4 weeks

Safety and Interpretability

Study robustness, alignment, transparency, and how to reason about model behavior.

AlignmentRobustnessInterpretability

Open papers

Research library

Large Language Models

Showing papers for this learning path. Open any paper card to read the full paper and related resources.

40 papers shown

unread2000

R: A Language and Environment for Statistical Computing

Most R novices will start with Appendix A [A sample session], page 80.This should give some familiarity with the style of R sessions and more importantly some instant feedback on what actually happens.Many users will come to R mainly for its graphical facilities.

Learn AI papers in the right order.

Where to start, and what to read next

Start Here

Classical ML

Optimization

Deep Learning Core

Sequence Models and LLMs

Generative AI

Multimodal and Retrieval

RL and Agents

Systems and Scaling

Safety and Interpretability

Architecture

Learning Paradigms

Applications

Trust and Deployment

Large Language Models

R: A Language and Environment for Statistical Computing

Attention is All you Need

Long Short-Term Memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

MizAR 60 for Mizar 50

Gradient-based learning applied to document recognition

AI-Assisted Pipeline for Dynamic Generation of Trustworthy Health Supplement Content at Scale

The Pascal Visual Object Classes (VOC) Challenge

Distributed Representations of Words and Phrases and their Compositionality

Neural Machine Translation by Jointly Learning to Align and Translate

Sequence to Sequence Learning with Neural Networks

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Attention Is All You Need

Training language models to follow instructions with human feedback

Machine Learning: Algorithms, Real-World Applications and Research Directions

LLaMA: Open and Efficient Foundation Language Models

Language Models are Few-Shot Learners

Large language models encode clinical knowledge

Is Space-Time Attention All You Need for Video Understanding?

Llama 2: Open Foundation and Fine-Tuned Chat Models

LoRA: Low-Rank Adaptation of Large Language Models

A Survey on Evaluation of Large Language Models

GPT-4 Technical Report

Sparks of Artificial General Intelligence: Early experiments with GPT-4

A Survey on Distributed Machine Learning

Attention Is All You Need In Speech Separation

Autonomous chemical research with large language models

RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Scaling Distributed Machine Learning with In-Network Aggregation

Channel Attention Is All You Need for Video Frame Interpolation

Chain-Of-Thought Prompting Elicits Reasoning in Large Language Models

Ethical and Bias Considerations in Artificial Intelligence (AI)/Machine Learning.

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Attention is all you need: utilizing attention in AI-enabled drug discovery

Has the Future Started? The Current Growth of Artificial Intelligence, Machine Learning, and Deep Learning

Deliberative Alignment: Reasoning Enables Safer Language Models

The state-of-the-art on Intellectual Property Analytics (IPA): A literature review on artificial intelligence, machine learning and deep learning methods for analysing intellectual property (IP) data

Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation