AI research atlas / v2

Learn AI papers in the right order.

Start with landmark ideas, move through foundations, then branch into LLMs, GenAI, agents, systems, and safety with a reading path that keeps the field from feeling random.

Start roadmap My reading

10 learning tracksFull-paper readerChatGPT handoff

Recommended firstLandmark papers

Build the mental timeline before going deep.

Then specializeLLMs, GenAI, safety

Move from foundations to modern systems.

Read modePDF + resources

Path-firstNo more random paper hopping

Research-nativearXiv links, PDFs, resources

Study loopTrack reading and discuss in ChatGPT

Learning path

Where to start, and what to read next

Start with landmarks

Orientation / 1-2 weeks

Start Here

Read the papers everyone keeps referencing so the rest of the map has anchors.

Know the landmark namesBuild historical contextPick a direction

Open papers

Foundations / 2-4 weeks

Classical ML

Learn the statistical and probabilistic ideas that still sit under modern models.

Bayesian thinkingModel evaluationUncertainty

Open papers

Foundations / 1-2 weeks

Optimization

Understand the training mechanics behind gradient-based learning.

Gradient descentGeneralizationTraining stability

Open papers

Builder / 3-5 weeks

Deep Learning Core

Move through representation learning, CNNs, residual networks, and scaling patterns.

CNN intuitionRepresentation learningBenchmark culture

Open papers

Builder / 3-6 weeks

Sequence Models and LLMs

Study attention, transformers, language modeling, instruction tuning, and evaluation.

AttentionPretrainingInstruction following

Open papers

Specialist / 3-6 weeks

Generative AI

Compare GANs, diffusion, autoregressive generation, and modern GenAI workflows.

DiffusionGANsGeneration tradeoffs

Open papers

Specialist / 2-4 weeks

Multimodal and Retrieval

Connect language with images, retrieval, embeddings, and real-world knowledge access.

Vision-languageEmbeddingsRetrieval

Open papers

Specialist / 3-5 weeks

RL and Agents

Learn decision making, feedback, policy learning, and agent-style systems.

PoliciesRewardsExploration

Open papers

Practitioner / 2-4 weeks

Systems and Scaling

Understand the infrastructure and engineering papers behind large-scale training.

Distributed trainingServingEfficiency

Open papers

Practitioner / 2-4 weeks

Safety and Interpretability

Study robustness, alignment, transparency, and how to reason about model behavior.

AlignmentRobustnessInterpretability

Open papers

Research library

Graph Learning

Showing papers for this learning path. Open any paper card to read the full paper and related resources.

40 papers shown

unread2023

As a present to Mizar on its 50th anniversary, we develop an AI/TP system that automatically proves about 60% of the Mizar theorems in the hammer setting. We also automatically prove 75% of the Mizar theorems when the automated provers are helped by using only the premises used in the human-written Mizar proofs. We describe the methods and large-scale experiments leading to these results. This includes in particular the E and Vampire provers, their ENIGMA and Deepire learning modifications, a number of learning-based premise selection methods, and the incremental loop that interleaves growing a corpus of millions of ATP proofs with training increasingly strong AI/TP systems on them. We also present a selection of Mizar problems that were proved automatically.

Jakubův, Jan, Chvalovský, Karel, Goertzel, Zarathustra 75,444

Learn AI papers in the right order.

Where to start, and what to read next

Start Here

Classical ML

Optimization

Deep Learning Core

Sequence Models and LLMs

Generative AI

Multimodal and Retrieval

RL and Agents

Systems and Scaling

Safety and Interpretability

Architecture

Learning Paradigms

Applications

Trust and Deployment

Graph Learning

MizAR 60 for Mizar 50

Graph Attention Networks

Semi-Supervised Classification with Graph Convolutional Networks

Attention is all you need to solve chiral superconductivity

Master GAN: Multiple Attention is all you Need: A Multiple Attention Guided Super Resolution Network for Dems

Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation

Composable Assurance for AI Alignment: A Framework for Propagating Formal Safety Properties Through MLOps

Newton-Puiseux Analysis for Interpretability and Calibration of Complex-Valued Neural Networks

Multiclass Graph-Based Large Margin Classifiers: Unified Approach for Support Vectors and Neural Networks

Transformers are Graph Neural Networks

NeuroCoreX: An Open-Source FPGA-Based Spiking Neural Network Emulator with On-Chip Learning

Tackling the Curse of Dimensionality with Physics-Informed Neural Networks

Learning Active Subspaces and Discovering Important Features with Gaussian Radial Basis Functions Neural Networks

The Deep Arbitrary Polynomial Chaos Neural Network or how Deep Artificial Neural Networks could benefit from Data-Driven Homogeneous Chaos Theory

Random perfect matchings in regular graphs

Acute Lymphoblastic Leukemia Detection Using Hypercomplex-Valued Convolutional Neural Networks

Modern graph neural networks do worse than classical greedy algorithms in solving combinatorial optimization problems like maximum independent set

Dual Accuracy-Quality-Driven Neural Network for Prediction Interval Generation

MECCH: Metapath Context Convolution-based Heterogeneous Graph Neural Networks

Cover and Hitting Times of Hyperbolic Random Graphs

Social Influence Prediction with Train and Test Time Augmentation for Graph Neural Networks

SiReN: Sign-Aware Recommendation Using Graph Neural Networks

Continual Learning for Recurrent Neural Networks: an Empirical Evaluation

CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks

Spiking Inception Module for Multi-layer Unsupervised Spiking Neural Networks

Adaptive Propagation Graph Convolutional Network

On Information Plane Analyses of Neural Network Classifiers -- A Review

A Learning Framework for n-bit Quantized Neural Networks toward FPGAs

mFI-PSO: A Flexible and Effective Method in Adversarial Image Generation for Deep Neural Networks

Graph Convolutional Neural Networks with Node Transition Probability-based Message Passing and DropNode Regularization

Analyzing the Performance of Graph Neural Networks with Pipe Parallelism

Hierarchical Attentional Hybrid Neural Networks for Document Classification

Fast and Deep Graph Neural Networks

A Recurrent Probabilistic Neural Network with Dimensionality Reduction Based on Time-series Discriminant Component Analysis

Leveraging Dependency Forest for Neural Medical Relation Extraction

k-hop Graph Neural Networks

A Review on Neural Network Models of Schizophrenia and Autism Spectrum Disorder

Using Neural Networks for Relation Extraction from Biomedical Literature

A Neural Network-Evolutionary Computational Framework for Remaining Useful Life Estimation of Mechanical Systems

Missing Data Imputation with Adversarially-trained Graph Convolutional Networks