AI research atlas / v2

Learn AI papers in the right order.

Start with landmark ideas, move through foundations, then branch into LLMs, GenAI, agents, systems, and safety with a reading path that keeps the field from feeling random.

Start roadmap My reading

10 learning tracksFull-paper readerChatGPT handoff

Recommended firstLandmark papers

Build the mental timeline before going deep.

Then specializeLLMs, GenAI, safety

Move from foundations to modern systems.

Read modePDF + resources

Path-firstNo more random paper hopping

Research-nativearXiv links, PDFs, resources

Study loopTrack reading and discuss in ChatGPT

Learning path

Where to start, and what to read next

Start with landmarks

Orientation / 1-2 weeks

Start Here

Read the papers everyone keeps referencing so the rest of the map has anchors.

Know the landmark namesBuild historical contextPick a direction

Open papers

Foundations / 2-4 weeks

Classical ML

Learn the statistical and probabilistic ideas that still sit under modern models.

Bayesian thinkingModel evaluationUncertainty

Open papers

Foundations / 1-2 weeks

Optimization

Understand the training mechanics behind gradient-based learning.

Gradient descentGeneralizationTraining stability

Open papers

Builder / 3-5 weeks

Deep Learning Core

Move through representation learning, CNNs, residual networks, and scaling patterns.

CNN intuitionRepresentation learningBenchmark culture

Open papers

Builder / 3-6 weeks

Sequence Models and LLMs

Study attention, transformers, language modeling, instruction tuning, and evaluation.

AttentionPretrainingInstruction following

Open papers

Specialist / 3-6 weeks

Generative AI

Compare GANs, diffusion, autoregressive generation, and modern GenAI workflows.

DiffusionGANsGeneration tradeoffs

Open papers

Specialist / 2-4 weeks

Multimodal and Retrieval

Connect language with images, retrieval, embeddings, and real-world knowledge access.

Vision-languageEmbeddingsRetrieval

Open papers

Specialist / 3-5 weeks

RL and Agents

Learn decision making, feedback, policy learning, and agent-style systems.

PoliciesRewardsExploration

Open papers

Practitioner / 2-4 weeks

Systems and Scaling

Understand the infrastructure and engineering papers behind large-scale training.

Distributed trainingServingEfficiency

Open papers

Practitioner / 2-4 weeks

Safety and Interpretability

Study robustness, alignment, transparency, and how to reason about model behavior.

AlignmentRobustnessInterpretability

Open papers

Research library

AI Safety

Showing papers for this learning path. Open any paper card to read the full paper and related resources.

40 papers shown

unread1998

Gradient-based learning applied to document recognition

Multilayer neural networks trained with the back-propagation algorithm constitute the best example of a successful gradient based learning technique. Given an appropriate network architecture, gradient-based learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional neural networks, which are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation recognition, and language modeling. A new learning paradigm, called graph transformer networks (GTN), allows such multimodule systems to be trained globally using gradient-based methods so as to minimize an overall performance measure. Two systems for online handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of graph transformer networks. A graph transformer network for reading a bank cheque is also described. It uses convolutional neural network character recognizers combined with global training techniques to provide record accuracy on business and personal cheques. It is deployed commercially and reads several million cheques per day.

Learn AI papers in the right order.

Where to start, and what to read next

Start Here

Classical ML

Optimization

Deep Learning Core

Sequence Models and LLMs

Generative AI

Multimodal and Retrieval

RL and Agents

Systems and Scaling

Safety and Interpretability

Architecture

Learning Paradigms

Applications

Trust and Deployment

AI Safety

Gradient-based learning applied to document recognition

Training language models to follow instructions with human feedback

GPT-4 Technical Report

Autonomous chemical research with large language models

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Deliberative Alignment: Reasoning Enables Safer Language Models

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

An accelerated alignment method for analyzing time sequences of industrial alarm floods

Bergeron: Combating Adversarial Attacks through a Conscience-Based Alignment Framework

Jailbreaking and Mitigation of Vulnerabilities in Large Language Models

Improving LLM Safety Alignment with Dual-Objective Optimization

An Overview of Trustworthy AI: Advances in IP Protection, Privacy-Preserving Federated Learning, Security Verification, and GAI Safety Alignment

Agentic AI Frameworks: Architectures, Protocols, and Design Challenges

Objective metrics for ethical AI: a systematic literature review

Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model

Breach By A Thousand Leaks: Unsafe Information Leakage in 'Safe' AI Responses

Ensuring Safety and Trust: Analyzing the Risks of Large Language Models in Medicine

Context Is All You Need: A Hybrid Attention-Based Method for Detecting Code Design Patterns

When AIs Judge AIs: The Rise of Agent-as-a-Judge Evaluation for LLMs

Enhancing Security in Large Language Models: A Comprehensive Review of Prompt Injection Attacks and Defenses

Harnessing Metacognition for Safe and Responsible AI

Smoothed Embeddings for Robust Language Models

Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System

The ethical evaluation of large language models and its optimization

SafeTuneBed: A Toolkit for Benchmarking LLM Safety Alignment in Fine-Tuning

LLMs as Medical Safety Judges: Evaluating Alignment with Human Annotation in Patient-Facing QA

Building Trust in Artificial Intelligence: A Systematic Review through the Lens of Trust Theory

Position: AI Safety Must Embrace an Antifragile Perspective

Noise Injection Systemically Degrades Large Language Model Safety Guardrails

Diagnosing Hallucination Risk in AI Surgical Decision-Support: A Sequential Framework for Sequential Validation

Toward Constitutional Autonomy in AI Systems: A Theoretical Framework for Aligned Agentic Intelligence

Balancing Safety and Helpfulness in Healthcare AI Assistants through Iterative Preference Alignment

Co-Alignment: Rethinking Alignment as Bidirectional Human-AI Cognitive Adaptation

Position: The Complexity of Perfect AI Alignment - Formalizing the RLHF Trilemma

Strong Preferences Affect the Robustness of Preference Models and Value Alignment

LSR: Linguistic Safety Robustness Benchmark for Low-Resource West African Languages

Overview of PAN 2026: Voight-Kampff Generative AI Detection, Text Watermarking, Multi-Author Writing Style Analysis, Generative Plagiarism Detection, and Reasoning Trajectory Detection

FGD-Align: Pluralistic Alignment for Large Language Models via Fuzzy Group Decision-Making

A Review of AI Safety and Trustworthiness in Autonomous Vehicles