AI research atlas / v2

Learn AI papers in the right order.

Start with landmark ideas, move through foundations, then branch into LLMs, GenAI, agents, systems, and safety with a reading path that keeps the field from feeling random.

Start roadmap My reading

10 learning tracksFull-paper readerChatGPT handoff

Recommended firstLandmark papers

Build the mental timeline before going deep.

Then specializeLLMs, GenAI, safety

Move from foundations to modern systems.

Read modePDF + resources

Path-firstNo more random paper hopping

Research-nativearXiv links, PDFs, resources

Study loopTrack reading and discuss in ChatGPT

Learning path

Where to start, and what to read next

Start with landmarks

Orientation / 1-2 weeks

Start Here

Read the papers everyone keeps referencing so the rest of the map has anchors.

Know the landmark namesBuild historical contextPick a direction

Open papers

Foundations / 2-4 weeks

Classical ML

Learn the statistical and probabilistic ideas that still sit under modern models.

Bayesian thinkingModel evaluationUncertainty

Open papers

Foundations / 1-2 weeks

Optimization

Understand the training mechanics behind gradient-based learning.

Gradient descentGeneralizationTraining stability

Open papers

Builder / 3-5 weeks

Deep Learning Core

Move through representation learning, CNNs, residual networks, and scaling patterns.

CNN intuitionRepresentation learningBenchmark culture

Open papers

Builder / 3-6 weeks

Sequence Models and LLMs

Study attention, transformers, language modeling, instruction tuning, and evaluation.

AttentionPretrainingInstruction following

Open papers

Specialist / 3-6 weeks

Generative AI

Compare GANs, diffusion, autoregressive generation, and modern GenAI workflows.

DiffusionGANsGeneration tradeoffs

Open papers

Specialist / 2-4 weeks

Multimodal and Retrieval

Connect language with images, retrieval, embeddings, and real-world knowledge access.

Vision-languageEmbeddingsRetrieval

Open papers

Specialist / 3-5 weeks

RL and Agents

Learn decision making, feedback, policy learning, and agent-style systems.

PoliciesRewardsExploration

Open papers

Practitioner / 2-4 weeks

Systems and Scaling

Understand the infrastructure and engineering papers behind large-scale training.

Distributed trainingServingEfficiency

Open papers

Practitioner / 2-4 weeks

Safety and Interpretability

Study robustness, alignment, transparency, and how to reason about model behavior.

AlignmentRobustnessInterpretability

Open papers

Research library

Information Retrieval

Showing papers for this learning path. Open any paper card to read the full paper and related resources.

40 papers shown

unread2021

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

In the last few years, the deep learning (DL) computing paradigm has been deemed the Gold Standard in the machine learning (ML) community. Moreover, it has gradually become the most widely used computational approach in the field of ML, thus achieving outstanding results on several complex cognitive tasks, matching or even beating those provided by human performance. One of the benefits of DL is the ability to learn massive amounts of data. The DL field has grown fast in the last few years and it has been extensively used to successfully address a wide range of traditional applications. More importantly, DL has outperformed well-known ML techniques in many domains, e.g., cybersecurity, natural language processing, bioinformatics, robotics and control, and medical information processing, among many others. Despite it has been contributed several works reviewing the State-of-the-Art on DL, all of them only tackled one aspect of the DL, which leads to an overall lack of knowledge about it. Therefore, in this contribution, we propose using a more holistic approach in order to provide a more suitable starting point from which to develop a full understanding of DL. Specifically, this review attempts to provide a more comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field. In particular, this paper outlines the importance of DL, presents the types of DL techniques and networks. It then presents convolutional neural networks (CNNs) which the most utilized DL network type and describes the development of CNNs architectures together with their main features, e.g., starting with the AlexNet network and closing with the High-Resolution network (HR.Net). Finally, we further present the challenges and suggested solutions to help researchers understand the existing research gaps. It is followed by a list of the major DL applications. Computational tools including FPGA, GPU, and CPU are summarized along with a description of their influence on DL. The paper ends with the evolution matrix, benchmark datasets, and summary and conclusion.

Learn AI papers in the right order.

Where to start, and what to read next

Start Here

Classical ML

Optimization

Deep Learning Core

Sequence Models and LLMs

Generative AI

Multimodal and Retrieval

RL and Agents

Systems and Scaling

Safety and Interpretability

Architecture

Learning Paradigms

Applications

Trust and Deployment

Information Retrieval

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Yes, "Attention Is All You Need", for Exemplar based Colorization

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Attention is all you need to solve chiral superconductivity

LLandMark: A Multi-Agent Framework for Landmark-Aware Multimodal Interactive Video Retrieval

Lightweight and Direct Document Relevance Optimization for Generative Information Retrieval

Satisfactory Medical Consultation based on Terminology-Enhanced Information Retrieval and Emotional In-Context Learning

Rankers, Judges, and Assistants: Towards Understanding the Interplay of LLMs in Information Retrieval Evaluation

RAGPart & RAGMask: Retrieval-Stage Defenses Against Corpus Poisoning in Retrieval-Augmented Generation

Investigating LLM Variability in Personalized Conversational Information Retrieval

Explainable Information Retrieval in the Audit Domain

Newton-Puiseux Analysis for Interpretability and Calibration of Complex-Valued Neural Networks

Accessibility in Information Retrieval

Interactions with Generative Information Retrieval Systems

Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use

DESIRE-ME: Domain-Enhanced Supervised Information REtrieval using Mixture-of-Experts

Retrieving Comparative Arguments using Ensemble Methods and Neural Information Retrieval

Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023

Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation

Tackling the Curse of Dimensionality with Physics-Informed Neural Networks

Is Cross-modal Information Retrieval Possible without Training?

Towards Proactive Information Retrieval in Noisy Text with Wikipedia Concepts

Information Retrieval from the Digitized Books

Online Information Retrieval Evaluation using the STELLA Framework

Modern graph neural networks do worse than classical greedy algorithms in solving combinatorial optimization problems like maximum independent set

FAIR: Fairness-Aware Information Retrieval Evaluation

Neural ranking models for document retrieval

IITP@COLIEE 2019: Legal Information Retrieval using BM25 and BERT

Can Information Flows Suggest Targets for Interventions in Neural Circuits?

Match Your Words! A Study of Lexical Matching in Neural Information Retrieval

On Information Plane Analyses of Neural Network Classifiers -- A Review

Declarative Experimentation in Information Retrieval using PyTerrier

Evaluating Information Retrieval Systems for Kids

Reading Protocol: Understanding what has been Read in Interactive Information Retrieval Tasks

Leveraging Dependency Forest for Neural Medical Relation Extraction

Using Neural Networks for Relation Extraction from Biomedical Literature

Random Pairwise Shapelets Forest

Towards Theoretical Understanding of Weak Supervision for Information Retrieval

Improving Generalization of Deep Neural Networks by Leveraging Margin Distribution

Overcoming low-utility facets for complex answer retrieval