arXiv / 2019

A Unifying Bayesian View of Continual Learning

Sebastian Farquhar, Yarin Gal

Foundation ModelsLarge Language ModelsPopular and Landmark PapersProbabilistic ML

Some machine learning applications require continual learning - where data comes in a sequence of datasets, each is used for training and then permanently discarded. From a Bayesian perspective, continual learning seems straightforward: Given the model posterior one would simply use this as the prior for the next task. However, exact posterior evaluation is intractable with many models, especially with Bayesian neural networks (BNNs). Instead, posterior approximations are often sought. Unfortunately, when posterior approximations are used, prior-focused approaches do not succeed in evaluations designed to capture properties of realistic continual learning use cases. As an alternative to prior-focused methods, we introduce a new approximate Bayesian derivation of the continual learning loss. Our loss does not rely on the posterior from earlier tasks, and instead adapts the model itself by changing the likelihood term. We call these approaches likelihood-focused. We then combine prior- and likelihood-focused methods into one objective, tying the two views together under a single unifying framework of approximate Bayesian continual learning.

0 citations0 influential

Full paper

Read the original paper

Open PDF Source page

Learning resources

arXiv PDFPDF arXiv abstract pagearXiv Google Scholar referencesGoogle Scholar Papers with Code searchPapers with Code YouTube explanationsYouTube

Reading state

Discuss in ChatGPT

Uses your own ChatGPT account. The paper context is copied into a tutor prompt before ChatGPT opens.

Preview prompt

You are my AI/ML research paper instructor. I want to deeply understand the paper below.

First, teach it in layers:
1. One-paragraph intuition.
2. Problem statement and why it mattered.
3. Key method, architecture, or algorithm.
4. Important equations or mechanisms, explained intuitively.
5. Experiments and evidence.
6. Limitations, assumptions, and failure modes.
7. How this paper influenced later AI/ML/Deep Learning/GenAI work.
8. A 30-minute study plan with checkpoints.
9. Quiz me with 5 questions and wait for my answers.

When something is not available in the attached context, say what is missing and infer carefully.

### Paper attached as context
Title: A Unifying Bayesian View of Continual Learning
Authors: Sebastian Farquhar, Yarin Gal
Year: 2019
Venue: arXiv
Categories: Foundation Models, Large Language Models, Popular and Landmark Papers, Probabilistic ML
Citations: 0
Paper URL: https://arxiv.org/abs/1902.06494v1
Open PDF: https://arxiv.org/pdf/1902.06494v1

Abstract:
Some machine learning applications require continual learning - where data comes in a sequence of datasets, each is used for training and then permanently discarded. From a Bayesian perspective, continual learning seems straightforward: Given the model posterior one would simply use this as the prior for the next task. However, exact posterior evaluation is intractable with many models, especially with Bayesian neural networks (BNNs). Instead, posterior approximations are often sought. Unfortunately, when posterior approximations are used, prior-focused approaches do not succeed in evaluations designed to capture properties of realistic continual learning use cases. As an alternative to prior-focused methods, we introduce a new approximate Bayesian derivation of the continual learning loss. Our loss does not rely on the posterior from earlier tasks, and instead adapts the model itself by changing the likelihood term. We call these approaches likelihood-focused. We then combine prior- and likelihood-focused methods into one objective, tying the two views together under a single unifying framework of approximate Bayesian continual learning.

Learning resources:
- PDF: arXiv PDF (https://arxiv.org/pdf/1902.06494v1)
- arXiv: arXiv abstract page (https://arxiv.org/abs/1902.06494v1)
- Google Scholar: Google Scholar references (https://scholar.google.com/scholar?q=A%20Unifying%20Bayesian%20View%20of%20Continual%20Learning)
- Papers with Code: Papers with Code search (https://paperswithcode.com/search?q=A%20Unifying%20Bayesian%20View%20of%20Continual%20Learning)
- YouTube: YouTube explanations (https://www.youtube.com/results?search_query=A%20Unifying%20Bayesian%20View%20of%20Continual%20Learning+paper+explained)