Iliad Intensive Curriculum

Overview

We ask: if AGI is a learning machine, what can we say about it in principle? We draw on the fields of statistical learning theory and algorithmic information theory. We first explore three fundamental barriers that any powerful learning system must overcome — approximation (can the hypothesis class express the truth?), generalization (can finite data distinguish truth from noise?), and optimization (can the learner find good hypotheses efficiently?) — and examine why these barriers are in tension with each other. We then introduce Solomonoff induction, which elegantly resolves the first two barriers via a universal hypothesis class and a simplicity prior, but completely fails on the third. Along the way, we cover the bias-variance tradeoff, the no free lunch theorem, and cryptographic hardness arguments for why efficient universal learning is impossible in the worst case. This is necessarily a lightning tour of large fields; the goal is to establish a shared conceptual vocabulary and theoretical reference point that the rest of the deep learning theory cluster builds on.

Prerequisites

Probability theory basics: probability distributions, conditional probability, Bayes' theorem
Comfort with mathematical proofs and formal definitions
Some familiarity with the concept of Turing machines and computability (at the level of "know what a Turing machine is, why it's a universal model of computation, and what it means for a function to be computable")
No machine learning background assumed; no coding required
Helpful but not required: basic statistical concepts (bias, variance, estimators)

Content

Fast track

The content is already a very high-level overview, but to get the absolute basics, read the lecture slides and Sections 2-2.8 of Shane Legg's PhD thesis. Ideally this would still involve a decent amount of discussion (possibly with LLMs).

Main content

Main lecture:

Readings and exercises (in order):

Bitter Lesson
Introduction to Solomonoff induction
- Shane Legg's PhD thesis (Sections 2-2.8)
Solomonoff induction exercises
- Solutions here
Understanding the Bias-Variance tradeoff
The No Free Lunch theorems and their Razor
Understanding Machine Learning: From Theory to Algorithms
- Section 8.4 (Hardness of learning)

Principles of Learning

What you’ll learn

Overview

Prerequisites

Content

Fast track

Main content

Learn more