Iliad Intensive Curriculum

Overview

We turn from learning in principle to learning in practice. Deep learning appears to overcome all three barriers from the Principles of Learning module in ways that classical statistical learning theory could not predict or even actively counterpredicted — overparameterized networks generalize despite having the capacity to memorize, SGD finds good solutions on non-convex landscapes, and networks compactly represent functions in millions of dimensions. Beyond these, deep learning exhibits additional empirical phenomena that the classical framework doesn’t even address: learned representations converge across different architectures and training setups, models display in-context learning abilities that were never explicitly trained, etc. We survey these mysteries and explore candidate explanations, including the hypothesis that deep learning may be overcoming these barriers with similar mechanisms to Solomonoff induction. As with Day B.1, this is a broad lightning overview; the subsequent case study days (SLT, training dynamics, data attribution) each develop one specific line of attack on these mysteries in depth.

Prerequisites

Module on Principles of Learning: the three barriers (approximation, generalization, optimization), Solomonoff induction and the simplicity prior, no free lunch, bias-variance tradeoff
Basic understanding of deep learning, sufficient to read non-specialist ML papers
Knowledge of mechanistic interpretability (Day C.2) is very helpful motivation but not logically necessary

Content

Fast track

Read the lecture slides for the overall framing, then read "Deep Learning as Program Synthesis" (skipping the background section on Solomonoff induction, which was covered on the Principles of Learning day, and optionally deferring the "path forward" section). This gives a high level overview of various empirical mysteries. Then skim as many papers on the list as you have time/interest (possibly none).

Main content

Lecture:

Core readings (likely cover a smaller subset depending on time / audience interest):

Overview
- Deep Learning as Program Synthesis
  - Note that this post presents an opinionated hypothesis (deep learning is performing something analogous to Solomonoff induction) alongside relatively consensus discussion of empirical mysteries. The post is largely being shared for the latter, though students may find the hypothesis itself useful pedagogically
Approximation
- Approximation is expensive, but the lunch is cheap
- Why and When Can Deep – but Not Shallow – Networks Avoid the Curse of Dimensionality: a Review
Generalization
Optimization
- Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
- SGD learning on neural networks: leap complexity and saddle-to-saddle dynamics
Representational alignment
- The Platonic Representation Hypothesis
In-context learning
- In-context Learning and Induction Heads

Learn more

The Scaling Hypothesis
- Quite polemical. Nevertheless, very influential "ideas piece"
Getting aligned on representational alignment
- A broader overview of the representational alignment phenomenon
Exact solutions to the nonlinear dynamics of learning in deep linear neural networks
- Classic paper which finds stagewise learning in neural networks with linear activation function; will be covered later in the course
Stagewise Development in Neural Networks
- A nice paper investigating stagewise learning in small LLMs. Requires some SLT knowledge
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
- Training on "evil" data in narrow domains (like, code with vulnerabilities) generalizes to "evil" more broadly (praising Hitler, etc)

Mysteries of Deep Learning

What you’ll learn