Episode Details

Back to Episodes
The Bitter Lesson for Biology — Adam Green on Virtual Cells and Scaling Laws

The Bitter Lesson for Biology — Adam Green on Virtual Cells and Scaling Laws

Episode 3 Published 3 hours ago
Description

Markov Biosciences, a startup in San Francisco, is betting that biology is about to have its GPT moment. In this episode, founder Adam Green explains the "bitter lesson" for biology, the idea borrowed from Richard Sutton that large unbiased datasets and the right training objective tend to outcompete models with hard-coded rules and human priors. Adam thinks, in particular, that the virtual cell field took a wrong turn by spending hundreds of millions of dollars collecting expensive perturbation data. Green’s counterargument is that the data needed to train useful virtual cells is not limiting, but rather compute (and the loss function) are. By treating single-cell RNA-seq as a ranking problem rather than raw counts (a century-old idea traceable to a 1927 psychophysics paper), they found that virtual cells pre-trained on plain observational data show clean scaling laws, getting monotonically better at predicting unseen perturbations as the models grow, and beating a state-of-the-art model built specifically for that task.


00:00 - Cold open and introduction 

01:58 - The first clinical prediction from a virtual cell

05:38 - What is a "virtual cell," really? 

08:01 - Single-cell RNA-seq biases and the urns analogy

23:29 - The bitter lesson for biology

30:55 - Geometric Plackett-Luce: the right loss function

59:26 Trop2 deep dive

1:11:16 - Top-down vs. bottom-up biology, mechinterp, and control as the goal 


Readings and mentions: 

Listen Now