Arxiv_featurelearning

February 07, 2024

2024

Understanding how neural-networks learn features during training and how these impact their capacity to generalise is an important open question. In our recent pre-print, we provide a sharp analysis of how two-layer neural networks learn features from data, and improve over the kernel regime, after being trained with a single gradient descent step.

Enjoy Reading This Article?

Here are some more articles you might like to read next:

a distill-style blog post

a post with code

a post with diagrams

a post with github metadata

a post with videos