1 Comment
User's avatar
Awath Abdat's avatar

This is a cool technique, thank you for this article. Makes me curious how you would do it for other kinds of models besides classification, how their loss functions would look like when training the student model

Expand full comment