Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
Apr 4, 2024

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Tech Frontier

Information

Published
April 4, 2024
Type
audio
Language
EN
Author
Julien Rineau
Discover
Find new listens