manleyroberts / sparsefinder-mlm Goto Github PK
View Code? Open in Web Editor NEWThis project forked from deep-spin/sparsefinder-mlm
Code for training and evaluating MLM models for our paper "Predicting Attention Sparsity in Transformers"
License: MIT License