meshidenn / megatron-deepspeed-jp-abci Goto Github PK
View Code? Open in Web Editor NEWThis project forked from rioyokotalab/megatron-deepspeed-ylab
Ongoing research training transformer language models at scale, including: BERT & GPT-2
License: Other