Megatron-LM by Nvidia August 16, 2019 by admin Research on training transformer language models at scale, including BERT: https://github.com/NVIDIA/Megatron-LM