Introduction Repo with my attempt to implement transformers networks with Pytorch. References Annotated Transformer Building Transformer XL from scratch Pytorch Docs torch.nn.Transformer