pytorch-transformer Attention is all you need implementation YouTube video with full step-by-step implementation: https://www.youtube.com/watch?v=ISNdQcPhsts