This page is a hub for my transformer-related notes.

Core Routes

  • tokenization and representation learning
  • self-attention and sequence modeling
  • scaling, inference efficiency, and system design

Connected Routes

  • COMPSCI 714 is the most direct course path into this cluster
  • Soft Computing Explained is a good contrast point if you want the older probabilistic and approximate-reasoning view
  • Computational Graphics is where geometric intuition and visual structure may reconnect with representation learning later