COMPSCI 714 · AI Architecture and Design

This course focuses on the design and implementation of modern AI systems — transformer architectures, large language models, training dynamics, and system-level considerations in deep learning.

Notes here focus on:

  • architectural reasoning: why design choices are made, not just what they are
  • implementation-level understanding — building models from scratch to expose internals
  • connecting theory to real systems (GPT-2, attention mechanisms, tokenization)

Linked notes:

Bridges Back And Outward