transformers
2 pieces
Apr 13, 2025 9 min read
Math Foundations of Transformers and MoE Layers
A thorough explanation of the equations powering classic transformer structures and Mixture-of-Experts for advanced deep learning workflows.
ai-systems-engineeringtransformersmixture-of-expertsmachine-learning
Jan 26, 2025 2 min read
Understanding Tokenizers and Embedders in LLM Pipelines
A deep dive into the role, structure, and training of tokenizers and embedders in modern language models like GPT, BERT, and T5.
machine-learning-fundamentalsmachine-learningtransformerstokenizers