Blog

All posts

Filter

All Postsself-improvement18 machine-learning18 stanford-cs33618 letters15 deep-learning15 motivation8 discipline7 mindset6 books5 focus4 productivity4 action4 +74 more

Tutorials·January 17, 2026·10 min read

CS336 Notes: Lecture 15 - Alignment, SFT and RLHF

Post-training for helpful assistants: supervised fine-tuning on instructions, safety tuning, RLHF with preference data, PPO vs DPO, and the challenges of learning from human feedback.

machine-learning alignment stanford-cs336 rlhf

Read

Tutorials·January 16, 2026·11 min read

CS336 Notes: Lecture 14 - Data 2

Data filtering and deduplication at scale: n-gram language models, fastText classifiers, importance sampling, MinHash, LSH, and Bloom filters for efficient web-scale processing.

machine-learning data stanford-cs336 deduplication

Read

Tutorials·January 15, 2026·13 min read

CS336 Notes: Lecture 13 - Data 1

Training data for LLMs: Common Crawl processing, quality filtering, the evolution of data pipelines from BERT to modern models, and the critical role of copyright and licensing.

machine-learning data stanford-cs336 deep-learning

Read

Tutorials·January 14, 2026·10 min read

CS336 Notes: Lecture 12 - Evaluation

LLM evaluation beyond accuracy: perplexity, knowledge benchmarks, instruction-following, agent tasks, safety, and why evaluation design shapes what models become.

machine-learning evaluation stanford-cs336 benchmarks

Read

Tutorials·January 13, 2026·8 min read

CS336 Notes: Lecture 11 - Scaling Laws 2

Practical scaling: muP for hyperparameter transfer, WSD learning rate schedules, case studies from Cerebras-GPT, MiniCPM, and DeepSeek on compute-optimal training.

machine-learning scaling-laws stanford-cs336 deep-learning

Read