References — From Tokens to Embodied Minds

Courses

Stanford CS336 — Language Modeling from Scratch, Spring 2025, Percy Liang & Tatsunori Hashimoto. cs336.stanford.edu/spring2025. Full lectures on YouTube (Reddit announcement, July 11, 2025).
Stanford CS25 — Transformers United V5. web.stanford.edu/class/cs25.
Stanford CS231n — Deep Learning for Computer Vision, Spring 2024. cs231n.stanford.edu.
Stanford ENGR319 — Foundations of Generative AI. online.stanford.edu.
CMU 11-868 — Large Language Model Systems, Spring 2025, Lei Li. llmsystem.github.io/llmsystem2025spring.
Berkeley CS 285 — Deep Reinforcement Learning, Fall 2024, Sergey Levine. rail.eecs.berkeley.edu/deeprlcourse.
Berkeley CS 294-277 — Robots that Learn, Spring 2026, Jitendra Malik. Original announcement: Malik on X, Jan 6, 2025.
MIT 6.S191 — Introduction to Deep Learning, 2025. introtodeeplearning.com.
NVIDIA Deep Learning Institute — Robotics Fundamentals and Generative AI Teaching Kit. nvidia.com/training.
500 Global VC Unlocked: Silicon Valley (Aug 24 – Sept 4, 2026, Stanford campus, $25K). 500.co/vcu/stanford.
500 Global VC Unlocked: AI Edition. 500.co/vcu/ai.

Vaswani et al., Attention Is All You Need, arXiv:1706.03762, June 12, 2017.
He et al., Deep Residual Learning for Image Recognition, arXiv:1512.03385, Dec 10, 2015.
Dosovitskiy et al., An Image is Worth 16×16 Words (ViT), arXiv:2010.11929, Oct 22, 2020.
Su et al., RoFormer: Enhanced Transformer with Rotary Position Embedding, arXiv:2104.09864, April 20, 2021.
Schulman et al., Proximal Policy Optimization Algorithms, arXiv:1707.06347, July 20, 2017.
Haarnoja et al., Soft Actor-Critic, arXiv:1801.01290, Jan 4, 2018.
Hoffmann et al., Training Compute-Optimal LLMs (Chinchilla), arXiv:2203.15556, March 29, 2022.

Dao et al., FlashAttention-2, arXiv:2307.08691, July 17, 2023.
Kwon et al., PagedAttention / vLLM, arXiv:2309.06180, Sept 12, 2023.
Zheng et al., SGLang, arXiv:2312.07104, Dec 12, 2023.
Frantar et al., GPTQ, arXiv:2210.17323, Oct 31, 2022.
Leviathan et al., Speculative Decoding, arXiv:2211.17192, Nov 30, 2022.
Rafailov et al., Direct Preference Optimization, arXiv:2305.18290, May 29, 2023.
DeepSeek-AI, DeepSeek-V3 Technical Report, arXiv:2412.19437, Dec 27, 2024.
DeepSeek-AI, DeepSeek-R1, arXiv:2501.12948, Jan 22, 2025.
Meta AI, The Llama 3 Herd, arXiv:2407.21783, July 31, 2024.
Penedo et al., The FineWeb Datasets, arXiv:2406.17557, June 25, 2024.
Zhao et al., PyTorch FSDP, arXiv:2304.11277, April 21, 2023.
Liu et al., Ring Attention, arXiv:2310.01889, Oct 3, 2023.
Anthropic, Introducing Contextual Retrieval, Sept 19, 2024.
Microsoft Research, GraphRAG, arXiv:2404.16130, April 24, 2024.
Greshake et al., Indirect Prompt Injection, arXiv:2302.12173, Feb 23, 2023.
Amodei et al., Concrete Problems in AI Safety, arXiv:1606.06565, June 21, 2016.

Red Hat Developer, Introduction to distributed inference with llm-d, Nov 21, 2025.
Red Hat Developer, llm-d: Kubernetes-native distributed inferencing, May 20, 2025.
llm-d project. llm-d.ai.
vLLM documentation. docs.vllm.ai.
Triton documentation. triton-lang.org.
PyTorch Autograd Mechanics. docs.pytorch.org.
NVIDIA H100 Architecture Whitepaper, March 22, 2022.
NVIDIA Blackwell Architecture Brief, March 18, 2024.

Lynch & Park, Modern Robotics: Mechanics, Planning, and Control, Cambridge University Press, 2017. hades.mech.northwestern.edu.
Tobin et al., Domain Randomization, arXiv:1703.06907, March 20, 2017.
Chi et al., Diffusion Policy, arXiv:2303.04137, March 7, 2023.
Brohan et al., RT-2, robotics-transformer2.github.io, 2023.
Kim et al., OpenVLA, arXiv:2406.09246, June 13, 2024. Project: openvla.github.io.
Physical Intelligence, π0, HF blog, Feb 4, 2025.
Physical Intelligence, π0.5: a VLA with Open-World Generalization, arXiv:2504.16054, April 22, 2025.
Hugging Face, SmolVLA, HF blog, June 3, 2025. Model: lerobot/smolvla_base.
NVIDIA Research, GR00T N1, March 18, 2025.
NVIDIA Research, GR00T N1.5, research.nvidia.com/labs/gear/gr00t-n1_5, June 11, 2025.
NVIDIA Isaac Lab. isaac-sim.github.io/IsaacLab.
NVIDIA Isaac GR00T GitHub. github.com/NVIDIA/Isaac-GR00T.
Hugging Face LeRobot. github.com/huggingface/lerobot.
MuJoCo documentation. mujoco.readthedocs.io.
Qi et al., Visual Adversarial Examples Jailbreak Aligned LLMs, arXiv:2306.13213, June 22, 2023.

Goodfellow, Bengio & Courville, Deep Learning, MIT Press, 2016. deeplearningbook.org.
Sutton & Barto, Reinforcement Learning: An Introduction, 2nd ed., MIT Press, 2018. incompleteideas.net/book.
Cover & Thomas, Elements of Information Theory, 2nd ed., Wiley, 2006.