From Sand to Superintelligence · Reference
Reference
Bibliography
152 sources cited across the book — papers, manuals, archives, whitepapers — organized by chapter and unique URL.
Chapter 1 · The Mineral
- vast quantities of silicahttps://investornews.com/critical-minerals-rare-earths/cmi-masterclass-the-re…
Chapter 2 · Fire and Carbon
- Producing one ton of metallurgical-grade siliconhttps://www.energycentral.com/energy-biz/post/mining-and-refining-pure-silico…
Chapter 3 · The Nine-Nines Problem
- Over the course of several dayshttps://fpt-semiconductor.com/blogs/a-guidance-to-silicon-wafer-manufacturing…
Chapter 4 · Growing a Perfect Crystal
- A century laterhttps://www.waferworld.com/post/silicon-wafer-manufacturing-from-sand-to-silicon
Chapter 5 · From Log to Mirror
- Wafer slicing is a high-volume arthttps://www.sumcosi.com/english/products/process/step_02.html
Chapter 6 · Designing the Impossible
- nanosheet transistorshttps://www.tomshardware.com/tech-industry/semiconductors/tsmc-begins-quietly…
Chapter 8 · Light at 13.5 Nanometers
- Decades of lithographyhttps://www.uprtek.com/en/blogs/photolithography
- The whole process is repeated fifty thousand times per secondhttps://eureka.patsnap.com/report-what-challenges-do-euv-lithography-processe…
- A modern EUV scannerhttps://www.asml.com/news/stories/2021/semiconductor-manufacturing-process-steps
- high-numerical-aperture EUVhttps://www.eenewseurope.com/en/tsmc-shuns-high-na-euv-lithography/
Chapter 12 · CoWoS and the 2.5D Revolution
- By HBM4, the interface has doubled to 2,048 bits.https://introl.com/blog/nvidia-vera-rubin-platform-8-exaflops-infrastructure
- CoWoShttps://anysilicon.com/cowos-package/
- Roadmaps are paced by it.https://newsletter.semianalysis.com/p/vera-rubin-extreme-co-design-an-evolution
Chapter 13 · The Vera Rubin Superchip
- The whole module contains roughly seventeen thousand individual componentshttps://nvidianews.nvidia.com/news/rubin-platform-ai-supercomputer
- Olympus Arm v9 architecturehttps://developer.nvidia.com/blog/inside-the-nvidia-rubin-platform-six-new-ch…
Chapter 14 · The NVL72 Rack
- In the NVL72 designhttps://www.cnbc.com/2026/02/25/first-look-at-nvidias-ai-system-vera-rubin-an…
- 260 terabytes per secondhttps://www.signalintegrityjournal.com/articles/4183-nvidia-kicks-off-the-nex…
- Assembly time per tray drops from ~2 hours to about 5 minutes.https://developer.nvidia.com/blog/nvidia-vera-rubin-pod-seven-chips-five-rack…
Chapter 15 · Burn-In and Reliability
- Burn-inhttps://resources.system-analysis.cadence.com/blog/msa2020-conduct-burn-in-te…
- HTOLhttps://www.kessystemsinc.com/resources/the-pivotal-role-of-burn-in-testing-i…
- second-generation RAS enginehttps://www.nvidia.com/en-us/data-center/technologies/rubin/
Chapter 16 · The AI Factory
- DGX SuperPOD with DGX Vera Rubin NVL72https://blogs.nvidia.com/blog/dgx-superpod-rubin/
- An AI factory's electrical servicehttps://blog.se.com/datacenter/2026/04/09/building-ai-factories-why-integrate…
- 10× compared to Blackwellhttps://nvidianews.nvidia.com/news/rubin-platform-ai-supercomputer
Chapter 17 · The Electron's Choice
- Pure silicon's behaviour is conditionalhttps://www.nobelprize.org/prizes/physics/1956/summary/
- 1.12 electron-voltshttps://www.pveducation.org/pvcdrom/pn-junctions/band-gap
- low as 1013 dopant atoms per cubic centimeterhttps://en.wikipedia.org/wiki/Doping_(semiconductor)
- Bell Labs in work culminating in February 1940https://www.computerhistory.org/siliconengine/silicon-pn-junction-is-discovered/
Chapter 18 · The Transistor as a Valve
- 336 billionhttps://nvidianews.nvidia.com/news/rubin-platform-ai-supercomputer
- 0.4–0.7 V in modern deviceshttps://nanohub.org/resources/5780/download/2009.01.20-ece606-l28.pdf
- 15 nanometershttps://semiwiki.com/semiconductor-services/the-international-roadmap-for-dev…
- Dennard scaling's slow deathhttps://en.wikipedia.org/wiki/Dennard_scaling
Chapter 19 · From Switch to Logic
- Claude Shannon's 1937 master's thesishttps://en.wikipedia.org/wiki/A_Symbolic_Analysis_of_Relay_and_Switching_Circ…
- CMOS logichttps://www.computerhistory.org/siliconengine/cmos-circuits-eclipse-bipolar-a…
- ten billion such additions per second per corehttps://www.intel.com/content/www/us/en/developer/articles/technical/intel-sd…
Chapter 20 · Adders, Latches, Memory
- carry-lookaheadhttps://en.wikipedia.org/wiki/Carry-lookahead_adder
- carry-selecthttps://en.wikipedia.org/wiki/Carry-select_adder
- multipliershttps://www.cs.cmu.edu/~410/doc/segments/book.pdf
- tens of megabytes of cachehttps://www.tomshardware.com/news/intel-core-i9-13900k-review
Chapter 21 · The Clock
- tens of millions in a typical CPUhttps://en.wikipedia.org/wiki/Clock_signal
- 10% of the chip's total powerhttps://patents.google.com/patent/US20060277509A1/en
- Pentium 4 Prescott (2004) ran a 31-stage pipelinehttps://en.wikipedia.org/wiki/Pentium_4
- branch predictorhttps://en.wikipedia.org/wiki/Branch_predictor
Chapter 22 · Fetch, Decode, Execute
- AMD Zen 5https://www.amd.com/en/products/cpu/amd-ryzen-9-7950x
- M4 in a new MacBookhttps://www.apple.com/newsroom/2024/10/new-macbook-pro-features-m4-family-of-…
- µopshttps://en.wikipedia.org/wiki/Micro-operation
- von Neumann architecturehttps://en.wikipedia.org/wiki/Von_Neumann_architecture
Chapter 23 · From Transistors to ISA
- Intel x86-64 manualhttps://www.intel.com/content/www/us/en/developer/articles/technical/intel-sd…
- Arm ARMv9 referencehttps://developer.arm.com/documentation/ddi0487/latest
- Hennessy and Patterson's Computer Architecture textbookhttps://www.cs.cmu.edu/~410/doc/hennessy-patterson.pdf
- Pentiumhttps://en.wikipedia.org/wiki/Pentium_(original)
- PTX (Parallel Thread Execution)https://docs.nvidia.com/cuda/parallel-thread-execution/
Chapter 24 · Memory's Pyramid
- Computer Architecture: A Quantitative Approachhttps://www.elsevier.com/books/computer-architecture/hennessy/978-0-12-811905-1
- Intel's optimization manualhttps://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html
- 64 byteshttps://en.wikipedia.org/wiki/CPU_cache#Cache_entries
- subfield of OS performance workhttps://en.wikipedia.org/wiki/Translation_lookaside_buffer
Chapter 25 · Boot
- 0xFFFFFFF0https://wiki.osdev.org/Real_mode
- UEFIhttps://uefi.org/specifications
- GRUBhttps://www.gnu.org/software/grub/
- The Linux x86 boot protocolhttps://www.kernel.org/doc/html/latest/x86/boot.html
- kernel boothttps://www.kernel.org/doc/html/latest/admin-guide/bootconfig.html
- systemdhttps://systemd.io/
Chapter 26 · The OS as Conductor
- Modern Operating Systems by Tanenbaumhttps://www.cs.vu.nl/~ast/books/mos3/
- The Linux mmap man pagehttps://man7.org/linux/man-pages/man7/mmap.7.html
- three hundred and fifty system callshttps://man7.org/linux/man-pages/man2/syscalls.2.html
- Completely Fair Schedulerhttps://www.kernel.org/doc/html/latest/scheduler/sched-design-CFS.html
Chapter 27 · The Translation Stack
- CPython's ast modulehttps://docs.python.org/3/library/ast.html
- its own ASThttps://clang.llvm.org/docs/IntroductionToTheClangAST.html
- LLVM IRhttps://llvm.org/docs/LangRef.html
- LLVM's developer meetingshttps://llvm.org/devmtg/
- ELFhttps://refspecs.linuxbase.org/elf/elf.pdf
- Intel's optimization guidehttps://www.intel.com/content/www/us/en/develop/documentation/cpp-compiler-de…
Chapter 28 · The GPU's Different Mind
- The CUDA C++ Programming Guidehttps://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html
- NVIDIA's Volta whitepaperhttps://images.nvidia.com/content/volta-architecture/pdf/volta-architecture-w…
- Hopperhttps://resources.nvidia.com/en-us-tensor-core/nvidia-tensor-core-gpu-datasheet
- Tritonhttps://github.com/openai/triton
- FlashAttentionhttps://github.com/Dao-AILab/flash-attention
- occupancy calculatorshttps://developer.nvidia.com/cuda-toolkit
Chapter 29 · A Neural Network Lives in Numbers
- a Llama modelhttps://huggingface.co/meta-llama
- a Mistral modelhttps://huggingface.co/mistralai
- Attention Is All You Needhttps://arxiv.org/abs/1706.03762
- FlashAttentionhttps://github.com/Dao-AILab/flash-attention
- mixed-precision training paperhttps://docs.nvidia.com/deeplearning/performance/mixed-precision-training/ind…
Chapter 30 · A Thought, Token by Token
- tiktokenhttps://github.com/openai/tiktoken
Chapter 31 · The Second Wire
- stock tickerhttps://www.britannica.com/technology/stock-ticker
- patenthttps://www.loc.gov/collections/alexander-graham-bell-papers/articles-and-ess…
- Internet Protocolhttps://www.rfc-editor.org/rfc/rfc791
- "Information Management: A Proposal"https://www.w3.org/History/1989/proposal.html
- REST APIshttps://en.wikipedia.org/wiki/REST
- Radarhttps://radar.cloudflare.com/
- Routing layers like OpenRouterhttps://openrouter.ai/
Chapter 32 · Tokens on the Wire
- tiktokenhttps://github.com/openai/tiktoken
- SentencePiecehttps://github.com/google/sentencepiece
- Anthropichttps://www.anthropic.com/pricing
- OpenAIhttps://openai.com/pricing
- OpenAI's embeddings guidehttps://platform.openai.com/docs/guides/embeddings
- Sentence-Transformershttps://www.sbert.net/
- OpenAI's function-calling APIhttps://platform.openai.com/docs/guides/function-calling
- Anthropic's tool-usehttps://docs.anthropic.com/en/docs/build-with-claude/tool-use
Chapter 33 · Latency Is Cognition
- Nielsen's classic response-time workhttps://www.nngroup.com/articles/response-times-3-important-limits/
- Speculative decodinghttps://arxiv.org/abs/2305.13245
- Leviathan et al. (2022)https://arxiv.org/abs/2211.17192
Chapter 34 · Agents
- Anthropic's Claude with computer usehttps://www.anthropic.com/news/claude-3-5-sonnet
- OpenAI's Operatorhttps://openai.com/index/operator/
- OpenHandshttps://github.com/All-Hands-AI/OpenHands
- SWE-bench Verifiedhttps://www.swebench.com/
- AgentBenchhttps://arxiv.org/abs/2308.03688
- Cursorhttps://www.cursor.com/
- GitHub Copilot Workspacehttps://github.com/features/copilot
- Aiderhttps://aider.chat/
- Intercom's Finhttps://www.intercom.com/fin
Chapter 35 · Swarm
- multi-agent systemshttps://en.wikipedia.org/wiki/Multi-agent_system
- AutoGenhttps://github.com/microsoft/autogen
- LangGraphhttps://github.com/langchain-ai/langgraph
- Multi-Agent Research Systemhttps://www.anthropic.com/news/research
- Du et al. (2023)https://arxiv.org/abs/2305.14325
- OpenRouterhttps://openrouter.ai/
- Agent2Agent (A2A) protocolhttps://google.github.io/A2A/
Chapter 36 · Protocols of Trust
- OpenAIhttps://platform.openai.com/docs/guides/function-calling
- Anthropichttps://docs.anthropic.com/en/docs/build-with-claude/tool-use
- Model Context Protocolhttps://modelcontextprotocol.io/
- ODBChttps://en.wikipedia.org/wiki/Open_Database_Connectivity
- Agent2Agent (A2A) protocolhttps://google.github.io/A2A/
- Agent Communication Languageshttps://arxiv.org/abs/2402.08164
- Decentralized identifiershttps://www.w3.org/TR/did-core/
Chapter 37 · The Memory Commons
- FAISShttps://github.com/facebookresearch/faiss
- Pineconehttps://www.pinecone.io/
- Weaviatehttps://weaviate.io/
- Qdranthttps://qdrant.tech/
- pgvectorhttps://www.pgvector.org/
- chunking strategyhttps://arxiv.org/abs/2312.10997
- Knowledge graphshttps://en.wikipedia.org/wiki/Knowledge_graph
- GraphRAGhttps://github.com/microsoft/graphrag
Chapter 38 · The Browser Becomes the Worker
- robotic process automationhttps://en.wikipedia.org/wiki/Robotic_process_automation
- computer usehttps://www.anthropic.com/news/claude-3-5-sonnet
- Operatorhttps://openai.com/index/operator/
- WebDriverhttps://www.w3.org/TR/webdriver2/
- Chrome DevTools Protocolhttps://chromedevtools.github.io/devtools-protocol/
- WebArenahttps://webarena.dev/
Chapter 39 · Markets of Models
- OpenRouterhttps://openrouter.ai/
- Togetherhttps://www.together.ai/
- Fireworkshttps://fireworks.ai/
- Replicatehttps://www.replicate.com/
- Groqhttps://groq.com/
- Anthropic's Haiku 4.5https://www.anthropic.com/news/claude-haiku-4-5
- OpenAI's smaller GPT-5 variantshttps://openai.com/pricing
Chapter 40 · The Compounding
- Metcalfe's lawhttps://spectrum.ieee.org/metcalfes-law-is-wrong
- Cursorhttps://www.cursor.com/
- Shumailov et al. (2023)https://arxiv.org/abs/2305.17493
Chapter 41 · Where Value Reroutes
- BLShttps://www.bls.gov/
- Indeed Hiring Labhttps://www.hiringlab.org/
- long-tail publishershttps://www.theatlantic.com/