From Sand to Superintelligence · Reference

Reference

Bibliography

152 sources cited across the book — papers, manuals, archives, whitepapers — organized by chapter and unique URL.

Chapter 1 · The Mineral

vast quantities of silicahttps://investornews.com/critical-minerals-rare-earths/cmi-masterclass-the-re…

Chapter 2 · Fire and Carbon

Producing one ton of metallurgical-grade siliconhttps://www.energycentral.com/energy-biz/post/mining-and-refining-pure-silico…

Chapter 3 · The Nine-Nines Problem

Over the course of several dayshttps://fpt-semiconductor.com/blogs/a-guidance-to-silicon-wafer-manufacturing…

Chapter 4 · Growing a Perfect Crystal

A century laterhttps://www.waferworld.com/post/silicon-wafer-manufacturing-from-sand-to-silicon

Chapter 5 · From Log to Mirror

Wafer slicing is a high-volume arthttps://www.sumcosi.com/english/products/process/step_02.html

Chapter 6 · Designing the Impossible

nanosheet transistorshttps://www.tomshardware.com/tech-industry/semiconductors/tsmc-begins-quietly…

Chapter 8 · Light at 13.5 Nanometers

Decades of lithographyhttps://www.uprtek.com/en/blogs/photolithography
The whole process is repeated fifty thousand times per secondhttps://eureka.patsnap.com/report-what-challenges-do-euv-lithography-processe…
A modern EUV scannerhttps://www.asml.com/news/stories/2021/semiconductor-manufacturing-process-steps
high-numerical-aperture EUVhttps://www.eenewseurope.com/en/tsmc-shuns-high-na-euv-lithography/

Chapter 12 · CoWoS and the 2.5D Revolution

By HBM4, the interface has doubled to 2,048 bits.https://introl.com/blog/nvidia-vera-rubin-platform-8-exaflops-infrastructure
CoWoShttps://anysilicon.com/cowos-package/
Roadmaps are paced by it.https://newsletter.semianalysis.com/p/vera-rubin-extreme-co-design-an-evolution

Chapter 13 · The Vera Rubin Superchip

The whole module contains roughly seventeen thousand individual componentshttps://nvidianews.nvidia.com/news/rubin-platform-ai-supercomputer
Olympus Arm v9 architecturehttps://developer.nvidia.com/blog/inside-the-nvidia-rubin-platform-six-new-ch…

Chapter 14 · The NVL72 Rack

In the NVL72 designhttps://www.cnbc.com/2026/02/25/first-look-at-nvidias-ai-system-vera-rubin-an…
260 terabytes per secondhttps://www.signalintegrityjournal.com/articles/4183-nvidia-kicks-off-the-nex…
Assembly time per tray drops from ~2 hours to about 5 minutes.https://developer.nvidia.com/blog/nvidia-vera-rubin-pod-seven-chips-five-rack…

Chapter 15 · Burn-In and Reliability

Burn-inhttps://resources.system-analysis.cadence.com/blog/msa2020-conduct-burn-in-te…
HTOLhttps://www.kessystemsinc.com/resources/the-pivotal-role-of-burn-in-testing-i…
second-generation RAS enginehttps://www.nvidia.com/en-us/data-center/technologies/rubin/

Chapter 16 · The AI Factory

DGX SuperPOD with DGX Vera Rubin NVL72https://blogs.nvidia.com/blog/dgx-superpod-rubin/
An AI factory's electrical servicehttps://blog.se.com/datacenter/2026/04/09/building-ai-factories-why-integrate…
10× compared to Blackwellhttps://nvidianews.nvidia.com/news/rubin-platform-ai-supercomputer

Chapter 17 · The Electron's Choice

Pure silicon's behaviour is conditionalhttps://www.nobelprize.org/prizes/physics/1956/summary/
1.12 electron-voltshttps://www.pveducation.org/pvcdrom/pn-junctions/band-gap
low as 1013 dopant atoms per cubic centimeterhttps://en.wikipedia.org/wiki/Doping_(semiconductor)
Bell Labs in work culminating in February 1940https://www.computerhistory.org/siliconengine/silicon-pn-junction-is-discovered/

Chapter 18 · The Transistor as a Valve

336 billionhttps://nvidianews.nvidia.com/news/rubin-platform-ai-supercomputer
0.4–0.7 V in modern deviceshttps://nanohub.org/resources/5780/download/2009.01.20-ece606-l28.pdf
15 nanometershttps://semiwiki.com/semiconductor-services/the-international-roadmap-for-dev…
Dennard scaling's slow deathhttps://en.wikipedia.org/wiki/Dennard_scaling

Chapter 19 · From Switch to Logic

Claude Shannon's 1937 master's thesishttps://en.wikipedia.org/wiki/A_Symbolic_Analysis_of_Relay_and_Switching_Circ…
CMOS logichttps://www.computerhistory.org/siliconengine/cmos-circuits-eclipse-bipolar-a…
ten billion such additions per second per corehttps://www.intel.com/content/www/us/en/developer/articles/technical/intel-sd…

Chapter 20 · Adders, Latches, Memory

carry-lookaheadhttps://en.wikipedia.org/wiki/Carry-lookahead_adder
carry-selecthttps://en.wikipedia.org/wiki/Carry-select_adder
multipliershttps://www.cs.cmu.edu/~410/doc/segments/book.pdf
tens of megabytes of cachehttps://www.tomshardware.com/news/intel-core-i9-13900k-review

Chapter 21 · The Clock

tens of millions in a typical CPUhttps://en.wikipedia.org/wiki/Clock_signal
10% of the chip's total powerhttps://patents.google.com/patent/US20060277509A1/en
Pentium 4 Prescott (2004) ran a 31-stage pipelinehttps://en.wikipedia.org/wiki/Pentium_4
branch predictorhttps://en.wikipedia.org/wiki/Branch_predictor

Chapter 22 · Fetch, Decode, Execute

AMD Zen 5https://www.amd.com/en/products/cpu/amd-ryzen-9-7950x
M4 in a new MacBookhttps://www.apple.com/newsroom/2024/10/new-macbook-pro-features-m4-family-of-…
µopshttps://en.wikipedia.org/wiki/Micro-operation
von Neumann architecturehttps://en.wikipedia.org/wiki/Von_Neumann_architecture

Chapter 23 · From Transistors to ISA

Intel x86-64 manualhttps://www.intel.com/content/www/us/en/developer/articles/technical/intel-sd…
Arm ARMv9 referencehttps://developer.arm.com/documentation/ddi0487/latest
Hennessy and Patterson's Computer Architecture textbookhttps://www.cs.cmu.edu/~410/doc/hennessy-patterson.pdf
Pentiumhttps://en.wikipedia.org/wiki/Pentium_(original)
PTX (Parallel Thread Execution)https://docs.nvidia.com/cuda/parallel-thread-execution/

Chapter 24 · Memory's Pyramid

Computer Architecture: A Quantitative Approachhttps://www.elsevier.com/books/computer-architecture/hennessy/978-0-12-811905-1
Intel's optimization manualhttps://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html
64 byteshttps://en.wikipedia.org/wiki/CPU_cache#Cache_entries
subfield of OS performance workhttps://en.wikipedia.org/wiki/Translation_lookaside_buffer

Chapter 25 · Boot

0xFFFFFFF0https://wiki.osdev.org/Real_mode
UEFIhttps://uefi.org/specifications
GRUBhttps://www.gnu.org/software/grub/
The Linux x86 boot protocolhttps://www.kernel.org/doc/html/latest/x86/boot.html
kernel boothttps://www.kernel.org/doc/html/latest/admin-guide/bootconfig.html
systemdhttps://systemd.io/

Chapter 26 · The OS as Conductor

Modern Operating Systems by Tanenbaumhttps://www.cs.vu.nl/~ast/books/mos3/
The Linux mmap man pagehttps://man7.org/linux/man-pages/man7/mmap.7.html
three hundred and fifty system callshttps://man7.org/linux/man-pages/man2/syscalls.2.html
Completely Fair Schedulerhttps://www.kernel.org/doc/html/latest/scheduler/sched-design-CFS.html

Chapter 27 · The Translation Stack

CPython's ast modulehttps://docs.python.org/3/library/ast.html
its own ASThttps://clang.llvm.org/docs/IntroductionToTheClangAST.html
LLVM IRhttps://llvm.org/docs/LangRef.html
LLVM's developer meetingshttps://llvm.org/devmtg/
ELFhttps://refspecs.linuxbase.org/elf/elf.pdf
Intel's optimization guidehttps://www.intel.com/content/www/us/en/develop/documentation/cpp-compiler-de…

Chapter 28 · The GPU's Different Mind

The CUDA C++ Programming Guidehttps://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html
NVIDIA's Volta whitepaperhttps://images.nvidia.com/content/volta-architecture/pdf/volta-architecture-w…
Hopperhttps://resources.nvidia.com/en-us-tensor-core/nvidia-tensor-core-gpu-datasheet
Tritonhttps://github.com/openai/triton
FlashAttentionhttps://github.com/Dao-AILab/flash-attention
occupancy calculatorshttps://developer.nvidia.com/cuda-toolkit

Chapter 29 · A Neural Network Lives in Numbers

a Llama modelhttps://huggingface.co/meta-llama
a Mistral modelhttps://huggingface.co/mistralai
Attention Is All You Needhttps://arxiv.org/abs/1706.03762
FlashAttentionhttps://github.com/Dao-AILab/flash-attention
mixed-precision training paperhttps://docs.nvidia.com/deeplearning/performance/mixed-precision-training/ind…

Chapter 30 · A Thought, Token by Token

tiktokenhttps://github.com/openai/tiktoken

Chapter 31 · The Second Wire

stock tickerhttps://www.britannica.com/technology/stock-ticker
patenthttps://www.loc.gov/collections/alexander-graham-bell-papers/articles-and-ess…
Internet Protocolhttps://www.rfc-editor.org/rfc/rfc791
"Information Management: A Proposal"https://www.w3.org/History/1989/proposal.html
REST APIshttps://en.wikipedia.org/wiki/REST
Radarhttps://radar.cloudflare.com/
Routing layers like OpenRouterhttps://openrouter.ai/

Chapter 32 · Tokens on the Wire

tiktokenhttps://github.com/openai/tiktoken
SentencePiecehttps://github.com/google/sentencepiece
Anthropichttps://www.anthropic.com/pricing
OpenAIhttps://openai.com/pricing
OpenAI's embeddings guidehttps://platform.openai.com/docs/guides/embeddings
Sentence-Transformershttps://www.sbert.net/
OpenAI's function-calling APIhttps://platform.openai.com/docs/guides/function-calling
Anthropic's tool-usehttps://docs.anthropic.com/en/docs/build-with-claude/tool-use

Chapter 33 · Latency Is Cognition

Nielsen's classic response-time workhttps://www.nngroup.com/articles/response-times-3-important-limits/
Speculative decodinghttps://arxiv.org/abs/2305.13245
Leviathan et al. (2022)https://arxiv.org/abs/2211.17192

Chapter 34 · Agents

Anthropic's Claude with computer usehttps://www.anthropic.com/news/claude-3-5-sonnet
OpenAI's Operatorhttps://openai.com/index/operator/
OpenHandshttps://github.com/All-Hands-AI/OpenHands
SWE-bench Verifiedhttps://www.swebench.com/
AgentBenchhttps://arxiv.org/abs/2308.03688
Cursorhttps://www.cursor.com/
GitHub Copilot Workspacehttps://github.com/features/copilot
Aiderhttps://aider.chat/
Intercom's Finhttps://www.intercom.com/fin

Chapter 35 · Swarm

multi-agent systemshttps://en.wikipedia.org/wiki/Multi-agent_system
AutoGenhttps://github.com/microsoft/autogen
LangGraphhttps://github.com/langchain-ai/langgraph
Multi-Agent Research Systemhttps://www.anthropic.com/news/research
Du et al. (2023)https://arxiv.org/abs/2305.14325
OpenRouterhttps://openrouter.ai/
Agent2Agent (A2A) protocolhttps://google.github.io/A2A/

Chapter 36 · Protocols of Trust

OpenAIhttps://platform.openai.com/docs/guides/function-calling
Anthropichttps://docs.anthropic.com/en/docs/build-with-claude/tool-use
Model Context Protocolhttps://modelcontextprotocol.io/
ODBChttps://en.wikipedia.org/wiki/Open_Database_Connectivity
Agent2Agent (A2A) protocolhttps://google.github.io/A2A/
Agent Communication Languageshttps://arxiv.org/abs/2402.08164
Decentralized identifiershttps://www.w3.org/TR/did-core/

Chapter 37 · The Memory Commons

FAISShttps://github.com/facebookresearch/faiss
Pineconehttps://www.pinecone.io/
Weaviatehttps://weaviate.io/
Qdranthttps://qdrant.tech/
pgvectorhttps://www.pgvector.org/
chunking strategyhttps://arxiv.org/abs/2312.10997
Knowledge graphshttps://en.wikipedia.org/wiki/Knowledge_graph
GraphRAGhttps://github.com/microsoft/graphrag

Chapter 38 · The Browser Becomes the Worker

robotic process automationhttps://en.wikipedia.org/wiki/Robotic_process_automation
computer usehttps://www.anthropic.com/news/claude-3-5-sonnet
Operatorhttps://openai.com/index/operator/
WebDriverhttps://www.w3.org/TR/webdriver2/
Chrome DevTools Protocolhttps://chromedevtools.github.io/devtools-protocol/
WebArenahttps://webarena.dev/

Chapter 39 · Markets of Models

OpenRouterhttps://openrouter.ai/
Togetherhttps://www.together.ai/
Fireworkshttps://fireworks.ai/
Replicatehttps://www.replicate.com/
Groqhttps://groq.com/
Anthropic's Haiku 4.5https://www.anthropic.com/news/claude-haiku-4-5
OpenAI's smaller GPT-5 variantshttps://openai.com/pricing

Chapter 40 · The Compounding

Metcalfe's lawhttps://spectrum.ieee.org/metcalfes-law-is-wrong
Cursorhttps://www.cursor.com/
Shumailov et al. (2023)https://arxiv.org/abs/2305.17493

Chapter 41 · Where Value Reroutes

BLShttps://www.bls.gov/
Indeed Hiring Labhttps://www.hiringlab.org/
long-tail publishershttps://www.theatlantic.com/