Math Resources

Theoretical Computer Science
For cognitive context, see Computability and General Culture in Computer Science and AI.
Introductory
Petzold, Code
Brookshear & Brylow, Computer science: an overview
Dewdney, The New Turing Omnibus
Intermediate
Rosen, Discrete mathematics and its applications
Graham, Knuth & Patahsnik, Concrete mathematics
Sipser, Introduction to the theory of computation
Girard, Lafont & Taylor, Proofs and types
Garey & Johnson, Computers and intractability
Arora & Barak, Computational complexity
Goldreich, Computational complexity
Kleinberg & Tardos, Algorithm design
Motwani & Raghavan, Randomized algorithms
Miztenmacher & Upfal, Probabilty and computing
Aho, Hopcroft, Ullman, Data structures and algorithms
Aho, Hopcroft, Ullman, Design and analysis of computer algorithms
Pierce, Types and programming languages
Aho et al, Compilers
Advanced
Abelson & Sussman, Structure and interpretation of computer programs
Cormen et al, Introduction to algorithms
Knuth, The art of computer programming
Pierce (ed), Advanced topics in types and programming languages
Cook & Nguyen, Logical foundations of proof complexity
Krajicek, Bounded arithmetic, propositional logic and complexity theory
Networks
Tanenbaum & Weatherall, Computer networks
Concurrent and distributed
Bryant& O'Hallaron, Computer systems: a programmer's perspective
Ben Ari, Principles of concurrent and distributed programming
Herlihy et al, The Art of Multiprocessor Programming
Raynal, Concurrent Programming
Attya & Welch, Distributed Computing
van Steen & Tannenbaum, Distributed systems
Lynch, Distributed algorithms
Cachin et al, Introduction to Reliable and Secure Distributed Programming
High-performance computing
Eijkhout, The art of HPC
Fox et al, Parallel computing works
Hager & Wellein, Introduction to high performance computing for scientists and engineers
Sterling et al, High performance computing
Herlihi & Shavit, The art of multiprocessor programming
Jeffers & Reinders, High performance parallelism pearls (vols 1 &2)
Dongarra et al, The sourcebook of parallel computing
Operating systems
Silberschatz et al, Operating Systems Concepts
Tanenbaum & Bos, Modern operating systems
McHoes & Flynn, Understanding operating systems (vols 1 & 2)
Cryptography
Hoffstein et al, An introduction to mathematical cryptography
Katz & Lyndell, Introduction to modern cryptography
Goldreich, Foundations of cryptography (vols 1 & 2)
Stinson, Cryptography
Menezes et al, Handbook of applied cryptography
Computer architecture
Tanenbaum & Austin, Structured computer organization
Henessy & Patterson, Computer architecture
Henessy & Patterson, Computer organization and design
Artificial intelligence
Foundational
Russell & Norvig, Artificial Intelligence: A Modern Approach
Multi-agent systems
Woolridge, An Introduction to MultiAgent Systems
Weiss, Multiagent systems
Poole & Macworth, Artificial Intelligence: Foundations of Computational Agents
Shoham & Leyton-Brown, Multiagent systems
Woolridge, Reasoning about rational agents
Nisan et al, Algorithmic game theory
Cai & Lin, Directed Cooperation of Multi-Agent Systems
Pareschi & Toscani, Interacting multiagent systems
Xiang, Probabilistic reasoning in multiagent systems
Piccoli, Control of multi-agent systems
Wang et al, Mathematics of multiagent learning systems
Krishnan, AI Agents: Evolution, Architecture, and Real-World Applications
Applied
Wolfram, What is ChatGPT doing ... and why does it work ?
Amidi & Amidi, Super study guide: transformers and large language models
Ksmath et al, Large language models: a deep dive
Raschka, Build a large language model
Alto, Building LLM powered applications
Phoenix & Taylor, Prompt engineering for generative AI
Berryman & Ziegler, Prompt engineering for LLMs
Huyen, Designing machine learning systems
Huyen, AI engineering

Machine learning & artificial neural networks
For cognitive context, see Computer Science and General culture in computer science & AI.
Also see the Student project on Statistical Physics applied to SLT and BL.
Machine learning
History
Introductory
Deisenroth, Faisal, Ong, Mathematics for machine learning
Strang, Linear algebra and learning from data
MacKay, Information theory, inference and learning algorithms
James et al, An introduction to statistical learning
Kearns and Vazirani, An introduction to computational learning theory
Devroye et al, A probabilistic theory of pattern recognition
Intermediate
Mitchell, Machine learning
Wilmott, Machine learning
Shalev-Shwartz & Ben-David, Understanding machine learning
Zhang, Mathematical analysis of machine learning algorithms
Mohri, Rostamizadeh & Ameet Talwalkar, Foundations of machine learning
Murphy, Machine learning: a probabilistic perspective
Bishop, Pattern recognition and machine learning
Hastie, Tibschirani & Friedman, The elements of statistical learning
Vapnik, The nature of statistical learning theory
Kaplan, Notes on contemporary machine learning for physicists
Advanced
Mumford & Desolneux, Pattern theory
Grohs & Kutyniok (eds), Mathematical aspects of deep learning
Huang, Statistical mechanics of neural networks
Singular Learning Theory
Watanabe, Mathematical Theory of Bayesian Statistics for Unknown Information Source
Watanabe, Mathematical theory of Bayesian statistics
Watanabe, Algebraic geometry and statistical learning theory
- For more references, see the student project on Singular learning theory and statistical inference.
Artificial neural networks
History
Domingo, The master algorithm
Philipp Schmitt's, Blueprints for intelligence: a visual history of ANNs
Anderson & Rosenfeld, Talking nets: an oral history of neural networks
Wang & Raj, On the origin of deep learning
Schmidhuber, Deep learning in neural networks: an overview
Juergen Schmidhuber's homepage
Introductory
Classical
Rojas, Neural networks: a systematic introduction
Minski & Papert, Perceptrons: an introduction to computational geometry
Anthony & Bartlett, Neural network learning: theoretical foundations
Murphy, Probabilistic machine learning: an introduction
Deep learning
Prince, Understanding deep learning
Calin, Deep learning architectures: a mathematical approach
Bishop & Bishop, Deep learning: Foundations and concepts
Goodfellow, Bengio & Courville, Deep learning
Roberts, Yaida & Hanin, The principles of deep learning theory (free draft)
Intermediate
Haykin, Neural networks and learning machines
Aggarwal, Neural networks and deep learning
Advanced
Bronstein, Bruna, Cohen & Velickovic, Geometric deep learning
Attention mechanism
IBM think | wiki | GfG | medium | adaline
Niu et al, A review of the attention mechanism of deep learning
Soydaner, Attention mechanism in neural networks: where it comes and where it goes
Guo, Attention mechanisms in computer vision: a survey
Sun et al, Efficient attention mechanisms for large language models: a survey
Hernandez & Amigo, Attention mechanisms and their applications to complex systems
Ruan & Zhang, Towards understanding how attention mechanism works in deep learning
...
Transformers
Vaswami et al, Attention is all you need
Phuong and Hutter, Formal algorithms for transformers
Lin et al, A survey of transformers
Fnu et al, Understanding the architecture of vision transformer and its variants: A review
Gumaan, Universal Approximation Theorem for a Single-Layer Transformer
Omidi et al, Memory-augmented transformers
GPTs:
...
MoE
Zhang et al, Mixture of Experts in Large Language Models
LLMs & RLLMs
Zhang et al, Survey of Large Language Models in Extended Reality
Liu et al, A Comprehensive Evaluation on Quantization Techniques for Large Language Models
Zhang et al, A Survey of Reinforcement Learning for Large Reasoning Models
LLM alignment, explainability & interp
Pan et al, A Survey on Training-free Alignment of Large Language Models
Palikhe et al, Towards Transparent AI: A Survey on Explainable Language Models
...
Multimodal LLMs
Jaegle et al, Perceiver: general perception with iterated attention
Jaegle et al, PerceiverIO: A general architecture for structured inputs and outputs
Song et al, How to bridge the gap between modalities
Bae et al, Graph perceiver IO: a general architecture for graph-structured data
Carolan et al, A review of multimodal large language and vision models
Kibria et al: Decoding the multimodal maze
Chen et al: A survey of multimodal hallucination evaluation and deception
Xu et al, MARS2025 challenge
...
Embodied
Sanghai & Brown, Advances in transformers for robotics applications: a review
Shao et al, Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
Applications to science
Hu et al, A survey of scientific large language models
Wei et al, From AI for Science to Agentic Science: A Survey on Autonomous Scientific Discovery
Ma et al, A Survey of Deep Learning for Geometry Problem Solving
...

General culture in computer science & AI
For structured references see here.
I. Basic notions on computer architecture and function
a. Instruction set architecture
The von Neumann and Harvard models
a. Low level programming language
-Assembly language; assembler; disassembler
b. High level programming language
II. Networking
LAN: Ethernet (over fiber or twisted pair) | WIFI
IoT: Thread and Zigbee protocols | The Matter standard (wiki) from CSA (wiki)
III. Free and Open Source Software
People and movement
The GNU Manifesto | The Cathedral and the Bazaar
IoT
Air Gradient | Home Assistant (wiki) and Nabu Casa
IV. Programming
1. The Church-Turing thesis
2. Classical models of deterministic computation
c. General recursive functions
3. The Curry-Howard correspondence
a. Declarative: logical, functional, AML, Query, Regexp
b. Imperative: procedural, object-oriented
6. System programming; system programming language
9. API
a. Evolutionary programming
b. Genetic programming
12. History
V. Concurrent and distributed computing
1. Concurrency
b. Process
c. Thread
2. Real time operating systems (RTOS)
-HBM
-NPU
d. IoT
4. High performance computing (supercomputers & clusters)
a. History | FLOPS | Orders | CSNET | NSFNET
-Europe (Euro HPC | PRACE) | Japan | China
- Frontier | Aurora | El Capitan | Jupiter (wiki, JS @FZJ) | Fugaku
c. Resources: NESRC | ACM's SIGHPC | HPCwire | InsideHPC | Top500
d. Companies: HPE | IBM | NVIDIA | Intel | AMD | Dell | Atos (Eviden) | Fujitsu
e. Frontier: Zettascale | Neuromorphic | Superconducting | Beyond CMOS | Unconventional | Hypercomputation
VI. Influential people
1. Alan Turing
7. Bill Joy
VII. Resources
Berkeley | MIT | Carnegie Mellon | Stanford | Cambridge | Oxford | Melbourne
B. Artificial intelligence
For structured introductions to AI see:
Russell & Norvig: Artificial intelligence: a modern approach
I. Basic notions and techniques
1. The Dartmouth workshop of 1956
2. The cognitive revolution
5. Natural language processing
8. The common sense knowledge problem
9. AI winter
10. Distributed AI
c. Swarms
e. Artificial society | Social simulation | Computational sociology
f. Intelligent agents and agentic AI
-Communication, orchestration & protocols
g. Agentic Web (a review)
-Protocols: MCP | A2A | ACP (IBM think) | ANP | Agora
-Agentic browsers: Open AI Operator (wiki) | Fellou | Perplexity Comet | Opera Neon | Chrome add-on
11. Neural networks
a. Neuronal models & computational neuroscience:
-Connectionism | Blue Brain (wiki) | Brain/MINDS (wiki) | MICrONS (wiki) | China Brain Project
-Biological neuron models | Neural backpropagation
-Neural learning & computation: Hebbian | BCM | Generalized Hebbian (Oja's rule) | Synaptic plasticity (STDP, SNNs)
b. Artificial neurons | perceptrons
-Feedforward (MLP/FCN | Autoencoders | TDNN | CNN & CapsNet | PNN | RBFN | ITNN)
-SVM
-GMDH-based (MNN, Polynomial, Abductive)
-Transformers | Attention is all you need (paper)
12. Machine learning
a. Paradigms (supervised | unsupervised | reinforcement | self-supervised)
b. Techniques: transfer | feature (geometric) | hyperparameter optimization | MTL | ZSL | OSL | EBL | auto | meta | NAS | neuroevolution
c. Backpropagation | Attention
13. Deep Learning
-Foundation models | Stanford CRFM
-Test time computation: intro for RLMs
14. Generative AI
-SLMs
-LLMs
-RLMs
-GANs
16. Narrow AI
17. The AI boom
-ML companies & labs: DeepMind (Gemini) | Open AI (ChatGPT) | Anthropic (Claude) | MetaAI (Llama) | IBM (WatsonX, Granite) | HuggingFace | Mistral | DeepSeek (homepage) | Alibaba Cloud (Qwen) | Microsoft (365 Copilot & AI2)
-Hyperscale computing | Hyperscalers
-Wafer-scale integration | Cerebras (wiki)
18. History
-Online briefs: GfG | SCI | AI100 | IBM
-Nilsson, The quest for artificial intelligence
-Mitchell, Artificial intelligence: a guide for thinking humans
-Fradkov, Early history of machine learning
II. Ethics and safety
2. Mechanistic interpretability
3. Alignment
-General:
Christian, The alignment problem: machine learning and human values
-Groups:
ARC (Berkeley) | Ada Lovelace Institute (UK) | The Future Society (US & EU)
5. Deception
6. Machine ethics and friendly AI
7. Safety
The UK AI safety summit (UK, 2023)| The Bletchley Declaration
The AI Seoul Summit (South Korea, 2024) | The Seoul Declaration
The AI Action Summit (France, 2025) | The Paris Statement
The First International AI Safety Report
The AI Impact Summit (India, 2026)
AISI centers
US: CAISI and AISIC (National, @NIST) | CHAN (UC Berkeley)| CAIS (San Francisco) | CAISER (Oak Ridge) | AI Action Plan
EU: European AI Office| AI Board
Some national AISIs: UK | Japan | South Korea | Singapore
Other:
US: IAPS | CSET | CHAI | Redwood | Palisade | FMF | Epoch | ARC & METR | ARI | CivAI | FAR | Apart | LawAI | AIFutures
UK: CLTR | GovAI | Apollo | CLTR | Forethought | MATS | ARENA
Global: TFS | SAIF | Constellation | 80000hours
Some papers:
TASRA: a Taxonomy and Analysis of Societal-Scale Risks from AI (2023)
Two Types of AI Existential Risk: Decisive and Accumulative (2024)
Mechanistic Interpretability for AI Safety -- A Review (2024)
Safety Cases: How to Justify the Safety of Advanced AI Systems (2024)
Gradual disempowerment: Systemic Existential Risks from Incremental AI Development (2025)
AI-Enabled Coups: How a Small Group Could Use AI to Seize Power (2025)
III. Governance and policy
Regulation of AI | Global governance [GAIGO]
EU: The EU AI Act (law, wiki) ] | AI Innovation Package [AI Factories, AI Gigafactories , Gen4AIEU, NoE]| The Continent Action Plan | European approach to AI | European AI Alliance | AI Forum
US: America's AI Action Plan | The Stargate project (wiki)
China: Global AI Governance Initiative | Global AI Governance Action Plan
UN: Independent international scientific panel on AI
IV. Frontier, speculative and philosophical
General interest books:
Kissinger, Schmidt & Huttenlocher, The age of AI and our human future
Suleyman & Bhaksar, The coming wave
Russell, Human compatible
Ord: The precipice | revisted
Togelius, Artificial general intelligence
Bennett, A brief history of intelligence
Kurzweil, The singularity is nearer
1. Artificial General Intelligence (AGI)
a. The Turing test
d. Philosophy of AI and Philosophy of mind
-The Chinese room argument and hard problem of consciousness
-Unified Theories of Cognition and The Conscious Mind
-The binding problem
-Self-awareness and NCC
-The CRUM hypothesis
-Statistical learning and SLA
3. Cognitive models and architectures
- SoM, MDM, GWT, MPF and DCM, DC
-Conceptual spaces, cognitive maps, image schemas
-ACT-R, Soar, 4CAPS, DUAL, CLARION, LIDA, SDM
-A review
-Computational ToM (also here)
6. Recursive self-improvement and Technological singularity
Some essays:
Situational Awareness | AI Futures
V. Socio-political
2. National Security
-Hendricks, Schmidt & Wang's Superintelligence Strategy (expert version)
-5GW
4. Social
-Predictive analytics and policing
-Lawbots, computational law and legal informatics
-Automated journalism | Virtual assistants | Machine translation | AVs
-AI art: visual | music | film | architecture | games
-Digital twins | digital cloning | deepfakes | companions
-UBI
5. Ontological security
-Ontological security theory in International relations
-MIRI (wiki)/Less Wrong
7. Ideologies and movements
CEA | EV | Open Philantropy
c. TESCREAL
d. Luddism and neo-Luddism
-The precautionary principle and progress
-Movements: Post-development | Anarcho-primitivism | Degrowth | Collapsology | Survivalism
-Essays: Industrial Society and Its Future | Why the Future Doesn't Need Us
VI. Influential people
1. Alan Turing
5. Yann Le Cun
8. Ray Kurzweil
9. Eric Schmidt
VII. Resources
IEE Spectrum | IBM Think | ASCl's State of AI
CSAIL (MIT) | Alan Turing Institute (UK) | HAI & SAIL (Stanford)

Outline | Techniques | Technology forecasts | Foresight | Emerging technologies
Biases: optimism | scope | discounting | availability | conjunction | overconfidence
Global risks and future problems (current issues)
-Limits: LTG & 30y update (recent) | Degrowth movement | Green growth
-Catastrophic (scenarios): BAS, FTL, FLI
-Existential: CSER, SERI, ERO, MIRI
Some resources
Organizations: WSF | IFTF | Millenium Project
Think tanks: CHT | Roots of Progress | New Atlantis | Foresight
Journals: TFSC | Futures | JFS

Persistent algebraic topology and topological data analysis
Introductory
Carlsson, Persistent Homology and Applied Homotopy Theory, arXiv:2004.00738 [math.AT]
Huber, Persistent homology in data science
Edelsbrunner and Harrer, Persistent homology – a survey
Zomorodian and Carlsson, Computing persistent homology
Comprehensive
Oudot, Persistence Theory: From Quiver Representations to Data Analysis
Edelsbrunner and Harrer, Computational topology: an introduction
Persistent homotopy
Jardine, Persistent homotopy theory
Some applications to physics
M. Biagetti, A. Cole, G. Shiu, The Persistence of Large Scale Structures I: Primordial non-Gaussianity, JCAP 04 (2021) 061
A. Cole, M. Biagetti, G. Shiu, Topological Echoes of Primordial Physics in the Universe at Large Scales, arXiv:2012.03616 [astro-ph.CO]
J. H. T. Yip et al, Cosmology with Persistent Homology: a Fisher Forecast, JCAP 09 (2024) 034.