Old notes repo
AI
Foundations
Deep learning foundations
Reinforcement learning foundations
Machine learning foundations
Misc ML foundations
Optimization
Probabilistic generative modeling foundations
AI research
Transformer research
Large language models (LLMs)
More LLMs
Multimodal vision-language models
Image generation
Diffusion (DDPM)
Image processing
Video generation
Audio generation
Speech generation
Speech processing
General deep learning research
Agent reasoning
Deep/formal reasoning
Reinforcement learning research
Building models (applied ML advice)
ML systems research and engineering
LLM scaling
LLM scaling algorithms (systems)
LLM scaling cases / data
Inference performance optimization
GPU computing
Megatron-DeepSpeed
gpt-neox
gpt-neox MoE design doc
Megablocks
Applied LLMs
More specific codebases
llama2.c
Sweep codebase
AI research questions
Small models / scaling down LLMs
Datasets
LLM evals
AI organizations
Non-LLM deep learning and software engineering
Misc data or statistical algorithms
LLM applications
LLMs for coding
Codex
Sourcegraph Cody
axolotl
LLM code snippets
Dev scratch journal
Applied Stable Diffusion (classic)
Computer science and engineering
Algorithms and data structures
Systems
Database systems
Operating systems
Dataflow systems
Computer architecture
Programming languages
Compilers
Information retrieval
Sandboxing
Server sandboxing:
https://yz.mit.edu/posts/server-sandboxing/
Browser sandboxing
Distributed systems
CRDTs
Operational transform
Editing DAGs concurrently
Concepts
Cache maintenance:
https://yz.mit.edu/posts/cache-maintenance/
Commutativity
Vectorization
Concept relationships and bottom-up derivations
Math
Math notes
Statistics
Probability problems
Linear algebra
Theorem provers
Control systems and theory
Algorithmic information theory
Signal processing
‣
Development
General development
SQL alternatives
Node.js
Python
Python package management
Pytorch
Pandas
Web development
MIME, multipart, binary encodings