MAX Learning 🔥

Learn the MAX Graph API through progressive, hands-on examples

Why This Repository?

MAX delivers impressive speedups for ML inference. The official documentation provides excellent API references and tutorials. This repository complements that documentation by offering:

✅ Progressive learning path - Six examples building from basics (relu(x * 2 + 1)) to production transformers
✅ Learn by reading code - Minimal versions show pure MAX Graph API without abstractions
✅ Working implementations - Tested, runnable code you can study and modify immediately
✅ Real performance data - Benchmarks with correctness validation (e.g., 5.58x speedup on DistilBERT)
✅ Production insights - What works, what doesn't, and why (including GPU findings)
✅ Testing patterns - 49 pytest tests showing how to validate MAX implementations

Who is this for?
Developers who learn best by studying and running progressively complex examples, complementing the official tutorials.

What you'll learn:
How to build computational graphs with the MAX Python API - from simple element-wise operations through to production transformers.

About this repository: I'm learning MAX myself and documenting that journey through working examples. This isn't an authoritative guide - it's one developer's exploration of MAX, shared in the hope others find it useful. Corrections and improvements welcome!

Status

Version: 0.3.0
Stage: Public Release (January 2026)
Focus: Python MAX Graph API examples

Note on Mojo: This repository focuses on the Python MAX Graph API, which is the current production path for building graphs. While Mojo MAX Graph API existed previously, it was deprecated in May 2025. See examples/mojo/01_elementwise for details on the current state and architecture.

Learning Path: Examples in Order

Each example includes both a minimal version (no abstractions, pure MAX Graph API) and a full version (with configuration and helpers).

1️⃣ Element-wise Operations

Path: examples/python/01_elementwise/
Operation: y = relu(x * 2.0 + 1.0)
Learn: Basic graph construction, operations (mul, add, relu)
Status: ✅ Works on CPU and Apple Silicon GPU

2️⃣ Linear Layer

Path: examples/python/02_linear_layer/
Operation: y = relu(x @ W^T + b)
Learn: Matrix operations (matmul, transpose), parameter handling
Status: ✅ Works on CPU, ❌ Apple Silicon GPU (matmul kernel missing)

3️⃣ DistilBERT Sentiment

Path: examples/python/03_distilbert_sentiment/
Model: Full transformer (6 layers, 66M parameters)
Learn: Production model loading, tokenisation, multi-layer architecture
Performance: 5.58x speedup vs PyTorch on M1 CPU

4️⃣ MLP Regression

Path: examples/python/03_mlp_regression/
Model: Multi-layer perceptron (3 hidden layers)
Learn: Sequential layers, housing price prediction
Benchmarks: MAX vs PyTorch comparison included

5️⃣ CNN MNIST

Path: examples/python/04_cnn_mnist/
Model: Convolutional neural network (2 conv + 2 dense layers)
Learn: Convolutions, pooling, flattening, digit classification
Benchmarks: MAX vs PyTorch comparison included

6️⃣ RNN Forecast (WIP)

Path: examples/python/05_rnn_forecast/
Status: 🚧 Parked due to MAX Graph API limitations with sequence processing

Quick Start

# Install dependencies
pixi install

# Run examples - start with minimal versions to learn MAX Graph API
pixi run python examples/python/01_elementwise/elementwise_minimal.py
pixi run python examples/python/02_linear_layer/linear_layer_minimal.py

# Or use pixi tasks for full versions with configuration
pixi run example-elementwise-cpu   # 1️⃣ Element-wise: mul, add, relu
pixi run example-elementwise-gpu   # 1️⃣ Same ops on Apple Silicon GPU
pixi run example-linear            # 2️⃣ Linear layer: matmul + bias + relu
pixi run example-distilbert        # 3️⃣ DistilBERT transformer
pixi run example-mlp               # 4️⃣ MLP regression
pixi run example-cnn               # 5️⃣ CNN MNIST classifier

# Run tests (49 tests total)
pixi run test-python               # Full pytest suite
pixi run test-mojo                 # Mojo tests

# Run benchmarks (generates MD + JSON + CSV reports)
pixi run benchmark-elementwise     # 1️⃣ Element-wise: CPU vs GPU
pixi run benchmark-linear          # 2️⃣ Linear layer: CPU vs GPU  
pixi run benchmark-distilbert      # 3️⃣ DistilBERT: MAX vs PyTorch
pixi run benchmark-mlp             # 4️⃣ MLP: MAX vs PyTorch
pixi run benchmark-cnn             # 5️⃣ CNN: MAX vs PyTorch
pixi run benchmark-all             # Run all benchmarks

# Cleanup benchmark reports
pixi run clean-reports-all         # Remove all benchmark reports

Performance Highlights

DistilBERT (M1 CPU)

MAX: 45.88ms mean, 21.80 req/sec
PyTorch: 255.85ms mean, 3.91 req/sec
Speedup: 5.58x faster with 85% better P95 latency

MLP Regression (M1 CPU, batch=2048)

MAX: 142ms per batch
PyTorch: 0.56ms per batch
Note: PyTorch significantly faster on this workload (~253x)

CNN MNIST (M1 CPU, batch=256)

PyTorch: ~5x faster than MAX
Note: Both produce identical predictions (correctness validated)

Apple Silicon GPU

✅ Element-wise operations working (first reported MAX Graph GPU inference)
❌ Matrix operations blocked - matmul kernel not yet available
See Apple Silicon GPU Findings for details

Repository Structure

├── src/python/
│   ├── max_*/                     # MAX implementations (distilbert, mlp, cnn, rnn)
│   ├── utils/                     # Shared utilities (paths, benchmarks)
│   └── pyproject.toml             # Package configuration
├── src/mojo/                          # (Empty - ready for MAX Graph Mojo modules)
├── examples/mojo/
│   └── lexicon_baseline/          # v0.1.0 pure Mojo baseline (non-MAX Graph)
├── examples/python/
│   ├── 01_elementwise/            # Element-wise ops (minimal + full)
│   ├── 02_linear_layer/           # Linear layer (minimal + full)
│   ├── 03_distilbert_sentiment/   # DistilBERT transformer
│   ├── 03_mlp_regression/         # MLP for housing prices
│   ├── 04_cnn_mnist/              # CNN digit classifier
│   └── 05_rnn_forecast/           # RNN (WIP)
├── tests/python/                  # pytest suite (49 tests)
│   ├── 01_elementwise/
│   ├── 02_linear_layer/
│   ├── 03_distilbert/
│   ├── 03_mlp/
│   └── 04_cnn_mnist/
├── benchmarks/
│   ├── 01_elementwise/            # CPU vs GPU
│   ├── 02_linear_layer/           # CPU vs GPU
│   ├── 03_distilbert/             # MAX vs PyTorch
│   ├── 03_mlp/                    # MAX vs PyTorch
│   └── 04_cnn/                    # MAX vs PyTorch
├── docs/
│   ├── MAX_FRAMEWORK_GUIDE.md     # Comprehensive MAX guide
│   ├── PROJECT_STATUS.md          # Current status & learnings
│   └── APPLE_SILICON_GPU_FINDINGS.md  # GPU experiments
└── models/                        # Downloaded models (gitignored)

Completed Milestones

✅ v0.1.0 - Lexicon-based Baseline

Pure Mojo sentiment classifier
Simple lexicon-based approach
Benchmarking foundation

✅ v0.2.0 - MAX Graph DistilBERT

Full MAX Graph implementation of DistilBERT
5.58x speedup over PyTorch on M1
Comprehensive documentation & guides
Numbered examples for learning
Apple Silicon GPU experiments (element-wise ops working)

✅ v0.3.0 - Community Release

Six progressive examples: element-wise → linear → DistilBERT → MLP → CNN → RNN (WIP)
Minimal examples: Self-contained code highlighting MAX Graph API without abstractions
Comprehensive testing: 49 pytest tests with correctness validation
Performance benchmarks: MAX vs PyTorch comparisons for all models
Package restructuring: All Python modules installable, no sys.path manipulation
Systematic benchmarking: TOML configs, MD/JSON/CSV reports with machine IDs
Australian spelling throughout documentation
Ready for community feedback

Future Directions

Larger models: LLaMA, Mistral via MAX Pipeline API
Batch inference: Throughput optimisation
Quantisation: INT8/INT4 experiments
More GPU work: When matmul kernels available for Apple Silicon

Requirements

MAX 25.1.0 or later
Pixi package manager
Python 3.11+ (for MAX Python API)

Key Dependencies

MAX Engine: Graph compilation and inference
Transformers: Model and tokenizer loading
PyTorch: For benchmarking comparisons
pytest: Testing framework

Recommended Learning Path

Start with minimal examples: Run pixi run python examples/python/01_elementwise/elementwise_minimal.py to see pure MAX Graph API
Progress through numbered examples: Work through 1️⃣ → 6️⃣ in order, each building on previous concepts
Read the guides: docs/MAX_FRAMEWORK_GUIDE.md explains MAX concepts in depth
Run benchmarks: See real performance comparisons and correctness validation
Review tests: Study tests/python/ to see validation patterns
Explore GPU findings: Understand current Apple Silicon GPU capabilities and limitations

Sponsorship

This project is sponsored by DataBooth as part of our exploration of high-performance AI infrastructure.

Acknowledgements

Modular team for creating MAX and Mojo, and for their helpful responses on Discord
MAX documentation - particularly the MLP tutorial which inspired example 04
Community projects - mojo-toml and mojo-dotenv used in the lexicon baseline
Community feedback on early versions helped shape the structure and focus

See docs/ACKNOWLEDGEMENTS.md for detailed attributions.

Licence

MIT Licence - see LICENCE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
benchmarks		benchmarks
data		data
docs		docs
examples		examples
models		models
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENCE		LICENCE
README.md		README.md
pixi.toml		pixi.toml
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MAX Learning 🔥

Why This Repository?

Status

Learning Path: Examples in Order

1️⃣ Element-wise Operations

2️⃣ Linear Layer

3️⃣ DistilBERT Sentiment

4️⃣ MLP Regression

5️⃣ CNN MNIST

6️⃣ RNN Forecast (WIP)

Quick Start

Performance Highlights

DistilBERT (M1 CPU)

MLP Regression (M1 CPU, batch=2048)

CNN MNIST (M1 CPU, batch=256)

Apple Silicon GPU

Repository Structure

Completed Milestones

✅ v0.1.0 - Lexicon-based Baseline

✅ v0.2.0 - MAX Graph DistilBERT

✅ v0.3.0 - Community Release

Future Directions

Requirements

Key Dependencies

Recommended Learning Path

Sponsorship

Acknowledgements

Licence

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

DataBooth/max-learning

Folders and files

Latest commit

History

Repository files navigation

MAX Learning 🔥

Why This Repository?

Status

Learning Path: Examples in Order

1️⃣ Element-wise Operations

2️⃣ Linear Layer

3️⃣ DistilBERT Sentiment

4️⃣ MLP Regression

5️⃣ CNN MNIST

6️⃣ RNN Forecast (WIP)

Quick Start

Performance Highlights

DistilBERT (M1 CPU)

MLP Regression (M1 CPU, batch=2048)

CNN MNIST (M1 CPU, batch=256)

Apple Silicon GPU

Repository Structure

Completed Milestones

✅ v0.1.0 - Lexicon-based Baseline

✅ v0.2.0 - MAX Graph DistilBERT

✅ v0.3.0 - Community Release

Future Directions

Requirements

Key Dependencies

Recommended Learning Path

Sponsorship

Acknowledgements

Licence

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages