You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: modules/module1/content.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# Module 1: Foundations of GPU Programming with CUDA and HIP
2
2
*Heterogeneous Data Parallel Computing*
3
3
4
-
> Environment note: Examples are validated in containers using CUDA 12.9.1 (Ubuntu 22.04) and ROCm latest (rocm/dev-ubuntu-22.04:latest). Using Docker is recommended for a consistent setup.
4
+
> Environment note: Examples are validated in containers using CUDA 12.9.1 (Ubuntu 22.04) and ROCm 7.0 (Ubuntu 24.04). The advanced build system automatically detects your GPU vendor and optimizes accordingly. Using Docker is recommended for a consistent setup.
5
5
6
6
## Learning Objectives
7
7
After completing this module, you will be able to:
Copy file name to clipboardExpand all lines: modules/module2/content.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# Module 2: Advanced GPU Memory Management and Optimization
2
2
*Mastering GPU Memory Hierarchies and Performance Optimization*
3
3
4
-
> Environment note: Examples are tested in Docker containers with CUDA 12.9.1 and ROCm latest (rocm/dev-ubuntu-22.04:latest). Prefer Docker for reproducible builds.
4
+
> Environment note: Examples are tested in Docker containers with CUDA 12.9.1 (Ubuntu 22.04) and ROCm 7.0 (Ubuntu 24.04). The improved build system automatically optimizes memory access patterns. Prefer Docker for reproducible builds.
5
5
6
6
## Learning Objectives
7
7
After completing this module, you will be able to:
> Environment note: Use the provided Docker images (CUDA 12.9.1, ROCm latest) for consistent toolchains across platforms.
4
+
> Environment note: Use the provided Docker images (CUDA 12.9.1 on Ubuntu 22.04, ROCm 7.0 on Ubuntu 24.04) with automatic GPU detection for consistent toolchains across platforms.
5
5
6
6
## Learning Objectives
7
7
After completing this module, you will be able to:
> Environment note: Examples are validated with CUDA 12.9.1 and ROCm latest in Docker containers. Multi-GPU sections may require appropriate hardware and drivers.
3
+
> Environment note: Examples are validated with CUDA 12.9.1 (Ubuntu 22.04) and ROCm 7.0 (Ubuntu 24.04) in Docker containers. Multi-GPU sections may require appropriate hardware and drivers. Auto-detection build system optimizes for your platform.
Copy file name to clipboardExpand all lines: modules/module5/content.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# Module 5: Performance Considerations and GPU Optimization
2
2
3
-
> Environment note: Examples and profiling workflows are validated using Docker images with CUDA 12.9.1 and ROCm latest (rocm/dev-ubuntu-22.04:latest) for consistent toolchains.
3
+
> Environment note: Examples and profiling workflows are validated using Docker images with CUDA 12.9.1 (Ubuntu 22.04) and ROCm 7.0 (Ubuntu 24.04) for consistent toolchains. Enhanced build system includes profiling integrations.
4
4
5
5
## Table of Contents
6
6
1.[Introduction to GPU Performance Optimization](#introduction)
Copy file name to clipboardExpand all lines: modules/module6/content.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# Module 6: Fundamental Parallel Algorithms - Comprehensive Guide
2
2
3
-
> Environment note: The examples and benchmarks in this module are tested in Docker with CUDA 12.9.1 and ROCm latest to ensure reproducibility.
3
+
> Environment note: The examples and benchmarks in this module are tested in Docker with CUDA 12.9.1 (Ubuntu 22.04) and ROCm 7.0 (Ubuntu 24.04) to ensure reproducibility. Recent algorithm fixes improve performance.
> Environment note: Use the provided Docker environment (CUDA 12.9.1, ROCm latest) for consistent builds and tools across platforms.
3
+
> Environment note: Use the provided Docker environment (CUDA 12.9.1 on Ubuntu 22.04, ROCm 7.0 on Ubuntu 24.04) for consistent builds and tools across platforms. Recent algorithmic pattern fixes included.
0 commit comments