Portable GPU Programming
Shared Memory Usage
GitHub/fluidnumerics/gpu-programming
Portable GPU Programming
GitHub/fluidnumerics/gpu-programming
Home
Hardware
Hardware
GPU Accelerated Platforms
Estimating Performance
GPU Specifications Table
OpenMP GPU Offloading
OpenMP GPU Offloading
Basics
HIP and HIPFort
HIP and HIPFort
Basics
Benchmarking your hardware
Benchmarking your hardware
PCI
Memory
Compute
Multi-GPU Communications
Performance Topics
Performance Topics
Coalesced Memory Addressing
Shared Memory Usage
Occupancy
Asynchronous Operations
Multi-GPU Topics
Multi-GPU Topics
Task Affinity
GPU Direct Communications
Debugging Applications
Debugging Applications
Basics
Debugging with roc-gdb
Profiling Applications
Profiling Applications
Basics
Profiling with rocprof
For System Administrators
For System Administrators
Build amdclang/flang with AMD and Nvidia bitcodes
Installing OpenMPI with AMD GPU Support
Mentored Sprints
Mentored Sprints
About
Emergent Phenomena Revealed in Subatomic Matter (EmPRiSM)
Codelabs
Codelabs
Build a GPU Accelerated Application with HIP in C/C++
Build a GPU Accelerated Application with HIPFort in Fortran
Build a GPU Accelerated Application with OpenMP in C/C++
Build a GPU Accelerated Application with OpenMP in Fortran
Shared Memory Usage