CUDA Notes¶

This page summarizes CUDA-specific setup and failure modes.

Compatibility Checklist¶

Useful Python checks:

import torch
print(torch.cuda.is_available())
print(torch.version.cuda)

torch.cuda.is_available() is False:
- install matching CUDA-enabled torch wheel
- verify driver installation
Native backend loads but CUDA symbols are unavailable:
- rebuild csrc with a valid CUDA toolkit and visible nvcc
- check CMake output for CUDA detection messages
Runtime illegal memory access or launch failures:
- validate tensor shapes and bounds for source/receiver indices
- reduce workload size and reproduce with one shot for isolation
Performance lower than expected:
- test storage_mode=device first
- profile with realistic n_shots and nt
- verify kernels are not falling back to Python backend