triton Triton Inference Server Register a CUDA shared memory region Register a CUDA shared memory region for use with inference requests. GitHub