Tech Apr 20, 2026 9 min Running TRELLIS.2 on Apple Silicon MPS: a CUDA-free port A port that replaces TRELLIS.2's CUDA-only libraries (flash_attn, nvdiffrast, sparse 3D convolution) with pure-PyTorch equivalents and runs Microsoft's 4B image-to-3D model on an M4 Pro in about 3.5 minutes without any NVIDIA GPU. AppleSilicon MPS PyTorch 3D ローカルLLM ML