- Cango Inc. (San Francisco, CA)
- …GPU infrastructure and business teams to ensure timely product delivery. Lead performance engineering efforts including NCCL tuning, NUMA binding, CUDA kernel ... optimization. Drive cross-team collaboration (GPU kernel, compiler , distributed system, frontend APIs) to ensure system stability and scalability. Organize… more