• Cango Inc. (San Francisco, CA)
    …GPU infrastructure and business teams to ensure timely product delivery. Lead performance engineering efforts including NCCL tuning, NUMA binding, CUDA kernel ... optimization. Drive cross-team collaboration (GPU kernel, compiler , distributed system, frontend APIs) to ensure system stability and scalability. Organize… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source