- Genmo Inc. (San Francisco, CA)
- …of AI and pushing the boundaries of what's possible in video generation. We're seeking a GPU Performance Engineer to squeeze every last FLOP from our H100 ... serving stack to its absolute limits. The Role You'll be our performance optimization expert, using advanced profiling tools to identify bottlenecks and implementing… more
- Genmo Inc. (San Francisco, CA)
- A video generation research lab is seeking an experienced GPU Performance Engineer to optimize their model serving stack and maximize performance on H100 ... have over 5 years of systems programming experience, a strong foundation in GPU architecture, and proficiency with tools like Nsight Systems and nvprof. This… more
- 10X Recruiting Partners (San Francisco, CA)
- …their client's team in San Francisco. This role focuses on optimizing GPU virtualization performance and involves significant problem-solving and ownership of ... production systems. Ideal candidates should have expert-level C++ skills and a strong background in low-level systems. They will collaborate directly with the CTO on advanced systems challenges, offering a unique opportunity in a fast-paced, early-stage… more
- 10X Recruiting Partners (San Francisco, CA)
- …Software Engineer (C++ Systems) to join a client's team focused on GPU virtualization. The role requires optimizing performance at the systems level and ... involves complex challenges across production systems. Ideal candidates have strong C++ skills, a related degree, and experience with low-level systems. This position offers substantial ownership and the opportunity to tackle significant technical challenges… more
- Liquid AI (San Francisco, CA)
- A leading AI technology firm is seeking individuals experienced in writing GPU kernels to join their dynamic team. The role involves optimizing architectures for ... various model sizes while actively contributing to the enhancement of inference pipelines. Candidates should have proficiency in CUDA, C/C++, and PyTorch. This position offers the chance to work hands-on with state-of-the-art technology, all within a… more
- Advanced Micro Devices (San Jose, CA)
- …perspectives. Join us as we shape the future of AI and beyond. Principal / Senior GPU Software Performance Engineer - Post‑Training THE ROLE Drive the ... compiler, and model teams to land durable improvements. PREFERRED EXPERIENCE Proven GPU performance engineering for deep learning (ROCm/HIP, Triton, or similar).… more
- Advanced Micro Devices (San Jose, CA)
- A leading technology company is seeking a Principal / Senior GPU Software Performance Engineer in San Jose, CA. This role involves optimizing GPU ... systems. The ideal candidate will have strong skills in GPU performance engineering and experience with deep learning frameworks, particularly PyTorch.… more
- Smallest Inc. (San Francisco, CA)
- Role We're hiring a GPU Optimization Engineer who understands GPUs at a deep, architectural level - someone who knows exactly how to squeeze every last ... You'll Do Optimize model architectures (ASR, TTS, SLMs) for maximum performance on specific GPU hardware Profile models end-to-end to identify GPU … more
- Prima Mente (San Francisco, CA)
- …for multi-omics data processing at scale (1000+ samples) Optimising cost and performance , leveraging GPU acceleration where it matters Work with experimental ... in London, San Francisco and Dubai. Role focus - GPU -accelerated bioinformatics Architect, build, and own scalable production pipelines...at the frontier of AI and biology. You're an engineer , not an analyst. You thrive pushing the boundaries… more
- OpenAI (San Francisco, CA)
- …fleet team focuses on running the world's largest, most reliable, and frictionless GPU fleet to support OpenAI's general purpose model training and deployment. Work ... frameworks and deployment systems Ensuring fast model startup times though high performance snapshot delivery across blob storage down to hardware caching Much more!… more
- San Francisco Compute Co. (San Francisco, CA)
- …located in San Francisco is looking for a dedicated professional to manage GPU training clusters. You will ensure the smooth operation of high- performance ... for hardware management. Applicants must have experience with Linux, networking fundamentals, and GPU clusters. The role offers a salary ranging from $170k to $300k… more
- Recruiting From Scratch (San Francisco, CA)
- …in San Francisco, CA. The role involves building and optimizing a high- performance C++ GPU virtualization library and debugging complex distributed systems. ... Ideal candidates will demonstrate strong experience in modern C++ and performance engineering in a startup environment. This position offers a competitive salary and… more
- Vizcom (San Francisco, CA)
- …design technology company in San Francisco is seeking a Senior Software Engineer for Backend (Systems / Infrastructure). You will architect and deliver backend ... scalability as demand grows. This role involves optimizing APIs, managing GPU workloads, and collaborating with cross-functional teams. Ideal candidates have 5-8… more
- OpenAI (San Francisco, CA)
- An innovative company is seeking a talented software engineer to join their dynamic Inference team. This role involves designing and implementing infrastructure for ... large-scale multimodal models, focusing on high- performance delivery of audio and image inputs. You'll collaborate closely with researchers and product teams to push… more
- Relace (San Francisco, CA)
- A tech company specializing in code generation is seeking an Infrastructure Engineer in San Francisco. You will design and operate systems for high- performance ... infrastructure, work with cloud technologies, and optimize systems for deployment and performance . Candidates should have at least 2 years of experience producing… more
- Databricks Inc. (San Francisco, CA)
- …a research engineer to enhance deep learning techniques and optimize performance on NVIDIA architectures. Ideal candidates will have a PhD in Computer Science ... and experience with CUDA and distributed training frameworks. Join our diverse team and take part in groundbreaking AI developments within an inclusive and supportive environment. #J-18808-Ljbffr more
- Menlo Ventures (San Francisco, CA)
- …Kernel, you will own the design, implementation, optimization, and correctness of the high- performance GPU kernels powering our GenAI inference stack. You will ... MLP, softmax, layernorm, memory management) optimized for various hardware backends ( GPU , accelerators) Drive the performance roadmap for kernel-level… more
- 10X Recruiting Partners (San Francisco, CA)
- …systems from day one and tackling technically demanding challenges at the forefront of GPU infrastructure. What You'll Do Optimize performance of our C++ GPU ... smooth as possible. We're seeking a highly skilled Software Engineer (C++ Systems) to join our client's team and...This is a hardcore C++ systems role focused on GPU virtualization, performance tuning, production debugging, and… more
- Gimlet Labs, Inc (San Francisco, CA)
- …and hardware with previous successful exits. Gimlet Labs is seeking a Software Engineer focused on AI Performance . You will be researching and implementing ... GPU kernel layers Profiling, benchmarking, and analyzing system performance , identifying bottlenecks and optimization opportunities in execution runtimes… more
- Liquid AI (San Francisco, CA)
- …Experience: CUDA CUTLASS C/C++ PyTorch/Triton What You'll Actually Do: Write high- performance GPU kernels for inference workloads Optimize alternative ... locations. This Role Is For You If: You have experience writing high- performance , custom GPU kernels for training or inference You have an understanding of… more