- Epoch Biodesign (San Bruno, CA)
- …on climate and kitchen change. We are seeking an experienced Firmware Engineer to lead vision system development and touchscreen interface implementation for our ... raw image frames, perform preprocessing (ISP, color conversions), and manage AI inference models either locally or via cloud services. Collaborate with hardware… more
- DatologyAI (Redwood City, CA)
- …looking for an engineer with deep experience building and operating large-scale training and inference systems. You will design, implement, and maintain the ... researchers to productionize new models and features quickly and safely. Optimize training and inference pipelines for performance, reliability, and cost. Ensure… more
- F. Hoffmann-La Roche AG (South San Francisco, CA)
- Senior/Principal Software Engineer , AI Enablement (Full stack) page is loaded Senior/Principal Software Engineer , AI Enablement (Full stack) Apply ... optimise workflows. We also work on scaling up model training and inference , evaluating the quality of...to meet the scientific needs. The Opportunity: As a software engineer in AI Enablement with a… more
- quadric.io, Inc (Burlingame, CA)
- …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network...models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
- OpenAI (San Francisco, CA)
- …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We're looking for a senior engineer to design and build the load balancer that… more
- OpenAI (San Francisco, CA)
- …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...work is inherently cross-functional: you'll collaborate directly with researchers training these models and with product teams defining new… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
- Hamilton Barnes Associates Limited (San Francisco, CA)
- …of H100s, H200s, and B200s, ready to go for experimentation, full-scale model training , or inference . Our client operates high-performance GPU clusters powering ... the most advanced AI workloads worldwide. They're now building a serverless inference platform, beginning with cost-efficient batch inference and expanding into… more
- Capital One (San Francisco, CA)
- …Java, or Golang* Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware ... in engineering and mathematics, and your expertise in hardware, software , and AI enable you to see and exploit...developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
- Menlo Ventures (Burlingame, CA)
- …and scale that was not previously possible. You will build massively distributed training and inference pipelines, core MLOps tools and frameworks, and optimize ... team of proven drug hunters, deep learning researchers, and software engineers united by a common mission - drive...iteration on scalable and robust distributed infrastructure for ML training , inference , and evaluation. Support model … more
- Genesis Therapeutics Inc. (Burlingame, CA)
- …and scale that was not previously possible. You will build massively distributed training and inference pipelines, core MLOps tools and frameworks, and optimize ... team of proven drug hunters, deep learning researchers, and software engineers united by a common mission - drive...iteration on scalable and robust distributed infrastructure for ML training , inference , and evaluation. Support model … more
- Menlo Ventures (Burlingame, CA)
- …performance , optimizing the scalability and efficiency of every part of the training and inference pipeline. Ship state-of-the-art models to production, working ... we're a tight-knit team of proven deep learning researchers, software engineers, and drug discovery pioneers. Our shared mission...Role This role is for a highly-skilled ML Research Engineer who thrives at the intersection of fundamental research… more
- Quadric Inc. (Burlingame, CA)
- …between development engineering and hands‑on users in the field. The AI Application Engineer will [1] integrate Quadric product and software stack into AI/LLM ... general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network...and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and… more
- quadric.io, Inc (Burlingame, CA)
- …an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) ... inference workloads in a wide variety of edge and...C++ DSP and control code. Role: The Corporate Applications Engineer is the key bridge between development engineering and… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …This Role: The Crusoe Cloud Managed AI team seeks an ambitious and experienced Senior Software Engineer to join their team. You'll have a pivotal role in shaping ... large-scale, production-level services. (Preferred) Familiarity with AI infrastructure, including training , inference , and ETL pipelines. (Preferred) Contributions… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …Generative AI (Large Language Models, Multimodal). Familiarity with AI infrastructure, including training , inference , and ETL pipelines. Software Engineering ... About This Role: As a Senior Staff Software Engineer on the Managed AI...shaping the architecture and scalability of our next-generation AI inference platform. You will lead the design and implementation… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a staff software engineer for GenAI Performance and Kernel, you will own the design, implementation, optimization, and correctness of the ... high-performance GPU kernels powering our GenAI inference stack. You will lead development of highly-tuned, low-level compute paths, manage trade-offs between… more
- Rockstar (San Francisco, CA)
- …promise is simple: they make your AI system better. They are hiring a Backend Software Engineer (ML Infrastructure) to help design, build, and scale the core ... ML workloads, including fine-tuning and reinforcement learning. Build distributed training and inference pipelines that are efficient, fault-tolerant,… more