• DatologyAI (Redwood City, CA)
    …looking for an engineer with deep experience building and operating large-scale training and inference systems. You will design, implement, and maintain the ... researchers to productionize new models and features quickly and safely. Optimize training and inference pipelines for performance, reliability, and cost. Ensure… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Capital One (Fredericksburg, VA)
    …Java, or Golang Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware ... Lead AI Engineer (FM Hosting, LLM Inference ) Overview...support AI software components including foundation model training , large language model inference , similarity search,… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Deep Learning Software Engineer , Inference page is loaded## Senior Deep Learning Software Engineer , Inferencelocations: US, CA, Santa Clara: ... requisition id: JR2002670NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for...and Python experience is a plus.* Prior experience with training , deploying or optimizing the inference of… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We're looking for a senior engineer to design and build the load balancer that… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (Seattle, WA)
    …cloud-scale machine-learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... Overview AWS Neuron is the complete software stack for the AWS Inferentia and Trainium...programming language Fundamentals of machine learning models, their architecture, training and inference lifecycles along with work… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...work is inherently cross-functional: you'll collaborate directly with researchers training these models and with product teams defining new… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Google Inc. (Sunnyvale, CA)
    Software Engineer III, Infrastructure, Inference Control Plane corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor's degree or equivalent practical ... goes on and is growing every day. As a software engineer , you will work on a...push technology forward. The mission of Vertex AI Online Inference Infrastructure team is to build a model serving… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • jobr.pro (Sunnyvale, CA)
    …UI design and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with ... Large Language Models (LLM) and other Machine Learning (ML) models for inference . Experience building GPU-related software . Experience with compilers or ML… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • quadric.io, Inc (Burlingame, CA)
    …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network...models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Hamilton Barnes Associates Limited (San Francisco, CA)
    …of H100s, H200s, and B200s, ready to go for experimentation, full-scale model training , or inference . Our client operates high-performance GPU clusters powering ... the most advanced AI workloads worldwide. They're now building a serverless inference platform, beginning with cost-efficient batch inference and expanding into… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Technical Marketing Engineer - AI Inference at Scale page is loaded## Senior Technical Marketing Engineer - AI Inference at Scalelocations: US, ... platforms integrate CPUs, GPUs, DPUs, networking, and a full-stack software ecosystem to power AI at scale. We are...scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated computing product team.… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat (Boston, MA)
    …closely with our product and research teams to scale SOTA deep learning products and software . As an ML Ops engineer , you will work closely with our technical ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...in the technology industry work here. Whether we're building software , championing our products, or training new… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat, Inc. (Boston, MA)
    …closely with our product and research teams to scale SOTA deep learning products and software . As an ML Ops engineer , you will work closely with our technical ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...and research teams to manage training and deployment pipelines, create DevOps and CI/CD infrastructure,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Capital One (San Francisco, CA)
    …Java, or Golang* Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware ... in engineering and mathematics, and your expertise in hardware, software , and AI enable you to see and exploit...developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • The Association of Technology, Management and Applied… (Morgan Hill, CA)
    …development efficiencies, providing technical thought leadership based on conducting multiple software implementations, and applying both depth and breadth in a ... of relevant experience required. Experience in Model Ops and design, software development with proven effectiveness in delivering technology in fast-paced,… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Minimal (Seattle, WA)
    …is seeking a C++ Software Engineer to optimize AI training and inference systems, and enhance machine learning infrastructure. The ideal candidate ... a BS in a technical field and at least 2 years of software development experience. Responsibilities include designing systems for machine learning workloads and… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Google Inc. (Sunnyvale, CA)
    …JAX. 3 years of experience in software development for machine learning model inference or machine learning model training , and 1 year of experience with ML ... Senior Software Engineer , Machine Learning, Kernal Apply...model performance for large scale training and inference through tuning and optimization at both software more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source