• Epoch Biodesign (San Bruno, CA)
    …on climate and kitchen change. We are seeking an experienced Firmware Engineer to lead vision system development and touchscreen interface implementation for our ... raw image frames, perform preprocessing (ISP, color conversions), and manage AI inference models either locally or via cloud services. Collaborate with hardware… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Virtue AI (San Francisco, CA)
    …we're looking for passionate builders to join our core team. What You'll Do As an Inference Engineer , you will own how models are served in production. Your job ... workloads. You will: Serve and optimize LLM, embedding, and other ML models' inference across multiple model families Design and operate inference APIs with… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • F. Hoffmann-La Roche AG (South San Francisco, CA)
    Senior/Principal Software Engineer , AI Enablement (Full stack) page is loaded Senior/Principal Software Engineer , AI Enablement (Full stack) Apply ... our stakeholders, power data-driven science and accelerate decision-making. The Engineering - AI Enablement group within DDC is accountable...to meet the scientific needs. The Opportunity: As a software engineer in AI Enablement with a… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • quadric.io, Inc (Burlingame, CA)
    …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port...Engineering . 5+ years of experience in AI/LLM model inference and deployment frameworks/tools experience with model quantization (PTQ,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... principles. - Proficiency in debugging, profiling, and implementing best software engineering practices in large-scale systems. PREFERRED QUALIFICATIONS… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (San Francisco, CA)
    Senior Software Engineer , Model Inference ...or related field (or equivalent experience). 5+ years in software engineering focused on ML inference ... deliver measurable results at global scale. Description As a Software Engineer on the Apple Maps team,...will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Databricks Inc. (San Francisco, CA)
    Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll...BS/MS/PhD in Computer Science or a related field. Strong software engineering background (6+ years or equivalent)… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration...BS/MS/PhD in Computer Science, or a related field Strong software engineering background (3+ years or equivalent)… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …missing to get the job done. Have at least 5 years of professional software engineering experience. Have or can quickly gain familiarity with PyTorch, NVidia ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We are looking for an engineer who wants to take the world's largest and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... computing principles. Proficiency in debugging, profiling, and implementing best software engineering practices in large‑scale systems. Preferred Qualifications… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Pulse (San Francisco, CA)
    …experience is a plus About the Role Specialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and autoscaling ... across single-tenant and multi-tenant environments. Responsibilities Build inference services with smart batching and caching Optimize kernels, tokenization, and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …cloud platforms. You may be a good fit if you: Have significant software engineering experience, particularly with distributed systems Are results-oriented, with ... to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the...by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • quadric.io, Inc (Burlingame, CA)
    …an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) ... inference workloads in a wide variety of edge and...conventional C++ DSP and control code. Role The Full-Stack Engineer is key to making the Quadric product and… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Quadric Inc. (Burlingame, CA)
    …an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co‑optimized software and hardware is targeted to run neural network (NN) ... inference workloads in a wide variety of edge and...conventional C++ DSP and control code. Role: The Full‑Stack Engineer is key to making the Quadric product and… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Databricks Inc. (San Francisco, CA)
    A leading AI-focused technology company in San Francisco is seeking a Software Engineer for GenAI inference . In this role, you'll design, develop, and ... optimize the inference engine powering the Foundation Model API. You will...focusing on large-scale LLM applications. A strong background in software engineering , distributed systems, and machine learning… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Capital One (San Francisco, CA)
    …developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, ... engineering and mathematics, and your expertise in hardware, software , and AI enable you to see and exploit...developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (Burlingame, CA)
    …intersection of machine learning, physics, and computational chemistry, as well as engineering robust software systems that enable running large scale ... team of proven drug hunters, deep learning researchers, and software engineers united by a common mission - drive...experienced ML engineers to join the team and lead engineering efforts focused on driving forward our ML research… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Genesis Therapeutics Inc. (Burlingame, CA)
    …intersection of machine learning, physics, and computational chemistry, as well as engineering robust software systems that enable running large scale ... team of proven drug hunters, deep learning researchers, and software engineers united by a common mission - drive...ML infrastructure engineers to join the team and lead engineering efforts focused on driving forward our ML research… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Mvp VC (San Francisco, CA)
    A cutting-edge aerospace company in San Francisco is seeking a skilled software engineer to optimize and integrate the Ultimate Edge SDK for embedded platforms. ... NVIDIA hardware. Required qualifications include a Master's in Computer Engineering , expertise in C++/Python, and familiarity with containerization technologies.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source