• Eluvio (Berkeley, CA)
    …a highly focused and expert team of systems, networking, application, and video software engineers, AI scientists, ML engineers, and security specialists working ... content routing, just-in-time code execution, and inline frame accurate multi-modal content AI . We are headquartered in Berkeley, CA. Our technology supports ground… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge ... capabilities of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer , AI /ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Software Development Engineer , AI /ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... of applied scientists, system engineers, and product managers to deliver state‑of‑the‑art inference capabilities for Generative AI applications. Your work will… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    …technology company in Herndon, Virginia is seeking a Senior Software Development Engineer to work on AI /ML projects. You will design and optimize machine ... learning models for deployment on custom hardware accelerators, ensuring maximum performance. Ideal candidates will have over 5 years of experience, strong Python and C++ skills, and knowledge in machine learning principles. This role fosters a collaborative… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    A leading e-commerce platform in San Francisco is seeking a Software Development Engineer to develop and optimize machine learning models for custom hardware ... accelerators. This role involves performance tuning, debugging, and close collaboration with customers to enhance their models on AWS's services. The ideal candidate has strong programming skills in C++ and Python, along with a solid understanding of machine… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Virtue AI (San Francisco, CA)
    About Virtue AI Virtue AI sets the standard for advanced...to join our core team. What You'll Do As an Inference Engineer , you will own how models are ... Built on decades of foundational and award-winning research in AI security, its AI -native architecture unifies automated...Serve and optimize LLM, embedding, and other ML models' inference across multiple model families Design and operate … more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Databricks Inc. (San Francisco, CA)
    Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll bridge research advances and production… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    …GPU offtake. About the Role As an engineer working on Large Scale Inference you will build and scale software systems that accept, distribute, and purchase ... solutions to maximize compute utilization Create automated compute purchasing software to optimally fulfill inference job demand...fit for you if You enjoy the craftsmanship of software You're a thoughtful high-agency engineer Have… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... . About the Role We're looking for a senior engineer to design and build the load balancer that...will sit at the very front of our research inference stack - routing the world's largest AI more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Capital One (San Francisco, CA)
    …developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, ... in engineering and mathematics, and your expertise in hardware, software , and AI enable you to see...AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    About the Team Our Inference team brings OpenAI's most capable research and technology to the world through our products. We empower consumers, enterprise and ... developers alike to use and access our start-of-the-art AI models, allowing them to do things that they've...to before. We focus on performant and efficient model inference , as well as accelerating research progression via model… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... world-class developer experience while pushing the boundaries of what AI can do. We're expanding into multimodal inference... AI can do. We're expanding into multimodal inference , building the infrastructure needed to serve models that… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    …technology firm in San Francisco is seeking an engineer for Large Scale Inference . You will build and scale software systems to optimize compute for ... inference workloads. The ideal candidate enjoys software craftsmanship, is a strong communicator, and has an appreciation for reliable systems. The role offers… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    An innovative company is seeking a talented software engineer to join their dynamic Inference team. This role involves designing and implementing ... researchers and product teams to push the boundaries of AI technology, ensuring reliable production services. If you thrive...enjoy tackling complex challenges, this opportunity offers a chance to make a significant impact in the AI landscape.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    A technology-focused public benefit corporation in San Francisco seeks a skilled software engineer to join the inference team. This role involves building ... systems that power AI models like Claude, focusing on maximizing efficiency and enabling groundbreaking research. Ideal candidates have a background in distributed… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role Our Inference team is responsible ... Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be...by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    A leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal ... candidate has over 5 years of software engineering experience, strong familiarity with ML architectures, and experience with distributed systems. This role involves… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Arcade (San Francisco, CA)
    A pioneering technology company in San Francisco seeks a Software Engineer to build scalable backend systems and enhance generative AI workflows. The ideal ... design efficient architecture for model execution and collaborate closely with product and AI teams. This role offers the opportunity to innovate in a fast-paced… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source