- quadric.io, Inc (Burlingame, CA)
- …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric...; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI… more
- quadric.io, Inc (Burlingame, CA)
- A pioneering tech company is looking for an experienced AI Inference Engineer to bridge AI models and advanced processing platforms. This role requires ... expertise in AI model algorithms, strong C/C++ and Python skills, and...experience with deployment frameworks. You will optimize and benchmark AI models, ensuring efficient deployment in edge devices. The… more
- Menlo Ventures (San Francisco, CA)
- A technology-focused public benefit corporation in San Francisco seeks a skilled software engineer to join the inference team. This role involves building ... systems that power AI models like Claude, focusing on maximizing efficiency and enabling groundbreaking research. Ideal candidates have a background in distributed… more
- Amazon (San Francisco, CA)
- …technology company in Herndon, Virginia is seeking a Senior Software Development Engineer to work on AI /ML projects. You will design and optimize machine ... learning models for deployment on custom hardware accelerators, ensuring maximum performance. Ideal candidates will have over 5 years of experience, strong Python and C++ skills, and knowledge in machine learning principles. This role fosters a collaborative… more
- Capital One (San Francisco, CA)
- …financial services provider in San Francisco is seeking a Technical Specialist to develop AI and ML solutions. You'll need a strong foundation in engineering and at ... 4 years of experience programming in Python and deploying AI on cloud platforms. The ability to optimize solutions...The ability to optimize solutions and a passion for AI research are essential. This role offers a competitive… more
- Etched.ai, Inc. (San Jose, CA)
- A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This ... role requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should possess… more
- Mvp VC (San Francisco, CA)
- A cutting-edge aerospace company in San Francisco is seeking a skilled software engineer to optimize and integrate the Ultimate Edge SDK for embedded platforms. Key ... responsibilities include collaborating on performance tuning and ensuring efficient deployment on NVIDIA hardware. Required qualifications include a Master's in Computer Engineering, expertise in C++/Python, and familiarity with containerization technologies.… more
- Loft Orbital Solutions (San Francisco, CA)
- A leading space technology company in San Francisco is seeking a skilled engineer to contribute to the development and optimization of the Ultimate Edge SDK. The ... role focuses on integrating ONNX-based runtimes and optimizing performance across embedded platforms. Candidates should have a master's degree and solid experience in C++ or Python, along with familiarity with embedded systems. This position offers a salary of… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI /ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... of applied scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your work will… more
- OpenAI (San Francisco, CA)
- A leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal ... candidate has over 5 years of software engineering experience, strong familiarity with ML architectures, and experience with distributed systems. This role involves collaboration with researchers and focus on performance optimization. Compensation ranges from… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI /ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... of applied scientists, system engineers, and product managers to deliver state‑of‑the‑art inference capabilities for Generative AI applications. Your work will… more
- Hamilton Barnes Associates Limited (San Francisco, CA)
- …most advanced AI workloads worldwide. They're now building a serverless inference platform, beginning with cost-efficient batch inference and expanding into ... Join a stealth-mode hyperscale data center startup building an AI and cloud platform, powered by thousands of H100s, H200s, and B200s, ready to go for… more
- Icon Ventures (San Francisco, CA)
- …an AI ‑driven learning coach that's recognized as best‑in‑class. About the Role As Sr . Staff Applied AI Engineer , you will be the hands‑on technical ... Future of Learning Join us to design and deliver AI -powered learning tools that scale across the world and...pipelines), ensure robust evaluation and responsible deployment, and mentor senior engineers to multiply impact across the org. We're… more
- Menlo Ventures (San Francisco, CA)
- …researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role Our Inference team is responsible ... Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be...by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack… more
- OpenAI (San Francisco, CA)
- …research progression via model inference . About the Role We're looking for a senior engineer to design and build the load balancer that will sit at ... the very front of our research inference stack - routing the world's largest ...management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability infrastructure.… more
- Comfy (San Francisco, CA)
- …platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal candidate will ... engage in building efficient AI models and tackling complex challenges. The role requires...limits. Join a dynamic team focused on creating innovative AI solutions and shaping the future of visual generative… more
- Amazon (San Francisco, CA)
- Sr . System Development Engineer , High-Performance Accelerator Servers for AI /ML Do you want to shape the future of Generative AI at AWS? Join the team ... building the foundation of the world's most advanced cloud for AI training and inference - where multi-billion-parameter models come to life at scale. Here,… more
- Inference (San Francisco, CA)
- A technology company in San Francisco is seeking a Senior Full-Stack Engineer to develop frontend features for its AI platform. Responsibilities include ... optimizing performance, mentoring team members, and creating user-centric applications. Candidates should have 5+ years of experience with React, Tailwind, and Typescript. A competitive salary of $120,000 - $180,000 plus equity and benefits is offered.… more
- Disney (San Francisco, CA)
- …and deployment, ensuring that cutting‑edge AI solutions operate reliably at scale. As a Sr ML Ops Engineer , you will act as the backbone of our AI ... Skywalker Sound Development Group is seeking a highly skilled Sr ML Ops Engineer to build and...and maintain the infrastructure powering our machine learning and AI frameworks. This position is crucial in enabling seamless… more
- F. Hoffmann-La Roche AG (South San Francisco, CA)
- Senior /Principal Software Engineer , AI ...We also work on scaling up model training and inference , evaluating the quality of AI /ML models ... Enablement (Full stack) page is loaded Senior /Principal Software Engineer , AI Enablement (Full stack) Apply locations South San Francisco Basel time type… more