- Develop Health Inc. (Menlo Park, CA)
- …now scaling rapidly following a major funding round. About The Role: We're hiring an AI Engineer to take models from prototype to production and drive real ... Develop Health is on a mission to use AI to radically accelerate access to life‑saving medications....performance. Deploy models into production : design APIs, integrate inference into our Python‑based backend, and implement observability and… more
- quadric.io, Inc (Burlingame, CA)
- …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... of AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the… more
- quadric.io, Inc (Burlingame, CA)
- A pioneering tech company is looking for an experienced AI Inference Engineer to bridge AI models and advanced processing platforms. This role requires ... expertise in AI model algorithms, strong C/C++ and Python skills, and...experience with deployment frameworks. You will optimize and benchmark AI models, ensuring efficient deployment in edge devices. The… more
- Quadric Inc. (Burlingame, CA)
- A leading technology company in California is seeking an AI Inference Engineer to bridge AI models with unique platforms. Key responsibilities include ... should have a Bachelor's or Master's degree, 5+ years' experience in AI frameworks, and proficiency in C/C++ and Python. Competitive benefits included, such… more
- Pantera Capital (San Francisco, CA)
- …Full time Location Type Hybrid Department AI We are looking for an AI Inference engineer to join our growing team. Our current stack is Python, Rust, ... learning models for real-time inference . Responsibilities Develop APIs for AI inference that will be used by both internal and external customers Benchmark… more
- Pantera Capital (San Francisco, CA)
- A financial technology firm in San Francisco is seeking an experienced AI Inference Engineer to develop APIs for AI inference used by both internal ... and external customers. Candidates should have experience with machine learning systems and deep learning frameworks like PyTorch, and familiarity with LLM architectures. The role supports a hybrid work environment and offers a competitive salary, equity, and… more
- NVIDIA Corporation (Santa Clara, CA)
- Senior Technical Marketing Engineer - AI Inference at Scale page is loaded## Senior Technical Marketing Engineer - AI Inference at ... scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated computing product team....a consistent, high-impact go-to-market strategy.This role will focus on AI inference at scale, ensuring that customers… more
- Pantera Capital (Palo Alto, CA)
- …a skilled engineer to optimize traffic platforms that support production inference engines. The ideal candidate will have strong expertise in Kubernetes and ... A leading AI technologies company in Palo Alto is seeking...critical for enhancing the efficiency and stability of our AI infrastructure. Candidates should be ready to tackle complex… more
- Etched.ai, Inc. (San Jose, CA)
- A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This ... role requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should possess… more
- Etched.ai, Inc. (San Jose, CA)
- A transformative AI technology company in California is looking for a Software Engineer to join its Burn-in Testing team, ensuring the reliability of ... high-performance inference server hardware. The ideal candidate will design and execute burn-in test suites, analyze results, and collaborate with engineering teams.… more
- Amazon (Cupertino, CA)
- Software Development Engineer AI /ML, Inference Serving, AWS Neuron AWS Neuron is the software stack powering AWS Inferentia and Trainium machine learning ... accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to serve modern machine learning… more
- d-Matrix (Santa Clara, CA)
- …Performance Engineer specializing in performance analysis and optimization across AI hardware and software layers. The ideal candidate will possess strong ... A leading AI innovation company is seeking a Senior Principal...This hybrid position requires a commitment to fostering an inclusive environment while delivering cutting-edge solutions in generative AI .… more
- Menlo Ventures (San Francisco, CA)
- A technology-focused public benefit corporation in San Francisco seeks a skilled software engineer to join the inference team. This role involves building ... systems that power AI models like Claude, focusing on maximizing efficiency and enabling groundbreaking research. Ideal candidates have a background in distributed… more
- NVIDIA Corporation (Santa Clara, CA)
- A leading technology company is seeking a TensorRT-LLM Software Development Engineer . This role involves developing inferencing software for deep learning ... Applicants should be proactive and have solid technical skills, particularly in C/C++ programming and AI frameworks like TensorFlow and PyTorch. #J-18808-Ljbffr more
- MongoDB (Palo Alto, CA)
- …next-generation, AI -powered applications. About the Role We're looking for a Lead Engineer , Inference Platform to join our team building the inference ... deeply integrated into Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with design and implementation,… more
- jobr.pro (Sunnyvale, CA)
- …leading technology company in the United States is seeking a Software Engineer to develop next-generation technologies. You will work on critical projects, ... optimizing ML infrastructure and contributing to advanced AI solutions. The ideal candidate has a strong background in software development, machine learning, and is… more
- Menlo Ventures (San Francisco, CA)
- …capabilities of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our ... unforeseen compute and hardware constraints. Responsibilities Develop and optimize runtime AI inference pipelines for real-world robotic deployment. Build… more
- Amazon (San Francisco, CA)
- A leading technology company in Herndon, Virginia is seeking a Senior Software Development Engineer to work on AI /ML projects. You will design and optimize ... machine learning models for deployment on custom hardware accelerators, ensuring maximum performance. Ideal candidates will have over 5 years of experience, strong Python and C++ skills, and knowledge in machine learning principles. This role fosters a… more
- Amazon (San Francisco, CA)
- …leading e-commerce platform in San Francisco is seeking a Software Development Engineer to develop and optimize machine learning models for custom hardware ... accelerators. This role involves performance tuning, debugging, and close collaboration with customers to enhance their models on AWS's services. The ideal candidate has strong programming skills in C++ and Python, along with a solid understanding of machine… more
- Hp Iq (Palo Alto, CA)
- Machine Learning Engineer - Fine-Tuning and On-device AI HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, we're ... latency‑sensitive environments. Preferred Qualifications Experience with multi‑agent systems or AI assistant orchestration. Familiarity with advanced inference … more