- Virtue AI (San Francisco, CA)
- …we're looking for passionate builders to join our core team. What You'll Do As an Inference Engineer , you will own how models are served in production. Your job ... workloads. You will: Serve and optimize LLM, embedding, and other ML models' inference across multiple model families Design and operate inference APIs with… more
- BlackLine (Pleasanton, CA)
- …environments. This position requires a strong background in machine learning, software engineering , and operations to ensure the successful deployment, ... 2001, BlackLine has become a leading provider of cloud software that automates and controls the entire financial close...BlackLine! Make Your Mark: As a Machine Learning Operations Engineer , you will play a pivotal role in bridging… more
- F. Hoffmann-La Roche Gruppe (Pleasanton, CA)
- …to come. Join Roche, where every voice matters. The Position Principal DevOps Engineer - ML/AI Algorithms Developing software is great, but developing ... a purpose is even better! As a Principal DevOps Engineer - ML/AI Algorithms, you will work on products...for DevOps, paving the way for seamless and efficient software delivery processes. Location This role can be based… more
- BlackLine (Pleasanton, CA)
- …Since being founded in 2001, BlackLine has become a leading provider of cloud software that automates and controls the entire financial close process. Our vision is ... and Grow at BlackLine! Make Your Mark: The Principal AI/ML Operations Engineer leads the architecture, automation, and operationalization of both machine learning… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... principles. - Proficiency in debugging, profiling, and implementing best software engineering practices in large-scale systems. PREFERRED QUALIFICATIONS… more
- Apple Inc. (San Francisco, CA)
- Senior Software Engineer , Model Inference ...or related field (or equivalent experience). 5+ years in software engineering focused on ML inference ... deliver measurable results at global scale. Description As a Software Engineer on the Apple Maps team,...will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models… more
- Databricks Inc. (San Francisco, CA)
- Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll...BS/MS/PhD in Computer Science or a related field. Strong software engineering background (6+ years or equivalent)… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration...BS/MS/PhD in Computer Science, or a related field Strong software engineering background (3+ years or equivalent)… more
- OpenAI (San Francisco, CA)
- …missing to get the job done. Have at least 5 years of professional software engineering experience. Have or can quickly gain familiarity with PyTorch, NVidia ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We are looking for an engineer who wants to take the world's largest and… more
- Menlo Ventures (San Francisco, CA)
- …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... computing principles. Proficiency in debugging, profiling, and implementing best software engineering practices in large‑scale systems. Preferred Qualifications… more
- Pulse (San Francisco, CA)
- …experience is a plus About the Role Specialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and autoscaling ... across single-tenant and multi-tenant environments. Responsibilities Build inference services with smart batching and caching Optimize kernels, tokenization, and… more
- Menlo Ventures (San Francisco, CA)
- …cloud platforms. You may be a good fit if you: Have significant software engineering experience, particularly with distributed systems Are results-oriented, with ... to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the...by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack… more
- Databricks Inc. (San Francisco, CA)
- A leading AI-focused technology company in San Francisco is seeking a Software Engineer for GenAI inference . In this role, you'll design, develop, and ... optimize the inference engine powering the Foundation Model API. You will...focusing on large-scale LLM applications. A strong background in software engineering , distributed systems, and machine learning… more
- Capital One (San Francisco, CA)
- …developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, ... engineering and mathematics, and your expertise in hardware, software , and AI enable you to see and exploit...developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
- Etched.ai, Inc. (San Jose, CA)
- …flows, and reproducible performance metrics. This role requires deep knowledge of ML inference infrastructure, software engineering , and the ability to work ... A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK… more
- Etched.ai, Inc. (San Jose, CA)
- A transformative AI technology company in California is looking for a Software Engineer to join its Burn-in Testing team, ensuring the reliability of ... high-performance inference server hardware. The ideal candidate will design and...execute burn-in test suites, analyze results, and collaborate with engineering teams. Applicants should have proficiency in scripting, a… more
- Mvp VC (San Francisco, CA)
- A cutting-edge aerospace company in San Francisco is seeking a skilled software engineer to optimize and integrate the Ultimate Edge SDK for embedded platforms. ... NVIDIA hardware. Required qualifications include a Master's in Computer Engineering , expertise in C++/Python, and familiarity with containerization technologies.… more
- OpenAI (San Francisco, CA)
- …high-volume production environments. The ideal candidate has over 5 years of software engineering experience, strong familiarity with ML architectures, and ... experience with distributed systems. This role involves collaboration with researchers and focus on performance optimization. Compensation ranges from $325K to $490K. #J-18808-Ljbffr more
- F. Hoffmann-La Roche AG (South San Francisco, CA)
- Senior/Principal Software Engineer , AI Enablement (Full stack) page is loaded Senior/Principal Software Engineer , AI Enablement (Full stack) Apply ... our stakeholders, power data-driven science and accelerate decision-making. The Engineering - AI Enablement group within DDC is accountable...to meet the scientific needs. The Opportunity: As a software engineer in AI Enablement with a… more