- Virtue AI (San Francisco, CA)
- …we're looking for passionate builders to join our core team. What You'll Do As an Inference Engineer , you will own how models are served in production. Your job ... workloads. You will: Serve and optimize LLM, embedding, and other ML models' inference across multiple model families Design and operate inference APIs with… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... principles. - Proficiency in debugging, profiling, and implementing best software engineering practices in large-scale systems. PREFERRED QUALIFICATIONS… more
- Apple Inc. (San Francisco, CA)
- Senior Software Engineer , Model Inference ...or related field (or equivalent experience). 5+ years in software engineering focused on ML inference ... deliver measurable results at global scale. Description As a Software Engineer on the Apple Maps team,...will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models… more
- Databricks Inc. (San Francisco, CA)
- Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll...BS/MS/PhD in Computer Science or a related field. Strong software engineering background (6+ years or equivalent)… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration...BS/MS/PhD in Computer Science, or a related field Strong software engineering background (3+ years or equivalent)… more
- OpenAI (San Francisco, CA)
- …missing to get the job done. Have at least 5 years of professional software engineering experience. Have or can quickly gain familiarity with PyTorch, NVidia ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We are looking for an engineer who wants to take the world's largest and… more
- Menlo Ventures (San Francisco, CA)
- …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... computing principles. Proficiency in debugging, profiling, and implementing best software engineering practices in large‑scale systems. Preferred Qualifications… more
- Pulse (San Francisco, CA)
- …experience is a plus About the Role Specialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and autoscaling ... across single-tenant and multi-tenant environments. Responsibilities Build inference services with smart batching and caching Optimize kernels, tokenization, and… more
- Menlo Ventures (San Francisco, CA)
- …cloud platforms. You may be a good fit if you: Have significant software engineering experience, particularly with distributed systems Are results-oriented, with ... to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the...by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack… more
- Databricks Inc. (San Francisco, CA)
- A leading AI-focused technology company in San Francisco is seeking a Software Engineer for GenAI inference . In this role, you'll design, develop, and ... optimize the inference engine powering the Foundation Model API. You will...focusing on large-scale LLM applications. A strong background in software engineering , distributed systems, and machine learning… more
- Capital One (San Francisco, CA)
- …developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, ... engineering and mathematics, and your expertise in hardware, software , and AI enable you to see and exploit...developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
- Mvp VC (San Francisco, CA)
- A cutting-edge aerospace company in San Francisco is seeking a skilled software engineer to optimize and integrate the Ultimate Edge SDK for embedded platforms. ... NVIDIA hardware. Required qualifications include a Master's in Computer Engineering , expertise in C++/Python, and familiarity with containerization technologies.… more
- OpenAI (San Francisco, CA)
- …high-volume production environments. The ideal candidate has over 5 years of software engineering experience, strong familiarity with ML architectures, and ... experience with distributed systems. This role involves collaboration with researchers and focus on performance optimization. Compensation ranges from $325K to $490K. #J-18808-Ljbffr more
- F. Hoffmann-La Roche AG (South San Francisco, CA)
- Senior/Principal Software Engineer , AI Enablement (Full stack) page is loaded Senior/Principal Software Engineer , AI Enablement (Full stack) Apply ... our stakeholders, power data-driven science and accelerate decision-making. The Engineering - AI Enablement group within DDC is accountable...to meet the scientific needs. The Opportunity: As a software engineer in AI Enablement with a… more
- Virtue AI (San Francisco, CA)
- …for passionate builders to join our core team. Are you a high‑performing, motivated engineer ready to make a significant impact in the AI security space? Virtue AI ... is seeking a talented AI Infrastructure Engineer (MLOps) to join us. We are a fast‑paced,...You will combine cutting‑edge machine learning techniques with strong engineering practices to design and deploy scalable, effective solutions… more
- OpenAI (San Francisco, CA)
- …in this role if you: Possess a minimum of 5 years of professional software engineering experience, with added experience in payments, billing, or monetization ... for GPT-4, GPT-3, embeddings, and fine-tuning. Our team also manages large-scale inference infrastructure. With much more on the horizon, our impact continues to… more
- OpenAI (San Francisco, CA)
- Software Engineer , Financial Engineering Applied AI Engineering - San Francisco About the team The Applied team at OpenAI safely brings cutting‑edge ... GPT‑3, embeddings, and fine‑tuning. Our team also manages large‑scale inference infrastructure. With much more on the horizon, our...you: Possess a minimum of 5 years of professional software engineering experience, with added experience in… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …This Role: The Crusoe Cloud Managed AI team seeks an ambitious and experienced Senior Software Engineer to join their team. You'll have a pivotal role in shaping ... the architecture and scalability of our next-generation AI inference platform. You will lead the design and implementation of core systems for our AI services,… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …AI (Large Language Models, Multimodal). Familiarity with AI infrastructure, including training, inference , and ETL pipelines. Software Engineering Skills: ... About This Role: As a Senior Staff Software Engineer on the Managed AI...shaping the architecture and scalability of our next-generation AI inference platform. You will lead the design and implementation… more