- OpenAI (San Francisco, CA)
- …Inference team. This role involves designing and implementing infrastructure for large- scale multimodal models, focusing on high-performance delivery of audio ... and image inputs. You'll collaborate closely with researchers and product teams to push the boundaries of AI technology, ensuring reliable production services. If you thrive in fast-paced environments and enjoy tackling complex challenges, this opportunity… more
- Tether Operations Limited (San Francisco, CA)
- …integrating text, visual, and audio modalities. Engineer scalable training and inference pipelines optimized for large‑ scale multimodal datasets and ... the full development pipeline from data processing & data loading to training, inference , and optimization. Experience working with large‑ scale text data, or… more
- Eloquent AI (San Francisco, CA)
- …AI At Eloquent AI, we're building the next generation of AI Operators- multimodal , autonomous systems that execute complex workflows across fragmented tools with ... future of financial services. Your Role As an AI Engineer at Eloquent AI, you will be at the...problem-solving skills, with the ability to customize, optimize, and scale AI models for real-world enterprise applications. If you're… more
- OpenAI (San Francisco, CA)
- …in ways far beyond text. In this role, you will: Design and implement inference infrastructure for large- scale multimodal models. Optimize systems for ... boundaries of what AI can do. We're expanding into multimodal inference , building the infrastructure needed to...software engineer to help us serve OpenAI's multimodal models at scale . You'll be part… more
- Virtue AI (San Francisco, CA)
- …we're looking for passionate builders to join our core team. What You'll Do As an Inference Engineer , you will own how models are served in production. Your job ... in AI security, its AI-native architecture unifies automated red‑teaming, real‑time multimodal guardrails, and systematic governance for enterprise apps and agents.… more
- Eloquent AI, Inc. (San Francisco, CA)
- …At Eloquent AI, we're building the next generation of AI Operators- multimodal , autonomous systems that execute complex workflows across fragmented tools with ... future of financial services. Your Role As an AI Engineer at Eloquent AI, you will be at the...problem‑solving skills, with the ability to customize, optimize, and scale AI models for real-world enterprise applications. If you're… more
- Pulse (San Francisco, CA)
- …founding experience is a plus About the Role Specialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and ... data infrastructure: extracting accurate, structured information from complex documents at scale . We have a breakthrough approach to document understanding that… more
- Menlo Ventures (San Francisco, CA)
- …leaders working together to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the critical systems that ... to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request… more
- Liquid AI (San Francisco, CA)
- …Spun out of MIT, our mission is to build efficient AI systems at every scale . Our Liquid Foundation Models (LFMs) operate where others can't: on-device, at the edge, ... You If: You have experience with machine learning at scale You have worked with audio models and understand...frameworks like DeepSpeed, FSDP, or Megatron-LM You've worked with multimodal data (eg audio, text, image, video) You've contributed… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …The Crusoe Cloud Managed AI team seeks an ambitious and experienced Senior Software Engineer to join their team. You'll have a pivotal role in shaping the ... architecture and scalability of our next-generation AI inference platform. You will lead the design and implementation...This role gives you the opportunity to build and scale infrastructure capable of handling millions of API requests… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …etc. AI/ML Expertise: Experience in Generative AI (Large Language Models, Multimodal ). Familiarity with AI infrastructure, including training, inference , and ... people can create ambitiously with AI - without sacrificing scale , speed, or sustainability. Be a part of the...cloud infrastructure. About This Role: As a Principal Software Engineer on the Managed AI team at Crusoe, you'll… more
- Menlo Ventures (San Francisco, CA)
- …data pipelines to handle advanced language model training requirements Optimize large scale training and inference pipelines for stable and efficient ... together to build beneficial AI systems. About the role As a Staff Infrastructure Engineer on our team you will work end to end, identifying and addressing key… more
- OpenAI (San Francisco, CA)
- …training and inference infrastructure that powers frontier models at massive scale . Our systems unify how researchers train and serve models, abstracting away ... focus on advancing model capabilities while we handle the scale , efficiency, and reliability required to bring those models...life. About the Role We are looking for an engineer to design and implement the dataset infrastructure that… more
- Virtue AI (San Francisco, CA)
- …Helm‑based enterprise installs Preferred Qualifications Experience deploying ML / LLM inference systems at scale Familiarity with vLLM, sglang, Triton, ... in AI security, its AI-native architecture unifies automated red‑teaming, real‑time multimodal guardrails, and systematic governance for enterprise apps and agents.… more
- Amazon (San Francisco, CA)
- …Robotics team, where you'll contribute to breakthrough foundation models run at production scale . As a Software Development Engineer embedded in our science ... Software Development Engineer , Frontier AI & Robotics Job ID: 2914306...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale . In this… more
- Icon Ventures (San Francisco, CA)
- …Experience building on a modern MLOps stack (feature mgmt, orchestration, streaming, online inference at scale ) Compensation, Benefits & Perks Quizlet is an ... us to design and deliver AI-powered learning tools that scale across the world and unlock human potential. About...as best‑in‑class. About the Role As an Applied AI Engineer , you will be working at the forefront of… more
- LanceDB Inc. (San Francisco, CA)
- About LanceDB LanceDB is a developer-friendly, open-source data lake for multimodal AI. From hyper-scalable vector search to advanced retrieval for RAG, from ... streaming training data to interactive exploration of large- scale AI datasets, LanceDB is the best foundation for...the Role We are looking for a Senior Solutions Engineer who blends deep technical understanding of AI/ML infrastructure… more
- Liquid AI (San Francisco, CA)
- …Spun out of MIT, our mission is to build efficient AI systems at every scale . Our Liquid Foundation Models (LFMs) operate where others can't: on-device, at the edge, ... Role Is For You If: You have experience with machine learning at scale You're proficient in PyTorch, and familiar with distributed training frameworks like… more
- Eloquent AI (San Francisco, CA)
- …AI At Eloquent AI, we're building the next generation of AI Operators- multimodal , autonomous systems that execute complex workflows across fragmented tools with ... of financial services. Your Role As a Senior Software Engineer , AIOps & Infrastructure at Eloquent AI, you will...LLMs efficiently while ensuring stability, observability, and performance at scale . You'll play a key role in automating LLMOps… more
- Scale AI, Inc. (San Francisco, CA)
- …excitement about system optimization Experience with multi-node LLM training and inference Experience with developing large- scale distributed ML systems Strong ... language models including instruction tuning, RLHF, tool use, reasoning, agents, and multimodal , etc. Compensation packages at Scale for eligible roles include… more