- MongoDB (Palo Alto, CA)
- **About the Role** We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic ... Atlas and designed for developer-first experiences. As a Senior Engineer , you'll focus on building core systems and services...You'll Do** + Design and build components of a multi -tenant inference platform integrated directly with MongoDB… more
- NVIDIA (Santa Clara, CA)
- …and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You'll architect and implement ... workloads across multi -GPU, multi -node, and multi -cloud environments. You'll collaborate across inference , compiler,...way to integrate research ideas and prototypes into NVIDIA's software products. What we need to see: + Bachelor's… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... NVIDIA's inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions. + Work with cross-collaborative teams across frameworks, NVIDIA libraries… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... NVIDIA's inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions. + Work with cross-collaborative teams across frameworks, NVIDIA libraries… more
- MongoDB (Palo Alto, CA)
- We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, ... Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with... platform + Design and build components of a multi -tenant inference service that integrates with Atlas… more
- General Motors (Sunnyvale, CA)
- …job is eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure ... of state-of-the-art (SOTA) machine learning models for experimental and bulk inference , with a focus on performance, availability, concurrency, and scalability.… more
- Amazon (Sunnyvale, CA)
- …Context for inference efficiency. Key job responsibilities * Develop high-performance inference software for a diverse set of neural models, typically in ... is a group of innovative developers working on groundbreaking multi -modal inference solutions that revolutionize how AI...new and existing systems experience - 1+ years of software development engineer or related occupational experience… more
- NVIDIA (Santa Clara, CA)
- …enthusiastic about building the next generation of scalable AI systems. As a Principal Software Engineer on the Dynamo project, you will address some of the ... inference challenges, such as context window scaling and multi -model agentic workflows. With highly competitive salaries and a...are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want… more
- Microsoft Corporation (Redmond, WA)
- …problems on the intersection of AI and Cloud. We're looking for a **Principal Software Engineer ** **- Azure AI Inferencing** to drive the design, optimization, ... solve real world inference problems for state-of-the-art large language (LLM) and multi -modal Gen AI models from OpenAI and other model providers? We are already… more
- Google (Sunnyvale, CA)
- Senior Software Engineer , Machine Learning, Kernel _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA **Mid** Experience driving progress, ... PyTorch or JAX. + 3 years of experience in software development for machine learning model inference ...goes on and is growing every day. As a software engineer , you will work on a… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** As a **Senior Software Engineer - Machine Learning** , you are a technical leader working at the intersection of ... machine learning and software engineering. You have expertise in both areas and...models + TensorFlow Serving for TF models + Triton Inference Server for multi -framework support + BentoML… more
- Red Hat (Raleigh, NC)
- …and deliver innovative apps. The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference Runtimes experience to join our rapidly ... **What You Will Do** + Develop and maintain a high-quality, high-performing ML inference runtime platform for multi -modal and distributed model serving. +… more
- Walmart (Sunnyvale, CA)
- …personalized results for our customers and drive business KPIs. As a Staff Software Engineer , you'll spend your days translating requirements into solutions, ... Summary ** **What you'll do ** International Recommendations platform is a multi -tenant framework that powers the personalization experiences for Mexico, Canada, and… more
- Walmart (Sunnyvale, CA)
- …platform standards and engineering playbooks. + Drive experimentation (A/B testing, multi -armed bandits, causal inference ) and champion innovation. **Product ... automated remediation. + Develop and optimize LLM-based agents for multi -step reasoning, knowledge grounding, and decision-making. + Architect scalable, distributed… more
- LinkedIn (Mountain View, CA)
- …to optimize their models and deliver the best performance possible. As a Senior Software Engineer , you will have first-hand opportunities to advance one of the ... optimize performance across algorithms, AI frameworks, data infra, compute software , and hardware to harness the power of our...billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra team, you will… more
- LinkedIn (Mountain View, CA)
- …engineers to optimize their models and deliver the best performance possible. As a Software Engineer , you will have first-hand opportunities to advance one of ... optimize performance across algorithms, AI frameworks, data infra, compute software , and hardware to harness the power of our...billions of user queries Model Training Infrastructure: As an engineer on the AI Training Infra team, you will… more
- NVIDIA (Santa Clara, CA)
- NVIDIA Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning models across multi -node distributed environments. ... resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and roadmap for memory management of large-scale… more
- SURVICE Engineering (Dayton, OH)
- …career with a leading organization, come see what we can offer you! Position Software Engineer + Location: Dayton, Ohio + Security Clearance: Active TS/SCI ... spending, tuition reimbursement. Position Summary SURVICE Engineering is currently seeking a Software Engineer to support several DoD survivability programs. You… more
- ServiceNow, Inc. (San Diego, CA)
- …reliable, and data-aware automation experiences for our customers. The Opportunity: Principal Software Engineer We are looking for a Principal Software ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
- NVIDIA (Santa Clara, CA)
- …upon which every new AI-powered application is built. We are seeking a senior engineer to design and build factory automation for NVIDIA Inference Microservices ... all the way through deployment in heterogeneous hardware and software environments. You will influence and drive technical advances...+ Excellent interpersonal skills and the ability to lead multi -functional efforts + BS or MS in Computer Science,… more