Senior Inference Platform Engineer Jobs

141 jobs (page 1)

Categories

All Categories

Engineering (50)

Software/IT (11)

Management (5)

Senior Inference Platform…

Hamilton Barnes Associates Limited (San Francisco, CA)

…of the most advanced AI workloads worldwide. They're now building a serverless inference platform , beginning with cost-efficient batch inference and ... want to miss this opportunity! Key Responsibilities Take ownership of the inference platform architecture, from batch to low-latency workloads. Design, build,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Inference Platform…

Etched.ai, Inc. (San Jose, CA)

A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This role ... requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should possess… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Engineer (Gen AI…

Capital One (Washington, DC)

Senior AI Engineer (Gen AI Platform Services, Agentic Systems) Overview: At Capital One, we are creating responsible and reliable AI systems, changing ... agreed upon number of hours to be regularly worked. Cambridge, MA: $158,600 - $181,000 for Senior AI Engineer McLean, VA: $158,600 - $181,000 for Senior AI … more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Model…

Apple Inc. (San Francisco, CA)

Senior Software Engineer , Model Inference San...best map in the world. In this role on ML Platform , you will help bring advanced deep learning and large ... measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead...will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Principal Software Engineer…

Akamai Technologies GmbH (Cambridge, MA)

Senior Principal Software Engineer - Akamai Inference Cloud (Remote) United States (Remote) Job Description Do you thrive on defining the future of AI ... requiring a longer-term view and deep understanding of business objectives. As a Senior Principal Software Engineer , you will be responsible for: Defining the… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Principal MLOps Engineer , AI…

Red Hat (Boston, MA)

…bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM...scale LLM deployments.We are seeking an experienced ML Ops engineer to work closely with our product and research… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Principal MLOps Engineer , AI…

Red Hat, Inc. (Boston, MA)

…bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM...LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product and research… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
AI Inference Engineer

quadric.io, Inc (Burlingame, CA)

…executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port...platform ; [2] optimize the model deployment for efficient inference ; [3] profile and benchmark the model performance. This… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior ML Inference Engineer…

Comfy (San Francisco, CA)

A leading AI platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal ... candidate will engage in building efficient AI models and tackling complex challenges. The role requires a strong background in PyTorch and a passion for pushing performance limits. Join a dynamic team focused on creating innovative AI solutions and shaping… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Engineer -AI Inference

The Association of Technology, Management and Applied… (Morgan Hill, CA)

…innovation in AI. We are building the next generation of Gen AI platform , empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth ... organizations. This is a unique opportunity to contribute to a critical platform that will enable secure, scalable, and high-performance AI capabilities across the… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Backend Engineer…

Neara (Palo Alto, CA)

Job type: Full Time Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into ... jobsarchetypeaiio. About the Role Were looking for a highly motivated backend engineer with a passion for building performant, scalable, and resilient distributed… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Frontend Engineer…

Inference (San Francisco, CA)

A technology company in San Francisco is seeking a Senior Full-Stack Engineer to develop frontend features for its AI platform . Responsibilities include ... optimizing performance, mentoring team members, and creating user-centric applications. Candidates should have 5+ years of experience with React, Tailwind, and Typescript. A competitive salary of $120,000 - $180,000 plus equity and benefits is offered.… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Fullstack Engineer - Frontend Focus

Inference (San Francisco, CA)

Inference .net is hiring a Senior Full-Stack (Frontend-Focused) Engineer Help us build beautiful, performant web experiences that give users super-powers over ... our globally distributed LLM inference platform . If you love shipping React apps that feel snappy at planet-scale, we'd love to meet you. About Inference .net… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Backend…

harvey.ai (San Francisco, CA)

…being written today - and we're just getting started. Role Overview As a Backend Platform Engineer at Harvey, you will help build and operate the cohesive ... incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform , and deep domain expertise, we're reshaping how critical knowledge work… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI/ML Engineer (AI…

Whoop, Inc. (Boston, MA)

…personalized guidance that members can act on every day. WHOOP is hiring a Senior AI/ML Engineer to help scale the intelligence layer behind WHOOP's AI-powered ... continuous physiological data into clear insights and actionable recommendations. Our AI platform is central to this mission, turning raw physiological signals into… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Data Platform Engineer

Ellipsis Health (San Francisco, CA)

…of the most preeminent venture capital teams. We are currently looking for an experienced Senior Data Platform Engineer , at the Staff or Principal level, ... Responsibilities Design, develop, and operation of a scalable and secure data platform to support analytics, ML Ops, and business intelligence Collaborate closely… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Platform Engineer

Qualcomm (San Diego, CA)

…Group, Engineering Group > Software Engineering General Summary We are seeking a Senior AI Platforms Engineer to design, build, and operate the infrastructure ... Optimize inference performance for throughput, latency, and cost efficiency. Platform Engineering Build and maintain Kubernetes clusters for AI workloads with… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior AI Platform Engineer

Synagi (San Francisco, CA)

…ML tasks. Nice-to-Haves Experience with DeepSpeed or vLLM for efficient inference serving. Familiarity with LangChain or LlamaIndex for rapid agent prototyping. ... in decentralised or edge deployments (eg, WASM at the edge) for ultra-low-latency inference . Applying Send your resume-plus a short note (3-5 sentences) describing a… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Engineer (Gen AI…

Capital One (San Francisco, CA)

…services* Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, ... developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Systems Engineer…

Lambda Inc. (San Francisco, CA)

Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference . Lambda's mission is to make compute as ubiquitous as electricity ... securely and accurately. Our scope includes enterprise applications, integrations, data platform and analytics, compliance automation, and all things IT. What You'll… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search