- Hamilton Barnes Associates Limited (San Francisco, CA)
- …of the most advanced AI workloads worldwide. They're now building a serverless inference platform , beginning with cost-efficient batch inference and ... want to miss this opportunity! Key Responsibilities Take ownership of the inference platform architecture, from batch to low-latency workloads. Design, build,… more
- Etched.ai, Inc. (San Jose, CA)
- A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This role ... requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should possess… more
- Capital One (Washington, DC)
- Senior AI Engineer (Gen AI Platform Services, Agentic Systems) Overview: At Capital One, we are creating responsible and reliable AI systems, changing ... agreed upon number of hours to be regularly worked. Cambridge, MA: $158,600 - $181,000 for Senior AI Engineer McLean, VA: $158,600 - $181,000 for Senior AI … more
- Apple Inc. (San Francisco, CA)
- Senior Software Engineer , Model Inference San...best map in the world. In this role on ML Platform , you will help bring advanced deep learning and large ... measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead...will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models… more
- Akamai Technologies GmbH (Cambridge, MA)
- Senior Principal Software Engineer - Akamai Inference Cloud (Remote) United States (Remote) Job Description Do you thrive on defining the future of AI ... requiring a longer-term view and deep understanding of business objectives. As a Senior Principal Software Engineer , you will be responsible for: Defining the… more
- Red Hat (Boston, MA)
- …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM...scale LLM deployments.We are seeking an experienced ML Ops engineer to work closely with our product and research… more
- Red Hat, Inc. (Boston, MA)
- …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM...LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product and research… more
- quadric.io, Inc (Burlingame, CA)
- …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port...platform ; [2] optimize the model deployment for efficient inference ; [3] profile and benchmark the model performance. This… more
- Comfy (San Francisco, CA)
- A leading AI platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal ... candidate will engage in building efficient AI models and tackling complex challenges. The role requires a strong background in PyTorch and a passion for pushing performance limits. Join a dynamic team focused on creating innovative AI solutions and shaping… more
- The Association of Technology, Management and Applied… (Morgan Hill, CA)
- …innovation in AI. We are building the next generation of Gen AI platform , empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth ... organizations. This is a unique opportunity to contribute to a critical platform that will enable secure, scalable, and high-performance AI capabilities across the… more
- Neara (Palo Alto, CA)
- Job type: Full Time Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into ... jobsarchetypeaiio. About the Role Were looking for a highly motivated backend engineer with a passion for building performant, scalable, and resilient distributed… more
- Inference (San Francisco, CA)
- A technology company in San Francisco is seeking a Senior Full-Stack Engineer to develop frontend features for its AI platform . Responsibilities include ... optimizing performance, mentoring team members, and creating user-centric applications. Candidates should have 5+ years of experience with React, Tailwind, and Typescript. A competitive salary of $120,000 - $180,000 plus equity and benefits is offered.… more
- Inference (San Francisco, CA)
- Inference .net is hiring a Senior Full-Stack (Frontend-Focused) Engineer Help us build beautiful, performant web experiences that give users super-powers over ... our globally distributed LLM inference platform . If you love shipping React apps that feel snappy at planet-scale, we'd love to meet you. About Inference .net… more
- harvey.ai (San Francisco, CA)
- …being written today - and we're just getting started. Role Overview As a Backend Platform Engineer at Harvey, you will help build and operate the cohesive ... incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform , and deep domain expertise, we're reshaping how critical knowledge work… more
- Whoop, Inc. (Boston, MA)
- …personalized guidance that members can act on every day. WHOOP is hiring a Senior AI/ML Engineer to help scale the intelligence layer behind WHOOP's AI-powered ... continuous physiological data into clear insights and actionable recommendations. Our AI platform is central to this mission, turning raw physiological signals into… more
- Ellipsis Health (San Francisco, CA)
- …of the most preeminent venture capital teams. We are currently looking for an experienced Senior Data Platform Engineer , at the Staff or Principal level, ... Responsibilities Design, develop, and operation of a scalable and secure data platform to support analytics, ML Ops, and business intelligence Collaborate closely… more
- Qualcomm (San Diego, CA)
- …Group, Engineering Group > Software Engineering General Summary We are seeking a Senior AI Platforms Engineer to design, build, and operate the infrastructure ... Optimize inference performance for throughput, latency, and cost efficiency. Platform Engineering Build and maintain Kubernetes clusters for AI workloads with… more
- Synagi (San Francisco, CA)
- …ML tasks. Nice-to-Haves Experience with DeepSpeed or vLLM for efficient inference serving. Familiarity with LangChain or LlamaIndex for rapid agent prototyping. ... in decentralised or edge deployments (eg, WASM at the edge) for ultra-low-latency inference . Applying Send your resume-plus a short note (3-5 sentences) describing a… more
- Capital One (San Francisco, CA)
- …services* Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, ... developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and… more
- Lambda Inc. (San Francisco, CA)
- Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference . Lambda's mission is to make compute as ubiquitous as electricity ... securely and accurately. Our scope includes enterprise applications, integrations, data platform and analytics, compliance automation, and all things IT. What You'll… more