• Hamilton Barnes Associates Limited (San Francisco, CA)
    …of the most advanced AI workloads worldwide. They're now building a serverless inference platform , beginning with cost-efficient batch inference and ... want to miss this opportunity! Key Responsibilities Take ownership of the inference platform architecture, from batch to low-latency workloads. Design, build,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Etched.ai, Inc. (San Jose, CA)
    A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This role ... requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should possess… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (San Francisco, CA)
    Senior Software Engineer , Model Inference San...best map in the world. In this role on ML Platform , you will help bring advanced deep learning and large ... measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead...will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Akamai Technologies GmbH (Cambridge, MA)
    Senior Principal Software Engineer - Akamai Inference Cloud (Remote) United States (Remote) Job Description Do you thrive on defining the future of AI ... requiring a longer-term view and deep understanding of business objectives. As a Senior Principal Software Engineer , you will be responsible for: Defining the… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat (Boston, MA)
    …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM...scale LLM deployments.We are seeking an experienced ML Ops engineer to work closely with our product and research… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat, Inc. (Boston, MA)
    …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM...LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product and research… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • The Association of Technology, Management and Applied… (Morgan Hill, CA)
    …innovation in AI. We are building the next generation of Gen AI platform , empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth ... organizations. This is a unique opportunity to contribute to a critical platform that will enable secure, scalable, and high-performance AI capabilities across the… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • quadric.io, Inc (Burlingame, CA)
    …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port...platform ; [2] optimize the model deployment for efficient inference ; [3] profile and benchmark the model performance. This… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Neara (Palo Alto, CA)
    Job type: Full Time Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into ... jobsarchetypeaiio. About the Role Were looking for a highly motivated backend engineer with a passion for building performant, scalable, and resilient distributed… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Comfy (San Francisco, CA)
    A leading AI platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal ... candidate will engage in building efficient AI models and tackling complex challenges. The role requires a strong background in PyTorch and a passion for pushing performance limits. Join a dynamic team focused on creating innovative AI solutions and shaping… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Inference (San Francisco, CA)
    A technology company in San Francisco is seeking a Senior Full-Stack Engineer to develop frontend features for its AI platform . Responsibilities include ... optimizing performance, mentoring team members, and creating user-centric applications. Candidates should have 5+ years of experience with React, Tailwind, and Typescript. A competitive salary of $120,000 - $180,000 plus equity and benefits is offered.… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Inference (San Francisco, CA)
    Inference .net is hiring a Senior Full-Stack (Frontend-Focused) Engineer Help us build beautiful, performant web experiences that give users super-powers over ... our globally distributed LLM inference platform . If you love shipping React apps that feel snappy at planet-scale, we'd love to meet you. About Inference .net… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Capital One (Washington, DC)
    Senior AI Engineer (Gen AI Platform Services, Agentic Systems) Overview: At Capital One, we are creating responsible and reliable AI systems, changing ... agreed upon number of hours to be regularly worked. Cambridge, MA: $158,600 - $181,000 for Senior AI Engineer McLean, VA: $158,600 - $181,000 for Senior AI … more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • harvey.ai (San Francisco, CA)
    …being written today - and we're just getting started. Role Overview As a Backend Platform Engineer at Harvey, you will help build and operate the cohesive ... incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform , and deep domain expertise, we're reshaping how critical knowledge work… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Qualcomm (San Diego, CA)
    …Group, Engineering Group > Software Engineering General Summary We are seeking a Senior AI Platforms Engineer to design, build, and operate the infrastructure ... Optimize inference performance for throughput, latency, and cost efficiency. Platform Engineering Build and maintain Kubernetes clusters for AI workloads with… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Whoop, Inc. (Boston, MA)
    …personalized guidance that members can act on every day. WHOOP is hiring a Senior AI/ML Engineer to help scale the intelligence layer behind WHOOP's AI-powered ... continuous physiological data into clear insights and actionable recommendations. Our AI platform is central to this mission, turning raw physiological signals into… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Ellipsis Health (San Francisco, CA)
    …of the most preeminent venture capital teams. We are currently looking for an experienced Senior Data Platform Engineer , at the Staff or Principal level, ... Responsibilities Design, develop, and operation of a scalable and secure data platform to support analytics, ML Ops, and business intelligence Collaborate closely… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Synagi (San Francisco, CA)
    …ML tasks. Nice-to-Haves Experience with DeepSpeed or vLLM for efficient inference serving. Familiarity with LangChain or LlamaIndex for rapid agent prototyping. ... in decentralised or edge deployments (eg, WASM at the edge) for ultra-low-latency inference . Applying Send your resume-plus a short note (3-5 sentences) describing a… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Capital One (San Francisco, CA)
    …services* Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, ... developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Lambda Inc. (San Francisco, CA)
    Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference . Lambda's mission is to make compute as ubiquitous as electricity ... securely and accurately. Our scope includes enterprise applications, integrations, data platform and analytics, compliance automation, and all things IT. What You'll… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source