• Databricks Inc. (San Francisco, CA)
    …data and AI company in San Francisco seeks a Staff Software Engineer for GenAI inference to lead its architecture and optimization efforts. Candidates should ... with at least 6 years of experience and an understanding of ML inference internals. Key tasks include collaborating on model features, optimizing the inference more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Solutions Architect , Autonomous Driving - GenAI page is loaded Senior Solutions Architect , Autonomous Driving - GenAI Apply locations US, CA, ... and we are looking for an expert AV and GenAI Solutions Architect to help assist customers...VLMs, DiT, etc. Experience in deploying LLM models at scale on mainstream cloud providers (eg, AWS, Azure, GCP).… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Databricks Inc. (San Francisco, CA)
    Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the architecture, ... low latency, and robust scaling. Your work will encompass the full GenAI inference stack: kernels, runtimes, orchestration, memory, and integration with… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web Services ... Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Software Development Engineer, AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development ... kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Rubrik, Inc. (Palo Alto, CA)
    …- streaming telemetry, audit trails, and behavioral analytics across thousands of agents. Architect and scale systems that handle millions of agent decisions per ... possible for organizations to operate production‑grade AI agents at scale . As a member of the Rubrik Agent Cloud...- including model gateways (like LiteLLM or MCP), fine‑tuning, inference optimization, or policy enforcement in AI workloads. Strong… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Icon Ventures (San Francisco, CA)
    …Experience building on a modern MLOps stack (feature mgmt, orchestration, streaming, online inference at scale ) Compensation, Benefits & Perks Quizlet is an ... us to design and deliver AI-powered learning tools that scale across the world and unlock human potential. About...that boost learner outcomes and creator productivity. You will architect and ship a variety of models and modeling… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (Cupertino, CA)
    …Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. ... The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training performances. They are enabled through a state-of-the-art… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Anzen Technologies, Inc. (San Francisco, CA)
    …evaluating results and rolling them out with the appropriate infrastructure. Architect production‑grade systems that marry real‑time inference with ... best investors in the world, and are continuing to scale our team in Mexico, Canada, and the United...Broad ML toolbox, from classic ML (especially NLP) to GenAI . Strong backend fundamentals: distributed systems, streaming data pipelines.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    …Proof Of Concepts advancing the state of the art in AI & ML for GenAI . Collaborate with cross-functional teams to architect and execute technically rigorous AI ... core capabilities, ensuring a seamless and intuitive user experience. Develop new inference and training techniques to improve the performance of Large Language… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect , Autonomous…

    NVIDIA (Santa Clara, CA)
    …the world's leading AI company, and we are looking for an expert AV and GenAI Solutions Architect to help assist customers with adoption of NVIDIA's full-stack ... and ride sharing algorithms among other things. A Solutions Architect is the first line of technical expertise between...hands-on technical mentorship to partners and customers on Nvidia GenAI stack. Guide customers to develope and deploy Agentic… more
    NVIDIA (10/23/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer, AI/ML, AWS…

    Amazon (Cupertino, CA)
    …Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit… more
    Amazon (01/06/26)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect , Agentic AI

    NVIDIA (Santa Clara, CA)
    …crowd: + Demonstrate expertise in building applications and systems using NeMo Framework, inference at scale , NIMs, AI Blueprints. + Take end-to-end ownership of ... is seeking an outstanding Senior AI Engineer or Solutions Architect to join our growing team focused on partner...category defining systems and production grade AI solutions at scale . What you will be doing: + Building an… more
    NVIDIA (11/12/25)
    - Save Job - Related Jobs - Block Source
  • (USA) Principal, Software Engineer

    Walmart (Sunnyvale, CA)
    …leader driving the next phase of Walmart's Performance and Resiliency Engineering. Architect , build, and scale intelligent agentic AI/ML systems that proactively ... Infrastructure & platforms is vital to success at the scale of Walmart. Our team builds and maintains the...agents for multi-step reasoning, knowledge grounding, and decision-making. + Architect scalable, distributed AI systems with a focus on… more
    Walmart (12/24/25)
    - Save Job - Related Jobs - Block Source