• AI / ML Model Runtime

    Broadcom (Palo Alto, CA)
    …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
    Broadcom (10/24/25)
    - Save Job - Related Jobs - Block Source
  • Technical Lead, ML Frameworks,…

    Google (Mountain View, CA)
    Technical Lead, ML Frameworks, Runtime , Devices, Numerical Acceleration _corporate_fare_ Google _place_ Mountain View, CA, USA **Advanced** Experience owning ... project strategy, ML design, and optimizing industry-scale ML infrastructure (eg, model deployment, model...be responsible for providing expertise over the compiler and runtime boundary and provide technical strategy and guidance to… more
    Google (10/04/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer, Systems ML - Frameworks…

    Meta (Menlo Park, CA)
    …strategy that delivers a highly flexible platform to train & serve new DL/ ML model architectures, combined with auto-tuned high performance for production ... be working on one of the core areas such as PyTorch framework components, AI compiler and runtime , high-performance kernels and tooling to accelerate machine… more
    Meta (09/06/25)
    - Save Job - Related Jobs - Block Source
  • Research Scientist, AI & Systems Co-design…

    Meta (Menlo Park, CA)
    …via concurrent design and optimization of many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network ... sustained scaling and hardware efficiency during training and inference. 3. Benchmark, analyze, model and project the performance of AI workloads against a wide… more
    Meta (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer, FAR (Frontier…

    Amazon (San Francisco, CA)
    … stack (cuDNN, CUDA Graph, etc.) - Experience with ML compilers (ONNX Runtime , TVM, etc.) - Experience with transformer model optimization - Background in ... Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned...- Explore and evaluate emerging optimization techniques including ONNX Runtime and other ML compilers - Maintain… more
    Amazon (09/02/25)
    - Save Job - Related Jobs - Block Source
  • Principal AI Architect

    Microsoft Corporation (Mountain View, CA)
    …+ Proven track record of cross-disciplinary collaboration between hardware, software, and ML model teams. + Experience profiling and optimizing large-scale ... AI Architect** to join our team! **Responsibilities** + Model Bring-Up & Characterization + Lead the bring-up and... Co-Design + Partner with silicon and system architects, compiler/ runtime engineers, and model researchers to define… more
    Microsoft Corporation (10/30/25)
    - Save Job - Related Jobs - Block Source
  • Azure AI Security Senior Consultant

    Deloitte (San Francisco, CA)
    …experience with Azure Machine Learning and Azure OpenAI + Proven experience with AI / ML model evaluation, adversarial testing, and a deep understanding ... models, focusing on encryption, access control, data integrity, model scanning, and overall AI model... registry security, secure model deployment, and runtime security monitoring for AI models +… more
    Deloitte (10/23/25)
    - Save Job - Related Jobs - Block Source
  • Principal AI Engineer (GenAI) - Molecular…

    Bristol Myers Squibb (Brisbane, CA)
    …. **Summary:** Own the strategy and delivery of Gen AI - native applications, predictive- model workflows, and insight-driven analytics ... uncover deeper insights, and make better decisions. **Molecular Discovery ML Enablement:** + Champion predictive- model use-cases across..., or on-prem containers . + Knowledge of GPU runtime tuning or Triton-based multi- model serving. +… more
    Bristol Myers Squibb (09/04/25)
    - Save Job - Related Jobs - Block Source
  • Kubernetes Platform Engineer - Private AI

    Broadcom (Palo Alto, CA)
    …control plane that automates the lifecycle of AI Services: Model Gallery, Model Runtime and ML API Gateway, Data Indexing and Retrieval, and Agent ... Kubernetes Platform Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...key to building a best in class private cloud AI platform. You will have a high impact by… more
    Broadcom (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer, Frontier AI

    Amazon (San Francisco, CA)
    …serving at scale - Explore and evaluate emerging optimization techniques including ONNX Runtime and other ML compilers - Maintain high engineering standards ... Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll contribute to breakthrough foundation models run at production… more
    Amazon (09/02/25)
    - Save Job - Related Jobs - Block Source
  • Lead Engineer, Inference Platform

    MongoDB (Palo Alto, CA)
    …routing, and model health monitoring + Collaborate with peers across ML , infra, and product teams to define architectural patterns and operational practices that ... model serving architecture using tools like vLLM, ONNX Runtime , and container orchestration in Kubernetes + Provide technical...world's most popular developer data platform + Collaborate with ML experts from Voyage. ai to bring cutting-edge… more
    MongoDB (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer, TPU Performance

    Google (Mountain View, CA)
    …experience with software design and architecture. + 3 years of experience with ML infrastructure (eg, model deployment, model evaluation, optimization, data ... model architecture and optimize the performance of these ML models on TPU systems for both JAX and...for Gemini and OSS ML models. The ML , Systems, & Cloud AI (MSCA) organization… more
    Google (10/23/25)
    - Save Job - Related Jobs - Block Source
  • Technical Artist

    Meta (Burlingame, CA)
    …and ML concepts at a core level. (data collection, model training, deployment, runtime performance) **Preferred Qualifications:** Preferred Qualifications: ... new capabilities in partnership with cross functional teams 2. Implement runtime logic, tooling within a complex real-time environment with performance limitations… more
    Meta (10/20/25)
    - Save Job - Related Jobs - Block Source