• AI / ML Model Runtime

    Broadcom (Palo Alto, CA)
    …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
    Broadcom (07/25/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Product Manager - Runtime Infra,…

    Amazon (Cupertino, CA)
    …an experienced Technical Product Manager to define and drive product strategy for Neuron Runtime and ML Infrastructure integration. You will be part of the AWS ... ML performance in the cloud. You will lead runtime and infrastructure requirements working backward from customer needs,...to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the… more
    Amazon (09/02/25)
    - Save Job - Related Jobs - Block Source
  • Technical Lead, ML Frameworks,…

    Google (Mountain View, CA)
    Technical Lead, ML Frameworks, Runtime , Devices, Numerical Acceleration _corporate_fare_ Google _place_ Mountain View, CA, USA **Advanced** Experience owning ... project strategy, ML design, and optimizing industry-scale ML infrastructure (eg, model deployment, model...into priorities and projects for the broader group. The ML , Systems, & Cloud AI (MSCA) organization… more
    Google (10/04/25)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer, ML

    pony.ai (Fremont, CA)
    …evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to ... went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony. ai provides a...compute platform architecture design and software infrastructure. + Apply model optimization and efficient deep learning techniques to models… more
    pony.ai (08/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer, AI

    Amazon (Cupertino, CA)
    …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * would with ... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that...of the most interesting and impactful infrastructure challenges in AI / ML today. Basic Qualifications - 5+ years… more
    Amazon (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer, AI

    Amazon (Cupertino, CA)
    …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design, ... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that...of the most interesting and impactful infrastructure challenges in AI / ML today. Basic Qualifications - Bachelor's degree… more
    Amazon (08/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer, AI

    Amazon (Cupertino, CA)
    …use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible for ... and performance tuning of a wide variety of LLM model families, including massive scale large language models like...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
    Amazon (09/13/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer- AI / ML , AWS…

    Amazon (Cupertino, CA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
    Amazon (10/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI / ML , AWS…

    Amazon (Cupertino, CA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
    Amazon (10/17/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI / ML , AWS…

    Amazon (Cupertino, CA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such ... Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with...side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training… more
    Amazon (09/19/25)
    - Save Job - Related Jobs - Block Source
  • ML Acceleration / Framework Engineer…

    Amazon (Cupertino, CA)
    …to this and extending all of this for the Neuron based system is key. - ML Frameworks partners with compiler, runtime , and research experts to make AWS Trainium ... side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training...with vLLM, Triton, and TensorRT-turning breakthrough ideas into production‑ready AI for millions of customers. - The ML more
    Amazon (10/13/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager, ML Kernel…

    Amazon (Cupertino, CA)
    …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design and ... expertise to push the boundaries of what's possible in AI acceleration. The AWS Neuron SDK, developed by the... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that… more
    Amazon (09/04/25)
    - Save Job - Related Jobs - Block Source
  • Sr. ML Kernel Performance Engineer, AWS…

    Amazon (Cupertino, CA)
    …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design and ... expertise to push the boundaries of what's possible in AI acceleration. The AWS Neuron SDK, developed by the... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that… more
    Amazon (08/15/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer, Systems ML - Frameworks…

    Meta (Menlo Park, CA)
    …strategy that delivers a highly flexible platform to train & serve new DL/ ML model architectures, combined with auto-tuned high performance for production ... be working on one of the core areas such as PyTorch framework components, AI compiler and runtime , high-performance kernels and tooling to accelerate machine… more
    Meta (09/06/25)
    - Save Job - Related Jobs - Block Source
  • Research Scientist, AI & Systems Co-design…

    Meta (Menlo Park, CA)
    …via concurrent design and optimization of many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network ... sustained scaling and hardware efficiency during training and inference. 3. Benchmark, analyze, model and project the performance of AI workloads against a wide… more
    Meta (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer, FAR (Frontier…

    Amazon (San Francisco, CA)
    … stack (cuDNN, CUDA Graph, etc.) - Experience with ML compilers (ONNX Runtime , TVM, etc.) - Experience with transformer model optimization - Background in ... Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned...- Explore and evaluate emerging optimization techniques including ONNX Runtime and other ML compilers - Maintain… more
    Amazon (09/02/25)
    - Save Job - Related Jobs - Block Source
  • Azure AI Security Senior Consultant

    Deloitte (San Francisco, CA)
    …experience with Azure Machine Learning and Azure OpenAI + Proven experience with AI / ML model evaluation, adversarial testing, and a deep understanding ... models, focusing on encryption, access control, data integrity, model scanning, and overall AI model... registry security, secure model deployment, and runtime security monitoring for AI models +… more
    Deloitte (09/25/25)
    - Save Job - Related Jobs - Block Source
  • Principal AI Engineer (GenAI) - Molecular…

    Bristol Myers Squibb (Brisbane, CA)
    …. **Summary:** Own the strategy and delivery of Gen AI - native applications, predictive- model workflows, and insight-driven analytics ... uncover deeper insights, and make better decisions. **Molecular Discovery ML Enablement:** + Champion predictive- model use-cases across..., or on-prem containers . + Knowledge of GPU runtime tuning or Triton-based multi- model serving. +… more
    Bristol Myers Squibb (09/04/25)
    - Save Job - Related Jobs - Block Source
  • Principal AI Infrastructure Abstraction…

    Cisco (San Jose, CA)
    …You will bridge the gap between raw compute resources and AI / ML frameworks, allowing infrastructure teams and model developers to consume shared ... with a focus in **multi-tenant environments** . + Experience integrating with ** AI / ML platforms or pipelines** (eg, PyTorch, TensorFlow, Triton Inference Server,… more
    Cisco (10/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineering Manager, Cloud…

    Google (Sunnyvale, CA)
    …experience leading technical project strategy, ML design, and optimizing industry-scale ML infrastructure (eg, model deployment, model evaluation, data ... Senior Software Engineering Manager, Cloud AI , Agents _corporate_fare_ Google _place_ Sunnyvale, CA, USA...for the team designing and building our core agentic runtime . This includes solving first-principle challenges in Large Language… more
    Google (10/01/25)
    - Save Job - Related Jobs - Block Source