• Senior Deep Learning

    NVIDIA (Durham, NC)
    We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer ... who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreaking hardware-software… more
    NVIDIA (10/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Deep Learning Architect for LLM Inference ! NVIDIA is at the forefront of the generative AI revolution. Our Inference ... 6+ years of relevant industry experience + Detailed knowledge of deep learning inference serving, PyTorch programming, profiling, and compiler optimizations.… more
    NVIDIA (09/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software…

    NVIDIA (Santa Clara, CA)
    NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, ... at the forefront of efficient large-scale model serving and inference . You will play a central role in improving...of groundbreaking language models. You'll work closely with the deep learning community to implement the latest… more
    NVIDIA (09/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software…

    NVIDIA (Santa Clara, CA)
    NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, ... Our team is responsible for developing and maintaining high-performance deep learning frameworks, including SGLang and vLLM,...at the forefront of efficient large-scale model serving and inference . You will play a central role in improving… more
    NVIDIA (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software…

    NVIDIA (Santa Clara, CA)
    …and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer to develop and scale up our ... large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from neural architecture search...3+ years of relevant work or research experience in Deep Learning . + Excellent software design skills,… more
    NVIDIA (10/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer, Deep

    NVIDIA (Santa Clara, CA)
    Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA's TensorRT team as a Senior ... best practices with C++11 and C++14. + Familiarity with deep learning concepts and frameworks. + A...models (such as Large Language Models) & frameworks for inference . + Background with C++17. NVIDIA is widely considered… more
    NVIDIA (10/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer…

    Amazon (Seattle, WA)
    …Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference more
    Amazon (09/26/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer,…

    Amazon (Seattle, WA)
    …Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference more
    Amazon (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - Distributed…

    NVIDIA (CA)
    …Dynamo Inference Server! NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team, and we are a remote friendly work ... on the world. We are now looking for a Senior System Software Engineer to work on user facing...world are using GPUs to power a revolution in deep learning , enabling breakthroughs in problems from… more
    NVIDIA (08/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior GenAI Algorithms Engineer - Model…

    NVIDIA (Santa Clara, CA)
    …and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative ... cache optimization, parallelism strategies). + Drive continuous innovation in deep learning inference performance to strengthen NVIDIA platform integration… more
    NVIDIA (09/23/25)
    - Save Job - Related Jobs - Block Source
  • Senior Inference Technical Product…

    NVIDIA (Santa Clara, CA)
    Deep understanding of modern data center architectures, accelerated computing, distributed inference , deep learning frameworks (PyTorch, TensorFlow, JAX), ... We are looking for a Senior Technical Product Marketing Manager. This role will...rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with… more
    NVIDIA (09/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior DL Algorithms Engineer…

    NVIDIA (Santa Clara, CA)
    …experience. + 3+ years of experience. + Strong background in deep learning and neural networks, in particular inference . + Experience with performance ... We are now looking for a Senior DL Algorithms Engineer! NVIDIA is seeking ...help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid… more
    NVIDIA (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Product Manager - Inference

    NVIDIA (Santa Clara, CA)
    …one of the industry's most desirable employers. NVIDIA is at the center of Deep Learning , Artificial Intelligence, and Autonomous Vehicles. If you're looking for ... be doing: + Serve as a Subject Matter Expert on AI Inference : Maintain a deep understanding of the entire inference stack, including performance, scaling… more
    NVIDIA (09/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Software Engineer, LLM…

    NVIDIA (Santa Clara, CA)
    …If you're passionate about system-level performance, compiler IR, and GPU kernel optimization for deep learning inference , we'd love to consider you for our ... Senior AI Software Engineer, in our LLM Inference Performance Analysis and Optimization team! NVIDIA leads the...Contribute to a core team at the forefront of deep learning and LLM inference more
    NVIDIA (11/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Marketing Engineer - AI…

    NVIDIA (Santa Clara, CA)
    Deep understanding of modern data center architectures, accelerated computing, distributed inference , deep learning frameworks (PyTorch, TensorFlow, JAX), ... power AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated...high-impact go-to-market strategy. This role will focus on AI inference at scale, ensuring that customers and partners understand… more
    NVIDIA (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer - Dynamo…

    NVIDIA (CA)
    …improving performance of AI inference systems. + Background with deep learning algorithms and frameworks. Especially experience Large Language Models ... We are now looking for a Senior System Software Engineer to work on Dynamo...Server! NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial… more
    NVIDIA (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, Generative AI…

    NVIDIA (Santa Clara, CA)
    …different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our ... and work on exciting projects and proof-of-concepts focused on inference for Generative AI and Large Language Models (LLMs)....equivalent experience) + 8+ years of hands-on experience with Deep Learning frameworks such as PyTorch and… more
    NVIDIA (09/26/25)
    - Save Job - Related Jobs - Block Source
  • Senior MLOps Engineer, vLLM…

    Red Hat (Raleigh, NC)
    …ML Ops engineer to work closely with our product and research teams to scale SOTA deep learning products and software. As an ML Ops engineer, you will work ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...to solving challenging technical problems at the forefront of deep learning , this is the role for… more
    Red Hat (10/09/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Manager, LLM Inference

    Amazon (Cupertino, CA)
    …AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators. Come optimize LLMs such as Llama and GPT-OSS to run ... fast on Trainium. As the SDM for the LLM Inference Model Enablement team, you will lead a team...day in the life You will work with your senior management and technical leaders to define the model… more
    Amazon (09/06/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Manager, AI Inference

    Amazon (Seattle, WA)
    …AWS Neuron, the complete software stack for Trainium, Amazon's custom cloudscale machine learning accelerators. Come optimize LLMs such as Llama and GPT OSS to run ... fast on Trainium. As the SDM for the Neuron Inference Technology building blocks team, you will guide your...day in the life You will work with your senior management and technical leaders to define the building… more
    Amazon (08/15/25)
    - Save Job - Related Jobs - Block Source