• Senior High - Performance

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior High - Performance LLM Training Engineer! NVIDIA is seeking experienced engineers specializing in performance analysis ... world's most advanced computing systems. This position focuses on optimizing NVIDIA's high - performance LLM software stack in frameworks like PyTorch and JAX… more
    NVIDIA (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software Engineer,…

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Deep Learning Software Engineer, LLM Performance ! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... analyzing and improving the performance of LLM inference! NVIDIA is rapidly... modeling, profiling, debug, and code optimization of a DL/HPC/ high - performance application + Architectural knowledge of CPU… more
    NVIDIA (07/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Engineer ( LLM Core)

    Capital One (San Francisco, CA)
    Senior AI Engineer ( LLM Core) **Overview:** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital ... deliver our industry leading capabilities with breakthrough product experiences and scalable, high - performance AI infrastructure. At Capital One, you will help… more
    Capital One (08/20/25)
    - Save Job - Related Jobs - Block Source
  • Director, Data Science - Quality & LLM

    Walmart (Sunnyvale, CA)
    …how customers shop through conversation. As **Director, Data Science - Quality & LLM Judging Systems for Conversational Commerce** , you will lead a critical pillar ... under the Senior Director of Data Science - Agentic AI for...evaluations. This includes combining traditional human-labeled approaches with advanced " LLM -as-a-judge" techniques. You will design prompt-based evaluation tasks, identify… more
    Walmart (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Manager, LLM Inference…

    Amazon (Cupertino, CA)
    …and automation. The ideal candidate will have a strong background in LLM model architectures, model performance optimizations, and inference techniques, such ... really fast on Trainium. As the SDM for the LLM Inference Model Enablement team, you will lead a...as delivering high - performance models using distributed inference libraries. You should be… more
    Amazon (09/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC and AI Networking…

    NVIDIA (Santa Clara, CA)
    …sounds like a fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing (HPC) and AI Networking Performance Research ... LLM training on NVIDIA supercomputers and distributed systems focusing on high - performance networking and Nvidia Collective Communications Library (NCCL). +… more
    NVIDIA (09/03/25)
    - Save Job - Related Jobs - Block Source
  • Senior Machine Learning Engineer

    SAP (Palo Alto, CA)
    …to grow and succeed. We are seeking a highly skilled and driven ** Senior Machine Learning Engineer** to design and deliver intelligent, distributed systems that ... power large-scale AI and large language model ( LLM ) capabilities. In this role, you will shape cutting-edge...framework + Mentor and guide engineers from junior to senior levels, fostering technical excellence + Partner with cross-functional… more
    SAP (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior DGX Cloud Performance

    NVIDIA (Santa Clara, CA)
    …seeking highly skilled Parallel and Distributed Systems engineers to drive the performance analysis, optimization, and modeling to define the architecture and design ... understanding of the methodology to conduct end to end performance analysis of critical AI applications running on large...best practices + Work with a diverse set of LLM workloads and their application areas such as health… more
    NVIDIA (08/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Manager, Machine Learning Engineer…

    Cisco (San Jose, CA)
    …systems. Key Responsibilities Team Leadership & Management Lead and grow a high -performing engineering team focused on LLM applications and infrastructure. ... Senior Manager, Machine Learning Engineer - ML Ops...evaluation. Optimize latency, accuracy, and context window handling for high -traffic LLM services. Architecture & Scalability Own… more
    Cisco (09/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Performance

    NVIDIA (Santa Clara, CA)
    We are now seeking a Senior Deep Learning Performance Architect! NVIDIA is looking for outstanding Performance Architects with a background in performance ... the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing:...+ Familiarity with advanced optimizations and SW/HW co-design in LLM training and inference + Exposure to using AI… more
    NVIDIA (09/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Performance

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking outstanding Performance Analysis Architects with a background in ... and develop the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing: + Develop innovative… more
    NVIDIA (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer,…

    Amazon (Cupertino, CA)
    …boundary, our engineers build systematic infrastructure, innovate new methods and create high - performance kernels for ML functions, ensuring every compute unit ... a unique opportunity to work at the intersection of machine learning, high - performance computing, and distributed architectures, where you'll help shape the… more
    Amazon (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software Engineer,…

    NVIDIA (Santa Clara, CA)
    high -level frameworks like PyTorch and HuggingFace to developing and improving high - performance kernel implementations in CUDA, TRT- LLM , and Triton. This ... from arbitrary torch models for our automated deployment solution. + Develop high - performance optimization techniques for inference, such as automated model… more
    NVIDIA (08/23/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software Engineer,…

    NVIDIA (Santa Clara, CA)
    …sophisticated AI applications. Our team is responsible for developing and maintaining high - performance deep learning frameworks, including SGLang and vLLM, which ... as other DL frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA… more
    NVIDIA (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior GenAI Algorithms Engineer - Model…

    NVIDIA (Santa Clara, CA)
    …Familiarity with NVIDIA's deep learning SDKs (eg, TensorRT). + Experience developing high - performance GPU kernels for machine learning workloads using CUDA, ... focuses on optimizing generative AI models such as large language models ( LLM ) and diffusion models for maximal inference efficiency using techniques ranging from… more
    NVIDIA (09/23/25)
    - Save Job - Related Jobs - Block Source
  • Senior Machine Learning Engineer, Customer…

    Amazon (Santa Clara, CA)
    …compression techniques, quantization methods, and efficient serving strategies for high - performance conversational AI applications. - Demonstrated ability to ... solutions. The CET team leads AI and Large Language Models ( LLM )-driven customer experience transformation using task-oriented dialogue systems. We develop… more
    Amazon (09/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software Engineer,…

    NVIDIA (Santa Clara, CA)
    …sophisticated AI applications. Our team is responsible for developing and maintaining high - performance open-source frameworks, which are at the forefront of ... NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference...frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative… more
    NVIDIA (09/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - NIM Factory…

    NVIDIA (Santa Clara, CA)
    …upon which every new AI-powered application is built. We are seeking a Senior Software Engineer focused on container and cloud infrastructure. You will help design ... tooling for container build, packaging, and deployment. You will help improve reliability, performance , and scale across thousands of GPUs. There is much more to… more
    NVIDIA (09/19/25)
    - Save Job - Related Jobs - Block Source
  • Senior Inference Technical Product…

    NVIDIA (Santa Clara, CA)
    …solutions in some of the world's most exciting areas including artificial intelligence and high performance computing. Come grow your career to new heights at ... We are looking for a Senior Technical Product Marketing Manager. This role will...the latest AI models and NVIDIA's platform to maximize performance and minimize TCO + Develop crisp clear positioning,… more
    NVIDIA (09/25/25)
    - Save Job - Related Jobs - Block Source
  • AI Senior Staff Systems Engineer

    Cadence Design Systems, Inc. (San Jose, CA)
    …in techniques such as model quantization, distillation, and using high - performance serving frameworks (eg, vLLM, TGI, TensorRT- LLM ) to maximize inference ... Engineer to join our team. This is a hands-on, senior individual contributor role that will be pivotal in...lifecycle of our AI systems, from architecting and building high - performance GPU clusters to deploying and optimizing… more
    Cadence Design Systems, Inc. (09/30/25)
    - Save Job - Related Jobs - Block Source