- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior High - Performance LLM Training Engineer! NVIDIA is seeking experienced engineers specializing in performance analysis ... world's most advanced computing systems. This position focuses on optimizing NVIDIA's high - performance LLM software stack in frameworks like PyTorch and JAX… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer, LLM Performance ! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... analyzing and improving the performance of LLM inference! NVIDIA is rapidly... modeling, profiling, debug, and code optimization of a DL/HPC/ high - performance application + Architectural knowledge of CPU… more
- JPMorgan Chase (Palo Alto, CA)
- …dynamic team and make a meaningful impact by supporting the delivery of high -quality products that resonate with clients. As a Product Associate in Global Banking ... best-in-class enterprise Search capability incorporating the latest technologies (including AI/ LLM ) and applying best practices. You are expected to support… more
- Red Hat (Boston, MA)
- …provides a stable platform for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on model optimization algorithms, ... testing of various inference optimization algorithms in the vLLM LLM -compressor (https://github.com/vllm-project/ llm -compressor) project + Create and manage… more
- S&P Global (Charlottesville, VA)
- …Role:** **Grade Level (for internal use):** 10 **The** Role: Sr Data Scientist- NLP, LLM and GenAI S&P is a leader in risk management solutions leveraging automation ... AI/ML. This role is a unique opportunity for hands-on ML scientists and NLP/Gen AI/ LLM scientists to grow into the next step in their career journey and apply her… more
- NVIDIA (Santa Clara, CA)
- …see how you can make a lasting impact on the world. We are looking for a Senior LLM Systems Engineer to help build our NeMo Microservice Platform. Our team is ... service stability, observability and reliability + Relentlessly pursue speed of light performance under high load What we need to see: + BS, Masters, or… more
- JPMorgan Chase (New York, NY)
- …to join our dynamic AIML Data Platforms Team. As an **Applied AI ML Senior Associate** within our **Corporate Sector,** you will play a pivotal role in developing ... of the art models. Join our dynamic team dedicated to building a high -impact solution using generative technologies. We are focused on developing a multi-agent… more
- NVIDIA (Santa Clara, CA)
- …is leading the way in groundbreaking developments in Artificial Intelligence, High - Performance Computing, and Visualization. The GPU, our invention, serves ... for customers as possible. You will own ideating what high -quality training means and looks like across various projects,...team and working with them under ambiguity at a high velocity and can make things happen. What you'll… more
- Qualcomm (San Diego, CA)
- …in executing, analyzing, and optimizing neural networks + Experience in writing high performance software for multicore systems + Experience with Python ... innovative engineers with experience in software system design, compiler technology, performance modeling, and bottleneck analysis. Job activities span the whole… more
- Amazon (Seattle, WA)
- …experiment and deliver scaled runtime solutions based on experiments and research based on LLM agents. As a Senior Machine Learning Engineer on the team, you ... Description We work on state of the art agentic systems built around Large Language Models ( LLM ). This is a fast paced dynamic environment to rapidly… more
- Ankura (DE)
- …Join us in shaping the future of AI beyond chatbots, where LLM -powered Agents solve complex problems, streamline workflows, and optimize analytics-driven insights. ... ideally be working US eastern time hours. As a Senior Software Engineer (Python), you will be a core...Agentic AI Platform, designing and scaling microservices that power LLM -driven AI Agents. You will collaborate with AI researchers,… more
- NVIDIA (Santa Clara, CA)
- …, SGLang). + Experience with GPU resource scheduling, cache management, or high - performance networking. + Understanding of LLM -specific inference challenges, ... architecture, GPU resource management, and intelligent request handling, Dynamo achieves high - performance AI inference for demanding applications. Our team is… more
- NVIDIA (Santa Clara, CA)
- …seeking highly skilled Parallel and Distributed Systems engineers to drive the performance analysis, optimization, and modeling to define the architecture and design ... understanding of the methodology to conduct end to end performance analysis of critical AI applications running on large...best practices + Work with a diverse set of LLM workloads and their application areas such as health… more
- NVIDIA (Santa Clara, CA)
- We are now seeking a Senior Deep Learning Performance Architect! NVIDIA is looking for outstanding Performance Architects with a background in performance ... the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing:...+ Familiarity with advanced optimizations and SW/HW co-design in LLM training and inference + Exposure to using AI… more
- Amazon (Arlington, VA)
- …that have significant global benefit. The Brand Protection team designs and builds high performance AI systems using machine learning that identify and prevent ... in Natural Language Processing (NLP) and Large Language Models ( LLM )? Are you interested in building Generative AI solutions...build of our vision for Brand Protection. As a senior applied scientist on the team, you will use… more
- Amazon (Seattle, WA)
- …years of technology/software sales experience - Experience designing, developing, and optimizing high -quality prompts and templates that guide LLM behavior. - 3+ ... of thought leadership and innovation around Machine Learning, Comfortable presenting to senior data and AI leaders, and have demonstrated ability to think… more
- JPMorgan Chase (New York, NY)
- …our global team. Your responsibilities will entail hands on development of high -impact business products through full-stack engineering of LLM -powered solutions, ... expert who is excited about the opportunity. As a senior ML and GenAI engineer (VP), you will will...solutions + Solid Python programming skills required; with other high - performance language such as Go a big… more
- NVIDIA (Santa Clara, CA)
- … high -level frameworks like PyTorch and HuggingFace to developing and improving high - performance kernel implementations in CUDA, TRT- LLM , and Triton. This ... from arbitrary torch models for our automated deployment solution. + Develop high - performance optimization techniques for inference, such as automated model… more
- Walmart (Bentonville, AR)
- …to serve 10M+ smart devices in real time. You'll design and implement high - performance , memory-efficient components that sit at the intersection of real-time ... ** **What you'll do ** We are hiring a Senior Software Engineer to pioneer the development of a...serve billions of ads requests every month with our high - performance ad servers. There are millions of… more
- Amazon (Seattle, WA)
- …compression techniques, quantization methods, and efficient serving strategies for high - performance conversational AI applications. - Demonstrated ability to ... The CET Science team leads AI and Large Language Models ( LLM )-driven customer experience transformation using task-oriented dialogue systems. We develop multi-modal,… more