- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior High - Performance LLM Training Engineer! NVIDIA is seeking experienced engineers specializing in performance analysis ... world's most advanced computing systems. This position focuses on optimizing NVIDIA's high - performance LLM software stack in frameworks like PyTorch and JAX… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer, LLM Performance ! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... analyzing and improving the performance of LLM inference! NVIDIA is rapidly... modeling, profiling, debug, and code optimization of a DL/HPC/ high - performance application + Architectural knowledge of CPU… more
- JPMorgan Chase (Palo Alto, CA)
- …dynamic team and make a meaningful impact by supporting the delivery of high -quality products that resonate with clients. As a Product Associate in Global Banking ... best-in-class enterprise Search capability incorporating the latest technologies (including AI/ LLM ) and applying best practices. You are expected to support… more
- NVIDIA (Santa Clara, CA)
- …see how you can make a lasting impact on the world. We are looking for a Senior LLM Systems Engineer to help build our NeMo Microservice Platform. Our team is ... service stability, observability and reliability + Relentlessly pursue speed of light performance under high load What we need to see: + BS, Masters, or… more
- NVIDIA (Santa Clara, CA)
- …is leading the way in groundbreaking developments in Artificial Intelligence, High - Performance Computing, and Visualization. The GPU, our invention, serves ... for customers as possible. You will own ideating what high -quality training means and looks like across various projects,...team and working with them under ambiguity at a high velocity and can make things happen. What you'll… more
- NVIDIA (Santa Clara, CA)
- …, SGLang). + Experience with GPU resource scheduling, cache management, or high - performance networking. + Understanding of LLM -specific inference challenges, ... architecture, GPU resource management, and intelligent request handling, Dynamo achieves high - performance AI inference for demanding applications. Our team is… more
- NVIDIA (Santa Clara, CA)
- …seeking highly skilled Parallel and Distributed Systems engineers to drive the performance analysis, optimization, and modeling to define the architecture and design ... understanding of the methodology to conduct end to end performance analysis of critical AI applications running on large...best practices + Work with a diverse set of LLM workloads and their application areas such as health… more
- NVIDIA (Santa Clara, CA)
- We are now seeking a Senior Deep Learning Performance Architect! NVIDIA is looking for outstanding Performance Architects with a background in performance ... the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing:...+ Familiarity with advanced optimizations and SW/HW co-design in LLM training and inference + Exposure to using AI… more
- Amazon (San Francisco, CA)
- …years of technology/software sales experience - Experience designing, developing, and optimizing high -quality prompts and templates that guide LLM behavior. - 3+ ... of thought leadership and innovation around Machine Learning, Comfortable presenting to senior data and AI leaders, and have demonstrated ability to think… more
- NVIDIA (Santa Clara, CA)
- … high -level frameworks like PyTorch and HuggingFace to developing and improving high - performance kernel implementations in CUDA, TRT- LLM , and Triton. This ... from arbitrary torch models for our automated deployment solution. + Develop high - performance optimization techniques for inference, such as automated model… more
- Amazon (Santa Clara, CA)
- …compression techniques, quantization methods, and efficient serving strategies for high - performance conversational AI applications. - Demonstrated ability to ... solutions. The CET team leads AI and Large Language Models ( LLM )-driven customer experience transformation using task-oriented dialogue systems. We develop… more
- NVIDIA (Santa Clara, CA)
- …engines from NVIDIA and the community, including NVIDIA TensorRT and TensorRT- LLM , NIM microservices optimize response latency and throughput for each combination ... NIMs for building multimodal extraction, re-ranking, and embedding pipelines with high accuracy and maximum data privacy. It delivers quick, context-aware responses… more
- NVIDIA (Santa Clara, CA)
- …sophisticated AI applications. Our team is responsible for developing and maintaining high - performance deep learning frameworks, including SGLang and vLLM, which ... NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference...frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative… more
- NVIDIA (Santa Clara, CA)
- …solutions in some of the world's most exciting areas including artificial intelligence and high performance computing. Come grow your career to new heights at ... We are looking for a Senior Technical Product Marketing Manager. This role will...the latest AI models and NVIDIA's platform to maximize performance and minimize TCO + Develop crisp clear positioning,… more
- NVIDIA (Santa Clara, CA)
- …solutions in some of the world's most exciting areas including artificial intelligence and high performance computing. Come grow your career to new heights at ... We are looking for a Senior Manager, Inference Platform Technical Product Marketing. This...the latest AI models and NVIDIA's platform to maximize performance and minimize TCO + Develop crisp clear positioning,… more
- Capital One (San Francisco, CA)
- …deliver our industry leading capabilities with breakthrough product experiences and scalable, high - performance AI infrastructure. At Capital One, you will help ... Senior AI Engineer **Overview:** At Capital One, we...Guardrails, PyTorch, and more. + Invent and introduce state-of-the-art LLM optimization techniques to improve the performance … more
- Amazon (Santa Clara, CA)
- …and implement strategies for data collection, annotation, and model training to ensure high -quality and robust performance of the chatbots. - Conduct experiments ... generation. Key job responsibilities - Research and development of LLM -based chatbots and conversational AI systems for customer service...and evaluations to measure the performance of the developed models and systems, and identify… more
- Walmart (Sunnyvale, CA)
- …is NOT available for this role** **. Walmart is seeking a ** Senior Data Scientist** with a **strong educational background in Computer Science, Mathematics, ... develop innovative **AI products like Q;A assistants, Text to SQL analytics, and LLM reasoning** , while applying causal learning and anomaly detection in the… more
- NVIDIA (Santa Clara, CA)
- …LLM frameworks like NeMo, Megatron, DeepSpeed, or similar. + Experience with high - performance system design: knowledge of GPUs, CPUs, DPUs, NVLink, NVSwitch, ... to power AI at scale! We are seeking a highly technical and creative Senior Technical Marketing Engineer to join our team to showcase the innovations that power… more
- Deloitte (San Jose, CA)
- UKG HRCO WFM Senior Manager Are you passionate about helping clients realize their ROI related to technology transformations? Do you love finding the right support ... with the application's requirements, based on factors such as performance , scalability, and maintainability. Required Qualifications: + Bachelor's Degree preferably… more