- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior High - Performance LLM Training Engineer! NVIDIA is seeking experienced engineers specializing in performance analysis ... world's most advanced computing systems. This position focuses on optimizing NVIDIA's high - performance LLM software stack in frameworks like PyTorch and JAX… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer, LLM Performance ! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... analyzing and improving the performance of LLM inference! NVIDIA is rapidly... modeling, profiling, debug, and code optimization of a DL/HPC/ high - performance application + Architectural knowledge of CPU… more
- NVIDIA (Santa Clara, CA)
- …with performance modeling, profiling, debug, and code optimization of a DL/HPC/ high - performance application + Architectural knowledge of CPU and GPU + GPU ... We are now looking for a TensorRT- LLM Software Development Engineer! NVIDIA is hiring software...can be scaled to multiple platforms for functionality and performance + Perform benchmarking, profiling, and system-level programming for… more
- Capital One (San Jose, CA)
- Senior AI Engineer (AI Foundations, LLM Core and Agentic AI) **Overview:** At Capital One, we are creating responsible and reliable AI systems, changing banking ... deliver our industry leading capabilities with breakthrough product experiences and scalable, high - performance AI infrastructure. At Capital One, you will help… more
- NVIDIA (Santa Clara, CA)
- …equivalent experience + 15+ years of experience building large-scale distributed systems, high - performance storage, or ML systems infrastructure in C/C++ and ... NVIDIA Dynamo is a high -throughput, low-latency inference framework for serving generative AI...models across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU shards,… more
- NVIDIA (Santa Clara, CA)
- …sounds like a fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing (HPC) and AI Networking Performance Research ... LLM training on NVIDIA supercomputers and distributed systems focusing on high - performance networking and Nvidia Collective Communications Library (NCCL). +… more
- NVIDIA (Santa Clara, CA)
- We are now seeking a Senior Deep Learning Performance Architect! NVIDIA is looking for outstanding Performance Architects with a background in performance ... the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing:...+ Familiarity with advanced optimizations and SW/HW co-design in LLM training and inference + Exposure to using AI… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking outstanding Performance Analysis Architects with a background in ... and develop the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing: + Develop innovative… more
- NVIDIA (Santa Clara, CA)
- …seeking highly skilled Parallel and Distributed Systems engineers to drive the performance analysis, optimization, and modeling to define the architecture and design ... have a deep understanding of the methodology to conduct end to end performance analysis of critical AI applications running on large scale parallel and distributed… more
- NVIDIA (Santa Clara, CA)
- … LLM infrastructure accordingly. + Optimize the infrastructure for performance , scalability, and reliability, ensuring secure and efficient management of data. ... to improve LLM infrastructure. + Lead with purpose and maintain high -quality engineering practices that inspire others to achieve excellence. What we need to… more
- LinkedIn (Mountain View, CA)
- …the company. Our team works on a wide range of cutting-edge ML: LLM fine tuning, text generation, LLM -as-a-judge, prompt engineering, embedding-based retrieval, ... be based in Sunnyvale, CA. **Key Responsibilities** As a senior AI Manager, you will lead a team of...applied scientists and engineers to design and deliver scalable LLM and matching solutions that improve the relevance and… more
- Walmart (Sunnyvale, CA)
- …with DevOps, cloud, and observability platforms. We are looking for a ** Senior Software Engineer** to design, build, and scale intelligent systems and automations ... techniques such as anomaly detection, classification, recommendation systems, and LLM -based reasoning to deliver actionable insights and automate decision-making.… more
- Palo Alto Networks (Santa Clara, CA)
- …a security-sensitive environment. + Own the end-to-end lifecycle of ML and LLM components, from problem formulation and model development to production deployment, ... monitoring, and iterative improvement. + Integrate ML and LLM -based services with backend systems and data pipelines, ensuring...and build data analysis tools to continuously improve model performance as data and threats evolve. + Partner closely… more
- NVIDIA (Santa Clara, CA)
- …sophisticated AI applications. Our team is responsible for developing and maintaining high - performance deep learning frameworks, including SGLang and vLLM, which ... as other DL frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA… more
- NVIDIA (Santa Clara, CA)
- …Familiarity with NVIDIA's deep learning SDKs (eg, TensorRT). + Experience developing high - performance GPU kernels for machine learning workloads using CUDA, ... focuses on optimizing generative AI models such as large language models ( LLM ) and diffusion models for maximal inference efficiency using techniques ranging from… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …in techniques such as model quantization, distillation, and using high - performance serving frameworks (eg, vLLM, TGI, TensorRT- LLM ) to maximize inference ... Engineer to join our team. This is a hands-on, senior individual contributor role that will be pivotal in...lifecycle of our AI systems, from architecting and building high - performance GPU clusters to deploying and optimizing… more
- NVIDIA (Santa Clara, CA)
- …sophisticated AI applications. Our team is responsible for developing and maintaining high - performance open-source frameworks, which are at the forefront of ... NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference...frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative… more
- FocusKPI Inc. (Mountain View, CA)
- …clients, a high -tech SaaS company. Team is looking for a Senior Offensive Security Engineer to proactively identify, exploit, and help eliminate security ... FocusKPI is seeking a Senior Offensive Security Engineer (Web & AI systems)...contract with potential to convert depending on the candidate's performance Pay Range: $85 - 100/hr **No C2C resumes… more
- NVIDIA (Santa Clara, CA)
- …solutions in some of the world's most exciting areas including artificial intelligence and high performance computing. Come grow your career to new heights at ... We are looking for a Senior Technical Product Marketing Manager. This role will...the latest AI models and NVIDIA's platform to maximize performance and minimize TCO + Develop crisp clear positioning,… more
- NVIDIA (Santa Clara, CA)
- …engines from NVIDIA and the community, including NVIDIA TensorRT and TensorRT- LLM , NIM microservices optimize response latency and throughput for each combination ... NIMs for building multimodal extraction, re-ranking, and embedding pipelines with high accuracy and maximum data privacy. It delivers quick, context-aware responses… more