- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior High - Performance LLM Training Engineer! NVIDIA is seeking experienced engineers specializing in performance analysis ... world's most advanced computing systems. This position focuses on optimizing NVIDIA's high - performance LLM software stack in frameworks like PyTorch and JAX… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer, LLM Performance ! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... analyzing and improving the performance of LLM inference! NVIDIA is rapidly... modeling, profiling, debug, and code optimization of a DL/HPC/ high - performance application + Architectural knowledge of CPU… more
- Capital One (San Jose, CA)
- Senior AI Engineer (AI Foundations, LLM Core and Agentic AI) **Overview:** At Capital One, we are creating responsible and reliable AI systems, changing banking ... deliver our industry leading capabilities with breakthrough product experiences and scalable, high - performance AI infrastructure. At Capital One, you will help… more
- NVIDIA (Santa Clara, CA)
- …sounds like a fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing (HPC) and AI Networking Performance Research ... LLM training on NVIDIA supercomputers and distributed systems focusing on high - performance networking and Nvidia Collective Communications Library (NCCL). +… more
- NVIDIA (Santa Clara, CA)
- We are now seeking a Senior Deep Learning Performance Architect! NVIDIA is looking for outstanding Performance Architects with a background in performance ... the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing:...+ Familiarity with advanced optimizations and SW/HW co-design in LLM training and inference + Exposure to using AI… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking outstanding Performance Analysis Architects with a background in ... and develop the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing: + Develop innovative… more
- Palo Alto Networks (Santa Clara, CA)
- …a security-sensitive environment. + Own the end-to-end lifecycle of ML and LLM components, from problem formulation and model development to production deployment, ... monitoring, and iterative improvement. + Integrate ML and LLM -based services with backend systems and data pipelines, ensuring...and build data analysis tools to continuously improve model performance as data and threats evolve. + Partner closely… more
- NVIDIA (Santa Clara, CA)
- …sophisticated AI applications. Our team is responsible for developing and maintaining high - performance deep learning frameworks, including SGLang and vLLM, which ... as other DL frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …in techniques such as model quantization, distillation, and using high - performance serving frameworks (eg, vLLM, TGI, TensorRT- LLM ) to maximize inference ... Engineer to join our team. This is a hands-on, senior individual contributor role that will be pivotal in...lifecycle of our AI systems, from architecting and building high - performance GPU clusters to deploying and optimizing… more
- NVIDIA (Santa Clara, CA)
- …sophisticated AI applications. Our team is responsible for developing and maintaining high - performance open-source frameworks, which are at the forefront of ... NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference...frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative… more
- NVIDIA (Santa Clara, CA)
- …solutions in some of the world's most exciting areas including artificial intelligence and high performance computing. Come grow your career to new heights at ... We are looking for a Senior Technical Product Marketing Manager. This role will...the latest AI models and NVIDIA's platform to maximize performance and minimize TCO + Develop crisp clear positioning,… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer, FlashInfer. NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for ... building things like new abstractions, efficient attention kernel implementations, new LLM inference runtimes components, and kernel code generators to accelerate… more
- Deloitte (San Jose, CA)
- …subject matter knowledge to bring solutions to clients with a focus on achieving a high level of performance and quality through delivery of both agile and ... critical to businesses. Your contributions can help clients improve financial performance , accelerate new digital ventures, and fuel growth through innovation. AI… more
- Capital One (San Jose, CA)
- …deliver our industry leading capabilities with breakthrough product experiences and scalable, high - performance AI infrastructure. At Capital One, you will help ... Senior Manager, AI Engineering (People Leader) **Overview** :...Guardrails, PyTorch, and more. + Invent and introduce state-of-the-art LLM optimization techniques to improve the performance … more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing and ... What you will be doing: + Optimize deep learning models for low-latency, high -throughput inference, with a focus on LLMs, VLMs, diffusion models, and World… more
- NVIDIA (Santa Clara, CA)
- …management tools like Kubernetes. + Strong programming skills in Python and a high - performance language such as C++ for efficient system development. + Strong ... NVIDIA is searching for a senior or principal engineer who specializes in building...monitoring and debugging tools to ensure the reliability and performance of training workflows on large GPU clusters. +… more
- ServiceNow, Inc. (Santa Clara, CA)
- …for designing and governing the implementation of scalable, secure, and high -performing solutions on the ServiceNow platform. This role works closely with ... They will have demonstrated the ability to become a trusted advisor to senior executives and facilitate customer success from strategic or annual planning functions… more
- NVIDIA (Santa Clara, CA)
- …inference. + Convert and deploy models using frameworks such as TensorRT and TensorRT- LLM + Understand, analyze, profile, and optimize performance of deep ... We are now looking for a Senior DL Algorithms Engineer! We are seeking a...with model optimization and serving frameworks, such as: TensorRT, TensorRT- LLM , vLLM, SGLang. As NVIDIA makes inroads into the… more
- NVIDIA (Santa Clara, CA)
- …algorithms and software stacks as they vigilantly seek out opportunities for performance optimization and continuously deliver high quality software. Does the ... NVIDIA is looking for a dedicated and motivated senior build and continuous integration (CI/CD) engineer for...growing team to release software more frequently while maintaining high -quality and maximum performance . + Work with… more
- NVIDIA (Santa Clara, CA)
- …training and inference workloads. + Experience in evaluating, analyzing, and optimizing LLM training and inference performance of state-of-the-art models on ... systems with hundreds of thousands of nodes. + Optimizing communication performance : Identify and eliminate bottlenecks in data transfer and synchronization during… more