- Qualcomm (San Diego, CA)
- …**Ideal candidates for this position will demonstrate the following:** + Experience in serving frameworks, like vLLM + Strong development skills in PyTorch + Strong ... understanding of LLMs, Multi-modal and reasoning models + Experience in executing, analyzing, and optimizing neural networks + Experience in writing high performance software for multicore systems + Experience with Python + Understanding of multi-core… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate ... of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement LLM inference, serving and deployment algorithms and optimizations using TensorRT LLM ,… more
- NVIDIA (Santa Clara, CA)
- …engineers enthusiastic about building the next generation of scalable AI systems. As a Senior Applied AI Software Engineer on the Dynamo project, you will ... supporting a variety of LLM frameworks (eg, TensorRT- LLM , vLLM, SGLang). + Disaggregated Serving : Architect...are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want… more
- The Walt Disney Company (Glendale, CA)
- …and entertainment content, across all media platforms. **Job Summary:** We're looking for a Senior Software Engineer to help shape the future of Ad Technology's ... media portfolio to advance the technological foundation and consumer media touch points serving millions of people around the world. Here are a few reasons why… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... SGLang and vLLM, which are at the forefront of efficient large-scale model serving and inference. You will play a central role in improving these platforms,… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing ... scale in real-world applications. + Hands-on experience with model optimization and serving frameworks, such as: TensorRT, TensorRT- LLM , vLLM, SGLang. As NVIDIA… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior DL Algorithms Engineer ! NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help ... new features, fix bugs and deliver production code to TRT- LLM , NVIDIA's open-source inference serving library. + Profile and analyze bottlenecks across the full… more
- Charles Schwab (San Francisco, CA)
- …The AI Incubation and Enablement team is looking for a talented, technical, hands-on Senior Engineer to drive the development of innovative AI solutions. This ... rapid, iterative software development using Large Language Models. The Senior Engineer on the AI Incubation and...your team. + Experience working with LLMs and shipping LLM -powered applications to production is a big plus. +… more
- LinkedIn (Sunnyvale, CA)
- …company. Our team works on a wide range of cutting-edge ML, like LLM fine tuning, text generation, LLM -as-a-judge, prompt engineering, embedding-based retrieval, ... or Ph.D. in Computer Science or related technical discipline * Experience with LLM , ranking, recommender systems * Full stack experience with AI systems, from… more
- Warner Bros. Discovery (San Francisco, CA)
- …you are supported, here you are celebrated, here you can thrive. **Machine Learning Engineer - Services (Video AI Platform)** **Who We Are ** At Warner Bros. ... on supporting applications of AI to video , the **M achine Learning Engineer - Services** group powers infrastructure and backend service s behind production… more
- NVIDIA (Santa Clara, CA)
- …full-stack software ecosystem to power AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated computing product ... JAX), and inference-specific frameworks & optimizations (Triton Inference Server, TensorRT- LLM , vLLM, SGLang). + Market Awareness - Experience conducting technical… more
- Ankura (CA)
- …and test cutting edge solutions in each of our respective areas. Role Overview: As a Senior AI Engineer and Data Scientist (Director) in the Data Analytics & AI ... advances to the quality and breadth of ML and LLM solutions. This role offers the chance to work...The Ankura team consists of more than 2000 professionals serving 3,000+ clients across 55 countries who are leaders… more
- NVIDIA (Santa Clara, CA)
- …and inference workloads. + Experience in evaluating, analyzing, and optimizing LLM training and inference performance of state-of-the-art models on cutting-edge ... Tensor Parallelism, Expert Parallelism, and FSDP. + Understanding of the emerging serving architectures like Disaggregated Serving and inference servers like… more
- Deloitte (Los Angeles, CA)
- …trade management operations. + Supervising assignments by the Global Trade professionals serving as Consultants, Senior Consultants, and Managers. + Developing ... world in our world! What you'll do As a Senior Manager on our Global Trade Advisory team, you...+ Certified SAFe Architect + Certified SAFe Agile Software Engineer + Certified SAFe Product Owner / Product Manager… more