- Qualcomm (Santa Clara, CA)
- …design, through interactions with graphics driver, architecture, and game development. Responsibilities for GPU compiler performance engineer : + Profile ... for talented engineers to create world class GPU compiler products to enable high performance graphics and compute with low power consumption. This position… more
- Qualcomm (Santa Clara, CA)
- …transformation to help create a smarter, connected future for all. As a Qualcomm GPU Engineer , you may architect, design, implement, verify, and/or optimize the ... performance and power of GPU cores. Qualcomm Engineers...of compiler experience, or 10+ years of compiler + GPU related experience (eg, driver or… more
- Qualcomm (Santa Clara, CA)
- …transformation to help create a smarter, connected future for all. As a Qualcomm GPU Engineer , you may architect, design, implement, verify, and/or optimize the ... performance and power of GPU cores. Qualcomm...with needs and goals. * Develops critical driver and compiler software to support GPU products. *… more
- Meta (Menlo Park, CA)
- …leading. **Required Skills:** Software Engineer , Systems ML - PyTorch Compiler , PyTorch Framework, PyTorch Performance Responsibilities: 1. Develop the PT2 ... industry experience in developing compilers, ML systems, ML accelerators, GPU performance , and similar. **Preferred Qualifications:** Preferred Qualifications:… more
- NVIDIA (Santa Clara, CA)
- We are searching for an extraordinary Sr. Compiler Engineer for an exciting and fun role in our Deep Learning Accelerator (DLA/NPU) team. Our team is responsible ... for the DLA compiler toolchain stack as well as the end-to-end DLA...leading the way in groundbreaking developments in Artificial Intelligence, High- Performance Computing and Visualization. The GPU , our… more
- Meta (Menlo Park, CA)
- …such as XLA, TVM, ONNX, Halide, and Triton. 13. Experience in vertical performance optimization, GPU performance , quantization, or distributed training. 14. ... We seek an Engineer Manager to drive subcomponents of the PyTorch Compiler development and holistic areas across teams, such as vertical performance ,… more
- Google (Sunnyvale, CA)
- …Experience with compiler optimization, code generation, and runtime systems for GPU architectures (eg, OpenXLA, MLIR, Triton, etc.). + Knowledge of low-level ... GPU programming (eg, CUDA, OpenCL, etc.) and performance...on and is growing every day. As a software engineer , you will work on a specific project critical to… more
- Google (Sunnyvale, CA)
- …Experience in low-level GPU programming (eg, CUDA, OpenCL, etc.) and performance tuning techniques. + Experience with compiler optimization, code generation, ... ML models to leverage GPUs. + Knowledge of modern GPU architectures (eg, NVIDIA, AMD, etc.), memory hierarchies, and... architectures (eg, NVIDIA, AMD, etc.), memory hierarchies, and performance bottlenecks. + Ability to develop and utilize … more
- Meta (Menlo Park, CA)
- …on one of the core areas such as PyTorch framework components, AI compiler and runtime, high- performance kernels and tooling to accelerate machine learning ... will also partner with hardware design teams to develop compiler optimizations for high performance . You will...codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels… more
- Google (Mountain View, CA)
- …programming models (eg CUDA, OpenCL) + Experience in leveraging custom kernels and compiler infrastructure to improve performance on hardware + Experience with ... Snapshot We are seeking a software engineer to define, drive, and critically contribute to...+ Being responsible for Pre-Training efficiency and optimising the performance of the latest models on Google's fleet of… more
- Google (Sunnyvale, CA)
- …model evaluation, data processing, debugging, fine tuning). + Experience with performance analysis and GPU programming. Preferred qualifications: + Master's ... maintain LLM training and serving benchmarks, and use them to identify performance opportunities and drive Accelerated Linear Algebra (XLA): GPU /Triton … more
- Meta (Menlo Park, CA)
- …of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation ... on all kinds of hardware as well, including CPU, GPU , and custom Silicon. We build the model publishing...performance . 3. Analyze neural networks, develop & implement compiler optimization algorithms. 4. Accelerate the next generation of… more
- NVIDIA (Santa Clara, CA)
- … analysis for training/inference workload + Knowledge of Linux device drivers and/or compiler implementation + Knowledge of GPU and/or CPU architecture and ... AI researchers and SW/HW teams running AI workload in GPU cluster. As a member of the software development...debugging tricky failures and issues to help improve the performance and efficiency of the system. What you'll be… more
- Amazon (Sunnyvale, CA)
- …Echo Show line of products. We are looking for a talented and passionate software engineer to be part of an exciting technology creation team at Amazon. You will ... you will be work along side partner science teams to develop the compiler infrastructure and lower deep learning workloads to heterogeneous device backends. You will… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More ... recently, GPU deep learning ignited modern AI - the next...supports many key products spanning the gamut of high performance computing, scientific computing, data analytics, deep learning, and… more
- NVIDIA (Santa Clara, CA)
- …scikit-learn) and deep learning (eg TensorFlow, PyTorch) + Experience with low-level GPU performance optimization + Experience building, debugging, profiling and ... improve the performance of developed APIs on various CPU and GPU architectures, especially as a part of customer-critical end-to-end workflows + prototype… more
- NVIDIA (Santa Clara, CA)
- …learning framework. + Partnering with NVIDIA's hardware and software teams to improve GPU performance in PyTorch. + Design, build and support production AI ... software stack all the way from users to the CUDA compiler , to the Lightning-Thunder Graph Compiler (https://github.com/Lightning-AI/lightning-thunder) , as… more
- NVIDIA (Santa Clara, CA)
- …profiling and tuning of HPC/AI workloads + Experience with CUDA programming and GPU performance optimization + Background with tasking or asynchronous runtimes, ... compiler optimizations and parallelization heuristics to improve the performance of AI models at extreme scales + Develop...runtime systems that underlay the foundation of all distributed GPU computing at NVIDIA What We Need To See:… more
- Meta (Menlo Park, CA)
- …and collaborate with the open-source community. **Required Skills:** Server Efficiency Performance Engineer Responsibilities: 1. Develop and optimize C/C++ ... by advancing system software.We seek software engineers for roles focusing on performance engineering to support the majority of Meta's server-side software. At Meta… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High- Performance Computing and Visualization. The GPU , our invention, serves ... We are searching for an extraordinary Sr. System SW Engineer for an exciting and fun role in our...drivers, and the firmware, as well as the DLA compiler toolchain stack. DLA supports a growing range of… more