- NVIDIA (Santa Clara, CA)
- …but not required. + Experience with the following technologies is a huge plus: XLA , TVM, MLIR, LLVM, OpenAI Triton, deep learning models and algorithms, and deep ... CUDA or with GPUs in general. + Experience with open-source compilers such as XLA , LLVM, MLIR or TVM. With highly competitive salaries and a comprehensive benefits… more
- NVIDIA (Santa Clara, CA)
- …teams to integrate resiliency features into AI frameworks like PyTorch and JAX/ XLA . + Testing & Automation: Develop and implement tests to ensure robustness, ... computing environments. + Familiarity with AI frameworks such as PyTorch, JAX/ XLA , TensorFlow, or similar. + Experience with debugging and profiling tools… more
- NVIDIA (Santa Clara, CA)
- …training of frontier models on industry-leading frameworks like PyTorch and JAX/ XLA , with near-zero downtime. Your optimizations will span from algorithmic ... with using and contributing to modern AI frameworks like PyTorch and JAX/ XLA , specifically for large-scale training workloads. + A strong passion for designing… more
- Amazon (Cupertino, CA)
- …a motivation to achieve results. Experience with technologies and tools such as XLA , vLLM or Hugging Face transformers is highly valued. *Utility Computing (UC)* AWS ... help lead the integration of AWS Neuron's capabilities into JAX, PJRT, PyTorch and PyTorch/ XLA to ensure a seamless user experience with each new JAX and PyTorch… more
- Meta (Burlingame, CA)
- …13. Experience in developing ML compilers (eg, PyTorch Compiler, Triton, MLIR, JAX, XLA ) or ML frameworks (eg, JAX, vLLM, ONNX, TensorRT). 14. Good understanding of ... the fast-moving Generative AI space 15. Experience in building OSS communities and extensive social media presence in the ML Sys domain. **Public Compensation:** $70.67/hour to $208,000/year + bonus + equity + benefits **Industry:** Internet **Equal… more
- Meta (Menlo Park, CA)
- …parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA , TVM, Halide is a plus 12. OR AI frameworks: Experience in ... developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development **Public Compensation:**… more
- Meta (Sunnyvale, CA)
- …Qualifications:** Preferred Qualifications: 9. Experience in developing PyTorch/PT2, Triton, MLIR, JAX, XLA , TVM is a huge plus 10. Knowledge in GPU architecture, ML ... accelerator performance, and developing high-performance kernels. 11. Experience in building OSS communities and extensive social media presence in the ML Sys domain. 12. Experience with training models, end-to-end model optimizations, or applying ML to… more
- Amazon (Cupertino, CA)
- …help lead the efforts building distributed training support into Pytorch and Jax using XLA and the Neuron compiler and runtime stacks. This role will help tune these ... models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium . Strong software development and ML knowledge are both critical to this role. About the team About Us Inclusive Team Culture Here at AWS, we embrace… more
- Amazon (Sunnyvale, CA)
- …For system researchers, familiarity with deep learning compilers, auto-parallelization, and XLA /MLIR ecosystems Amazon is an equal opportunity employer and does not ... discriminate on the basis of protected veteran status, disability, or other legally protected status. Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to… more
- Amazon (Seattle, WA)
- …allow the profiler to support multiple frameworks, such as PyTorch, TensorFlow, and XLA . A successful candidate will have an established background in building AI/ML ... and performance analysis tools. Experience with ML-specific profiler tools (like PyTorch Profiler or TensorFlow Profiler) is highly desirable, along with along with direct customer-facing experience and a strong motivation to achieve results. A day in the life… more
- Amazon (Cupertino, CA)
- …efforts building distributed training and inference support into Pytorch, Tensorflow, Jax using XLA and the Neuron compiler and runtime stacks. This role will help ... tune these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and the TRn1 , Inf1 servers. Strong software development and ML knowledge are both critical to this role. About the… more
- Amazon (Seattle, WA)
- …efforts building distributed training and inference support into Pytorch, Tensorflow, Jax using XLA and the Neuron compiler and runtime stacks. This role will help ... tune these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and the TRn1 , Inf1 servers. Strong software development and ML knowledge are both critical to this role. About the… more
- CSL Behring (King Of Prussia, PA)
- …to ensure service level management (SLA) and Experience Level management ( XLA ) metrics are consistently met. **Position Qualifications and Experience Requirements:** ... Education + Undergraduate degree in Information Technology, Computer Science, or a related field preferred. Related certifications, and advanced graduate studies desirable. Experience + 15+ years' experience in the pharmaceutical/biotechnology industry.… more
- Amazon (Cupertino, CA)
- …help lead the efforts building distributed training support into Pytorch, Tensorflow using XLA and the Neuron compiler and runtime stacks. This role will help tune ... these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and the TRn1 , Inf1 servers. Strong software development and ML knowledge are both critical to this role. About the team… more
- Amazon (Seattle, WA)
- …will lead efforts to build distributed training support into PyTorch and JAX using XLA , the Neuron compiler, and runtime stacks. You will optimize models to achieve ... peak performance and maximize efficiency on AWS custom silicon, including Trainium and Inferentia, as well as Trn2, Trn1, Inf1, and Inf2 servers. Strong software development skills, the ability to deep dive, work effectively within cross-functional teams, and… more
- Amazon (Seattle, WA)
- …help lead the efforts building distributed inference support into Pytorch, Tensorflow using XLA and the Neuron compiler and runtime stacks. This role will help tune ... these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and the TRn1 , Inf1 servers. Strong software development using C++/Python and ML knowledge are both critical to this… more
- Google (Mountain View, CA)
- …Software stacks (eg, networking, storage, etc.), ML infrastructure, Compilers (eg XLA ), or specialization in another ML field. Preferred qualifications: + Master's ... degree or PhD in Engineering, Computer Science, or a related technical field. + 3 years of experience working in a matrixed organization involving cross-functional, or cross-business projects. + 3 years of experience in a technical leadership role leading… more
- Amazon (Seattle, WA)
- …(LLVM, GCC) and code generation techniques for new hardware - Experience with XLA , TVM, MLIR, LLVM, deep learning models and algorithms, and deep learning framework ... design. - Interactions with open-source communities, in either a leadership or code contributor role Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Los… more
- NVIDIA (Santa Clara, CA)
- …experience with Deep Learning frameworks (PyTorch, JAX, etc.), compilers (Triton, XLA , etc.), and NVIDIA libraries (TRTLLM, TensorRT, Nemo, NCCL, RAPIDS, etc.). ... + Familiarity with deep learning architectures and the latest LLM developments. + Background with NVIDIA hardware and software, performance tuning, and error diagnostics. + Hands-on experience with GPU systems in general including but not limited to… more
- NVIDIA (Santa Clara, CA)
- …OpenCL programming experience. + Experience with the following technologies: MLIR, LLVM, XLA , TVM, deep learning models and algorithms, and deep learning framework ... design. With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most brilliant and hardworking people in the world working with us… more