- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops...AWS Neuron Software Development Kit (SDK), which includes an ML compiler, Neuron Kernel Interface (NKI) compiler,… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines ... deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service… more
- Meta (Sunnyvale, CA)
- …experience 5. 6+ experience in either silicon architecture, silicon modeling, performance architecture, kernel development, or building tools for silicon ... **Summary:** Meta is seeking an ASIC Engineer , Architecture to join our Infrastructure organization. Our...efficiently. You will have an opportunity to work with AI/ ML and video codec experts in the company, help… more
- Google (Sunnyvale, CA)
- Staff Software Engineer , Borglet ML , Offloads _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA; +2 more; +1 more **Advanced** Experience ... Drivers, and GPU Programming. + Experience with the Linux kernel interface and containers. + Understanding of key concepts...interface and containers. + Understanding of key concepts of performance analysis and tuning. + Ability to be reliable,… more
- Meta (Menlo Park, CA)
- …one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware ... strategy that delivers a highly flexible platform to train & serve new DL/ ML model architectures, combined with auto-tuned high performance for production… more
- Amazon (Cupertino, CA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance . The Inference Enablement and Acceleration team ... a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from… more
- Walmart (Sunnyvale, CA)
- …Performance , Debugging & Troubleshooting** + Identify, diagnose, and resolve performance bottlenecks across Ceph/Scale-Out storage solution, Linux kernel , ... large-scale data replication, backup, and disaster recovery strategies. + Exposure to AI/ ML workloads on Scale-Out storage and performance optimization for GPU… more
- Broadcom (San Jose, CA)
- …designed for high performance computing and networking applications including AI and ML . This is driven by the growing need for high server bandwidth, highest ... of the next generation of Ethernet NIC solutions for AI/ ML and High performance computing applications. We...join the NIC product development team. As a Software Engineer , you will be responsible for designing and development… more
- NVIDIA (Santa Clara, CA)
- …non- ML computer vision + Strong fundamentals with system-level performance : multi-threaded, multi-process and distributed software development. + Grounding in ... pre- and post-processing. + Improve the efficiency of VLM models themselves: kernel optimization in CUDA + Upstream improvements to SDKs and libraries across… more
- NVIDIA (Santa Clara, CA)
- …learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, ... We are now looking for a Senior Deep Learning Software Engineer , FlashInfer. NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for… more
- FocusKPI Inc. (Mountain View, CA)
- FocusKPI is looking for an SDET Android System Engineer or Android System Software Development Engineer in Test (SDET) to join one of our clients, a high-tech ... and execute comprehensive test strategies, including functional, integration, regression, and performance testing, with a focus on core Android internals, APIs,… more
- Insight Global (Palo Alto, CA)
- Job Description Insight Global is looking to hire a Senior Performance Engineer for a client in the quantum computing space. This is a fully remote contract ... machine learning models on GPU clusters. - Fine-tune GPU kernels for performance optimization. - Collaborate closely with scientists to support computational needs.… more
- Capital One (San Jose, CA)
- Distinguished AI Engineer (Agentic AI Platform Infrastructure) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For ... charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to… more
- Broadcom (San Jose, CA)
- …designed for high performance computing and networking applications including AI and ML . This is driven by the growing need for high server bandwidth, highest ... of the next generation of Ethernet NIC solutions for AI/ ML and High performance computing applications. We...join the NIC product development team.** As a Software Engineer , you will be responsible for designing and development… more
- NVIDIA (Santa Clara, CA)
- …experienced engineer to triage customers' hardware platform issues and AI/ ML workloads in huge datacenters of rack-scale platforms, solve customer problems, and ... NVIDIA is looking for an engineer who wants the excitement of direct customer...programming skills, and experience with multi-GPU platforms. Expertise analyzing performance of distributed GPU-accelerated workloads is a plus. What… more
- Palo Alto Networks (Santa Clara, CA)
- …contributions in these areas are a significant plus. + Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton ... development through runtime. As a Principal Machine Learning Inference Engineer , you will serve as a technical authority and...design and long-term strategy of our AI platform - ML inference. Beyond individual contribution, you will lead complex… more
- Broadcom (San Jose, CA)
- …designed for high performance computing and networking applications including AI and ML . This is driven by the growing need for high server bandwidth, highest ... of the next generation of Ethernet NIC solutions for AI/ ML and High performance computing applications. We...join the NIC product team. As a Software QA Engineer , you will be responsible for validation of the… more
- Walmart (Sunnyvale, CA)
- …management + Knowledge of low-latency serving architectures + Familiarity with ML -specific security requirements + Background in performance profiling and ... **Position Summary ** As a Senior Machine Learning Engineer , you are a technical leader working at...validation + Develop monitoring, logging, and alerting systems for ML services + Create infrastructure for A/B testing and… more