- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead NKI requirements working ... to define and drive product strategy for the Neuron Kernel Interface (NKI), a compiler library enabling custom ...Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost… more
- Amazon (Cupertino, CA)
- …stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead framework integration ... of Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the cloud to… more
- LinkedIn (Mountain View, CA)
- …models, large language models, to computer vision models. We optimize performance across algorithms, AI frameworks, data infra, compute software, and hardware. ... in our current CPU and GPU fleet, and develop high- performance and power-efficient FPGA solutions for the large scale...solutions for the large scale AI models. As a Sr . Staff Engineer on the AI Infra team, you… more
- Amazon (Cupertino, CA)
- …troubleshooting bottlenecks and improving architectures and algorithms on Graviton. - Contribute performance improvements and bug fixes to Linux kernel and other ... Graviton Software team is seeking Software Engineers to drive performance optimization of open source projects, internal services and...down the stack - from working on the Linux kernel to debugging and optimizing C++ or Rust applications… more
- Amazon (Cupertino, CA)
- …Seattle. The AWS Graviton Software team is seeking Software Engineers to optimize performance for AWS Graviton. Graviton delivers the best price/ performance in ... is used by over 90% of our largest customers. You'll drive performance optimization across open source projects, internal services, and customer applications,… more
- Broadcom (San Jose, CA)
- …controllers/Network Interface Cards targeted towards Enterprise, Server & Storage and AI/ ML markets. Key responsibilities are to assist customers in qualifying and ... on driver bring up in customer environment, optimizing drivers for optimal performance , helping customers bring up features like RDMA, RoCEv2, end-to-end congestion… more
- Amazon (Cupertino, CA)
- …team. These components are used by all of AWS server platform teams, eg AI/ ML servers, storage servers, compute servers, etc. Given the sheer number of programs that ... products on a timely fashion and consistently enhance the quality and performance of customers' products, fostering long-term partnerships. You will lead and develop… more
- NVIDIA (Santa Clara, CA)
- …in high-level frameworks like PyTorch and HuggingFace to developing and improving high- performance kernel implementations in CUDA, TRT-LLM, and Triton. This is ... best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer to develop and scale...our automated deployment solution. + Analyze and profile GPU kernel -level performance to identify hardware and software… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** Job Description Summary Senior , Software Engineer - TV SDK You will joing a team who will be designing, building, and maintaining native / ... profiling-memory, GPU bandwidth, GC pauses-and guide OEM partners on kernel or firmware fixes. + Mentor & document: publish...Nielsen DAR, Conviva). + Exposure to Generative AI / ML inference on device (ONNX Runtime, TensorRT) for upscaling,… more
- Google (Sunnyvale, CA)
- …involving cross-functional, or cross-business projects. + Experience in computer architecture, high- performance computing. + Experience in Linux Kernel , Linux ... problems across the full-stack as we continue to push technology forward. The ML , Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages… more
- Amazon (Cupertino, CA)
- …tackle challenges that seemed insurmountable just yesterday, delivering breakthrough performance in cloud computing and AI infrastructure. We're looking for ... on custom-designed servers and hardware that form the foundation of modern ML infrastructure - Help shape the future of AWS's machine learning capabilities… more