- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead NKI requirements working ... to define and drive product strategy for the Neuron Kernel Interface (NKI), a compiler library enabling custom ...Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines ... deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service… more
- Amazon (Cupertino, CA)
- …are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops...AWS Neuron Software Development Kit (SDK), which includes an ML compiler, Neuron Kernel Interface (NKI) compiler,… more
- Deloitte (San Jose, CA)
- …familiarity with Go or Rust a plus. + Strong understanding of AI/ ML frameworks (PyTorch, TensorFlow, ONNX) and performance /model optimization. + Familiarity ... Sr . Java Full Stack Developer - Project Delivery...do/Responsibilities Join our AI and Systems Co-Design team, pioneering high- performance software and hardware technologies for AI and next-generation… more
- Amazon (Cupertino, CA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance . The Inference Enablement and Acceleration team ... a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from… more
- quadric.io, Inc (Burlingame, CA)
- …and endpoint devices, ranging from battery operated smart-sensor systems to high- performance automotive or autonomous vehicle systems. Unlike other NPUs or neural ... Domain Knowledge: Demonstrated ability to drive complex technical projects in the AI/ ML and embedded processing domain. Highly Desired Skills and Experience (Pluses)… more
- NVIDIA (Santa Clara, CA)
- …non- ML computer vision + Strong fundamentals with system-level performance : multi-threaded, multi-process and distributed software development. + Grounding in ... pre- and post-processing. + Improve the efficiency of VLM models themselves: kernel optimization in CUDA + Upstream improvements to SDKs and libraries across… more
- NVIDIA (Santa Clara, CA)
- …learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, ... We are now looking for a Senior Deep Learning Software Engineer, FlashInfer. NVIDIA has...the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing… more
- Insight Global (Palo Alto, CA)
- Job Description Insight Global is looking to hire a Senior Performance Engineer for a client in the quantum computing space. This is a fully remote contract ... machine learning models on GPU clusters. - Fine-tune GPU kernels for performance optimization. - Collaborate closely with scientists to support computational needs.… more
- NVIDIA (Santa Clara, CA)
- …solutions. Your problem-solving prowess will be crucial for AON's success with ML workloads. + Architect and Build High- Performance Platforms: Transform user ... our Developer Tools Always-On Profiling (AON) team as a Senior Software Architect, where you'll be pivotal in designing,...CUDA APIs, runtime, streams, kernels, and GPU architecture. + ML Ecosystem & Performance Analysis: Familiarity with… more
- NVIDIA (Santa Clara, CA)
- …solutions. Your problem-solving prowess will be crucial for AON's success with ML workloads. + Architect and Build High- Performance Platforms : Transform ... our Developer Tools Always-On Profiling (AON) team as a Senior Software Architect, where you'll be pivotal in designing,...CUDA APIs, runtime, streams, kernels, and GPU architecture. + ML Ecosystem & Performance Analysis: Familiarity with… more
- Walmart (Sunnyvale, CA)
- …management + Knowledge of low-latency serving architectures + Familiarity with ML -specific security requirements + Background in performance profiling and ... **Position Summary ** As a Senior Machine Learning Engineer, you are a technical...validation + Develop monitoring, logging, and alerting systems for ML services + Create infrastructure for A/B testing and… more
- NVIDIA (Santa Clara, CA)
- …for customer use cases, ensuring optimal end-to-end workflows and balanced accuracy- performance trade-offs. + Conduct deep GPU kernel -level profiling to ... and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI… more
- NVIDIA (Santa Clara, CA)
- …for an experienced engineer to triage customers' hardware platform issues and AI/ ML workloads in huge datacenters of rack-scale platforms, solve customer problems, ... solid programming skills, and experience with multi-GPU platforms. Expertise analyzing performance of distributed GPU-accelerated workloads is a plus. What you'll be… more
- NVIDIA (Santa Clara, CA)
- NVIDIA networking designs and manufactures high- performance networking equipment that enable the most powerful super computers in the largest data centers in the ... or RoCE (RDMA over Converged Ethernet) we make powerful ML /AI platforms possible. We believe in our people and...in large clusters even more performant. As a networking Sr . Solutions Architect at NVIDIA you will have agency… more
- Deloitte (Sacramento, CA)
- …critical to businesses. Your contributions can help clients improve financial performance , accelerate new digital ventures, and fuel growth through innovation. AI ... and business applications - delivering production-grade reliability, scalability, and performance . + Engineer core solution components directly, including data… more
- Cisco (San Jose, CA)
- …towards optimal results with regard to reusability, security, reliability, and performance . * Collaborate with various stakeholders to design complex systems ... comprised of multiple ML and non- ML services which meet the... services which meet the highest levels of security, performance , reliability, and scalability while satisfying requirements from within… more
- NVIDIA (Santa Clara, CA)
- …and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, ... of use, compute and memory efficiency, and achieving the best accuracy- performance tradeoffs through software-hardware co-design. Your work will span multiple layers… more