- Broadcom (Palo Alto, CA)
- …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
- Amazon (Seattle, WA)
- …an experienced Technical Product Manager to define and drive product strategy for Neuron Runtime and ML Infrastructure integration. You will be part of the AWS ... ML performance in the cloud. You will lead runtime and infrastructure requirements working backward from customer needs,...to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the… more
- Google (Sunnyvale, CA)
- Senior Software Engineer, AI / ML , Runtime Engines _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, ... ML field. + 3 years of experience with ML infrastructure (eg, model deployment, model...will partner with first-party and third-party Cloud customers. The ML , Systems, & Cloud AI (MSCA) organization… more
- pony.ai (Fremont, CA)
- …evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to ... went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony. ai provides a...compute platform architecture design and software infrastructure. + Apply model optimization and efficient deep learning techniques to models… more
- Palo Alto Networks (Santa Clara, CA)
- …collaborate with internal teams to tackle challenging security problems. As a senior AI / ML -powered cloud security product manager, you will be instrumental in ... office full time, with flexibility when it's needed. This model supports real-time problem-solving, stronger relationships, and the kind...abreast of the latest trends in cloud security and AI / ML + Define product requirements and manage… more
- Amazon (Seattle, WA)
- …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * would with ... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that...of the most interesting and impactful infrastructure challenges in AI / ML today. Basic Qualifications - 5+ years… more
- Amazon (Cupertino, CA)
- …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design, ... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that...of the most interesting and impactful infrastructure challenges in AI / ML today. Basic Qualifications - Bachelor's degree… more
- Amazon (Seattle, WA)
- …for development, enablement and performance tuning of a wide variety of ML model families, including state of art GEN- AI models and massive scale large ... a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...debug and resolve accuracy issues raising from migration of model to AI accelerators. The team develops… more
- Amazon (Cupertino, CA)
- …use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible for ... and performance tuning of a wide variety of LLM model families, including massive scale large language models like...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
- Amazon (Seattle, WA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... to and experience with Amazon's growing suite of generative AI services and other cutting-edge cloud computing offerings across...a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is… more
- TP-Link North America, Inc. (Irvine, CA)
- We are seeking for a Staff AI / ML Computer Vision Engineer to design and develop cutting-edge AI -powered features for our next-generation smart home ... project execution, mentoring engineers, setting standards for deploying efficient, real-time AI at the edge, and ensuring seamless integration with cloud… more
- Amazon (Cupertino, CA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
- Amazon (Seattle, WA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
- Amazon (Seattle, WA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
- Amazon (Seattle, WA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, ... for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
- Frontier Technology Inc. (Dayton, OH)
- …complex data relationships. + Develop data services that feed analytics pipelines or integrate AI / ML outputs into runtime systems. + Work with serialization ... + Design and implement APIs, data pipelines, and simulation runtime logic that connect and enable mission applications. +...Kafka or Redis. + Engineer data flow between analytic, AI , and simulation components to support real-time mission use… more
- Amazon (Seattle, WA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such ... Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with...side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training… more
- Amazon (Cupertino, CA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such ... Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with...side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training… more
- Amazon (Cupertino, CA)
- …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design and ... expertise to push the boundaries of what's possible in AI acceleration. The AWS Neuron SDK, developed by the... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that… more
- Amazon (Cupertino, CA)
- …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design and ... expertise to push the boundaries of what's possible in AI acceleration. The AWS Neuron SDK, developed by the... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that… more