• Staff ML Systems

    General Motors (Mountain View, CA)
    …another relevant field; or equivalent real-world experience + Experience building PB scale ML data management systems (datalake, data warehouse) from the ground ... if more than 3 days]. **The Role** As an engineer on this team, you will be responsible for...in the cloud and providing powerful foundations for GM ML Data Platform tools, frameworks, and services. Responsibilities include… more
    General Motors (05/31/25)
    - Save Job - Related Jobs - Block Source
  • Sr Staff Engineer , ML

    LinkedIn (Mountain View, CA)
    …in LinkedIn's Sunnyvale, CA campus. About the Role We are seeking a Senior Staff Engineer to design, build, and maintain our large-scale GPU infrastructure for ... machine learning ( ML ) and AI workloads. In this role, you will...design, and integrate scalable storage solutions (eg, parallel file systems , object storage, NVMe over Fabric) to meet performance… more
    LinkedIn (04/18/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , ML

    Snap Inc. (Palo Alto, CA)
    …ranking and recommendation systems more efficient and impactful. We're looking for a Staff Software Engineer , ML Inference to join Snap Inc! What you'll ... privacy at the forefront. You'll play a critical role in scaling our ML Infrastructure, optimizing AI training and inference systems , and driving innovations… more
    Snap Inc. (04/12/25)
    - Save Job - Related Jobs - Block Source
  • Staff AI/ ML Engineer

    General Motors (Mountain View, CA)
    …Onboard Embodied AI team is at the forefront of developing groundbreaking onboard ML systems powering fully autonomous vehicles. We leverage modern end-to-end ... field. + 8-10+ years of extensive experience developing and deploying advanced ML systems , particularly in end-to-end real-time onboard applications. + Proven… more
    General Motors (05/03/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , ML

    Google (Sunnyvale, CA)
    …experience working in a complex, matrixed organization. + Experience with machine learning systems (eg, background theory, TensorFlow, or other ML tools). + ... on and is growing every day. As a software engineer , you will work on a specific project critical...full-stack as we continue to push technology forward. The ML , Systems , & Cloud AI (MSCA) organization… more
    Google (04/28/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , Borglet…

    Google (Sunnyvale, CA)
    …team. Our stack is mostly C++ with some Go used for cluster services. The ML , Systems , & Cloud AI (MSCA) organization at Google designs, implements, and manages ... Go. Preferred qualifications: + Experience with system software, distributed systems , and complex multi-component software systems . +...on and is growing every day. As a software engineer , you will work on a specific project critical… more
    Google (05/31/25)
    - Save Job - Related Jobs - Block Source
  • AI/ ML Staff Software Development…

    IBM (San Jose, CA)
    …most challenging problems? If so, lets talk. **Your role and responsibilities** As a Staff AI/MLOps Development Engineer at Apptio, you will work closely with ... in hybrid cloud environments. You will help design and engineer efficient and resilient MLOps platforms and software products...for a sizable team, in terms of project impact, ML system design, and ML excellence. *… more
    IBM (05/15/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , AI/…

    Google (Mountain View, CA)
    …distributed team of engineers. + Lead the design and implementation of recommendation systems , optimize ML infrastructure, and guide the development of model ... development. + 5 years of experience building and deploying recommendation systems models (retrieval, prediction, ranking, embedding) in production and experience… more
    Google (05/23/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , AI/…

    Google (Mountain View, CA)
    …distributed team of engineers. + Lead the design and implementation of recommendation systems , optimize ML infrastructure, and guide the development of model ... development. + 5 years of experience building and deploying recommendation systems models (retrieval, prediction, ranking, embedding) in production and experience… more
    Google (04/05/25)
    - Save Job - Related Jobs - Block Source
  • ML Acceleration / Framework Engineer

    Amazon (Cupertino, CA)
    …services. We seek candidates with strong programming skills, eagerness to learn complex systems , and basic ML knowledge. This role offers growth opportunities in ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...ML infrastructure, bridging the gap between frameworks, distributed systems , and hardware acceleration. About the team Annapurna Labs… more
    Amazon (04/16/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , ML

    Snap Inc. (Palo Alto, CA)
    …precision, and always execute with privacy at the forefront. We're looking for a Staff Software Engineer , Machine Learning Infrastructure to join the AI Training ... Google Kubernetes Engine (GKE) and Sagemaker + Build comprehensive data management systems for scalable data collection, labeling, processing, and evaluation + Work… more
    Snap Inc. (04/24/25)
    - Save Job - Related Jobs - Block Source
  • [PhD] ML Infrastructure Engineer

    Amazon (Cupertino, CA)
    …Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, responsible for development, ... enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such as GPT and Llama, as well as… more
    Amazon (04/16/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    …Key job responsibilities AWS Networking is looking for a Sr. Software Development Engineer to drive innovation in Machine Learning ( ML ) Network Performance. You ... and performance benchmarking systems to deeply understand the performance of the ML network and you will develop and deliver innovative solutions that drive ever… more
    Amazon (05/30/25)
    - Save Job - Related Jobs - Block Source
  • [PhD] ML Compiler Engineer

    Amazon (Cupertino, CA)
    …support the development and scaling of a compiler to enable the world's largest ML workloads to run performantly on these custom Annapurna systems . The Product: ... advancing AI capabilities. The Inferentia/Trainium chips specifically offer unparalleled ML inference and training performances. They are enabled through… more
    Amazon (04/16/25)
    - Save Job - Related Jobs - Block Source
  • Sr. ML Kernel Performance Engineer

    Amazon (Cupertino, CA)
    …team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance ... kernels for ML functions, ensuring every FLOP counts in delivering optimal...at the intersection of software, hardware, and machine learning systems , you'll bring expertise in low-level optimization, system architecture,… more
    Amazon (05/17/25)
    - Save Job - Related Jobs - Block Source
  • Sr ML Compiler Engineer , Annapurna…

    Amazon (Cupertino, CA)
    …used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the ... best-in-class ML training performance with the most teraflops (TFLOPS) of...in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will… more
    Amazon (05/14/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer , HPC/…

    Amazon (Cupertino, CA)
    Description We are seeking an experienced engineer to work on distributed AI/ ML systems . This role involves working on collective operations - the ... kernels, and performant code is important. Experience with embedded systems is valued, and experience with high-speed networking or...solving hard problems, want to work with HPC and ML customers, iterate fast and deliver meaningful solutions at… more
    Amazon (05/14/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - AI/ ML , AWS…

    Amazon (Cupertino, CA)
    …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... Web Services (AWS) is looking for a Software Development Engineer II to build, deliver, and maintain complex products...customers and raise our performance bar. You'll design fault-tolerant systems that run at massive scale as we continue… more
    Amazon (05/06/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - ML

    Amazon (Santa Clara, CA)
    …the platform. If you have prior experience in building & scaling large scale systems , ML pipelines, and enjoy extracting maximum performance at every layer of ... changes in the industry. * Create solutions to run predictions on distributed systems with exposure to innovative technologies at incredible scale and speed. * Build… more
    Amazon (05/17/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer - AI/ ML , AWS…

    Amazon (Cupertino, CA)
    …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... enablement and performance tuning of a wide variety of ML model families, including massive scale large language models...(design patterns, reliability and scaling) of new and existing systems experience - - 5+ years of full software… more
    Amazon (05/11/25)
    - Save Job - Related Jobs - Block Source