• TikTok (San Jose, CA)
    …cost-effective. You'll also have the opportunity to design, build and deliver all kinds of systems as a software engineer . Responsibilities: - Engage in and ... Preferred Qualifications: - Expertise in designing, analyzing, and troubleshooting large-scale distributed systems . - Ability to debug, optimize code, and… more
    Upward (07/01/25)
    - Save Job - Related Jobs - Block Source
  • SailPoint Technologies Holdings, Inc. (San Jose, CA)
    …latency, hallucination, permissions, and oversight. Proficiency with cloud-native platforms, distributed systems , and infrastructure design for scalability and ... the role We're seeking a hands-on, deeply technical Principal Engineer to lead innovation in the Agentic AI space..... This role sits at the intersection of deep systems engineering , modern AI architectures , and the… more
    Upward (07/07/25)
    - Save Job - Related Jobs - Block Source
  • Enfabrica (Mountain View, CA)
    …Join an ambitious and highly experienced team of silicon and hyperscale data center systems experts as a Physical Design Engineer . Our team is motivated by ... track record on early-stage investments. We are a diverse team of expert chip/ software / systems architects and developers who excel in hardware/ software more
    Upward (07/17/25)
    - Save Job - Related Jobs - Block Source
  • X Corp. (Palo Alto, CA)
    …such as CDN operations, containerization, incident management, traffic routing, and distributed systems . Proficiency in scripting and automation (Python, Perl, ... crucial role in ensuring the high performance, reliability, and security of our systems . Each team focuses on different aspects of our infrastructure. Teams: CDN… more
    Upward (06/27/25)
    - Save Job - Related Jobs - Block Source
  • Replit (San Mateo, CA)
    …used for automation (Python, Go, or similar) Deep understanding of distributed systems Experience with container orchestration platforms (Kubernetes) and ... Replit is the fastest way to turn ideas into software . With our powerful AI-powered Agent and Assistant, anyone...serves millions of developers worldwide. As a Site Reliability Engineer , you will bridge the gap between development and… more
    Upward (07/01/25)
    - Save Job - Related Jobs - Block Source
  • Circle (San Francisco, CA)
    …collaboration Observability, troubleshooting, and performance optimization skills in complex, distributed systems Experience with: Kubernetes clusters at scale, ... Strong observability, problem-solving, and performance optimization skills in complex, distributed systems Hands-on experience with Blue-Green, Canary, and… more
    Upward (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Air Apps (San Francisco, CA)
    …and system monitoring . Knowledge of load balancing, failover strategies, and distributed systems . Understanding of security best practices, access control, ... along the way. The Role As a Site Reliability Engineer (SRE) at Air Apps, you will be responsible...systems . You will work at the intersection of software development and operations, implementing automation, monitoring, and performance… more
    Upward (07/08/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer

    Rubrik (Palo Alto, CA)
    …services for system monitoring, detecting faults, and automatically self-healing the distributed systems + Design, develop, and operationalize high-performance, ... Computer Science or related field + 2+ years of software development experience on Linux, preferably in Platform/ Systems...domain + Strong fundamentals in data structures, algorithms, and distributed systems design + Strong background in… more
    Rubrik (05/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    …from the crowd: + Technical competency in managing and automating large-scale distributed systems independent of cloud providers. Advanced hands-on experience ... + 5+ years in similar role and experience on large-scale production systems . Experience with common software engineering principles, tools and techniques.… more
    NVIDIA (07/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Applied AI Software Engineer

    NVIDIA (Santa Clara, CA)
    …building the next generation of scalable AI systems . As a Senior Applied AI Software Engineer on the Dynamo project, you will address some of the most ... Go for Kubernetes controllers and operators development. + Deep understanding of distributed systems , parallel computing, and GPU architectures. + Experience… more
    NVIDIA (06/11/25)
    - Save Job - Related Jobs - Block Source
  • Software Dev Engineer III,…

    Amazon (East Palo Alto, CA)
    …base. You'll bring a passion for innovation, data, search, analytics, and distributed systems . You'll also: Solve challenging technical problems, often ones ... about transforming business challenges into technological breakthroughs? Join Amazon as a Software Development Engineer (SDE) and help shape the future of… more
    Amazon (07/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior DGX Cloud Software Engineer

    NVIDIA (Santa Clara, CA)
    …and fleet management engineering. + Experience with infrastructure automation and distributed systems design developing tools for running large scale ... We are seeking Software Engineers with previous experience building and running...more of the following: Linux, Slurm, Kubernetes, Local and Distributed Storage, and Systems Networking. Ways to… more
    NVIDIA (06/27/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - AI/ML, AWS…

    Amazon (Cupertino, CA)
    …- Bachelor's degree in computer science or equivalent - Preferred previous software engineer expertise with Pytorch/Jax/Tensorflow, Distributed libraries and ... customers and raise our performance bar. You'll design fault-tolerant systems that run at massive scale as we continue...that use them. This role is for a senior software engineer in the Machine Learning Applications… more
    Amazon (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer - AI/ML, AWS…

    Amazon (Cupertino, CA)
    …design or architecture (design patterns, reliability and scaling) of new and existing systems experience - 5+ years of full software development life cycle, ... science or equivalent - Experience in computer architecture - Previous software engineering expertise with Pytorch/Jax/Tensorflow, Distributed libraries and… more
    Amazon (07/02/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer - AI/ML, AWS…

    Amazon (Cupertino, CA)
    …design or architecture (design patterns, reliability and scaling) of new and existing systems experience - 5+ years of full software development life cycle, ... science or equivalent - Experience in computer architecture - Previous software engineering expertise with Pytorch/Jax/Tensorflow, Distributed libraries and… more
    Amazon (06/11/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer - AI/ML, AWS…

    Amazon (Cupertino, CA)
    …and the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This ... and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large...systems experience - - 5+ years of full software development life cycle, including coding standards, code reviews,… more
    Amazon (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer III, Google…

    Google (Sunnyvale, CA)
    …academic or industry setting. + Experience building and supporting large scale distributed systems and infrastructure. + Familiarity with Kubernetes development, ... of experience with an advanced degree. + Experience in distributed computing or machine learning infrastructure. Preferred qualifications: +...goes on and is growing every day. As a software engineer , you will work on a… more
    Google (07/12/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Google…

    Google (Sunnyvale, CA)
    …design and architecture. + 3 years of experience developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, ... bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage,...goes on and is growing every day. As a software engineer , you will work on a… more
    Google (07/15/25)
    - Save Job - Related Jobs - Block Source
  • ML Acceleration / Framework Engineer

    Amazon (Cupertino, CA)
    …offers growth opportunities in ML infrastructure, bridging the gap between frameworks, distributed systems , and hardware acceleration. About the team Annapurna ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...The ML Inference team collaborates closely with hardware designers, software optimization experts, and systems engineers to… more
    Amazon (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Marketing Engineer

    NVIDIA (Santa Clara, CA)
    …Our data center platforms integrate CPUs, GPUs, DPUs, networking, and a full-stack software ecosystem to power AI at scale! We are seeking a highly technical ... and creative Senior Technical Marketing Engineer to join our team to showcase the innovations...world's largest AI models. This role will focus on distributed AI model training, ensuring that customers and partners… more
    NVIDIA (07/17/25)
    - Save Job - Related Jobs - Block Source