- Aequor (Waltham, MA)
- Cloud Infrastructure/ AI/ML Engineer Position Overview We are seeking an experienced Cloud Infrastructure/ AI/ML/ Data Engineer to support variety of projects ... across Genomic Medicine Unit (GMU) research and platform work. This contractor role focuses on providing infrastructure solutions to enable AI/ML models developments… more
- Labelbox (San Francisco, CA)
- …some Java & Kotlin APIs: GraphQL Cloud & Infrastructure: Google Cloud Platform (GCP), Kubernetes Databases: MySQL, Spanner, PostgreSQL Queueing / Streaming: ... company offering three integrated solutions for frontier AI development: Enterprise Platform & Tools : Advanced annotation tools, workflow automation, and quality… more
- OpenAI (San Francisco, CA)
- …backing ChatGPT and the API. The systems we support include inference kubernetes clusters, GPU health, Infiniband performance, node lifecycle, and more.We seek to ... GPU clusters at scale Have experience operating orchestration systems such as Kubernetes at scale Take pride in building and operating scalable, reliable, secure… more
- Deutsche Bank (Cary, NC)
- Job Description: Job Title : Senior UI/UX Engineer Corporate Title: Assistant Vice President Location: Cary, NC Overview Our Corporate Trust team in Cary under Trust ... Proficient in designing, developing, and maintaining complex applications in Java based platform . JavaScript frameworks like React, Angular etc. is a must. Strong… more
- Palantir Technologies (New York, NY)
- …Maintaining availability of physical Linux servers that power the Palantir platform in air-gapped production environments Design, deploy, and operate infrastructure ... in secure facilities Experience with containers (Docker/Podman) and orchestration (OpenShift/ Kubernetes ) at scale is a plus Preferred Certifications: DOD 8570… more
- DigitalOcean (Seattle, WA)
- …the dreamers and builders in the world. We are looking for a Senior Engineer I who is passionate about building scalable, intuitive, and reliable billing solutions. ... As a Senior Engineer I on the Billing Engineering Team at DigitalOcean,...Finance, Product, and Engineering to ensure that our billing platform remains accurate, transparent, and highly available. The ideal… more
- JPMorgan Chase & Co. (Chicago, IL)
- …a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the CIB - Global Banking, you hold a leadership ... best practices with the ability to implement these practices within an application or platform Fluency in at least one programming language such as (eg, Python, Java… more
- TikTok (San Jose, CA)
- …US users safe. Our focus is on providing oversight and protection of the TikTok platform and US user data, so millions of Americans can continue turning to TikTok to ... containers for multiple services and deploying, managing, and monitoring them in Kubernetes cluster; Developing CI/CD tools in Python and writing automation Bash… more
- Celonis (Redwood City, CA)
- …critical role in ensuring the health, performance, and resilience of our platform . The team applies advanced software engineering and Site Reliability Engineering ... enhances the availability, scalability, and efficiency of our services. Partner with platform and application development teams to learn from incidents and improve… more
- Saviance (Boston, MA)
- Job Title: Site Reliability Engineer Location: Remote with Quarterly visits to Chennai, Tamil Nadu, India Duration: Full-Time bout BigRio: BigRio is a remote-based, ... software solutions. bout the Job: We are looking for a Site Reliability Engineer (SRE) to join our small but dynamic team. You will collaborate closely with… more
- Cynet Systems (Sunnyvale, CA)
- …experience working with container orchestration frameworks including on-prem and rancher kubernetes and good knowledge on kubernetes objects. Experience working ... sql and nosql dbs. Experience building CICD pipelines (preferred). Cloud platform knowledge (specifically AWS) is required. Incident handling and problem management. more
- Firsthand (New York, NY)
- About Firsthand Firsthand has built the first AI-powered Brand Agent platform , transforming the way marketers and publishers engage consumers through their own AI ... and advertising focus on back-office automation, the Firsthand Brand Agent Platform (TM) powers front-line consumer engagement. Operating across both owned… more
- TikTok (New York, NY)
- …at any time. Responsibilities - Work with infrastructure, product and platform engineering teams on operating and deploying software platforms, capacity planning ... Azure, GCP. - Familiarity with infrastructure and provisioning tools like Kubernetes , Terraform, Ansible, and SaltStack. - Secure infrastructure in a distributed… more
- TriNet (Atlanta, GA)
- …reliability, performance, high availability, observability, and overall stability of the platform by leveraging the key SRE foundational principles such as ... Memcached Experience working with IaC tools like Terraform, Ansible and managing Kubernetes services, including HELM Good knowledge of REST APIs, OAuth, OpenID… more
- Glean (Palo Alto, CA)
- …Glean: Founded in 2019, Glean is an innovative AI-powered knowledge management platform designed to help organizations quickly find, organize, and share information ... build a better way - an AI-powered enterprise search platform that helps people quickly and intuitively access the...are seeking a skilled and motivated Senior Site Reliability Engineer (SRE) to become a valuable addition to our… more
- GoodLeap (Irvine, CA)
- …customer communication, deeper business intelligence, and streamlined payment and operations. Our platform has led to more than $30 billion in financing for ... people across Africa, Asia, and South America. Position Summary The Site Reliability Engineer (SRE) role is a hybrid position that combines elements of software… more
- Replit (San Mateo, CA)
- …infrastructure that serves millions of developers worldwide. As a Site Reliability Engineer , you will bridge the gap between development and operations, implementing ... automation and establishing best practices that enable our platform to scale efficiently while maintaining high availability. We are seeking SREs who are passionate… more
- Okta (San Francisco, CA)
- …bring to the role 9+ years of experience as a site reliability or platform engineer , preferably in a fast-scaling environment. 3+ years of experience building ... device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform , provide secure access,...and operating workloads orchestrated by Kubernetes . Have familiarity with large scale containerised deployments, both… more
- SEI Investments (Boston, MA)
- We have a challenging career opportunity for a DevOps/SRE Engineer . You will have the ability to automate tasks that cause friction for both our customers and ... processes Ensure organization compliance requirements are met Ensure security of platform Plan, deploy, and maintain critical business applications in prod/non-prod… more
- TikTok (San Jose, CA)
- …models and systems to identify and defend internet abuse and fraud on our platform . Our mission is to protect billions of users and publishers across the globe ... trust and safety system using the tremendous amount of data generated on the platform . With the continuous efforts from our team, TikTok is able to provide the… more