- Cisco (Milpitas, CA)
- Senior Full Stack Engineer - Cloud-Native Observability Platform We are the Catalyst Center Platforms and Capabilities team, responsible for delivering ... innovation. One of our key initiatives is a cloud-native observability platform purpose-built for Cisco Catalyst Center...What You'll Do We're looking for a Senior Software Engineer to take ownership of building and shaping the… more
- NVIDIA (Santa Clara, CA)
- …be doing: + Design, implement and support operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on ... at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of… more
- NVIDIA (Santa Clara, CA)
- …HPC Observability Engineer to design and build the next-generation observability platform for large- scale AI workloads, GPU clusters, and ... high-throughput data stores (eg, TSDBs, columnar databases, OLAP systems) for large- scale observability data. + Drive self-service analytics capabilities through… more
- NVIDIA (Santa Clara, CA)
- …a Senior/Staff Engineer to compose and build the next-generation, multi-region observability platform . This platform powers our rapidly expanding AI, ... and Observability ecosystem, operating at an immense scale : trillions of metrics, hundreds of terabytes of logs,...including GPU Compute, Distributed Systems, Networking, ML Infra, AI Platform , and Cloud Services to ensure engineers have deep… more
- Cisco (San Jose, CA)
- …management, logging standards, and ensuring what's most useful is monitored. + Operate the Observability platform components, such as our Splunk Platform and ... as AI Defense, AI Canvas, and AI Assistants. Within the DevOps team, our Cloud Platform and Observability group provides all the necessary insights to power our… more
- NVIDIA (Santa Clara, CA)
- …scale on-prem Infrastructure and Networking. + Hands-on experience with managing large scale Observability Platforms with LLMs & ML Models and building custom ... We are looking for a highly skilled Principal Software Engineer to design and develop AIOps & Observability...from wide range of assets. + Developed unified cloud observability platform to monitor Network, Compute, Power,… more
- Nutanix (San Jose, CA)
- …you a passionate engineer with a strong background in Performance and Observability , experience of building & scaling distributed systems, and a desire to build ... India and is focused on building an enterprise-grade data platform and delivering exceptional multi cloud observability ...that is distributed, resilient, and highly performant at a large- scale deployment. + Develop a robust design and write… more
- Palo Alto Networks (Santa Clara, CA)
- …designing, building, and maintaining the infrastructure and automation that powers our large- scale cloud platform . You will work closely with engineering teams ... engineer who is passionate about automation, cloud infrastructure, observability , and continuous integration/deployment. You will contribute to the evolution of… more
- Palo Alto Networks (Santa Clara, CA)
- …Exact Data Matching (EDM), Document Fingerprinting, and advanced ML/AI classifiers. **High- Scale Platform Engineering & Performance** Ensure the platform ... Kubernetes, observability , and SLO management is applied across the platform . **Mentorship & Operational Excellence** The role requires leadership in engineering… more
- NVIDIA (Santa Clara, CA)
- …as well as advanced switching and routing concepts + Experience collaborating with platform security experts to define tradeoffs between security and ease of use. + ... to influence and achieve results without direct authority in large- scale , collaborative environments. Demonstrable experience in implementing left shift strategy… more
- Google (Sunnyvale, CA)
- Senior Engineering Manager, ML Optimization Tools and Observability _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and ... of the following: ML performance, debugging, optimization, profiling, or observability . + 5 years of experience in a people...Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have… more
- MongoDB (Palo Alto, CA)
- We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, ... deeply integrated into Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform , you'll be hands-on with design and implementation,… more
- LinkedIn (Mountain View, CA)
- …engineering and serving with hundreds of billions of parameters models and large scale feature engineering infra for all AI use cases from recommendation models, ... performance optimizations across billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra team, you will play a crucial role… more
- Cisco (San Jose, CA)
- **Meet the Team** The Splunk Observability Application Platform Team is a dynamic group of engineers responsible for the core platform powering Splunk ... Our platform is the foundation for advanced observability capabilities that enable our customers to thrive. We...teams, driving significant impact and working with microservices at scale . Our mission is to optimize platform … more
- LinkedIn (Mountain View, CA)
- …engineering and serving with hundreds of billions of parameters models and large scale feature engineering infra for all AI use cases from recommendation models, ... performance optimizations across billions of user queries Model Training Infrastructure: As an engineer on the AI Training Infra team, you will play a crucial role… more
- IBM (San Jose, CA)
- …any image for any cloud. * [4] Waypoint makes infrastructure easily accessible at scale , enabling platform teams to deliver golden patterns and workflows with an ... security, and scalability in their cloud journey. HashiCorp Cloud Platform (HCP) is the backbone of product delivery for...critical part of HCP with a mission to provide observability data to the customers. Observability data… more
- MongoDB (Palo Alto, CA)
- …for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native ... Atlas and designed for developer-first experiences. As a Senior Engineer , you'll focus on building core systems and services...product, infrastructure, and ML teams to ensure the inference platform meets the scale , reliability, and latency… more
- Zscaler (San Jose, CA)
- …to Zscaler and help shape the future of cybersecurity. We are looking for a Sr. Staff Platform Engineer to join our team. This is a Hybrid role based in San ... the Zero Trust Exchange department. As a Sr. Staff Platform Engineer , you'll lead our multi-tenant Kubernetes...that helps teams ship quickly and confidently at global scale . **What you'll do (Role Expectations)** + Own architecture… more
- General Motors (Mountain View, CA)
- …Software Engineer , you will architect and build the core platform services including the API gateway, scheduler, lifecycle orchestration, and developer tooling ... to shape a platform that transforms automotive software development at GM scale . **What You'll Do** + Design and implement core platform services including… more
- General Motors (Mountain View, CA)
- …reliability and cost efficiency. **About the Role:** We are seeking a Staff ML Engineer to help build and scale robust compute platforms for ML workflows. ... large scale initiatives + Experience working with Google Cloud Platform , Microsoft Azure, or Amazon Web Services **Preferred Qualifications** + Hands-on… more