- Wells Fargo (Iselin, NJ)
- …LLMs and leveraging **Python** for integration and ML model development + Exposure to ** AIOps ** , ** SRE principles** , and **infrastructure as code** + Strong ... communication skills and ability to present technical concepts to leadership + Experience with **NoSQL databases** , **SQL optimization** , and **data modeling** **Job Expectations:** + Hybrid work environment with occasional travel (up to 5%) + Flexibility to… more
- Kyndryl (New York, NY)
- …delivery (eg, agile, cloud native, DevOps) and operations (eg, observability, automated response, SRE , AIOps , etc.) including the ability to articulate a path ... toward a target operating model (people, process, and tools/technology) + Desire and demonstrated ability to stay abreast of emerging technologies that are priorities for utilities, such as artificial intelligence, machine learning, computer vision, and GenAI,… more
- Steampunk (Mclean, VA)
- …accounts, and more. **Contributions** As a **Sr. Steampunk Site Reliability Engineer ( SRE ),** you will be responsible for working with program development teams, ... for the reliability of an organizations' applications and infrastructure. As an SRE , your primary responsibility is to combine aspects of software engineering with… more
- SOS International LLC (Redstone Arsenal, AL)
- …Secret Security Clearance with SCI eligibility Experience as a Site Reliability Engineer ( SRE ) Experience with AIOps and FinOps Experience with Petabyte scale ... data sets Experience with large-scale, multi-INT analytics BS or MS in Computer Science, Statistics, Mathematics, Physics or a quantitative field Work Environment Working conditions are normal for an office environment. Fast paced, deadline-oriented… more
- EPAM Systems (Newtown, PA)
- …client accounts. You'll architect next-generation managed services that integrate DevOps, SRE principles, and intelligent automation to transform how we deliver ... advisor to clients, designing future-ready service landscapes that leverage GenAI, AIOps , and advanced observability across cloud and on-premise environments while… more
- JPMorgan Chase (Jersey City, NJ)
- …building innovative platforms, automating infrastructure operations, and enabling Agentic-based AIOps platforms. Our mission is to enhance scalability, security, and ... you will be tasked with the design, construction, and maintenance of our AIOps solution. This role demands a profound knowledge of AI/ML technologies, IT… more
- TEKsystems (Overland Park, KS)
- …orchestration systems, OSS/BSS interfaces, and service assurance platforms * Work closely with SRE and NOC teams to identify and automate high-impact Day 2 incident ... SMF, IMS, PCF, Linux, openshift, Prometheus, Grafana, Kibana, ServiceNow AIOps , Red Hat Top Skills Details devops,Cloud Operations,Telcom,Application Support,AMF,SMF,IMS,PCF,Linux,openshift,Prometheus,Grafana,Kibana,ServiceNow… more
- CVS Health (Trenton, NJ)
- …to stay abreast of the latest trends and advancements in AI/ML, IT Operations, SRE , and Cybersecurity. + Advocate for the adoption of AI/ML solutions across the ... + 15+ years leading Enterprise Machine Learning, Infrastructure, Data Science, and/or SRE practices + 5+ years applying Machine Learning to optimize technology… more
- TEKsystems (Overland Park, KS)
- …KPIs (eg, session count, call drops, throughput) * Collaborate with NOC and SRE teams to design runbooks and remediation playbooks for common issues * Identify ... SMF, IMS, PCF, Linux, openshift, Prometheus, Grafana, Kibana, ServiceNow AIOps , Red Hat Top Skills Details devops,Cloud Operations,Telcom,Application Support,AMF,SMF,IMS,PCF,Linux,openshift,Prometheus,Grafana,Kibana,ServiceNow… more
- The Walt Disney Company (Glendale, CA)
- …role with strategic reach. You'll collaborate closely with security, DevOps, SRE , and application teams to deliver platform capabilities that improve developer ... Terraform, enabling reusable and composable module patterns. + Lead AIOps initiatives including anomaly detection, automated remediation, and intelligent alerting… more
- Cisco (AZ)
- …building anomaly detection systems. You should be comfortable working with MLOps, AIOps PyTorch, TensorFlow, MLflow and ONNX, and apply innovative ML capabilities to ... engineering, design, TPM, and documentation. + Collaborate with QA, SRE , and release teams to plan, drive and implement...environments. + Actively maintain a view of how the AIOps market, competition and technologies are evolving, and factor… more
- PagerDuty (San Francisco, CA)
- …at translating complex technical workflows into intuitive experiences for engineering, SRE , and business users. + Excellent communication and collaboration skills, ... with the PagerDuty Operations Cloud. The PagerDuty Operations Cloud combines AIOps , Automation, Customer Service Operations and Incident Management with a powerful… more
- Citigroup (Jacksonville, FL)
- …have good knowledge of the Technology Landscape Project Management methodology, SRE , Service Management, Operational Risk Management, Change Management & Capacity ... on Risk - and manage all IA 7. Drive automation and data driven AIOPS adoption in DEVOPS and Capacity Management. **Qualifications:** + Bachelor's degree or higher +… more
- Amazon (San Francisco, CA)
- …customers to achieve their observability goals, from basic monitoring to advanced AIOps implementations - Execute and optimize POCs in customer environments - ... containers, serverless) - Experience with DevOps practices and tools - Knowledge of SRE principles and practices About the team Why AWS? Amazon Web Services (AWS)… more
- Wolters Kluwer (Wichita, KS)
- …* Stay current on emerging trends and technologies in observability and AIOps to inform future enhancements. **Skills and Qualifications:** * Leadership: Proven ... * Incident Response: Experience working within or alongside incident management and SRE practices. * Tool Proficiency: Hands-on experience with tools such as… more