• Lead Software Engineer

    Wells Fargo (Iselin, NJ)
    …LLMs and leveraging **Python** for integration and ML model development + Exposure to ** AIOps ** , ** SRE principles** , and **infrastructure as code** + Strong ... communication skills and ability to present technical concepts to leadership + Experience with **NoSQL databases** , **SQL optimization** , and **data modeling** **Job Expectations:** + Hybrid work environment with occasional travel (up to 5%) + Flexibility to… more
    Wells Fargo (07/25/25)
    - Save Job - Related Jobs - Block Source
  • Director, Consult Partner - Utilities / Enterprise…

    Kyndryl (New York, NY)
    …delivery (eg, agile, cloud native, DevOps) and operations (eg, observability, automated response, SRE , AIOps , etc.) including the ability to articulate a path ... toward a target operating model (people, process, and tools/technology) + Desire and demonstrated ability to stay abreast of emerging technologies that are priorities for utilities, such as artificial intelligence, machine learning, computer vision, and GenAI,… more
    Kyndryl (07/27/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Site Reliability Engineer ( SRE )

    Steampunk (Mclean, VA)
    …accounts, and more. **Contributions** As a **Sr. Steampunk Site Reliability Engineer ( SRE ),** you will be responsible for working with program development teams, ... for the reliability of an organizations' applications and infrastructure. As an SRE , your primary responsibility is to combine aspects of software engineering with… more
    Steampunk (07/04/25)
    - Save Job - Related Jobs - Block Source
  • Data Engineer

    SOS International LLC (Redstone Arsenal, AL)
    …Secret Security Clearance with SCI eligibility Experience as a Site Reliability Engineer ( SRE ) Experience with AIOps and FinOps Experience with Petabyte scale ... data sets Experience with large-scale, multi-INT analytics BS or MS in Computer Science, Statistics, Mathematics, Physics or a quantitative field Work Environment Working conditions are normal for an office environment. Fast paced, deadline-oriented… more
    SOS International LLC (06/03/25)
    - Save Job - Related Jobs - Block Source
  • Director, Next-Generation Managed Services

    EPAM Systems (Newtown, PA)
    …client accounts. You'll architect next-generation managed services that integrate DevOps, SRE principles, and intelligent automation to transform how we deliver ... advisor to clients, designing future-ready service landscapes that leverage GenAI, AIOps , and advanced observability across cloud and on-premise environments while… more
    EPAM Systems (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer - Vice President

    JPMorgan Chase (Jersey City, NJ)
    …building innovative platforms, automating infrastructure operations, and enabling Agentic-based AIOps platforms. Our mission is to enhance scalability, security, and ... you will be tasked with the design, construction, and maintenance of our AIOps solution. This role demands a profound knowledge of AI/ML technologies, IT… more
    JPMorgan Chase (07/17/25)
    - Save Job - Related Jobs - Block Source
  • Network Automation Engineer - Architect

    TEKsystems (Overland Park, KS)
    …orchestration systems, OSS/BSS interfaces, and service assurance platforms * Work closely with SRE and NOC teams to identify and automate high-impact Day 2 incident ... SMF, IMS, PCF, Linux, openshift, Prometheus, Grafana, Kibana, ServiceNow AIOps , Red Hat Top Skills Details devops,Cloud Operations,Telcom,Application Support,AMF,SMF,IMS,PCF,Linux,openshift,Prometheus,Grafana,Kibana,ServiceNow… more
    TEKsystems (07/29/25)
    - Save Job - Related Jobs - Block Source
  • Executive Director - AI and Machine Learning

    CVS Health (Trenton, NJ)
    …to stay abreast of the latest trends and advancements in AI/ML, IT Operations, SRE , and Cybersecurity. + Advocate for the adoption of AI/ML solutions across the ... + 15+ years leading Enterprise Machine Learning, Infrastructure, Data Science, and/or SRE practices + 5+ years applying Machine Learning to optimize technology… more
    CVS Health (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Network Automation Engineer - Lead

    TEKsystems (Overland Park, KS)
    …KPIs (eg, session count, call drops, throughput) * Collaborate with NOC and SRE teams to design runbooks and remediation playbooks for common issues * Identify ... SMF, IMS, PCF, Linux, openshift, Prometheus, Grafana, Kibana, ServiceNow AIOps , Red Hat Top Skills Details devops,Cloud Operations,Telcom,Application Support,AMF,SMF,IMS,PCF,Linux,openshift,Prometheus,Grafana,Kibana,ServiceNow… more
    TEKsystems (07/29/25)
    - Save Job - Related Jobs - Block Source
  • Principal Platform Engineer, Infrastructure…

    The Walt Disney Company (Glendale, CA)
    …role with strategic reach. You'll collaborate closely with security, DevOps, SRE , and application teams to deliver platform capabilities that improve developer ... Terraform, enabling reusable and composable module patterns. + Lead AIOps initiatives including anomaly detection, automated remediation, and intelligent alerting… more
    The Walt Disney Company (07/17/25)
    - Save Job - Related Jobs - Block Source
  • Principal Product Manager - Artificial…

    Cisco (AZ)
    …building anomaly detection systems. You should be comfortable working with MLOps, AIOps PyTorch, TensorFlow, MLflow and ONNX, and apply innovative ML capabilities to ... engineering, design, TPM, and documentation. + Collaborate with QA, SRE , and release teams to plan, drive and implement...environments. + Actively maintain a view of how the AIOps market, competition and technologies are evolving, and factor… more
    Cisco (06/14/25)
    - Save Job - Related Jobs - Block Source
  • Principal Product Manager - Incident Management

    PagerDuty (San Francisco, CA)
    …at translating complex technical workflows into intuitive experiences for engineering, SRE , and business users. + Excellent communication and collaboration skills, ... with the PagerDuty Operations Cloud. The PagerDuty Operations Cloud combines AIOps , Automation, Customer Service Operations and Incident Management with a powerful… more
    PagerDuty (07/23/25)
    - Save Job - Related Jobs - Block Source
  • VP, DevOps and Capacity Management Process Lead…

    Citigroup (Jacksonville, FL)
    …have good knowledge of the Technology Landscape Project Management methodology, SRE , Service Management, Operational Risk Management, Change Management & Capacity ... on Risk - and manage all IA 7. Drive automation and data driven AIOPS adoption in DEVOPS and Capacity Management. **Qualifications:** + Bachelor's degree or higher +… more
    Citigroup (07/22/25)
    - Save Job - Related Jobs - Block Source
  • Observability Customer Success Specialist, AWS…

    Amazon (San Francisco, CA)
    …customers to achieve their observability goals, from basic monitoring to advanced AIOps implementations - Execute and optimize POCs in customer environments - ... containers, serverless) - Experience with DevOps practices and tools - Knowledge of SRE principles and practices About the team Why AWS? Amazon Web Services (AWS)… more
    Amazon (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Manager, IT Operations

    Wolters Kluwer (Wichita, KS)
    …* Stay current on emerging trends and technologies in observability and AIOps to inform future enhancements. **Skills and Qualifications:** * Leadership: Proven ... * Incident Response: Experience working within or alongside incident management and SRE practices. * Tool Proficiency: Hands-on experience with tools such as… more
    Wolters Kluwer (06/18/25)
    - Save Job - Related Jobs - Block Source