Back to Jobs
B
VP

Senior Vice President, Site Reliability Engineer

BNY Mellon·Posted 1 month ago

Location

All India, Chennai

Experience

12–16 years

Required Skills

KubernetesAppDynamicsSplunkPythonGoJavaTerraformCICDPrometheusGrafanaSRE principles

About the Role

As a Senior Vice President, Site Reliability Engineer at BNY, you will play a crucial role in driving reliability and performance on the Wealth Services Platform team in Pune. Your responsibilities will include:

  • • Define SLOs/SLIs, enhance observability, and proactively identify and address system bottlenecks in cloud environments.

  • • Automate infrastructure and operations using tools like Terraform, Kubernetes, and CI/CD for scalable, fault-tolerant deployments.

  • • Collaborate with product, infrastructure, and DevOps teams to enhance services' resilience and ensure architectural clarity.

  • • Lead incident management through on-call rotations, postmortems, and automated recovery to minimize downtime.

  • • Establish and maintain monitoring systems using tools such as Prometheus, Grafana, AppDynamics, and Splunk for real-time alerting and root cause analysis.

  • • Develop platform tooling and pipelines for container orchestration, third-party integrations, and cloud-native operations to enhance system efficiency and reliability.

  • • Mentor engineers, promote SRE best practices, and instill a reliability-first culture across engineering teams.
  • To excel in this role, you should possess:

  • • 12+ years of experience in cloud infrastructure (Azure, AWS, GCP), containerization (Docker, Kubernetes), and Infrastructure as Code (Terraform, Helm).

  • • Proficiency in observability and monitoring tools like Prometheus, Grafana, AppDynamics, Datadog, Splunk, with incident response and on-call support experience.

  • • Strong programming and scripting skills in languages such as Python, Go, or Java, focusing on automation, tooling, and system integration.

  • • Deep understanding of SRE principles, including SLAs, SLOs, error budgets, postmortems, and reliability-focused system design.

  • • Excellent collaboration and communication skills, with Agile environment experience and a background in partnering with cross-functional engineering, product, and operations teams.
  • Join BNY and be part of a company that values innovation and technological advancement in the finance industry. Enjoy competitive compensation, benefits, and wellbeing programs designed to support your personal and professional growth. As an Equal Employment Opportunity/Affirmative Action Employer, BNY promotes diversity and inclusivity in its workforce. As a Senior Vice President, Site Reliability Engineer at BNY, you will play a crucial role in driving reliability and performance on the Wealth Services Platform team in Pune. Your responsibilities will include:

  • • Define SLOs/SLIs, enhance observability, and proactively identify and address system bottlenecks in cloud environments.

  • • Automate infrastructure and operations using tools like Terraform, Kubernetes, and CI/CD for scalable, fault-tolerant deployments.

  • • Collaborate with product, infrastructure, and DevOps teams to enhance services' resilience and ensure architectural clarity.

  • • Lead incident management through on-call rotations, postmortems, and automated recovery to minimize downtime.

  • • Establish and maintain monitoring systems using tools such as Prometheus, Grafana, AppDynamics, and Splunk for real-time alerting and root cause analysis.

  • • Develop platform tooling and pipelines for container orchestration, third-party integrations, and cloud-native operations to enhance system efficiency and reliability.

  • • Mentor engineers, promote SRE best practices, and instill a reliability-first culture across engineering teams.
  • To excel in this role, you should possess:

  • • 12+ years of experience in cloud infrastructure (Azure, AWS, GCP), containerization (Docker, Kubernetes), and Infrastructure as Code (Terraform, Helm).

  • • Proficiency in observability and monitoring tools like Prometheus, Grafana, AppDynamics, Datadog, Splunk, with incident response and on-call support experience.

  • • Strong programming and scripting skills in languages such as Python, Go, or Java, focusing on automation, tooling, and system integration.

  • • Deep understanding of SRE principles, including SLAs, SLOs, error budgets, postmortems, and reliability-focused system design.

  • • Excellent collaboration and communication skills, with Agile environment experience and a background in partnering with cross-functional engineering, product, and operations teams.
  • Join BNY and be part of a company that values innovation and technological advancement in the finance industry. Enjoy competitive compensation, benefits, and wellbeing programs designed to support your personal and professional growth. As an Equal Employment Opportunity/Affirmative Action Employer, BNY promotes diversity and inclusivity in its workforce.

    Land this role fasterProfessional
    🎙️

    SAGE

    Mock interview coach

    Rehearse the 5 most-likely questions for this role with live AI feedback.

    📄

    SPAR

    Resume tailoring

    Rewrite your resume to lead with what this hiring panel cares about.

    🤝

    REACH

    Warm intro outreach

    Find the hiring manager + 2nd-degree intros and draft the messages.

    More Information Technology Roles

    View all

    90% of leadership roles never appear on job boards

    Join HireIQ to access confidential opportunities, AI-powered matching, and direct connections to hiring decision-makers.

    Join the Talent Network