Engineering-L2-Hyderabad-Vice President-Software Engineering-Bengaluru/Hyderabad
Location
Bengaluru, Karnataka, India
Required Skills
About the Role
Job Description Site Reliability Engineer - Vice President Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run scalable, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for improving the availability and reliability of the firm’s most critical platform services and ensures they meet the requirements of our internal and external users. It is also responsible for the firmwide policies and standards focused on firm’s digital resilience. We are looking for engineers who are motivated to collaborate with our businesses to build and run sustainable production systems, which can evolve and adapt to changes in our fast-paced, global business environment.
The SRE team develops and maintains platforms and tools which help other Engineering teams in Goldman Sachs to build and operate reliable and resilient systems. These systems span on-premises datacenters and multiple public cloud environments. The platforms we offer include central logging, monitoring, agents and alerting and we provide tools to drive adoption and improvements to capacity planning, operational readiness assessments, production incident postmortems, SLIs / SLOs, and deployment automation including canary releases.
The products and services we provide to our internal customers are used by thousands of engineers every day. We believe that reliability is the most important feature of any system, and we are devoted to giving our engineers the platforms and tools they need to build and operate reliable products.
Role Overview As a Site Reliability Engineer (SRE) at Goldman Sachs, you will be a pivotal leader in ensuring the availability, reliability, and scalability of the firm's most critical platform applications and services. You will combine deep software and systems engineering expertise to architect, build, and run large-scale, massively distributed, fault-tolerant systems. This role involves providing technical leadership, mentoring senior engineers, and collaborating closely with internal teams and executive stakeholders to build and operate sustainable production systems that can adapt to our dynamic global business environment. You will drive a culture of continuous improvement, championing the adoption of advanced SRE principles and best practices across the organization.
Responsibilities
Qualifications
+ Exceptional programming skills in one or more major languages such as Java, Python, Go with a focus on building robust, scalable software.
+ Extensive hands-on experience with cloud platforms (e.g., AWS, GCP) and deep expertise in containerization and orchestration technologies (e.g., Docker, Kubernetes).
+ Mastery of Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation) and configuration management tools (e.g., Puppet, Chef, Ansible).
+ Advanced proficiency in Prompt Engineering and Retrieval-Augmented Generation (RAG) architectures to automate complex SRE workflows, such as the generation of Infrastructure as Code (IaC), dynamic runbooks, and incident response summaries.
+ Profound understanding of distributed systems, and advanced system performance tuning.
+ Expertise in designing and implementing comprehensive monitoring, alerting, logging and tracing solutions (e.g., Prometheus, Grafana, ELK stack, Datadog, PagerDuty).
+ Deep experience with CI/CD tools and practices (e.g., Jenkins, GitLab, Maven).
+ Strong foundation in databases and distributed systems.
+ Exceptional problem-solving abilities and analytical skills, with a track record of resolving complex technical challenges.
+ Experience with Distributed Databases like Elastic Search
+ Experience with working on GCP Big Query
+ Experience with messaging Systems Like Kafka
SAGE
Mock interview coach
Rehearse the 5 most-likely questions for this role with live AI feedback.
SPAR
Resume tailoring
Rewrite your resume to lead with what this hiring panel cares about.
REACH
Warm intro outreach
Find the hiring manager + 2nd-degree intros and draft the messages.
About Goldman Sachs
Goldman Sachs is a leading global financial institution delivering a broad range of financial services. India is one of its largest engineering and operations hubs globally.
4.0
Glassdoor
16,000
Reviews
78%
Recommend
68%
CEO Approval
More Engineering & Technology Roles
View all →Construction Director
UNITED CARGO LOGISTIC · Bangalore, Chennai, Ribhoi, Kottayam, Hyderabad, Kolkata, Thiruvanananthapuram, Rupnagar, Jamui, Kandla
Posted 1 month ago
Vice President - Quality Assurance
Explorance · All India, Chennai
Posted 1 month ago
Director/Deputy Director - Office of Alumni Relations
Ashoka University · Delhi, Sonipat
Posted 6 days ago
Director- Internal Audit
Black Turtle · Mumbai
Posted 2 days ago
90% of leadership roles never appear on job boards
Join HireIQ to access confidential opportunities, AI-powered matching, and direct connections to hiring decision-makers.
Join the Talent Network