Site Reliability Engineer - Director - Risk & Resiliency Management at Morgan Stanley
Bengaluru, karnataka, India -
Full Time


Start Date

Immediate

Expiry Date

25 Feb, 26

Salary

0.0

Posted On

27 Nov, 25

Experience

10 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Site Reliability Engineering, Production Support, DevOps, Agile, Scrum, ITIL, Incident Management, Problem Management, Automation, Monitoring Tools, Stakeholder Management, Linux, Unix, Python, SQL, Load Balancing

Industry

Financial Services

Description
You should have 10+ years of working experience in Production support and 4+ years of being an SRE. Good to have Dev-Ops concepts. Comfortable with DevOps, Agile, Scrum, ITIL concepts and SRE principles (Observability, Reliability, Toil reduction or etc.) Extensive knowledge of Incident and Problem Management. Managing critical incidents and ensuring all key management and business stakeholders are kept up to date. Ensure Production Management is closely aligned/embedded. Incorporate System Reliability Engineering and DevOps implementations into the day-to-day role by developing automated solutions to long standing problems to ensure minimal downtime and manual effort. Should be able to identify the opportunities of automation and capable of delivering the solution. Your code must meet production standards. Exposure to monitoring tools like Grafana and others, You should be able to setup the optimize monitoring. Identify and prioritise technical debt and operational inefficiencies. Analyse business processes to identify automation opportunities. Integrate automation solutions with existing systems and infrastructure. Reduction of the cost of support (hours of effort) through the elimination of operational issues, optimization and automation of tasks, development of operational tools and driving client self-service to minimize constraints. Stake Holder management, alignment and work in healthy environment with other Dev teams. Build extensive business and application knowledge required for supporting client facing applications. Certificate renewal, hygiene clearance, upgrades, DR and others are the primary responsibilities of this role. Should be able to manage and handle the Audit requirements. Help to find the details and clarify the queries from Internal Auditors. Interface with clients and other technology teams to provide governance and control around the production environment. Ability to manage an incident call and coordinate multiple teams towards a common goal of resolving a business impactful outage Working in shifts, support on weekends/holiday and a team player who can take charge when needed as lead are the key requirements of this role. You should apply on this requisition if you have, at minimum, the following profile: Bachelor's degree in computer science or related field Minimum 10 years of experience on either application development (Python, HTML, Shell Scripting, Java Script) covering production support or relevant production support experience or SRE. Ability to manage an incident call and coordinate multiple teams towards a common goal of resolving a business impactful outage. Strong knowledge of DevOps and SRE Principles with grasp over tools / approach to apply them. Strong infrastructure knowledge in Linux / Unix admin, Storage, Networking and Web Technologies. Advanced Unix Shell / Python scripting experience. Advanced SQL query language knowledge (Sybase and/or DB2 preferred). System knowledge in Unix/Linux , and Infrastructure set up such as Load balancing Our values - putting clients first, doing the right thing, leading with exceptional ideas, committing to diversity and inclusion, and giving back - aren't just beliefs, they guide the decisions we make every day to do what's best for our clients, communities and more than 80,000 employees in 1,200 offices across 42 countries. Our teams are relentless collaborators and creative thinkers, fueled by their diverse backgrounds and experiences. We are proud to support our employees and their families at every point along their work-life journey, offering some of the most attractive and comprehensive employee benefits and perks in the industry. There's also ample opportunity to move about the business for those who show passion and grit in their work. To learn more about our offices across the globe, please copy and paste https://www.morganstanley.com/about-us/global-offices​ into your browser. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents.
Responsibilities
Manage critical incidents and ensure all key management and business stakeholders are kept up to date. Develop automated solutions to long-standing problems to ensure minimal downtime and manual effort.
Loading...