Site Reliability Engineer - Director- Software Production Management & Reli at Morgan Stanley
Bengaluru, karnataka, India -
Full Time


Start Date

Immediate

Expiry Date

24 Feb, 26

Salary

0.0

Posted On

26 Nov, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Linux Troubleshooting, Database Administration, Performance Optimization, Task Automation, Communication, Collaboration, Technical Emergency Response, Data Transfer Technologies, Software Engineering, Data Engineering, Operational Escalation

Industry

Financial Services

Description
A commitment to understanding the range of products in our eco-system with a view to specializing in at least one and optimizing the end-to-end workflow Maximizing the availability and performance of supported systems through optimized and automated plant management, enhanced observability, ongoing problem management and architecture reviews with peers Identification and prioritization of technical debt that is impacting system reliability, performance or squad efficiency, through the elimination of operational issues, optimization and automation of tasks, development of operational tools and driving client self-service to minimize human dependency for support or maintenance Complex troubleshooting in a Linux environment with a focus on collaborating with others to identify the underlying cause of issues and agreeing on lasting improvements that can be made Exploring and delivering improved observability including performance metrics, actionable logging, tracing and meaningful alerting that can define and measure the target reliability of a product Being sensitive to clients' needs (ie the Firm's community of internal developers) to help maximize their productivity, including troubleshooting their issues and developing “self-healing” solutions Minimizing the issue escalation rate to ensure the squad has the greatest possible flow of feature delivery Being dependable and operationally responsive during agreed hours, including sharing on-call rotation with the rest of the global team (with a time-off in lieu system) The successful candidate will have around 6 years of experience. Strong Linux troubleshooting skills Strong experience of database administration, engineering or troubleshooting (ideally including performance optimization) Development skills in any programming language (ideally Python) for task automation Excellent oral and written communication Ability to establish effective relationships with colleagues and clients to collaborate on successful delivery and/or troubleshooting Ability to respond appropriately during occasional technical emergencies, such as outages. Experience with data transfer technologies such as ETL (eg Talend, Informatica), Kafka, MQ etc. Software engineering or data engineering experience Experience of being an operational point of escalation Our values - putting clients first, doing the right thing, leading with exceptional ideas, committing to diversity and inclusion, and giving back - aren't just beliefs, they guide the decisions we make every day to do what's best for our clients, communities and more than 80,000 employees in 1,200 offices across 42 countries. Our teams are relentless collaborators and creative thinkers, fueled by their diverse backgrounds and experiences. We are proud to support our employees and their families at every point along their work-life journey, offering some of the most attractive and comprehensive employee benefits and perks in the industry. There's also ample opportunity to move about the business for those who show passion and grit in their work. To learn more about our offices across the globe, please copy and paste https://www.morganstanley.com/about-us/global-offices​ into your browser. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents.
Responsibilities
Maximize the availability and performance of supported systems through optimized management and enhanced observability. Identify and prioritize technical debt impacting system reliability and efficiency while collaborating with internal developers to troubleshoot issues.
Loading...