Director, Site Reliability Engineering - Abbott Diabetes Care (Mississauga)

at  Abbott Laboratories

Mississauga, ON, Canada -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate13 Jul, 2024Not Specified14 Apr, 202410 year(s) or aboveInfrastructure,Devops,Code,Software Systems,ContainerizationNoNo
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

About Abbott
Abbott is a global healthcare leader, creating breakthrough science to improve people’s health. We’re always looking towards the future, anticipating changes in medical science and technology.
Working at Abbott

At Abbott, you can do work that matters, grow, and learn, care for yourself and family, be your true self and live a full life. You will have access to:

  • Career development with an international company where you can grow the career you dream of.
  • A company recognized as a great place to work in dozens of countries around the world and named one of the most admired companies in the world by Fortune.
  • A company that is recognized as one of the best big companies to work for as well as a best place to work for diversity, working mothers, female executives, and scientists.

The Opportunity
This position works out of our Mississauga location in the Abbott Diabetes Care (ADC) Division. In ADC, we’re focused on helping people with diabetes manage their health with life-changing products that provide accurate data to drive better-informed decisions. We’re revolutionizing the way people monitor their glucose levels with our new sensing technology.
As Director of Site Reliability Engineering, you will play a critical role in establishing and executing a site reliability strategy for the ADC Medical Device Mobile and Cloud Digital Software portfolio (both Class II and Class III). You will lead a team of engineers that partners with and influences our Architecture and Engineering teams in delivering highly resilient software solutions for our customers.
As a servant leader, you will be responsible for identifying SRE improvement opportunities and influencing change within the organization. You will need a strong software engineering background in a highly secured environment and have DevOps or formal SRE experience. You will need extensive technical knowledge in the development, delivery, and implementation of highly complex and critical software systems. Expertise in the value and principles of SRE (SLI/SLO, Error Budgets, Toil, Observability, Release Engineering) is critical for success in this role. You will have demonstrated the ability to develop, communicate and execute your vision resulting in the adoption of practices and tooling that will help strengthen our position as the premier leader in the Diabetes Care business.

What You’ll Do

  • Be a thought leader and mentor in SRE. Develop and enable a culture of SRE in our software development, delivery and operational practices;
  • Establish a comprehensive SRE strategy and roadmap in partnership with the ADC Digital team;
  • Grow and lead a team to execute on the SRE roadmap;
  • Help software engineering teams and business stakeholders establish and evolve reliability goals and measure progress against those goals using SLIs/SLOs;
  • Create and enforce site reliability standards;
  • Work with software engineering teams to adopt those standards and to continuously improve production stability and resilience;
  • Evaluate the current tiers of service of our applications, reliability standards and practices to define steps to continuously improve on them;
  • Participate in blameless postmortems on critical incidents and help teams use their learnings to better predict, detect and prevent future issues;
  • Establish and lead an SRE community of practice and foster a culture of continuous improvement of system site performance and reliability. Share knowledge and lessons learned across the Digital organization;
  • Partner with the customer support team for rapid resolution of issues;
  • Evaluate and monitor social signals for site reliability. Monitor app store comments and other social channels;
  • Build a practice of rapid detection and root cause determination while keeping stakeholders informed.

Required Qualifications

  • Master’s degree;
  • Minimum (10) ten years of experience managing large-scale digital software systems;
  • Exposure to cloud development and deployment technologies, including containerization, infrastructure as code, and multi-cloud configurations;
  • Deep understanding of DevOps and SRE Best Practices.

Follow your career aspirations to Abbott for diverse opportunities with a company that can help you build your future and live your best life. Abbott is an Equal Opportunity Employer, committed to employee diversity.
Connect with us at www.abbott.com, on Facebook at www.facebook.com/Abbott and on Twitter @AbbottNews and @AbbottGlobal

Responsibilities:

  • Be a thought leader and mentor in SRE. Develop and enable a culture of SRE in our software development, delivery and operational practices;
  • Establish a comprehensive SRE strategy and roadmap in partnership with the ADC Digital team;
  • Grow and lead a team to execute on the SRE roadmap;
  • Help software engineering teams and business stakeholders establish and evolve reliability goals and measure progress against those goals using SLIs/SLOs;
  • Create and enforce site reliability standards;
  • Work with software engineering teams to adopt those standards and to continuously improve production stability and resilience;
  • Evaluate the current tiers of service of our applications, reliability standards and practices to define steps to continuously improve on them;
  • Participate in blameless postmortems on critical incidents and help teams use their learnings to better predict, detect and prevent future issues;
  • Establish and lead an SRE community of practice and foster a culture of continuous improvement of system site performance and reliability. Share knowledge and lessons learned across the Digital organization;
  • Partner with the customer support team for rapid resolution of issues;
  • Evaluate and monitor social signals for site reliability. Monitor app store comments and other social channels;
  • Build a practice of rapid detection and root cause determination while keeping stakeholders informed


REQUIREMENT SUMMARY

Min:10.0Max:15.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Proficient

1

Mississauga, ON, Canada