Associate Site Reliability Engineer

at  HCA Healthcare

Nashville, TN 37203, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate30 Nov, 2024Not Specified02 Sep, 20242 year(s) or aboveKotlin,Technical Documentation,Reliability,Root Cause,Design Patterns,Performance Tuning,Dashboards,Software Development,Platforms,Engineers,Writing,Client Server Technologies,Code,Leadership,Resiliency,Operational Excellence,SwiftNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

INTRODUCTION

Do you want to join an organization that invests in you as an Associate Site Reliability Engineer? At HCA Healthcare, you come first. HCA Healthcare has committed up to $300 million in programs to support our incredible team members over the course of three years.

NOTE: ELIGIBILITY FOR BENEFITS MAY VARY BY LOCATION.

You contribute to our success. Every role has an impact on our patients’ lives and you have the opportunity to make a difference. We are looking for a dedicated Associate Site Reliability Engineer like you to be a part of our team.

JOB SUMMARY AND QUALIFICATIONS

Position Summary
What makes HCA Healthcare Information Technology Group (ITG) unique as a technology company is that our solutions ultimately impact the care of patients. Although our skills are needed in many industries, we in ITG apply them specifically to the noble cause of healthcare. We are “Healthcare Inspired.” This guiding vision pervades and positively influences every level of our organization. It shapes our mission, defines our values, and brings our leaders and employees together in a shared enthusiasm for their work, setting ITG apart as a uniquely purpose-driven company in the IT industry. As a part of that, we exist to raise the bar, unlock possibilities, and care like family.
As an Associate Site Reliability Engineer (SRE), you will provide SRE best practices for mission-critical applications across the enterprise. When these applications fail, you’ll have the skills and decision-making capabilities to quickly restore services, investigate the root cause, and develop a plan that mitigates future failures. You will spend time analyzing system performance and identifying ways to enhance the reliability of our environments, from developing dashboards, performing configuration changes, building robust monitoring systems, and learning how to leverage automation to drive efficiencies. You will help drive uptime and reliability across the enterprise.
We are on a mission to change the face of the healthcare industry through value-driven products. These products will create innovation for all healthcare users across HCA’s nationwide ecosystem. To do this, we are building both curious and quick teams to adapt to new technologies.

Major Responsibilities:

  • Practices and adheres to the “Code of Conduct” philosophy and “Mission and Value Statement.”
  • Promote a collaborative team environment and work closely with colleagues to achieve business objectives.
  • Collaborate with stakeholders (e.g., business stakeholders, product owners, project managers, and end users) to understand functional and non-functional requirements.
  • Lead Investigations and solution proposals to development and design problems.
  • Participate with team members in scope of work estimation and forecasting.
  • Improve performance of existing software by diagnosing and resolving critical issues.
  • Prepare technical documentation, including software & architectural design evaluation plans, data flow diagrams, test results, and technical manuals.
  • Adhere to and influence established development practices and processes.
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
  • Ongoing review of technology, infrastructure, and code to enhance and build resiliency into the applications.
  • Create sustainable systems and services through automation and uplifts.
  • Balance feature development & deployments with speed, reliability, and well-defined service-level objectives.
  • Partner with development teams and vendors of 3rd party applications to improve services through rigorous testing and release procedures.
  • Build/Develop automations to “self-heal” applications and reduce the toil of manual operational tasks. Pursuit of operational excellence, uptime, and reliability of our applications
  • Participate, lead, and drive in creating postmortem analysis of why services broke or degraded, including recommendations for long-term fixes. It may require going across multiple teams and organizations within the enterprise. Determine root-cause for all production-level incidents and write corresponding high-quality RCA reports.
  • Collaborating and building relationships across business and technology organizations, providing sound analysis, and thought leadership.
  • Support system upgrades, architecture design, implementations, and deployments.
  • Ability to work in a complex organization, navigate multiple verticals of expertise and negotiate, guide direct and influence your peers to provide real solutions.
  • Maintain industry knowledge in software development, architecture, and development products, such as databases, security, and automation products.

Education & Experience:
Bachelor’s degree Computer Science or related field preferred
Qualified candidates must have 2+ years of relevant work experience required

Knowledge, Skills, Abilities, Behaviors:

  • Knowledge of infrastructure, frameworks, and software/cloud design patterns for implementing applications in the cloud preferred
  • Experience in the use and implementation of relevant tools and platforms (e.g., cloud platforms (IaaS and PaaS), web technologies, client-server technologies, continuous integration, and deployment) preferred
  • Experience with version control (Git) and open-source practices preferred
  • Experience in one or more coding languages. (JavaScript/Typescript, C#, Python, Java, Swift or Kotlin) preferred
  • Experience with automation of CI/CD pipelines preferred
  • Experience with IaC such as Terraform preferred
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks required
  • Be a creative thinker, not bound by “the way things have always been done”. What you know is less important than how well you learn and innovate. We don’t need engineers who know all the answers; we need engineers who can invent the answers no one has thought of yet, to the questions yet to be asked required
  • Experienced in helping define SLIs, SLOs & SLOs, and the experience to build observability to report on operating against those objectives required
  • Strong ability to communicate complex technical information in a condensed manner to various stakeholders verbally and in writing required
  • Ability to build and maintain strong cross-functional partnerships at all levels of the organization required
  • Strong: Learning and teaching other team members and others external to the team preferred
  • Ability to work, make aligned decisions, plan, and accomplish goals without explicit direction/guidance from leadership required
  • Experience with system architectures, how software systems interact, and integrate required
  • Ability to evaluate new technologies to assist senior leadership align it to the HCA Healthcare strategic roadmap required
  • Strong understanding of SRE practices and implementations required
  • Expertise in knowledge of Linux and Windows Systems Administration and how to manage through code required
  • Ability to determine best practices and articulate authoritative direction required
  • Ability to help establish and grow the SRE principles with the team required
  • Growth mindset and a willingness to learn new skills, technologies, and frameworks required

HCA Healthcare has been recognized as one of the World’s Most Ethical Companies® by the Ethisphere Institute more than ten times. In recent years, HCA Healthcare spent an estimated $3.7 billion in cost for the delivery of charitable care, uninsured discounts, and other uncompensated expenses.
“Good people beget good people."- Dr. Thomas Frist, Sr.
HCA Healthcare Co-Founder
We are a family 270,000 dedicated professionals! Our Talent Acquisition team is reviewing applications for our Associate Site Reliability Engineer opening. Qualified candidates will be contacted for interviews. Submit your resume today to join our community of caring!
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status

Responsibilities:

  • Practices and adheres to the “Code of Conduct” philosophy and “Mission and Value Statement.”
  • Promote a collaborative team environment and work closely with colleagues to achieve business objectives.
  • Collaborate with stakeholders (e.g., business stakeholders, product owners, project managers, and end users) to understand functional and non-functional requirements.
  • Lead Investigations and solution proposals to development and design problems.
  • Participate with team members in scope of work estimation and forecasting.
  • Improve performance of existing software by diagnosing and resolving critical issues.
  • Prepare technical documentation, including software & architectural design evaluation plans, data flow diagrams, test results, and technical manuals.
  • Adhere to and influence established development practices and processes.
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
  • Ongoing review of technology, infrastructure, and code to enhance and build resiliency into the applications.
  • Create sustainable systems and services through automation and uplifts.
  • Balance feature development & deployments with speed, reliability, and well-defined service-level objectives.
  • Partner with development teams and vendors of 3rd party applications to improve services through rigorous testing and release procedures.
  • Build/Develop automations to “self-heal” applications and reduce the toil of manual operational tasks. Pursuit of operational excellence, uptime, and reliability of our applications
  • Participate, lead, and drive in creating postmortem analysis of why services broke or degraded, including recommendations for long-term fixes. It may require going across multiple teams and organizations within the enterprise. Determine root-cause for all production-level incidents and write corresponding high-quality RCA reports.
  • Collaborating and building relationships across business and technology organizations, providing sound analysis, and thought leadership.
  • Support system upgrades, architecture design, implementations, and deployments.
  • Ability to work in a complex organization, navigate multiple verticals of expertise and negotiate, guide direct and influence your peers to provide real solutions.
  • Maintain industry knowledge in software development, architecture, and development products, such as databases, security, and automation products


REQUIREMENT SUMMARY

Min:2.0Max:7.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Computer Science

Proficient

1

Nashville, TN 37203, USA