Site Reliability Engineer (L2) - (Google Data Platform Services)

at  CVS Health

Hartford, Connecticut, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate05 Jul, 2024USD 97335 Annual05 Apr, 20243 year(s) or aboveAutomation Tools,Cloud Storage,Proactive Monitoring,Code,Google Cloud Platform,Computer Science,InfrastructureNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

POSITION SUMMARY

We are looking for a motivated Site Reliability Engineer L2 with a focus on Google Data Platform Services to join our dynamic Production data support team. The ideal candidate will be instrumental in maintaining and improving the reliability and performance of our data platforms, specifically those hosted on GCP. You will be involved in monitoring system health, responding to incidents, and contributing to initiatives that enhance the stability and scalability of our data services.
Key Responsibilities:
Monitor the health and performance of data services on GCP, including BigQuery, DataProc, Cloud Storage, and other related services.
Respond to and resolve L2 incidents efficiently while escalating more complex issues to senior team members.
Contribute to the development and implementation of automation scripts and tools for routine maintenance tasks and alerts.
Assist in the configuration and maintenance of monitoring and alerting systems, leveraging GCP’s operations suite (formerly Stackdriver) and other tools like Prometheus and Grafana.
Support continuous improvement efforts by participating in post-incident reviews and implementing recommended changes to prevent recurrence.
Collaborate with cross-functional teams to ensure that data reliability standards are integrated into the development and deployment of all data services.
Maintain up-to-date documentation on system configurations, incident response protocols, and operational best practices.

REQUIRED QUALIFICATIONS

3+ years of experience with Google Cloud Platform services, especially data-related services like BigQuery, DataProc, composer and Cloud Storage.
3+ years of experience with scripting and automation tools (e.g., Python, Bash).
3+ years of experience with DevOps and SRE principles, including CI/CD pipelines, infrastructure as code, and proactive monitoring.

PREFERRED QUALIFICATIONS

Certification in Google Cloud Platform or related cloud technologies.
Experience with monitoring and alerting tools (e.g., Prometheus, OTEL, Grafana).
Exposure to incident management and ITIL processes.
Strong analytical and problem-solving skills, with a keen attention to detail.
Effective communication and teamwork abilities.
Willingness to learn and adapt in a fast-paced environment.

EDUCATION

Bachelor’s degree in Computer Science, Engineering, or a related technical field.

Responsibilities:

Please refer the Job description for details


REQUIREMENT SUMMARY

Min:3.0Max:8.0 year(s)

Information Technology/IT

Software Engineering

Graduate

Computer science engineering or a related technical field

Proficient

1

Hartford, CT, USA