Site Reliability Engineer (L2) - (Google Data Platform Services)
at CVS Health
Hartford, Connecticut, USA -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 05 Jul, 2024 | USD 97335 Annual | 05 Apr, 2024 | 3 year(s) or above | Automation Tools,Cloud Storage,Proactive Monitoring,Code,Google Cloud Platform,Computer Science,Infrastructure | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
POSITION SUMMARY
We are looking for a motivated Site Reliability Engineer L2 with a focus on Google Data Platform Services to join our dynamic Production data support team. The ideal candidate will be instrumental in maintaining and improving the reliability and performance of our data platforms, specifically those hosted on GCP. You will be involved in monitoring system health, responding to incidents, and contributing to initiatives that enhance the stability and scalability of our data services.
Key Responsibilities:
Monitor the health and performance of data services on GCP, including BigQuery, DataProc, Cloud Storage, and other related services.
Respond to and resolve L2 incidents efficiently while escalating more complex issues to senior team members.
Contribute to the development and implementation of automation scripts and tools for routine maintenance tasks and alerts.
Assist in the configuration and maintenance of monitoring and alerting systems, leveraging GCP’s operations suite (formerly Stackdriver) and other tools like Prometheus and Grafana.
Support continuous improvement efforts by participating in post-incident reviews and implementing recommended changes to prevent recurrence.
Collaborate with cross-functional teams to ensure that data reliability standards are integrated into the development and deployment of all data services.
Maintain up-to-date documentation on system configurations, incident response protocols, and operational best practices.
REQUIRED QUALIFICATIONS
3+ years of experience with Google Cloud Platform services, especially data-related services like BigQuery, DataProc, composer and Cloud Storage.
3+ years of experience with scripting and automation tools (e.g., Python, Bash).
3+ years of experience with DevOps and SRE principles, including CI/CD pipelines, infrastructure as code, and proactive monitoring.
PREFERRED QUALIFICATIONS
Certification in Google Cloud Platform or related cloud technologies.
Experience with monitoring and alerting tools (e.g., Prometheus, OTEL, Grafana).
Exposure to incident management and ITIL processes.
Strong analytical and problem-solving skills, with a keen attention to detail.
Effective communication and teamwork abilities.
Willingness to learn and adapt in a fast-paced environment.
EDUCATION
Bachelor’s degree in Computer Science, Engineering, or a related technical field.
Responsibilities:
Please refer the Job description for details
REQUIREMENT SUMMARY
Min:3.0Max:8.0 year(s)
Information Technology/IT
Software Engineering
Graduate
Computer science engineering or a related technical field
Proficient
1
Hartford, CT, USA