JOB SUMMARY -
For this role, we are looking for a Sr. SRE / DevOps Engineer at Sunnyvale, California location.
As Site Reliability Engineer, the individual will work closely with multi-functional teams, automate operations, optimize infrastructure, implement security and solve issues in an exciting, fast-paced environment. The individual will play a vital role in ensuring that the systems are reliable, scalable, and high performing.
Experience Required: 8+ years of experience on DevOps and Site Reliability Engineering.
MUST HAVE/REQUIRED EXPERIENCE AND SKILLS:
- Hands-on with containerization and orchestration: Docker, Kubernetes/EKS.
- Proficiency in infrastructure as code tools: Terraform, Ansible, or CloudFormation.
- Experience setting up and managing services running on Kubernetes.
- In-depth understanding of SRE principals including monitoring, ing, error budgets, fault analysis, and automation.
- In-depth knowledge of monitoring and observability tools: Apache Splunk
- Knowledge of Linux operating system principles, networking fundamentals, and systems management
- Demonstrable fluency in at least one of the following languages: Java or Python
- Ability to identify and communicate technical and architectural problems, while working with partners and their team to iteratively find solutions.
- Building and managing CI/CD pipeline - gatekeeping production deployments, develop and implement GIT branching strategies, branch protection rules, network policies, scale up/ scale down the load on AWS.
- Strong problem-solving and analytical skills
- Solve performance issues and scalability issues in the system.
TECHNICAL SKILLS:
- DevOps and SRE
- AWS Kubernetes/EKS, Docker
- Terraform, Ansible, or CloudFormation
- Apache Splunk, Apache Flink
- Programming/Scripting using Java or Python
- CI/CD
- Database - Vertica, Snowflake.
Behavioral Skills:
- Excellent Communication skills and collaboration skills
- Ability to propose and implement improvements in the system
- Ability to work with cross-functional stakeholders
- Adaptability and a willingness to learn new technologies and techniques.
- Proactive approach to issues, ability to provide prompt resolution/work around.
Job Type: Full-time
Pay: $110,000.00 - $137,000.00 per year
Work Location: In perso