Site Reliability Engineer at CloudRaft
, , India -
Full Time


Start Date

Immediate

Expiry Date

23 Feb, 26

Salary

0.0

Posted On

25 Nov, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Kubernetes, CI/CD, Python, Golang, Observability, Terraform, Cloud Platforms, Problem-Solving, Communication, Open Source

Industry

IT Services and IT Consulting

Description
About CloudRaft CloudRaft is a dynamic and forward-thinking company specializing in cutting-edge AI and cloud-native solutions. We thrive on creativity, collaboration, and innovation, empowering our team to solve complex challenges and deliver impactful results. Join us to be part of a team that values growth, excellence, and a passion for technology. Job Description We are looking for a talented and experienced SRE to join our team. You'll help scale our operations, design and maintain robust infrastructure, and implement best practices for reliability and efficiency in our cloud-native environment. Responsibilities Manage and maintain Kubernetes clusters (on-prem and cloud: OpenShift, EKS, AKS, and GKE). Implement and manage CI/CD pipelines using tools like Jenkins, GitHub Actions, Argo CD, or GitLab. Design and maintain observability with tools such as Prometheus, Grafana, Loki, OpenTelemetry, and others. Optimize system performance and troubleshoot production issues. Implement SRE concepts, including SLIs and SLOs, to ensure system reliability. Automate infrastructure and operational tasks using programming languages like Golang or Python and IaC like Terraform or Pulumi. Stay updated on emerging trends like AI, MLOps, and Edge Computing. Share knowledge via technical writing and speaking engagements. Qualifications Bachelor’s degree in Computer Science, IT, or a related field. 4-8 years of experience in SRE or Platform Engineering or DevOps roles. Strong experience with Kubernetes, cloud native and cloud platforms (AWS, Azure, GCP). Proficiency in programming (Python, Golang, or Node.js). Familiarity with CI/CD tools and modern deployment strategies. Knowledge of observability tools and infrastructure as code. Excellent problem-solving and communication skills. Inclination towards open source is a plus. Why CloudRaft? Medical Insurance for you and your family from Plum Insurance Flexibility and focus on work-life balance. Challenging problems and a strong peer group. Become a founding member and participate in building the organization.
Responsibilities
Manage and maintain Kubernetes clusters and implement CI/CD pipelines. Optimize system performance and automate infrastructure tasks.
Loading...