Principal Site Reliability Engineer at PTC

Pune, maharashtra, India -

Full Time

Start Date

Immediate

Expiry Date

08 Apr, 26

Salary

0.0

Posted On

08 Jan, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

DevOps, SRE, CI/CD, Cloud-Native Technologies, Azure, AWS, Containers, Kubernetes, Python, Shell, Groovy, PostgreSQL, GitHub, Jenkins, Ansible, Terraform, Monitoring Tools

Industry

Software Development

Description

Collaborate with global teams to implement containerized DevOps infrastructure and cloud-native solutions. Build and maintain automation for deployment, monitoring, reporting, and analysis. Manage CI/CD pipelines to optimize efficiency and reliability. Apply industry best practices for system hardening and configuration management. Secure, scale, and manage Linux-based virtual environments. Develop and maintain solutions for system administration, backups, disaster recovery, and performance/security monitoring. Design and implement secure automation for development, testing, and production environments. Continuously evaluate and improve existing systems, ensuring compliance with industry standards and best practices. Promote knowledge sharing across cross-functional teams, including IT and Engineering. Communicate clearly about technical decisions, trade-offs, and their impact on the broader system. Minimum 7 years of hands-on experience in DevOps and SRE roles. Proven expertise in continuous delivery and deployment of SaaS products. Deep understanding of cloud-native technologies, especially on Azure and AWS, including IaaS, PaaS, backup, rollback, and HA/DR strategies. Advanced knowledge of Containers and Kubernetes (both managed and OS-level), including ingress controllers like Emissary and nginx. Proficiency in scripting languages such as Python, Shell, and Groovy; familiarity with YAML. Experience with PostgreSQL or other RDBMS. Strong version control skills using GitHub or similar SCCM tools. Hands-on experience with Jenkins for CI/CD pipeline management. Familiarity with configuration and deployment tools like Ansible, Helm, Helmfile/Helmsman, Terraform, and FluxCD. Experience with monitoring and logging tools such as Prometheus, Grafana, and Azure Monitoring. CKA (Certified Kubernetes Administrator) certification. Solid understanding of Linux system administration. Familiarity with Agile methodologies, frameworks, and metrics. Experience working with large, interoperable product suites.

Responsibilities

The Principal Site Reliability Engineer will collaborate with global teams to implement containerized DevOps infrastructure and cloud-native solutions while managing CI/CD pipelines. They will also ensure system security, scalability, and compliance with industry standards.