Lead Cloud Platform Support Engineer at EPAM Systems Inc
Desde casa, Cauca, Colombia -
Full Time


Start Date

Immediate

Expiry Date

06 May, 25

Salary

200.0

Posted On

07 Feb, 25

Experience

2 year(s) or above

Remote Job

No

Telecommute

No

Sponsor Visa

No

Skills

Powershell, Python, Cloud Applications, Aws, Information Technology, Linux System Administration, Bash, Red Hat Linux, Centos, Ansible, Computer Science, Azure

Industry

Information Technology/IT

Description

We are looking for a highly skilled and knowledgeable Lead Cloud Platform Support Engineer to guide our cloud support team.
This pivotal role involves leading the management and support of our cloud-based platforms, resolving complex issues, optimizing system performance, and directing collaborative efforts across departments to foster operational excellence and innovation. The successful candidate will have deep technical expertise in cloud infrastructure, demonstrable leadership experience, and the ability to mentor team members.
We accept CVs in English only.

REQUIREMENTS

  • Azure Certified Solutions Architect or Sys Ops Administrator
  • 10+ years of IT industry experience, including in-depth knowledge of AWS, Azure, REST APIs, and a Bachelor’s degree in Computer Science or Information Technology
  • 5+ years of experience in a primary domain with expertise in cloud-based development platforms, change management procedures, and troubleshooting complex production incidents in cloud applications
  • 3+ years of advanced experience in creating sophisticated build scripts for release management using tools like Terraform, Ansible, and PowerShell, coupled with programming proficiency in Python and Bash
  • 2+ years of experience in configuring and leading efforts on Kubernetes clusters, EKS, AKS, including tools like Helm and Prometheus
  • Over 5 years of Linux system administration with advanced knowledge in Red Hat Linux or CentOS
  • At least 1 year of leadership experience in a similar role with a proven ability to mentor and lead a technical team
  • Flexible to lead and participate in a 24x7 operations support environment on a rotational shift basis
  • Demonstrated capability to research, absorb, and organize strategic data into actionable information for business and technical enhancements
Responsibilities
  • Oversee Azure systems deployment, lifecycle maintenance, capacity planning, and collaboration with product teams
  • Lead the triage and resolution of service management system incidents and provide strategic action plans
  • Monitor applications, manage advanced data manipulation for widgets, generate critical reports, and oversee comprehensive problem identification and resolution processes
  • Optimize system data manipulation by fine-tuning agents and collectors
  • Provide expert customer support and lead consultations with internal and external stakeholders
  • Manage and enhance Azure and AWS infrastructure, including policy oversight, advanced configuration, troubleshooting, and maintenance
  • Develop and standardize scripts for automation and report generation utilizing Terraform, Ansible, GIT, and PowerShell across the team
  • Maintain and upgrade applications within EKS, AKS, Dockers, and Docker Registry to ensure operational efficiency
  • Administer intricate networking protocols and robust network security measures in cloud environments
  • Implement and manage advanced Cloudflare products and comparable tools
  • Lead continuous monitoring initiatives and manage critical cloud components such as Virtual Machines, Load Balancer, S3, and Azure Backup
Loading...