JOB DESCRIPTION
The Manager, DevOps and Infrastructure is responsible for leading and managing the company’s multi-cloud infrastructure, DevOps practices, and system administration. This role ensures high availability, performance, security, and scalability of mission-critical systems while enabling efficient, automated, and defect-free software deployment processes. The manager will oversee a team responsible for CI/CD pipelines, infrastructure as code (IaC), automation, monitoring, configuration management, and system reliability. This position also participates in a 24x7 on-call rotation to support critical infrastructure incidents and ensure operational continuity.
QUALIFICATIONS / REQUIREMENTS
- To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below represent the knowledge, skill, and/or ability required.
EDUCATION AND EXPERIENCE
- Bachelor’s degree in Computer Science, Information Systems, Engineering, or a related field required; Master’s degree preferred.
- 7+ years of progressive experience in DevOps and infrastructure roles, including 2+ years in a management or technical leadership capacity.
- Proven experience operating full-cycle SDLC environments including source control, issue tracking, build systems, testing, and production control.
TECHNICAL SKILLS
- Expert-level understanding of the Software Development Lifecycle (SDLC), including source control (e.g., Git), defect tracking, build automation, testing, deployment, and production control.
- Strong programming and scripting skills in TypeScript, Python, PowerShell, C#, and Bash.
- Hands-on experience with Infrastructure-as-Code tools such as Terraform, CloudFormation, Azure Resource Manager templates, and Ansible.
- Experience managing modern CI/CD tools and workflows, including Jenkins, GitHub Actions, GitLab CI, and blue/green deployments.
- Strong experience with containerization and orchestration tools such as Docker, Kubernetes, and LXC.
- Exposure to serverless technologies.
- Proficiency in cloud-native tools such as EC2, RDS, S3, CloudFront, Route 53, ELB, AMIs, and Directory Services.
- Familiarity with Linux and Windows system administration, networking protocols, firewalls, ACLs, DNS, load balancing, proxies, and SOC compliance requirements.
- Understanding of SQL and database architecture for troubleshooting performance issues.
OTHER SKILLS AND ABILITIES
- Strong leadership, communication, and collaboration skills.
- Demonstrated ability to manage multiple priorities in a dynamic and fast-paced environment.
- Analytical mindset with strong troubleshooting and root cause analysis skills.
- Willingness to participate in and coordinate a 24x7 on-call support rotation, including after-hours and weekend availability for critical incidents.
- Commitment to continuous improvement, innovation, and fostering professional growth within the team.
- Long-term thinker who drives excellence, accountability, and shared success across technical and business functions.
Incase you would like to apply to this job directly from the source, please click here