Senior Infrastructure Engineer I (Amex ID: 24015268) at Haystack
London, England, United Kingdom -
Full Time


Start Date

Immediate

Expiry Date

25 Oct, 25

Salary

0.0

Posted On

25 Jul, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Network Security, Python, Network Monitoring Tools, Cloud Storage, Orchestration, Storage Solutions, Virtualization, Code, Docker, Switching, Disaster Recovery, Business Continuity Planning, Nginx, Capacity Planning, Firewalls, Pipeline Management, Hyper V, Automation

Industry

Information Technology/IT

Description

ID: Amex ID: 24015268
Hybrid requirements: This role has flexible working patterns.
We are seeking a versatile and highly skilled Full Stack Infrastructure Engineer with expertise in Compute, Storage, Network, and Cloud technologies. The ideal candidate will design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance.

REQUIRED SKILLS AND EXPERIENCE

Proven experience managing and optimizing a diverse infrastructure stack.
Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation).
Familiarity with service mesh technologies (Istio, Linkerd).
Solid understanding of virtualization (VMware, Hyper-V) and containerization (Docker, Kubernetes) and orchestration.
Understanding of storage solutions (SAN, NAS, cloud storage) and backup systems.
Strong understanding of network protocols, routing, switching, and firewalls.
Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools.
Experience in DNS management and troubleshooting.
Experience in network security best practices.
Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk).
Proficiency in at least one scripting language (Python, Bash) for automation.
Experience with CI/CD pipeline management and DevOps practices.
Strong understanding of disaster recovery and business continuity planning.
Experience with performance tuning and capacity planning.
Understanding of chaos engineering principles and practices.
Skills in cost optimization for cloud infrastructure.

Responsibilities

Ensure the reliability, availability, and performance of the entire infrastructure stack including compute, storage, network, and cloud components.
Lead incident response efforts across the infrastructure stack, coordinating with Application Support, SRE, and Engineering teams to minimize MTTD and MTTR.
Perform root cause analysis for infrastructure-related incidents and implement corrective actions.
Develop and maintain automation tools for managing infrastructure resources.
Collaborate with Engineering teams to plan and execute system upgrades and maintenance.
Conduct capacity planning and resource management for all infrastructure components.
Participate in on-call rotations to provide 24x7 support for all critical infrastructure issues.
Design and implement disaster recovery plans and business continuity strategies.
Implement best practices for monitoring, logging, and alerting across the infrastructure.
Foster a culture of continuous improvement and operational excellence.
Analyze complex infrastructure problems, design scalable and resilient solutions, and lead the implementation of these solutions.
Collaborate with architects and other engineers to design and enhance the architecture of infrastructure systems, ensuring alignment with business needs and technology standards.

Loading...