SRE Site Reliability Engineering Manager

at  CapitalCom

London, England, United Kingdom -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate01 Feb, 2025Not Specified03 Nov, 20241 year(s) or abovePython,Programming Languages,Budget Constraints,Security,Software,Teams,Scripting,Ownership,Interpersonal SkillsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

We are a leading trading platform that is ambitiously expanding to the four corners of the globe. Our top-rated products have won prestigious industry awards for their cutting-edge technology and seamless client experience. We deliver only the best, so we are always in search of the best people to join our ever-growing talent team.
As SRE Engineering Manager, you will lead and manage a team of SRE and DevOps to build, maintain, and optimize scalable, reliable, and secure infrastructure.
You will be responsible for ensuring the availability and performance of mission-critical systems, fostering a culture of operational excellence, and driving initiatives that enhance automation and incident response. Your role will involve collaborating with cross-functional teams to design resilient systems, improve deployment processes, and mitigate risks.

REQUIREMENTS:

  • Proven experience in a leadership role managing SRE or Infrastructure/DevOps teams.
  • Strong sense of ownership for systems and service reliability. Responsibility for support, monitoring, and addressing technical issues. i
  • Automation advocate - you believe in removing operational load and prevention problem recurrence via software.
  • Strong technical background in systems architecture, networking, and cloud infrastructure
  • Knowledge of scripting and programming languages (Python, Go, Bash).
  • Excellent problem-solving skills and ability to navigate complex, high-stakes environments.
  • Ability to manage multiple projects simultaneously, set priorities, allocate resources, and ensure timely completion within budget constraints.
  • Ability to encourage and foster a culture of visibility and transparency across teams
  • Familiarity with security and compliance best practices in cloud environments
  • Strong communication and interpersonal skills, with the ability to collaborate with both technical and non-technical stakeholders. Russian B1+ Level would be a plus
  • Familiarity with fintech regulations and industry standards (e.g., GDPR, PCI DSS) would be a plus

Responsibilities:

  • Lead, mentor, and grow a team of SREs and DevOps, fostering a culture of collaboration,diverse, high performing, continuous learning, innovation and respectful working environment.
  • Be directly responsible for uptime, own availability and performance build automation to prevent problem recurrence.
  • Lead project delivery and management of the automation initiatives to improve operational efficiency, reduce toil, and enhance system reliability, balancing the velocity of development features with the risk to reliability.
  • Forecast and plan for system capacity needs, implement cost-optimization strategies, continuously evaluating the cost and performance trade-offs of the system.
  • Proactively identify potential issues in the system, initiating actions to resolve or mitigate them before they impact users.
  • Collaborate with cross-functional teams to build reliable and secure platforms that meet business needs, act as a liaison between the SRE team and other stakeholders, facilitating communication and prioritizing work.
  • Ensure system security, data integrity and compliance for regulatory requirements


REQUIREMENT SUMMARY

Min:1.0Max:6.0 year(s)

Information Technology/IT

IT Software - Other

Other

Graduate

Proficient

1

London, United Kingdom