Senior Site Reliability Engineer
at OnBuy
Bournemouth, England, United Kingdom -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 07 Feb, 2025 | GBP 80000 Annual | 07 Nov, 2024 | N/A | Java,Python,Architecture,Google Cloud,New Relic,Interpersonal Skills,Kubernetes,Distributed Systems,Logging,Collaborative Environment,Service Providers,Azure,Microservices,Aws,Programming Languages | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
WHO ARE ONBUY?
OnBuy are an online marketplace who are on a mission of being the best choice for every customer, everywhere.
We have recently been named one of the UK’s fastest-growing tech companies in Deloitte’s Technology Fast 50 for the third year in a row (as well as ‘Fastest-Growing Tech Business in the South West’).
All achievements we are very proud of, but we don’t let that go to our head. We are all laser focused on our mission and understand the huge joint effort ahead of us needed to succeed.
REQUIREMENTS
Essential
- Proven experience as a Senior Site Reliability Engineer or in a similar role
- Strong proficiency in programming languages such as Python, Go, or Java.
- Experience with cloud service providers (AWS, Azure, Google Cloud) and container orchestration tools (Kubernetes, Docker).
- Solid understanding of networking, distributed systems, and microservices architecture.
- Familiarity with monitoring and logging tools (New Relic, Prometheus, Grafana, ELK stack, GCP logging).
- Excellent problem-solving skills and the ability to work effectively in a team.
- A strong determination and work ethic to find the best solution to any problem.
- Excellent problem-solving skills and the ability to work in a fast-paced, collaborative environment.
- Strong communication and interpersonal skills, with the ability to effectively collaborate with cross-functional teams.
Responsibilities:
- Design and implement scalable systems to ensure high availability and performance.
- Develop automated solutions for monitoring, scaling, and system health management.
- Collaborate with software development teams to identify and resolve reliability issues.
- Create and maintain documentation related to system architecture, processes, and configurations.
- Perform incident response and postmortem analysis to improve site reliability and performance.
- Monitor system performance and make necessary adjustments to ensure optimal functionality.
- Implement and manage infrastructure as code using tools like Terraform or Ansible.
- This role requires out-of-hours support (via a rota) to address urgent DevOps issues, ensuring the reliability and availability of critical systems. Payment for this support is made via the companies ‘out of hours working’ policy’
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Application Programming / Maintenance
Software Engineering
Graduate
Proficient
1
Bournemouth, United Kingdom