Site Reliability Engineer at NetNut.io

, , Israel -

Full Time

Start Date

Immediate

Expiry Date

08 Jul, 26

Salary

0.0

Posted On

09 Apr, 26

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Linux, TCP/IP, HTTP/HTTPS, DNS, Proxy Technologies, Bash, Python, Prometheus, Grafana, ELK, Docker, Kubernetes, CI/CD, Jenkins, Network Troubleshooting, Automation

Industry

Information Technology & Services

Description

About the job We are looking for a hands-on Senior SRE – Network & Proxy Infrastructure to take ownership of a large-scale, global proxy infrastructure. Our system consists of hundreds of Linux servers distributed worldwide, handling high-volume, real-time traffic. This role sits at the intersection of systems, networking, and production reliability. You will be responsible for ensuring the stability, performance, and scalability of the platform - from deep packet-level troubleshooting to building monitoring, automation, and operational workflows. This is not a traditional IT or networking role - it is a production engineering position focused on real-world traffic, performance, and reliability at scale. Responsibilities: Configure, install, and manage proxy systems to optimize network and data flow performance. Investigate and resolve production issues involving latency, connectivity, and routing behavior Monitor system and network health using tools like Prometheus, Grafana, ELK, or similar observability stacks. Implement alerts and dashboards to proactively detect and mitigate service degradation. Write and modify scripts/code to automate network and system operations. Troubleshoot technical issues related to proxies, networks, and server performance. Analyze network traffic (TCP/IP, HTTP/S) to identify bottlenecks and anomalies Work with DevOps tools to manage and maintain server infrastructure. Analyse network traffic data and identify patterns to enhance overall system efficiency. Collaborate with R&D and IT teams to improve system architecture and performance. Requirements: · 3+ years of hands-on experience in production systems / SRE / network-oriented roles. Strong understanding of: TCP/IP fundamentals HTTP/HTTPS behavior DNS resolution and networking basics (firewalls, load balancers and Proxy technologies) Strong Linux skills (processes, networking tools, system performance) Proficiency in scripting languages (Bash, Python); experience with automation and CI/CD workflows (e.g., Jenkins). Hands-on experience with DevOps tools (e.g., Linux, Docker, Kubernetes, cloud platforms). Background in customer service/support is an advantage. Why This Role Is Interesting Work on a globally distributed system at a massive scale Solve real network problems - not just dashboards High ownership and impact on core product performance Close collaboration with engineering on architecture decisions

Responsibilities

You will manage and optimize global proxy infrastructure to ensure high performance, scalability, and reliability of real-time traffic. This involves troubleshooting network issues, automating operational workflows, and monitoring system health using observability tools.