Site Reliability Engineer at Coralogix

Boston, Massachusetts, United States -

Full Time

Start Date

Immediate

Expiry Date

23 May, 26

Salary

0.0

Posted On

22 Feb, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Kubernetes, AWS, Kafka, Prometheus, Thanos, Git, Argo CD, Istio, Terraform, Crossplane, Golang, FedRAMP Compliance, Vulnerability Management, Incident Management, Infrastructure as Code, Networking

Industry

Software Development

Description

Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique architecture powers in-stream analytics without reliance on expensive indexing or hot storage. We specialize in comprehensive monitoring of logs, metrics, trace and security events with features such as APM, RUM, SIEM, Kubernetes monitoring and more, all enhancing operational efficiency and reducing observability spend by up to 70%. We are looking for a Site Reliability Engineer to work as part of our Cloud Infrastructure Team. Focusing on Enterprise FedRal Cloud Infrastructure. In This Role, You Will: Work in high scale environments - Coralogix data pipeline processes 55Tb of data each day Adopt cutting edge technologies with end-to-end responsibility Building internal tools to expand our platform capabilities Collaborate with R&D to improve stability & reliability of the system Lead the product roadmap - our product is designed for engineers. Therefore, our engineers promote, enhance, and take a crucial part in influencing the product roadmap. Perform operational duties for FedRAMP cloud products, including deployments, on-call support, and incident management. Our Tech Stack Is Unique And In Constant Growth Kubernetes, Kops, AWS, Kafka, Prometheus, Thanos, Coralogix, Git, Argo CD, Istio, and many more! Requirements At least 5 years of experience as a DevOps Engineer/ SRE in production environments In-depth experience with Kubernetes - operating & monitoring are key parts At least 2 years of experience Experience with FedRAMP compliance (High/Moderate levels), vulnerability management, and continuous monitoring, including scanning, patching, and reporting. High familiarity with monitoring tools such as Coralogix, Grafana, Prometheus Experience in AWS or other cloud providers Experience with infrastructure as a code (Terraform, Crossplane, etc.) Understanding of networking - from networking layers to different networking protocols (http, grpc, ssl) Some software engineering experience, preferably in Golang. An advantage - operating data pipelines An advantage - familiarity with Apache Kafka Cultural Fit We’re seeking candidates who are hungry, humble, and smart. Coralogix fosters a culture of innovation and continuous learning, where team members are encouraged to challenge the status quo and contribute to our shared mission. If you thrive in dynamic environments and are eager to shape the future of observability solutions, we’d love to hear from you. Coralogix is an equal opportunity employer and encourages applicants from all backgrounds to apply.

Responsibilities

The Site Reliability Engineer will focus on the Enterprise Federal Cloud Infrastructure, working in high-scale environments processing significant daily data volumes and adopting cutting-edge technologies with end-to-end responsibility. Duties include building internal tools, collaborating with R&D for system stability, leading the product roadmap influence, and performing operational duties for FedRAMP cloud products like deployments and on-call support.