Cloud and Observability Engineer (Remote Role, 6pm - 3am) at Coralogix
New Delhi, delhi, India -
Full Time


Start Date

Immediate

Expiry Date

19 Jul, 26

Salary

0.0

Posted On

20 Apr, 26

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Observability, DevOps, Cloud Computing, Kubernetes, GCP, Azure, AWS, PromQL, Prometheus, CI/CD, Shell Scripting, Python, Regex, Monitoring, Alerting, Data Analysis

Industry

Software Development

Description
Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique architecture powers in-stream analytics without reliance on expensive indexing or hot storage. We specialize in comprehensive monitoring of logs, metrics, trace and security events with features such as APM, RUM, SIEM, Kubernetes monitoring and more, all enhancing operational efficiency and reducing observability spend by up to 70%. Coralogix is rebuilding the path to observability using a real-time streaming analytics pipeline that provides monitoring, visualization, and alerting capabilities without the burden of indexing. By enabling users to define different data pipelines per use case, we provide deep Observability and Security insights, at an infinite scale, for less than half the cost. About The Position: Job Summary: As a Cloud and Observability Engineer you will play a critical role in ensuring a smooth transition of customers’ monitoring and observability infrastructure. Your expertise in various other observability tools, coupled with a strong understanding of DevOps, will be essential in successfully migrating alerts and dashboards through creating extension packages and enhancing the customer's monitoring capabilities. You will collaborate with cross-functional teams, understand their requirements, design migration & extension strategies, execute the migration process, and provide training and support throughout the engagement Responsibilities: 1. Extension Delivery: Build & enhance quality extension packages for alerts, dashboards and parsing rules in Coralogix Platform to improve monitoring experience for key services using our platform. This would entail .Research related to building world class extensions including for container technology, services from cloud service providers, etc. 2. Building related Alerts and Dashboards in Coralogix, validating their accuracy & consistency and creating their detailed overviews and documentation Configuring Parsing rules in Coralogix using regex to structure the data as per requirements 3.Building packages as per Coralogix methodology and standards and automating ongoing process using shell or python scripting 4. Support internal stakeholders and customers with respect to queries, issues and feedback with respect to deployed extensions 5.Migration Delivery: Help migrate customer alerts, dashboards and parsing rules from leading competitive observability and security platforms to Coralogix Knowledge Management: 6. Build, maintain and evolve documentation with respect to all aspects of extensions and migration 7. Conduct training sessions for internal stakeholders and customer on all aspects of the platform functionality (alerts, dashboards, parsing, querying, etc.), migrations process & techniques and extensions content 8. Collaborate closely with internal stakeholders and customers to understand their specific monitoring needs, gather requirements, and ensure alignment during the extension building process Requirements Professional Experience: 2+ years of experience as an SRE, Systems Engineer, DevOps Engineer, or similar roles, with a focus on monitoring, alerting, and observability solutions. Cloud Technology Experience - 2+ yrs of hands-on experience with and understanding of Cloud and Container technologies (GCP/Azure/AWS + K8/EKS/GKE/AKS). Cloud Service Provider DevOps certifications would be a plus Observability Expertise: Good knowledge and hands-on experience with 2 or more Observability platforms, including alert creation, dashboard creation, and infrastructure monitoring.Researching latest industry trends is part of the scope. Deployments & Automation: Good understanding of CI/CD with at least one deployment and version control tool. Engineers would need to package alerts and dashboards as extension packs on an ongoing basis. Grafana & PromQL Proficiency: Basic understanding and practical experience with PromQL, Prometheus's query language, for querying metrics and creating custom dashboards. Person would also need to learn Dataprime and Lucene syntax on the job. Troubleshooting Skills: Excellent problem-solving and debugging skills to diagnose issues, identify root causes, and propose effective solutions. Communication Skills: Strong English verbal and written communication skills to collaborate with the customer's cross-functional teams, deliver training sessions, and create clear technical documentation. Analytical Thinking: Ability to analyze complex systems, identify inefficiencies or gaps, and propose optimized monitoring solutions. Availability: Mandatory to work in night shift - 6 PM to 3 AM from India. Candidates should be willing to visit the office in Gurugram for important meetings at least once a year or based on customer needs.
Responsibilities
The Cloud and Observability Engineer will manage the migration of customer monitoring infrastructure and create extension packages for alerts and dashboards. They will also provide technical support, conduct training sessions, and collaborate with cross-functional teams to optimize monitoring solutions.
Loading...