Senior Site Reliability Engineer at IFS SOLUTIONS ASIA PACIFIC PTE LTD

Colombo, Western Province, Sri Lanka -

Full Time

Start Date

Immediate

Expiry Date

14 Jul, 26

Salary

0.0

Posted On

15 Apr, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Azure, Site Reliability Engineering, PowerShell, IIS, Sitecore, WordPress, Observability, Incident Management, Automation, Azure Monitor, Log Analytics, Application Insights, Azure Front Door, Vulnerability Management, DNS, SSL/TLS

Industry

Software Development

Description

Company Description IFS is a billion-dollar revenue company with 7000+ employees on all continents. Our leading AI technology is the backbone of our award-winning enterprise software solutions, enabling our customers to be their best when it really matters–at the Moment of Service™. Our commitment to internal AI adoption has allowed us to stay at the forefront of technological advancements, ensuring our colleagues can unlock their creativity and productivity, and our solutions are always cutting-edge. At IFS, we’re flexible, we’re innovative, and we’re focused not only on how we can engage with our customers but on how we can make a real change and have a worldwide impact. We help solve some of society’s greatest challenges, fostering a better future through our agility, collaboration, and trust. We celebrate diversity and understand our responsibility to reflect the diverse world we work in. We are committed to promoting an inclusive workforce that fully represents the many different cultures, backgrounds, and viewpoints of our customers, our partners, and our communities. As a truly international company serving people from around the globe, we realize that our success is tantamount to the respect we have for those different points of view. By joining our team, you will have the opportunity to be part of a global, diverse environment; you will be joining a winning team with a commitment to sustainability; and a company where we get things done so that you can make a positive impact on the world. We’re looking for innovative and original thinkers to work in an environment where you can #MakeYourMoment so that we can help others make theirs. With the power of our AI-driven solutions, we empower our team to change the status quo and make a real difference. If you want to change the status quo, we’ll help you make your moment. Join Team Purple. Join IFS. Job Description As a Senior Site Reliability Engineer (SRE) within the Web Center of Excellence (Web COE), you will be responsible for ensuring the reliability, security, scalability, and performance of enterprise web platforms. You will support and optimize web applications built on Sitecore, WordPress, and IIS-based solutions, while actively driving proactive monitoring, anomaly detection, and vulnerability remediation. This role blends hands-on engineering, operational excellence, and forward-looking innovation, including participation in AI-driven observability and automation initiatives. You will work closely with developers, QA, solution architects, and business stakeholders to ensure highly available, secure, and resilient web services. Key Responsibilities Reliability & Operations Own the availability, performance, and stability of web applications hosted on Azure, including PaaS and IaaS workloads. Proactively monitor systems to detect anomalies, performance degradation, and reliability risks, and take preventive actions before customer impact occurs. Lead incident response, root cause analysis (RCA), and post-incident reviews, ensuring long-term corrective actions are implemented. Azure & Microsoft Ecosystem Design, operate, and optimize solutions using Azure services such as App Services, Azure VMs, Azure Monitor, Log Analytics, Application Insights, Azure Front Door, and Azure Networking. Automate operational tasks using PowerShell and Azure-native automation capabilities. Ensure adherence to Microsoft security and compliance best practices. Web Platform Support Support hosting, deployment, and operational health of Sitecore, WordPress, and legacy IIS-based applications. Collaborate with development teams to ensure applications are production-ready, scalable, and operationally sound. Guide teams on web hosting architecture, DNS governance, SSL/TLS, and traffic management. Security & Vulnerability Management Proactively identify security vulnerabilities, misconfigurations, and exposure risks across infrastructure and applications. Partner with security teams to implement remediation plans, patching strategies, and hardening standards. Ensure secure-by-design principles are embedded into web hosting and operational processes. Observability, Monitoring & AI Initiatives Build and enhance monitoring, alerting, and observability across the web ecosystem. Leverage data, logs, and metrics to identify trends and systemic risks. Contribute to AI-driven initiatives such as intelligent alerting, anomaly detection, predictive reliability, and automated remediation. Continuously improve operational maturity through tooling, dashboards, and insights. Collaboration & Leadership Work closely with Software Engineers, QA, and Architects to deliver reliable web services. Provide technical mentorship to junior SREs and engineers, setting operational best practices. Qualifications Required Bachelor’s degree in Computer Science, Information Technology, or equivalent professional experience. Strong experience as a Site Reliability Engineer, Systems Engineer, or Cloud Engineer supporting production web systems. Hands-on expertise with Microsoft Azure and the broader Microsoft ecosystem. Strong scripting and automation skills using PowerShell. Solid understanding of web technologies, IIS, HTTP/S, DNS, SSL, and load balancing. Experience with monitoring, alerting, and incident management in production environments. Proven ability to identify anomalies and proactively prevent failures and vulnerabilities. Preferred / Added Advantage Experience supporting Sitecore and WordPress in enterprise environments. Exposure to AI/ML-based observability, AIOps, or automation initiatives. Familiarity with ITIL practices, especially Incident, Problem, and Continual Service Improvement. Experience working in agile, cross-functional teams. Additional Information We embrace flexibility and hybrid work opportunities to support diverse needs and lifestyles, while also valuing inclusive workplace experiences. By fostering a sense of community, we drive innovation, strengthen connections, and nurture belonging. Our commitment ensures you can work in a way that suits you best, while also engaging with colleagues to share ideas and build meaningful relationships. IFS Referral Bonus Code: SH Job Location: On site

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities

The Senior Site Reliability Engineer will ensure the reliability, security, and performance of enterprise web platforms hosted on Azure. This role involves proactive monitoring, incident response, and driving AI-driven observability and automation initiatives.