Operations Support Engineer - A26064 at Activate Interactive Pte Ltd
Singapore, , Singapore -
Full Time


Start Date

Immediate

Expiry Date

24 May, 26

Salary

0.0

Posted On

23 Feb, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Cloud Infrastructure Management, Monitoring, Observability, Compliance, Security, Automation, Infrastructure as Code, Site Reliability Engineering, Virtualisation Platforms, Linux Administration, Windows Server Administration, Container Technologies, Networking, Scripting, Disaster Recovery, CI/CD

Industry

IT Services and IT Consulting

Description
Activate Interactive Pte Ltd (“Activate”) is a leading technology consultancy headquartered in Singapore with a presence in Malaysia and Indonesia. Our clients are empowered with quality, cost-effective, and impactful end-to-end application development, like mobile and web applications, and cloud technology that remove technology roadblocks and increase their business efficiency. We believe in positively impacting the lives of people around us and the environment we live in through the use of technology. Hence, we are committed to providing a conducive environment for all employees to realise their full potential, who in turn have the opportunity to continuously drive innovation. We are searching for our next team members to join our growing team. If you love the idea of being part of a growing company with exciting prospects in mobile and web technologies that create positive impact on people’s lives, then we would love to hear from you. Co-Development Business Unit is hiring for Operations Support Engineer Internal Code: A26064 You will work in Singapore Government Agencies This is a fixed term contract role. The engagement is 1 year with option to extend to 1 more year. What will you do? Infrastructure Management: Design, build, and maintain critical cloud infrastructure platforms encompassing compute, storage, networking, containerisation, virtualisation, DNS, monitoring, and supporting systems across development, staging, and production environments. Monitor and manage comprehensive cloud services including CloudWatch logs, alarms, synthetic monitoring, and integrated third-party solutions. Monitoring and Observability: Implement and maintain robust monitoring and observability frameworks for all platform components utilising modern tooling including AWS CloudWatch Canaries, StackOps, Prometheus, Grafana, and ELK stack implementations. Establish comprehensive observability practices to support proactive problem diagnosis and provide actionable insights into system health and performance metrics. Compliance and Security: Maintain adherence to Whole-of-Government platform standards, compliance frameworks, and security requirements through continuous monitoring using government-approved security and monitoring solutions. Implement security controls including access management, security hardening, and compliance monitoring with tools such as CyberArk. Automation and Infrastructure as Code: Develop and maintain infrastructure using Infrastructure as Code (IaC) methodologies with tools including Terraform, Ansible, and AWS CloudFormation to ensure repeatable, automated, and version-controlled deployments. Follow platform standards whilst executing infrastructure automation and modern operational practices to enhance efficiency and reliability. Site Reliability Engineering: Identify and eliminate repetitive operational tasks to improve Developer and Infrastructure Engineer efficiency whilst enhancing overall system reliability through systematic toil elimination and error budget management. Define, track, and report on SRE metrics including Service Level Objectives (SLO), Service Level Indicators (SLI), and error budgets. Platform Operations: Manage virtualisation platforms including VMware vSphere and Hyper-V, encompassing capacity monitoring, performance optimisation, and lifecycle management. Administer AWS Cloud services including EC2, ECS, S3, RDS (PostgreSQL and MS SQL), Docker/Kubernetes, Lambda, CloudFormation, CloudWatch, IAM, and VPC configurations alongside physical server infrastructure. Network and System Administration: Demonstrate proficiency with local networking technologies including TCP/IP, DNS, DHCP, VPN configurations, and routing protocols. Execute comprehensive platform patching strategies leveraging automation to maintain security and stability whilst minimising service disruption. Business Continuity: Maintain backup, disaster recovery, and high availability solutions for critical platform components including AWS Fault Injection Simulator (FIS) testing and multi-availability zone configurations. Support containerisation initiatives and maintain container orchestration platforms for traditional workloads. Collaboration and Documentation: Collaborate effectively with application teams to support platform stability, performance, and scalability requirements. Create and maintain comprehensive platform documentation, operational runbooks, and standard operating procedures. Support team development through knowledge sharing and mentoring on platform operations and modern infrastructure practices. What are we looking for? Advanced experience with enterprise virtualisation platforms (VMware vSphere, Hyper-V) Proficiency in Linux and Windows Server administration Expertise in server monitoring tool installation and regular patching of virtual and physical servers Comprehensive health check capabilities for servers, storage, and virtualisation platforms Strong experience with infrastructure automation tools (Ansible, Puppet, Chef) Proficiency with container technologies (ECS, Docker, Kubernetes) Experience with monitoring and observability platforms Infrastructure as Code expertise (Terraform, AWS CloudFormation, Ansible) Solid understanding of networking concepts and technologies Scripting capabilities in Python, PowerShell, Bash, and Node.js Experience with high-availability and disaster recovery solutions including AWS FIS Proficiency with GitHub tools and CI/CD pipeline setup and workflow management Bachelor’s degree in computer science, Information Technology, or related technical discipline with demonstrated experience in infrastructure operations and engineering. Strong understanding of enterprise infrastructure components with proven experience supporting infrastructure modernisation initiatives. Excellent analytical and problem-solving capabilities with strong documentation skills and effective communication abilities for both technical and non-technical stakeholders. VMware Certified Professional (VCP) or Windows vSphere Microsoft Certified: Windows Server Red Hat Certified Engineer (RHCE) AWS Certified Solutions Architect or AWS Certified SysOps Administrator Additional certifications in networking, security, or government IT standards. Previous experiences in government or highly regulated environments are strongly preferred. SENIOR OPERATIONS SUPPORT ENGINEER - ADDITIONAL REQUIREMENTS Leadership and Management: Lead infrastructure engineering teams to deliver comprehensive managed services for entire IT infrastructure environments. Direct desktop engineering teams to provide first-level support and technical problem resolution for end-user communities. Strategic Operations: Oversee and direct daily IT infrastructure operations, ensuring reliable and secure system, service, and application performance. Monitor and manage incident response for business-critical systems with focus on timely resolution to prevent operational delays and service outages. Organisational Engagement: Demonstrate capability to engage effectively with organisational management whilst establishing guidelines, policies, and procedures with strong execution oversight. Manage multiple concurrent deadlines as a self-directed professional with appropriate prioritisation skills. Operational Excellence: Monitor and respond to data centre issues and incidents whilst performing routine operational checks on servers, network devices, storage, and environmental systems. Track IT asset inventory ensuring comprehensive equipment accountability and end-of-life management. Incident and Change Management: Respond promptly to system alerts, alarms, and incidents with appropriate escalation to support teams following defined procedures. Support incident troubleshooting and recovery activities whilst managing planned maintenance, change requests, and scheduled outages. Coordinate hardware installation, replacement, and decommissioning activities alongside media handling and secure storage management. What do we offer in return? Fun working environment Employee Wellness Program To work in Singapore Government Agencies projects We provide structured development framework and growth opportunities. (We are a “SHRI 2025 Gold winner” in “Learning & Development; Coaching & Mentoring”) Why you'll love working with us? If you are looking for opportunities to collaborate with leading industry experts and be surrounded by highly motivated and talented peers, we welcome you to join us. We provide all employees with equal opportunities to grow and develop with us. We believe your success is our success. Activate Interactive Singapore is an equal opportunity employer. Employment decisions will be based on merit, qualifications and abilities. Activate Interactive Pte Ltd does not discriminate in employment opportunities or practices on the basis of race, colour, religion, gender, sexuality, national origin, age, disability, marital status or any other characteristics protected by law. Protecting your privacy and the security of your data are longstanding top priorities for Activate Interactive Pte Ltd. Your personal data will be processed for the purposes of managing Activate Interactive Pte Ltd’s recruitment related activities, which include setting up and conducting interviews and tests for applicants, evaluating and assessing the results, and as is otherwise needed in the recruitment and hiring processes. Please consult our Privacy Notice (https://www.activate.sg/privacy-policy) to know more about how we collect, use, and transfer the personal data of our candidates. Here you can find how you can request for access, correction and/or withdrawal of your Personal Data.
Responsibilities
The role involves designing, building, and maintaining critical cloud infrastructure platforms across development, staging, and production environments, including compute, storage, networking, and containerization. Responsibilities also include implementing robust monitoring frameworks, adhering to government compliance standards, and driving automation using Infrastructure as Code methodologies.
Loading...