Operations Manager - Infrastructure at Two95 International Inc
Kuala Lumpur, Kuala Lumpur, Malaysia -
Full Time


Start Date

Immediate

Expiry Date

14 Sep, 26

Salary

0.0

Posted On

16 Jun, 26

Experience

10 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Infrastructure Operations, Incident Management, Stakeholder Management, Vendor Governance, ITIL, ServiceNow, AWS, Virtualization, Data Center Services, Team Leadership, SLA Management, Root Cause Analysis, PowerShell, Python, SolarWinds, Dynatrace

Industry

technology;Information and Media

Description
We are seeking a highly skilled and proactive Operations Manager – Infrastructure to lead and manage infrastructure operations in a dynamic, 24x7 environment. The ideal candidate will have strong technical acumen, proven experience in managing critical incidents (P1/P2), driving Post-Incident Reviews (PIRs), and delivering exceptional customer service. This role demands leadership in managing remote teams and working within a managed services framework. Key Responsibilities Oversee day-to-day infrastructure operations ensuring high availability and performance. Ensure adherence to SLAs, KPIs, and compliance standards. Lead and drive P1/P2 incident calls with cross-functional teams. Ensure timely resolution and communication to stakeholders. Conduct and review Post-Incident Reports (PIRs) with root cause analysis and preventive actions. Act as the primary point of contact for customer escalations. Build strong relationships with clients and internal stakeholders. Ensure customer satisfaction through proactive communication and service excellence. Manage and mentor remote and distributed teams across geographies. Foster a culture of accountability, collaboration, and continuous improvement. Work closely with service providers to ensure delivery quality and contractual compliance. Monitor vendor performance and drive service improvements. Communication both Oral and written must be @ a good standard Strong Co-ordination and influencing ability to drive Incidents or BAU tasks (I.e. Updates, upgrades, etc..) to closure Has experience managing/leading cross functional team Has experience or involved in user experience improvements, identifying trends and risks. Ability to communicate clearly with both technical and non-technical people. Strong technical skill set in Data Center (DC) Services, with prior hands-on working knowledge Knowledge application support(optional) to understand the issues pertaining to application side Experience in handling Major Incidents along with effective stakeholder management Experience managing or working with offshore and remote teams Knowledge of the CPG environment (optional) Good understanding of ITSM processes and related governance Preferred Technical Knowledge Strong understanding of IT Infrastructure domains: Servers, Storage, Networking, Cloud (AWS), Virtualization, End user computing and Service Desk Familiarity with ITIL processes and tools – ServiceNow. Experience with monitoring tools (e.g., SolarWinds, , Dynatrace). Knowledge of automation and scripting (e.g., PowerShell, Python) is a plus. Experience 10+ years of experience in IT Infrastructure Operations, with at least 5 years in managerial role. Proven experience in managing 24x7 operations and critical incident handling. Experience in working with remote teams and global delivery models. Exposure to managed services environments and vendor governance. Soft Skills Excellent communication and interpersonal skills. Strong analytical and problem-solving abilities. Ability to work under pressure and manage multiple priorities. Leadership and team-building capabilities.
Responsibilities
Lead and manage 24x7 infrastructure operations, ensuring high availability and adherence to SLAs and KPIs. Drive the resolution of critical P1/P2 incidents and conduct Post-Incident Reviews to prevent recurrence.
Loading...