Critical Environment Operations Manager at Microsoft
Atlanta, Georgia, United States -
Full Time


Start Date

Immediate

Expiry Date

03 Mar, 26

Salary

0.0

Posted On

03 Dec, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Mechanical Systems, Fire Life Safety, HVAC, Data Center Operations, Preventative Maintenance, Energy Optimization, Risk Mitigation, Incident Reduction, Root Cause Analysis, Budget Management, Collaboration, Commissioning, Retrofitting, Capacity Upgrades, Sustainability, Data Analytics

Industry

Software Development

Description
Empower a culture of safety, security, and compliance in all aspects of datacenter operations, with a strong focus on mechanical and fire life safety systems. Oversee day-to-day operations, maintenance, and reliability of mechanical infrastructure — including chilled water systems, air handlers, Computer Room Air Conditioning(CRAC)/Computer Room Air Handler(CRAH) units, pumps, and cooling towers — as well as associated fire suppression, detection, and life safety systems. Partner with electrical peers to maximize datacenter uptime and critical environment (CE) availability, ensuring optimal system redundancy and operational efficiency. Drive continuous improvement in preventative maintenance, energy optimization, and risk mitigation for mechanical and fire systems. Lead incident reduction and root cause analysis efforts for mechanical-related events, driving a culture of accountability and learning. Deliver on cost and energy efficiency initiatives, ensuring alignment with sustainability and operational goals. Oversee routine reporting and data analytics related to power, temperature, humidity, and mechanical system performance to ensure SLA adherence. Collaborate closely with Engineering Groups (EGs) and regional peers to influence design improvements and operational best practices across the datacenter fleet. Establish strong partnerships with vendors and service providers, ensuring contract adherence, safety performance, and timely delivery of mechanical services. High School Qualification or equivalent AND 6+ years experience of mission-critical service management (e.g., providing IT services, manufacturing, warehouse, retail, military, or managing physical operations in an IT and/or critical environment infrastructure) OR equivalent experience 1+ year(s) of people management experience. These requirements include, but are not limited to the following specialized security screenings: Bachelor's Degree or Technical College certification in Mechanical Engineering, Facilities Engineering, or Building Services (or related field). 6+ years' experience in Critical Environment infrastructure management, with a focus on mechanical systems, HVAC, and fire life safety. Demonstrated experience managing large-scale mechanical operations, including chilled water systems, economizers, air handling, and heat rejection systems. Familiarity with electrical infrastructure (UPS, generators, switchgear) and the ability to collaborate across technical domains. Experience leading commissioning, retrofits, or capacity upgrades of mechanical or fire life safety systems in a live datacenter or mission-critical environment. Budget management experience ($1M+ OPEX or CAPEX programs). Applicable certifications: HVAC, NFPA, PMP, CDCP, LEED, ITIL, or related leadership/technical certifications. Experience working on large-scale CE or building services projects and collaborating with global, virtual, or cross-functional teams. Ability to support a 24x7 datacenter environment, including participation in an on-call rotation and availability during non-standard business hours (evenings, nights, weekends, or holidays) as operational needs require.
Responsibilities
Oversee day-to-day operations and maintenance of mechanical infrastructure in datacenters, focusing on safety and compliance. Drive continuous improvement initiatives for mechanical and fire systems to enhance operational efficiency and system reliability.
Loading...