Principal Engineer, AI at StarHub Ltd
, , Singapore -
Full Time


Start Date

Immediate

Expiry Date

15 Aug, 26

Salary

0.0

Posted On

17 May, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, Ansible, IP Networking, Broadband Architecture, Data Center Infrastructure, API Integration, System Orchestration, AIOps, CI/CD, Network Telemetry, Zero-Touch Provisioning, DCIM Tools, Cloud Environments, TR-069/TR-369, Data Pipelines, Network Automation

Industry

Telecommunications

Description
Job Description a)    IP & Broadband (Core + Access + Transmission)•    Design and implement automation frameworks across: IP Core (BNG, routing), Broadband Access (OLT/ONT) & Transmission and transport networks•    Enable automated provisioning, configuration management, and lifecycle operations.•    Develop standard APIs and integration interfaces for network actions.•    Implement configuration compliance and drift management mechanisms.•    Support high availability and resilience readiness (failover, rerouting support).•    Integrate network telemetry into centralized platforms AI-driven diagnostics and closed-loop automation workflows.•    Ensure all automation workflows are secure, auditable, and compliant.b)    Data Centre Operations (Infrastructure Automation + Energy & Space Optimization)•    Automate compute, storage, and network provisioning across data center environments.•    Develop runbook automation for operational tasks (restart, failover, scaling).•    Automate patching, upgrades, and lifecycle management.•    Enable infrastructure data pipelines and integrations to support AI-based anomaly detection and predictive maintenance systems.•    Enable event-driven execution workflows from monitoring systems.•    Maintain automation pipelines (CI/CD) for infrastructure operations.•    Ensure robust execution frameworks with rollback and validation mechanisms.•    Enable centralized monitoring of Power consumption (UPS, PDU), Cooling systems (HVAC, CRAC) & Environmental metrics (temperature, airflow, humidity)•    Enable automation readiness for Cooling optimization and airflow balancing and Power utilization tracking (PUE and efficiency metrics)•    Enable execution of optimization actions to improve energy efficiency and reduce costs.•    Support data collection and execution readiness for AI-driven energy optimization (cooling efficiency, power balancing).c)    Business Innovation & Strategic Projects•    Embed “automation-first” principles into transformation programs (e.g., iBNG, SiX AntiDDoS, IP-Optical SRv6 Network etc).•    Enable zero-touch provisioning (ZTP) for new deployments.•    Develop API-driven and programmable interfaces for new systems.d)    Monitoring, Visibility & Dashboard Enablement•    Implement centralized monitoring frameworks across network and data center domains.•    Enable real-time visibility of performance, utilization, and environmental metrics.•    Integrate telemetry into DCIM and monitoring platforms.•    Support development of Operational dashboards (NOC / CXOps) & Executive dashboards (capacity, utilization, risk)•    Enable alarm ingestion and visualization (without owning RCA logic).•    Enable dashboards that incorporate AI-driven insights (e.g., anomaly indicators, predictive alerts)•    Ensure telemetry pipelines support AI/ML consumption (real-time, structured, high-quality data feeds).e)    Automation of Operations •    Develop automation for Provisioning and configuration management, Infrastructure and network lifecycle operations & Runbook automation for repetitive tasks•    Enable execution of Network actions (configuration updates, resets) &  Infrastructure actions (restart, scaling)•    Provide secure, standardized APIs for execution.Qualifications Bachelor’s Degree in Engineering, Computer Science, or related field8 –10 years of experience in Network or data center operations & automation / system integrationStrong understanding of IP networking and broadband architecture & Data center infrastructure (power, cooling, monitoring)Experience with Automation tools (Python, Ansible, Scripting), API integrations and system orchestration, Monitoring and observability platforms & AI/ML-enabled operations (AIOps) concepts and data pipelinesPreferred Skills : DCIM tools, Cloud environments (AWS / GCP / Azure), TR-069 / TR-369 (device management) & familiarity with data platforms (e.g., telemetry systems, data lakes)
Responsibilities
Design and implement automation frameworks for IP core, broadband access, and data center infrastructure to enable automated provisioning and lifecycle operations. Develop AI-driven diagnostics, telemetry pipelines, and energy optimization systems to improve network resilience and efficiency.
Loading...