Azure Cloud System Support at NTT DATA
Kuala Lumpur, Kuala Lumpur, Malaysia -
Full Time


Start Date

Immediate

Expiry Date

18 Mar, 26

Salary

0.0

Posted On

18 Dec, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Azure Cloud, Release Management, Linux Administration, Windows Administration, CICD Pipeline, Monitoring Systems, Incident Management, PaaS, IaaS, Kubernetes, PowerShell, Python, NoSQL, SQL, Log Analytics, Documentation

Industry

IT Services and IT Consulting

Description
Release Management of new software via Tools Understand release management SOP = QA -> Load Test -> Stage Environment -> PROD Create/Manage monitoring and alerting systems and as needed to meet SLA's Comfortable with both Linux and Windows administration Working in agile teams, build, test and maintain aspects of CICD Pipeline Manage UI visual of license consumption & performance Evangelize with Engineering, Security, and cross functions on Ops Best Practices Firmware release - OTA (over the air) Launch new the mobile app / release new version of the existing mobile app - Appstore / Play store Participate in RCCAs when needed Maintain documentation & best practices (Wiki & Runbooks) Work with teams to set up standard alerts that can be placed in ARMs & CICD Support product NPI onboarding Participate in early phases of NPI's sprints when Arch tech runways are defined Support continuous delivery of programs in which patches, new versions, and bug fixes are more frequently deployed to end users without sacrificing stability or reliability Support on-call during off-hours crisis Responsible for Tier 1.5-2 support that includes end-2-end ownership of incidents from the time they enter the service line through closure for connected devices Responsible for 24X7 Major Incident Management support Implement corrective actions needed to mitigate security risks Ensure all tickets requiring follow-up work and/or calls are resolved. Ensure all the components are within MON purview 3-5 years' experience in Microsoft Azure cloud System Support. Understand the Microsoft Azure cloud - ideally Azure Fundamentals certified OR Computer Science/Information Systems Management degree Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages, and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure resources, escalate to Level 3 (soft dev team) Familiar with PaaS and IaaS - VMs, Storage, EventHub, Service Fabric Cluster (SFC), Azure Kubernetes Service (AKS), Cosmos DB, SQL Server, IoT Hub, Databricks, Key Vault, Data Lake Understand the concept of Internet of Things (IoT) - telemetry, ingestion, processing, data storage, reporting Understand the concept tools - Octopus, Bamboo, Terraform, Azure DevOps, Jenkins, GitHub, Ansible Understand the concept of container orchestration platforms (e.g. Kubernetes) Understand the concept of scripts: PowerShell, Python Understand the difference between NoSQL and SQL databases, and how to maintain them Understand monitoring and logging systems (Log Analytics, Splunk, ELK, Prometheus, Nagios, Zabbix, etc) Independent thinker - why does it break, what can I proactively do to fix it Required Strong English communication (written and oral) skills
Responsibilities
The role involves managing the release of new software and ensuring the stability and reliability of Azure cloud systems. Responsibilities include monitoring, incident management, and supporting continuous delivery of programs.
Loading...