Lead 1, Site Reliability Engineer

at  SP Global

Pasig, Pasig, Philippines -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate09 Nov, 2024Not Specified10 Aug, 20245 year(s) or aboveOwnership,Technology Solutions,Drive,Communication Skills,Automation,Production Systems,Leadership Skills,It,Completion,LearningNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

PREFERRED QUALIFICATIONS:

AWS Certified Solution Architect – Professional
Great attitude to learning, respect for fellow employees, thinking out of the box and respectfully challenging ideas & hungry for innovation
Good Leadership skills capable of leading a team
Good communication skills and a sense of ownership and drive
Have a software-centric mindset and can understand the full software stack – and beyond
Embrace automation over manual effort
Experience debugging complex problems and view problems as an opportunity to improve
Experience designing, building, and operating large-scale production systems
Experience working in enterprise-scale internal or customer-centric projects to completion, architecting technical solutions, and working closely with development & engineering teams
Provide attention to detail to design, problems, KPI’s, demonstrate the ability to stay focused during critical production events, and champion resolutions
Be able to gel in with the companies’ culture and effectively collaborate with other technology & business stakeholders
About S&P Global Market Intelligence
At S&P Global Market Intelligence, a division of S&P Global we understand the importance of accurate, deep and insightful information. Our team of experts delivers unrivaled insights and leading data and technology solutions, partnering with customers to expand their perspective, operate with confidence, and make decisions with conviction.
For more information, visit www.spglobal.com/marketintelligence .
What’s In It For You?

OUR PEOPLE:

We’re more than 35,000 strong worldwide—so we’re able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all.
From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We’re committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. We’re constantly seeking new solutions that have progress in mind. Join us and help create the critical insights that truly make a difference.

Responsibilities:

RESPONSIBILITIES:

Overseeing the transfer of data from traditional to cloud-based services
Experience with understanding business proposals and transforming them into technical solutions
Participate in the design of information and operational support systems
Creating production and migration schedules for large projects with timelines/milestones
Develop and leverage AWS tools and services to manage and automate key operations capabilities. This includes AWS Systems Manager, Patch Manager, Cloud Formation, and custom scripting to extend the AWS services
Proactively ensure the highest levels of systems and infrastructure availability
Monitor and test application performance for potential bottlenecks, identify possible solutions and work with developers to implement those fixes
Write and maintain custom scripts to increase system efficiency and reduce human intervention time on tasks
Maintain security, backup, and redundancy strategies
Provide 3rd-level support for AWS infrastructure
Increase alerting & monitoring quality, Reduce Alarm noise, and Increase Observability Gaps
Optimize Cloud Costing and analyze Capacity Planning
Reduce Operations exposure, Increase the pace of incidents recovery, and Implement Resiliency and remediation plans
Identifying and correcting problems stemming from audit and compliance
Liaise with vendors and other IT personnel for problem resolution
What we’re looking for:Basic Qualifications:
Bachelor’s/Master’s Degree in Computer Science, Information Systems, or equivalent
7+ years of proven work experience as an SRE or DevOps role within companies that provide critical, high-availability services
7+ years of system & solutions engineering, software development, or system operations background with 5+ years of proven work experience as an SRE or DevOps role within companies that provide critical, high-availability services
Experience automating infrastructure, testing, and deployments using tools like Terraform, CFT with Ansible, Rundeck, Autosys, Jenkins, Datadog & other industry-recognized tools to deliver Infrastructure as Code
Experience working with the Rundeck tool (Design, Setup, Deployment, Automation & Integration)
Experience in Configuring AutoSys Workload Automation and managing Job Scheduler, Event Server, and Application Server
5+ years of experience in AWS cloud infrastructure, storage, platforms, and data
5+ years of solid scripting skills on any 2 of the following cloud formation (Must), shell scripts, JavaScript, PowerShell, Python, Bash, SQL, .NET, Java, R, etc.
Experience in Implementing technical roadmaps, project plans, requirements, and designs in AWS technologies: VPC, EC2, EKS, ELB, RDS, Lambda, SES, SNS, Containers, etc.
Good conceptual understanding & knowledge of virtualization & container-based technologies such as AWS, Docker, Kubernetes, VMware, and Virtual Box
Working knowledge of Load balancers such as F5, ALB, ELB, etc.
Experience with any identity management systems such as (SAML/OAuth/OIDC), MFA, etc.
Knowledge of cloud security controls including tenant isolation, key management, encryption, vulnerability assessments, application firewalls, SIEM, etc.
Experience with one or more RDBMS Knowledge SQL Server is a plus
CI/CD delivery using code and configuration management automation tools such as GitHub, VSTS, Ansible, DSC, Puppet, Ambari, Chef, Salt, Jenkins, Maven, etc.
Delivery using modern methodologies especially SAFE Agile, Lean, etc.
Experience with monitoring solutions, such as AWS CloudWatch, ELK, etc.
Experience working with large-scale IT projects related to design, deployment, and configuration
Experience with containers, such as with Kubernetes, Container, Docker, or any OCI runtimes
Certified Cloud Professional (AWS Solutions Architects, DevOps, System)
Experience with scalable networking technologies, including Linux, software-defined networking, network virtualization, open protocols, App acceleration, Load Balancers, DNS, virtual private networks and their application in PaaS and IaaS technologies
Experience with Unix/Linux operating systems internals (e.g., filesystems, system calls), and with networking (e.g., routing, ESDN) or cloud systems
Experience in Performance tuning OS (Linux & Wintel), JVM
Experience with monitoring and observability such as with Datadog, AppDynamics, SolarWinds, Splunk, and Nagios
Experience using source control (Git, GitHub) and feature branching strategies
Experience Developing a keen understanding of the systems environment, dependencies, data flows, databases, and how the products are produced
Second-line on-call support, researching and analyzing issues, figuring out corrective actions, and coordinating and communicating with involved parties
Experience Working with and coordinating activities of IT development, Data Operations, Technical Engineering, and Product Management groups on day-to-day activities to ensure the integrity of the production systems
Ability to profile, analyze, and identify problem areas of a production system
Work with the above-mentioned groups to ensure efficient smooth implementations and ongoing support of new production systems
Development of defect corrections, production efficiencies, and product enhancements
Develop and Maintain SRE runbooks
Strong understanding of SLO, SLI, Error Budgets, and their implementation into SRE areas
Weekend, night, and holiday second-level on-call work scheduled on a rotating basis
Strong critical thinking and problem-solving skills
Ability to set direction and work individually, while working as part of a team towards a common goal
Excellent written and oral communication skills

OUR PURPOSE:

Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology–the right combination can unlock possibility and change the world.
Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence®, pinpointing risks and opening possibilities. We Accelerate Progress.


REQUIREMENT SUMMARY

Min:5.0Max:7.0 year(s)

Information Technology/IT

IT Software - System Programming

Software Engineering

Graduate

Computer Science, Information Systems

Proficient

1

Pasig, Philippines