Remote - Edge Site Reliability Engineer (SRE)/ Colombia
at GSB SOLUTIONS
Colombia, Huila, Colombia -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 17 Dec, 2024 | Not Specified | 20 Sep, 2024 | N/A | Postgresql,Access,Cloud,Capacity Planning,Meraki,Availability,Reliability,Ansible,Relational Databases,Aws,Powershell,Identity Federation,Information Technology,Operations,Infrastructure,Sql Server,Azure,Cisco,Cisco Routers,Iaas,Virtualization,Docker | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
LEVEL OF EDUCATION/QUALIFICATIONS NORMALLY REQUIRED:
- Bachelor’s Degree in Information Technology or related discipline.
- Distinctive qualifications relating to his/her area of expertise.
- Preferred AWS solution architect certification and/other public cloud service
providers.
SPECIFIC WORK EXPERIENCE:
- Experience working in devops team
- Previous experience in this role is desirable
- Proven experience collaborating in technical designs oriented to availability and
reliability.
- Experience in using and integrating cloud solutions.
- Experience designing for scalability, capacity planning and resource management
- At least 3 years or more of experience in Cloud and Devops teams,
- At least 7 years experience in Applications, Infrastructure, Storage, Platforms.
REQUIRED TECHNICAL / FUNCTIONAL SKILLS:
- Well versed and proficient on Automation Tools for IaaS / PaaS services such as:
- Infrastructure as a Code (i.e Cloud Formation,Terraform, Azure RM … etc)
- Cloud most used mark-up languages (YAML, JSON)
- Configuration Management Tools (i.e AWS System Manager, Ansible, Chef ,
Puppet … etc)
- Scripting for Operations (i.e Bash, PowerShell, Python.. etc).
- Source Control Management (Git, bitbucket, gitlab, github)
- CI/CD Orchestration Tools (i.e Bitbucket pipelines, Jenkins, CircleCI, Github
Actions, AWS Code Deploy, Azure DevOps….etc).
- Proficient in Operation IaaS services on at least one cloud service provider (AWS -
preferred, Azure or GCP)
- Preferred technical/functional skills:
- Knowledge in Network and Security technologies ( SDWAN( Meraki, velocloud),
MPLS, Cisco and Nexus switches, Cisco Routers, Cisco firewalls ( ASA, FPR) and
Load Balancers i.e F5/Netscaler).
- Understanding of Converged Infrastructure(i.e VMware+Cisco UCS+EMC Storage)
and HyperConverged (i.e Nutanix)
- Knowledge of Citrix solutions (namely XenApp)
- Understanding of VOIP and Call centre technologies / architectures / operations.
- Identity Access Management:. Understanding of Identity Lifecycle, access
management, Identity Federation, provisioning, certification, governance,
Active/Google Directory, MFA, Anti-virus and security, SAP-GRC, Sailpoints, Okta,
Ping, etc.)
- General Distributed Systems Understanding (i.e DBaaS, Hadoop Based Systems,
Kafka … etc)
- Knowledge in relational databases (Oracle, MS SQL Server, PostgreSQL, MySQL)
and non relational databases (MongoDB, Redshift, Coachbase….).
- Virtualization and Containerization Technologies (i.e Kubernetes, Docker, Tunzu,
Mware on AWS … etc)
- SAP Systems (BASIS administrators)
- Disaster recovery tools (Druva, CPM, etc.)
- End-to-End Monitoring tools (appdynamic, Dynatrace).
LEADERSHIP AND MANAGERIAL ABILITIES:
- Ability to be an effective member of a multicultural virtual team of both internal and
external subject specialists.
- Ability to drive transformation and change management.
- Ability to build trust relationships with internal personnel and external providers.
- Demonstrated ability to network in a complex matrix type organization structure.
- Ability to build good working relationships with providers.
- Capable of mentoring interns and intermediate SREs in all areas and clothes SREs
in their area of expertise
LINGUISTIC SKILLS:
- Fluency in English, both verbal and written.
- Other local languages are an asset.
Responsibilities:
- Guarantee the general system uptime, focus on availability to comply with the defined SLA,
SLO and SLI.
- Define metrics. As applications evolve over time, edge SRE is responsible for adapting the
right SLI and SLO and identifying significant projects that result in substantial cost savings
or revenues.
- Spend <=50% of their time spent on hands-on Operational run activities (toil). The
remaining 50% should be focused on reliability, performance and efficiency improvements
for Products
- Supports the Problem Management process and Root Cause Analysis following P1
incidents by promoting:
- Error budget control.
- Post mortem culture. Let’s learn from the errors.
- React under security breach and promote an incident protocol.
- A strong relationship with the security and operation team to support continuous
improvement of security assessments regarding
- Patching.
- Vulnerabilities.
- Secrets/Keys/Certifications.
- Compliance (Agents/clients installed)
- Release strategy. Defining the involved parts, creating guidelines for version control and
name conventions, recommended testing phases and releases.
- Contribute to new demand assessment by providing technical validation of the demand and
is in charge of the reliability engineering component of the demand.
- Continuous improvement functions as eliminating toil , learning through Chaos engineering
testing, creating and collaborating on improvement plans. Relation with business continuity,
helping with the assessment if it is required, doing or participating in the DR design and
reviewing the runbooks. Helping to prepare for Chaos Engineering tests.
- Participate in communication strategies, showing zone technical trends, reports of his/her
function and helping to prepare the training path for a new edge SRE with recommended
readings, practices and training if it is required..
- Maintain and review technology solutions catalog.
- Providing early engagement consulting to discuss specific architectures and design choices
in detail, and to help validate assumptions with the help of targeted prototypes
- To assist in ensuring that the Infrastructure & Operations practices & processes are aligned
with:
- Lafarge business objectives and priorities (Health & Safety, Communication,
Distribution Model, Innovation, …)
- Lafarge IT infrastructure strategy
- Lafarge Identity Management Systems
- Lafarge Business Systems
- Lafarge IT Security Policies and Directives
- Lafarge Demand, Project Portfolio and Finance Management Policies and standards.
Job dimensions:
- Sales, number of people, budget, volumes etc.:
- Global infrastructure to support the overall group with a turnover of ~35 bn EUR
- ~60 countries, ~4 000 sites, ~70 000 IT users and ~80 000 employees
- 3 zones to coordinate (Americas, EMEA, APAC)
- Infrastructure services provided on a 24x7x365 basis
- Number of servers: ~10 000
- CAPEX + OPEX management (budget to be defined).
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Network Administration / Security
Software Engineering
Graduate
Information technology or related discipline
Proficient
1
Colombia, Huila, Colombia