Remote - Edge Site Reliability Engineer (SRE)/ Colombia

at  GSB SOLUTIONS

Colombia, Huila, Colombia -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate17 Dec, 2024Not Specified20 Sep, 2024N/APostgresql,Access,Cloud,Capacity Planning,Meraki,Availability,Reliability,Ansible,Relational Databases,Aws,Powershell,Identity Federation,Information Technology,Operations,Infrastructure,Sql Server,Azure,Cisco,Cisco Routers,Iaas,Virtualization,DockerNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

LEVEL OF EDUCATION/QUALIFICATIONS NORMALLY REQUIRED:

  • Bachelor’s Degree in Information Technology or related discipline.
  • Distinctive qualifications relating to his/her area of expertise.
  • Preferred AWS solution architect certification and/other public cloud service
    providers.

SPECIFIC WORK EXPERIENCE:

  • Experience working in devops team
  • Previous experience in this role is desirable
  • Proven experience collaborating in technical designs oriented to availability and

reliability.

  • Experience in using and integrating cloud solutions.
  • Experience designing for scalability, capacity planning and resource management
  • At least 3 years or more of experience in Cloud and Devops teams,
  • At least 7 years experience in Applications, Infrastructure, Storage, Platforms.

REQUIRED TECHNICAL / FUNCTIONAL SKILLS:

  • Well versed and proficient on Automation Tools for IaaS / PaaS services such as:
  • Infrastructure as a Code (i.e Cloud Formation,Terraform, Azure RM … etc)
  • Cloud most used mark-up languages (YAML, JSON)
  • Configuration Management Tools (i.e AWS System Manager, Ansible, Chef ,

Puppet … etc)

  • Scripting for Operations (i.e Bash, PowerShell, Python.. etc).
  • Source Control Management (Git, bitbucket, gitlab, github)
  • CI/CD Orchestration Tools (i.e Bitbucket pipelines, Jenkins, CircleCI, Github

Actions, AWS Code Deploy, Azure DevOps….etc).
- Proficient in Operation IaaS services on at least one cloud service provider (AWS -

preferred, Azure or GCP)

  • Preferred technical/functional skills:
  • Knowledge in Network and Security technologies ( SDWAN( Meraki, velocloud),

MPLS, Cisco and Nexus switches, Cisco Routers, Cisco firewalls ( ASA, FPR) and

Load Balancers i.e F5/Netscaler).

  • Understanding of Converged Infrastructure(i.e VMware+Cisco UCS+EMC Storage)

and HyperConverged (i.e Nutanix)

  • Knowledge of Citrix solutions (namely XenApp)
  • Understanding of VOIP and Call centre technologies / architectures / operations.
  • Identity Access Management:. Understanding of Identity Lifecycle, access

management, Identity Federation, provisioning, certification, governance,
Active/Google Directory, MFA, Anti-virus and security, SAP-GRC, Sailpoints, Okta,

Ping, etc.)

  • General Distributed Systems Understanding (i.e DBaaS, Hadoop Based Systems,

Kafka … etc)

  • Knowledge in relational databases (Oracle, MS SQL Server, PostgreSQL, MySQL)

and non relational databases (MongoDB, Redshift, Coachbase….).

  • Virtualization and Containerization Technologies (i.e Kubernetes, Docker, Tunzu,

Mware on AWS … etc)

  • SAP Systems (BASIS administrators)
  • Disaster recovery tools (Druva, CPM, etc.)
  • End-to-End Monitoring tools (appdynamic, Dynatrace).

LEADERSHIP AND MANAGERIAL ABILITIES:

  • Ability to be an effective member of a multicultural virtual team of both internal and

external subject specialists.

  • Ability to drive transformation and change management.
  • Ability to build trust relationships with internal personnel and external providers.
  • Demonstrated ability to network in a complex matrix type organization structure.
  • Ability to build good working relationships with providers.
  • Capable of mentoring interns and intermediate SREs in all areas and clothes SREs

in their area of expertise

LINGUISTIC SKILLS:

  • Fluency in English, both verbal and written.
  • Other local languages are an asset.

Responsibilities:

  • Guarantee the general system uptime, focus on availability to comply with the defined SLA,

SLO and SLI.

  • Define metrics. As applications evolve over time, edge SRE is responsible for adapting the

right SLI and SLO and identifying significant projects that result in substantial cost savings

or revenues.

  • Spend <=50% of their time spent on hands-on Operational run activities (toil). The

remaining 50% should be focused on reliability, performance and efficiency improvements

for Products

  • Supports the Problem Management process and Root Cause Analysis following P1

incidents by promoting:

  • Error budget control.
  • Post mortem culture. Let’s learn from the errors.
  • React under security breach and promote an incident protocol.
  • A strong relationship with the security and operation team to support continuous

improvement of security assessments regarding

  • Patching.
  • Vulnerabilities.
  • Secrets/Keys/Certifications.
  • Compliance (Agents/clients installed)
  • Release strategy. Defining the involved parts, creating guidelines for version control and

name conventions, recommended testing phases and releases.

  • Contribute to new demand assessment by providing technical validation of the demand and

is in charge of the reliability engineering component of the demand.

  • Continuous improvement functions as eliminating toil , learning through Chaos engineering

testing, creating and collaborating on improvement plans. Relation with business continuity,
helping with the assessment if it is required, doing or participating in the DR design and

reviewing the runbooks. Helping to prepare for Chaos Engineering tests.

  • Participate in communication strategies, showing zone technical trends, reports of his/her

function and helping to prepare the training path for a new edge SRE with recommended

readings, practices and training if it is required..

  • Maintain and review technology solutions catalog.
  • Providing early engagement consulting to discuss specific architectures and design choices

in detail, and to help validate assumptions with the help of targeted prototypes

  • To assist in ensuring that the Infrastructure & Operations practices & processes are aligned

with:

  • Lafarge business objectives and priorities (Health & Safety, Communication,

Distribution Model, Innovation, …)

  • Lafarge IT infrastructure strategy
  • Lafarge Identity Management Systems
  • Lafarge Business Systems
  • Lafarge IT Security Policies and Directives
  • Lafarge Demand, Project Portfolio and Finance Management Policies and standards.

Job dimensions:

  • Sales, number of people, budget, volumes etc.:
  • Global infrastructure to support the overall group with a turnover of ~35 bn EUR
  • ~60 countries, ~4 000 sites, ~70 000 IT users and ~80 000 employees
  • 3 zones to coordinate (Americas, EMEA, APAC)
  • Infrastructure services provided on a 24x7x365 basis
  • Number of servers: ~10 000
  • CAPEX + OPEX management (budget to be defined).


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Network Administration / Security

Software Engineering

Graduate

Information technology or related discipline

Proficient

1

Colombia, Huila, Colombia