SRE (Site Reliability Engineer)

at  Travellab Africa Group

Gardens, Western Cape, South Africa -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate19 Dec, 2024Not Specified24 Sep, 2024N/AAzure,Devops,Scripting Languages,Code,Scalability,Analytical SkillsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

Our Travelstart team is seeking an SRE (Site Reliability Engineer) for our Dev Team. This role ensures the reliability, performance, and scalability of the Travelstart systems. This role bridges the gap between software development and system operations, focusing on automating infrastructure and processes to improve reliability and efficiency.

REQUIRED SKILLS AND EXPERIENCE

  • Strong understanding of cloud platforms (e.g., AWS, Azure, GCP) and infrastructure-as-code tools.
  • Proficiency in scripting languages (e.g., Python, Bash).
  • Experience with containerization technologies (e.g., Docker, Kubernetes).
  • Knowledge of networking and security concepts.
  • Experience with monitoring and alerting tools (e.g., Prometheus, Grafana).
  • Problem-solving and analytical skills.
  • Ability to work independently and as part of a team.

Desired Skills and Experience

  • Experience with DevOps methodologies and tools.
  • Knowledge of specific OTA technologies and systems.
  • Experience with chaos engineering and failure testing.
  • Certification in cloud platforms or DevOps.

By effectively fulfilling these responsibilities, the SRE will play a crucial role in ensuring the reliability, performance, and scalability of the Travelstart systems, ultimately enhancing customer satisfaction and business success.

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities:

  • Infrastructure Automation:
  • Design, develop, and maintain automated infrastructure provisioning and management systems (e.g., Terraform, Ansible, CloudFormation).
  • Create and manage configuration management tools (e.g., Puppet, Chef) to ensure consistent environments.
  • Automate routine tasks and processes to reduce manual intervention and errors.
  • System Reliability:
  • Monitor system performance and identify potential issues proactively.
  • Implement incident response procedures and participate in incident investigations.
  • Conduct root cause analysis to prevent recurring problems.
  • Develop and maintain service level agreements (SLAs) and ensure they are met.
  • Performance Optimization:
  • Optimize system performance through tuning, caching, and load balancing.
  • Conduct performance testing and benchmarking.
  • Identify and address bottlenecks in the system.
  • Scalability:
  • Design and implement scalable architectures to handle increasing traffic and data volumes.
  • Ensure the system can accommodate growth and peak loads.
  • Collaboration:
  • Work closely with development teams to ensure that new features and changes are reliable and scalable.
  • Collaborate with operations teams to maintain system stability and availability.
  • Participate in knowledge sharing and training activities.


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Certification in cloud platforms or devops.

Proficient

1

Gardens, Western Cape, South Africa