DevOps & Cloud Infrastructure startup experience at Travoom

Austin, Texas, United States -

Full Time

Start Date

Immediate

Expiry Date

01 May, 26

Salary

0.0

Posted On

01 Feb, 26

Experience

10 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

DevOps, Cloud Infrastructure, AWS, Kubernetes, CI/CD, Linux, Networking, System Debugging, Infrastructure-as-Code, Incident Response, Security, Financial Systems, Real-Time Systems, AI Systems, Crypto Platforms, Rust

Industry

Travel Arrangements

Description

Company Description Company Description – OleOle OleOle is building the world’s first football-specific social network and messaging platform for the global game’s 3 billion fans. Today, football fans are fragmented across general-purpose platforms—Twitter for debate, WhatsApp for chat, TikTok for video, ESPN and Sky Sports for content, separate apps for live scores, betting, tickets, and merchandise. OleOle brings all of this together into one native platform built specifically for football. OleOle combines: Twitter-style real-time conversations TikTok-style short-form video WhatsApp-style private and group messaging Live scores and match data AI-powered real-time translation across 200+ languages A football history and knowledge layer built to rival Wikipedia Integrated wallets, rewards, and crypto-only gambling Mini-programs that allow clubs, brands, creators, and partners to build directly on the platform The platform is ad-free by design and monetized through transactions, rewards, and platform services rather than attention farming. Technology Stack & Architecture OleOle is being built as a native, high-performance platform, not a stitched-together SaaS product. Core technology decisions include: Rust for backend services, chosen for performance, safety, and scalability A native cloud architecture (AWS initially) designed for global scale and fault tolerance Real-time data pipelines for live scores, messaging, and in-match activity AI systems for translation, content discovery, and football history queries Integrated crypto wallets used for rewards, commerce, and gambling A mini-program framework inspired by WeChat, allowing third parties to build on OleOle without platform tax or app-store fees The architecture is designed to support: Massive traffic spikes during global football events Real-time fan interaction across continents Secure financial transactions and wallets Continuous product expansion without re-architecting the platform every year Where the company is today Product vision, feature set, and core architecture are fully defined Design and UX are completed Development is underway across backend, mobile, AI, and infrastructure OleOle Sport (the hard-to-get ticket and travel business) provides a real revenue engine and traffic driver The focus now is execution, reliability, and scale OleOle is not experimenting with ideas—we are building a platform meant to last for decades. Job Description Role: Senior DevOps Engineer / Platform Reliability Lead OleOle is looking for a senior DevOps leader to own the reliability, scalability, and operational integrity of a global, real-time football platform. This s a hands-on leadership role. The core architecture, technologies, and product direction are already defined. The focus now is execution—building infrastructure that scales cleanly, fails gracefully, and supports millions of users during the world’s biggest sporting moments. You will be responsible for ensuring that a complex, multi-system platform operates as one reliable, observable, and secure system. What you will own End-to-end ownership of cloud infrastructure and platform reliability Design and operation of high-availability, fault-tolerant systems Kubernetes-based environments supporting real-time social, messaging, AI, and financial services CI/CD pipelines that are safe, repeatable, and trusted by engineers Monitoring, logging, alerting, and incident response across the entire platform Security, access control, secrets management, and operational best practices Production readiness for traffic spikes tied to live matches and global tournaments This role exists to prevent problems before users ever see them and to restore systems quickly and calmly when issues occur. What you’ll work on Operating and scaling real-time systems for live scores, messaging, and in-match activity Supporting AI translation workloads without impacting core platform performance Ensuring wallet, rewards, and financial infrastructure remain secure, auditable, and always available Managing production-grade MediaWiki infrastructure used for large-scale football history content Designing failover strategies so no single system can take down the platform Creating clear separation between development, staging, and production environments What we’re looking for Required 7+ years of experience in DevOps, SRE, or platform engineering roles Deep experience with AWS and cloud-native architectures Strong Kubernetes and container orchestration experience Proven track record running high-traffic, real-time production systems Infrastructure-as-Code experience (Terraform preferred) Strong understanding of Linux, networking, and system debugging Experience designing systems for reliability, not just deployment Strong plus Experience supporting crypto platforms, wallets, or exchanges Experience with Rust or high-performance backend systems Experience with live data feeds, sports, trading, or messaging platforms Prior ownership of incident response and on-call operations How you work You think in systems, not tickets You anticipate failure modes instead of reacting to them You communicate clearly and directly when something is unsafe or broken You are comfortable making decisions and taking ownership You focus on stability, clarity, and long-term maintainability This is not a role for someone who wants to debate architecture endlessly. The decisions are made. This role is about making them work in the real world. Qualifications Required Experience 7–10+ years of experience in DevOps, SRE, or Platform Engineering Prior experience at a startup or high-growth technology company, ideally from early or mid-stage through scale Proven ownership of production infrastructure for high-traffic, real-time platforms Deep hands-on experience with cloud-native architecture (AWS preferred) Strong experience operating Kubernetes in production environments Infrastructure-as-Code experience (Terraform or equivalent) Demonstrated ability to design systems for reliability, fault tolerance, and scalability Experience leading or owning incident response and production operations Strongly Preferred Working knowledge of Rust or experience supporting high-performance backend systems Experience with blockchain, crypto wallets, or exchange infrastructure Background in fintech, payments, trading, or financial systems Experience securing systems that handle transactions, keys, and sensitive data Familiarity with real-time data pipelines, messaging systems, or event-driven architectures What distinguishes a great candidate You have seen platforms break under real-world pressure — and fixed them You understand how early technical decisions affect long-term scale You know when to move fast and when stability matters more You think in terms of risk, failure modes, and blast radius You are comfortable taking ownership without needing constant direction What we are not looking for Junior or mid-level DevOps engineers Candidates without real production ownership Purely theoretical or certification-only backgrounds Engineers who want to debate decisions instead of executing them Additional Information Solutions not problems . Creative problem solver who can courageously propose and support new ideas to our organization. Not interested in best practices, lets build something better! Ability to adapt. An ideal candidate will welcome the opportunity to solve a broad range of problems using a wide array of technologies. Comfortable with ambiguity, shifting priorities and general growing pains of an early-stage technology company An exceptional entrepreneurial judgment that fosters independence over micro-management Understanding of football and international sports a huge plus Ole Ole is located in beautiful Austin Texas, however, this role requires some travel we are privately held and rapidly growing!

Responsibilities

You will own the end-to-end cloud infrastructure and platform reliability, ensuring that a complex, multi-system platform operates as one reliable, observable, and secure system. Your focus will be on building infrastructure that scales cleanly, fails gracefully, and supports millions of users during major sporting events.