Principal Site Reliability Engineer (SRE) at INFINITE CHOICE LLC
United States, , USA -
Full Time


Start Date

Immediate

Expiry Date

30 Nov, 25

Salary

200000.0

Posted On

01 Sep, 25

Experience

10 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Performance Tuning, Configuration Management, Security, Optimization, New Relic, Reliability Engineering, Containerization, Distributed Systems, Python, Computer Science, Code, Google Cloud Platform, Kubernetes, Docker, Nosql, Infrastructure, Deployment Strategies

Industry

Information Technology/IT

Description

HANDS-ON EXPERIENCE WITH GOOGLE CLOUD PLATFORM IS REQUIRED

  • Expertise with GCP monitoring and observability stack (Cloud Monitoring, Cloud Logging, Cloud Trace)
  • Experience with GKE, Compute Engine, Cloud Functions, and other core GCP services
  • Knowledge of GCP networking, security, and compliance capabilities
  • Understanding of GCP cost optimization and resource management

TECHNICAL SKILLS

  • Strong programming skills in Python, Go, Java, or similar languages
  • Experience with monitoring tools (Prometheus, Grafana, Datadog, New Relic, or similar)
  • Proficiency with containerization (Docker, Kubernetes) and orchestration platforms
  • Knowledge of CI/CD pipelines, automated testing, and deployment strategies
  • Understanding of database performance tuning and optimization (SQL and NoSQL)

EDUCATION

  • Bachelor’s degree in Computer Science, Engineering, or equivalent professional experience
  • Industry certifications (Google Cloud Professional, SRE or related certifications preferred)

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities

ABOUT THE ROLE

We’re seeking an exceptional Principal Site Reliability Engineer to architect, design, and build our SRE foundation from the ground up at InfiniteChoice. This is a rare greenfield opportunity to establish SRE practices, develop custom tooling, and create the reliability culture that will support our platform serving millions of users and billions in transaction volume.
As our Principal SRE, you’ll combine deep technical expertise with strategic vision to build world-class monitoring, observability, and automation systems. You’ll have the autonomy to define our SRE processes, select technologies, and create the framework that ensures our systems are reliable, scalable, and performant.
Location: Remote - US based

Loading...