DevOps Engineer at ThriveCart
, , -
Full Time


Start Date

Immediate

Expiry Date

17 Feb, 26

Salary

0.0

Posted On

19 Nov, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Terraform, AWS CloudWatch, Production Operations, Database Operations, Docker, Linux System Administration, Web Servers, E-commerce, PCI Compliance, CI/CD Pipeline

Industry

Software Development

Description
About ThriveCart: ThriveCart is the leading no-code sales platform for digital course creators, coaches, entrepreneurs, and online businesses looking to boost revenue, drive conversions, and scale audiences. ThriveCart powers over 65,000 businesses and 12 million enrolled students, generating over $2 billion in annual sales. The platform provides all the tools businesses need to create high-converting checkout experiences, manage powerful affiliate campaigns, and deliver seamless student experiences with its built-in learning management system, Learn/Learn+. Location: Remote in Europe or in Asia Must have full authorization to work in country of residence being in Europe or in Asia. The Role We're looking for an able hands-on DevOps engineer who loves being in the trenches. You'll spend your days building CloudWatch dashboards, writing Terraform code, investigating production issues, and making our infrastructure more observable and reliable. This is NOT a planning or coordination role - it's about doing the technical work that keeps a revenue-generating platform running smoothly. What you'll do: Build monitoring systems - Create CloudWatch dashboards and alarms that catch real problems (payment failures, database issues, API errors) before customers notice Write infrastructure code - Use Terraform to manage AWS resources (EC2, RDS, CloudWatch, IAM) with code review and automation Improve deployments - Make releases faster and safer through CI/CD improvements and automated rollbacks Handle production issues - Investigate anomalies, respond to alerts, find root causes, and prevent future occurrences Analyze performance - Identify bottlenecks in payment processing, database queries, and API responses What We're Looking For Required Experience (3-6 years) You must have: ⭐ Terraform production experience - managing real infrastructure with IaC ⭐ AWS CloudWatch experience - building dashboards, configuring alarms, writing custom metrics Production operations - supporting high-traffic web applications where downtime costs money Database operations - Database system monitoring, performance tuning, backups Docker & containers - Docker Compose, multi-container applications Linux system administration - comfortable with shell scripting and troubleshooting Web servers - Nginx or Apache configuration Nice to Have E-commerce or payment processing infrastructure experience PCI compliance knowledge Multi-database architecture experience On-call experience for revenue-critical systems CI/CD pipeline construction Personal Qualities Proactive - You spot problems before they become incidents Systematic - You troubleshoot methodically with data, not guesses Clear communicator - You explain complex issues in simple terms Detail-oriented - You catch edge cases and think about failure modes Calm under pressure - Production incidents don't rattle you Documentation-focused - You write runbooks because you know future-you will thank present-you Technical Environment Infrastructure you'll work with: AWS CloudWatch Terraform for infrastructure as code Docker for development environments MariaDB 10.11 (multi-database architecture) PHP & Node/typescript application stack Nginx web server Memcached caching AWS: EC2, RDS, Route53, IAM You'll be responsible for: Production monitoring and alerting Infrastructure as code (Terraform) CI/CD pipeline improvements Database operations and optimization Container orchestration Security and compliance support What Makes This Role Different High impact - Your observability will directly affect how fast we find and fix issues Real stakes - When payments fail, merchants lose money. Your work matters. Hands-on - You'll write code and build systems, not just coordinate or plan Complex system - Multi-database, legacy + modern, payment processing - interesting technical challenges Visibility - Your work is immediately visible to the entire engineering team What Success Looks Like In 30 days: You've responded to your first production alert You understand the architecture and data flows You've identified gaps in our current monitoring In 90 days: You've built 2+ dashboards the team relies on You've implemented 5+ alarms for previously blind spots You handle production incidents independently In 6 months: You've proactively prevented multiple production issues The team depends on your monitoring systems You've led an infrastructure improvement project You're comfortable on the on-call rotation On-Call Shared rotation (about 1 week per month) Focus on critical alerts (payment processing, database issues) Most scenarios have documented runbooks Team backup available for complex issues Compensated with time off or additional pay Why Join ThriveCart ✅ Direct impact - Your work directly affects revenue and reliability ✅ Modern practices - Terraform, IaC, CloudWatch, Docker ✅ Interesting problems - Complex distributed systems with real-world challenges ✅ Learning opportunities - Diverse tech stack (legacy + modern) ✅ Collaborative team - Work with experienced engineers ✅ Remote flexibility - FULL Remote role Our Values are: Commit to Excellence Find a Way Help Each Other Keep Growing Data First
Responsibilities
You will be responsible for building monitoring systems, writing infrastructure code, improving deployments, handling production issues, and analyzing performance. Your work will ensure the reliability and observability of a revenue-generating platform.
Loading...