Senior Storage Engineer at Verda

Helsinki, Uusimaa, Finland -

Full Time

Start Date

Immediate

Expiry Date

08 Sep, 26

Salary

0.0

Posted On

10 Jun, 26

Experience

10 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Ceph, Distributed Storage, Linux Systems, Networking, Infrastructure Automation, Capacity Planning, Incident Response, Technical Leadership, Mentoring, Object Storage, Block Storage, File Systems, Observability, Performance Tuning, Ansible, GPU Workloads

Industry

technology;Information and Internet

Description

Imagine a future where everyone has instant, low-cost access to powerful computing resources. We're building a fully featured European cloud computing platform-with everything one needs to run, scale, and deploy applications and workloads. In addition, our infrastructure runs on 100% renewable energy. We're ambitious, curious, and gutsy doers. We practice a low hierarchy across the company and high morale in our teams. We've already achieved a lot, yet we're only getting started. Now it's your chance to join the ride. We offer more than just the job - we offer a career-defining opportunity to be part of building something big! Join Verda while it's still being built - not once it's finished. Storage is the foundation our AI cloud sits on. As our Senior Storage Engineer, you'll lead the design and operation of the Ceph clusters that hold customer data at petabyte scale - object, block, and file - and serve them to demanding GPU workloads. This is a senior, hands-on role with clear leadership scope: you'll set the technical direction for the storage area, mentor other engineers, and be the person the company looks to when storage decisions have to be made. Why Verda Cash + equity compensation along with various fringe benefits (e.g., healthcare, lunch, wellbeing, etc.). Profitable operations with rapid, sustained growth. 31 nationalities, with 6 different ones on the management team. An opportunity to make a clear impact and work alongside world-class engineers, researchers, and partners across the global AI ecosystem. Practicalities Location: Helsinki Hybrid mode: This role requires presence in our Helsinki office 3 days per week. Don't worry - you won't be alone or hangry there. (We keep a stocked fridge) Employment type: Full-time and permanent Your responsibilities Lead the design, deployment, and operation of large-scale Ceph clusters used for production workloads. Set the technical direction for the storage area - architecture, standards, operational practices, and roadmap - and align it with the platform's broader goals. Bring a production-grade managed Object Storage product to market: access keys, bucket and object management, performance and durability SLOs, monitoring, and alerting. Scale and maintain storage systems operating at petabytes to tens (or hundreds) of petabytes. Operate Ceph across multiple interfaces and use cases: CephFS for shared file system workloads, RBD for block storage, and RADOS Gateway for object storage. Mentor and grow the engineers around you - through code review, design review, pairing, and shared on-call - and raise the bar of how the team operates storage. Improve observability, automation, and operational tooling around storage infrastructure. Troubleshoot complex performance and reliability issues in distributed storage environments. Work closely with compute and networking teams to integrate storage with large-scale GPU clusters. Drive capacity planning, upgrades, and lifecycle management of storage infrastructure. Participate in production operations and incident response, lead post-mortems, and turn lessons into durable improvements. Your key competencies Deep expertise with Ceph: deployment and cluster architecture, day-to-day operations, troubleshooting, and performance tuning. Track record of operating Ceph at multi-petabyte scale in production. Demonstrated technical leadership - owning a storage or infrastructure area end-to-end, setting direction, and being accountable for outcomes. Experience mentoring engineers. Strong knowledge of Linux systems and internals. Solid networking knowledge and experience debugging distributed system issues. Experience running and maintaining mission-critical production infrastructure, including on-call. Experience building automation for infrastructure operations. Strong collaboration and communication skills, with the ability to represent the storage area in cross-team and customer-facing conversations. Nice to have Experience operating RADOS Gateway (RGW) as a customer-facing managed Object Storage product. Experience with CephFS in production environments. Ansible knowledge. Familiarity with monitoring and observability tools used in large infrastructure environments (Prometheus, Grafana, Loki, or similar). Background operating storage at a cloud provider, or at a large website/SaaS with media or image storage at the 10-100 PB scale. Familiarity with GPU/AI workloads and the storage patterns they generate. What's next We're building fast and this role needs the right person behind it. There's no artificial deadline, but when we find who we're looking for, we move. If this sounds like your next move, apply now. Please submit your application through our Careers page. We don't accept applications sent by email.

Responsibilities

Lead the design, deployment, and operation of large-scale Ceph clusters to support AI cloud workloads at petabyte scale. Set the technical direction for storage architecture and mentor other engineers to improve operational standards.