Senior Software Engineer, Distributed Data Systems at Clera-AI
San Francisco, California, United States -
Full Time


Start Date

Immediate

Expiry Date

04 Sep, 26

Salary

0.0

Posted On

06 Jun, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Distributed Systems, Apache Spark, Hadoop, Haskell, TypeScript, OLAP Lakehouse, Query Optimization, Join Optimization, Database Optimization, Algorithms, Data Structures, Backend Engineering, Infrastructure Engineering, Platform Engineering, Data Systems Design, Scalable Infrastructure

Industry

technology;Information and Internet

Description
About the Role Join a startup building an agentic data lakehouse platform. As a Senior Software Engineer, Distributed Data Systems, you'll work on a greenfield project to build scalable data infrastructure that transforms enterprise data into actionable insights at scale. What You'll Do Work on a greenfield OLAP lakehouse project to build the data platform for the agentic era Design and implement distributed data system components, with a focus on join optimization and query performance Collaborate across infrastructure, services, and frontend teams to deliver data platform features Ship reliable, scalable data infrastructure that supports enterprise analytics What We're Looking For 4+ years of experience as a data systems, backend, infrastructure, or platform engineer Experience with big data systems (Apache Spark, Hadoop) Strong background in distributed systems Comfort diving into any part of the system—infrastructure, services, or frontend Proficiency in Haskell and/or TypeScript Track record of shipping products from zero to one Experience with databases and database optimization Strong data focus and understanding of data-driven systems Experience with OLAP lakehouse/data lakehouse architecture and query optimization Strong foundation in algorithms, data structures, and their real-world applications Location New York, NY, United States
Responsibilities
Design and implement distributed data system components for a greenfield OLAP lakehouse platform. Collaborate with cross-functional teams to ship reliable, scalable infrastructure for enterprise analytics.
Loading...