Senior Data Engineer at Genentech
New York, NY 10001, USA -
Full Time


Start Date

Immediate

Expiry Date

19 Sep, 25

Salary

262200.0

Posted On

19 Jun, 25

Experience

7 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Good communication skills

Industry

Information Technology/IT

Description

THE OPPORTUNITY

At Genentech and Roche, we’re at the forefront of a revolutionary transformation in drug discovery powered by AI and machine learning. Our “lab in the loop” strategy processes massive quantities of experimental data to train AI models that accelerate the discovery of new medicines. To enable this vision, we’re seeking an exceptional Senior Data Engineer to be part of the team building and maintaining our next-generation Therapeutic Molecule Registration (TMR) platform - a foundational component of our AI-driven drug discovery infrastructure, Lab-in-the-Loop (https://www.youtube.com/watch?v=cN1PxxQWoEc). This platform will serve as the central nervous system for managing and integrating molecular data across our global research organization, handling hundreds of billions of records and enabling unprecedented scale in virtual molecule design and testing. As the volume of AI-generated molecular designs grows exponentially, our TMR platform must evolve to become a high-performance, cloud-native system capable of supporting rapid iteration cycles between computational design and experimental validation. You will be instrumental in consolidating our molecule registration systems into a single, harmonized environment, unlocking the full potential of our data and accelerating the development of life-changing therapies. The ideal candidate has a proven record of standing up, migrating, and scaling databases with experience in chemical and/or biological registration systems. You will work on implementing scalable solutions for molecular data management and contribute to the architecture of our cloud-native platform.
You will work closely with Genentech Computational Sciences (gCS) colleagues, including our machine learning for drug development team, Genentech Research & Early Development (gRED) Drug Discovery teams including the Antibody Engineering division, and other teams across the Roche family of companies to identify, strategize, and productionalize high-impact applications from across the drug discovery and development pipeline. Genentech provides a dynamic and challenging environment for cutting-edge, multidisciplinary research in AI and drug discovery including access to rich sources of data, close links to top academic institutions around the world, as well as internal Genentech and Roche partners and research units.

Responsibilities
  • Design and implement features of our TMR data model
  • Oversee cloud data migration to TMR and production deployment
  • Contribute to technical design discussions and architecture decisions
  • Write high-quality, testable code for chemical registration workflows
  • Support and mentor junior team members
  • Collaborate with scientists and other engineers to implement business requirements
Loading...