Data Engineer

at  Cranleigh STEM

Cambridge CB2 1NT, , United Kingdom -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate21 Dec, 2024GBP 80000 Annual23 Sep, 2024N/AVersion Control,Relational Databases,Git,Sql,Python,Data Processing,R,Cloud Security,Process Automation,Programming LanguagesNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

SKILLS & QUALIFICATIONS:

  • Bachelor’s degree in a relevant technical field or equivalent experience
  • Proficiency in Python, R, or other programming languages for data processing
  • Experience managing and configuring cloud infrastructure and resources
  • Knowledge of cloud security best practices
  • Experience integrating APIs for process automation
  • Familiarity with containerization tools (Docker, Singularity)
  • Experience with schedulers such as AWS Batch, GCP Batch, or Slurm
  • Proficiency in Git and version control
  • Ability to critically assess data-handling practices in a commercial R&D setting
  • Understanding of data management best practices

DESIRABLE SKILLS:

  • Associate-level cloud certification
  • Knowledge of SQL and relational databases
  • Understanding of UK GDPR requirements for processing human genomics data

Responsibilities:

ROLE OVERVIEW:

We are seeking a skilled Data Engineer to join our Data Team and manage our data infrastructure. In this role, you will play a crucial part in streamlining data flow across the organization by integrating data between teams. You will be responsible for overseeing data flow management and maintaining cloud infrastructure to support our sequencing projects and downstream data analysis.
At Origin Sciences, we utilize advanced sequencing technologies to analyze mucus-based biospecimens, generating large volumes of data. The Data Engineer will manage the cloud infrastructure that supports these sequencing projects, enabling both clinical analytics and BI reporting.

MAIN DUTIES & RESPONSIBILITIES:

  • Manage and optimize cloud resources to handle large-scale sequencing data
  • Implement infrastructure improvements to enhance usability, performance, and security
  • Automate data collection processes from laboratory instruments
  • Use ETL processes to centralize organizational data
  • Provide ad-hoc engineering support to laboratory and clinical teams


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - DBA / Datawarehousing

Software Engineering

Graduate

A relevant technical field or equivalent experience

Proficient

1

Cambridge CB2 1NT, United Kingdom