Data Engineer
at Cranleigh STEM
Cambridge CB2 1NT, , United Kingdom -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 21 Dec, 2024 | GBP 80000 Annual | 23 Sep, 2024 | N/A | Version Control,Relational Databases,Git,Sql,Python,Data Processing,R,Cloud Security,Process Automation,Programming Languages | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
SKILLS & QUALIFICATIONS:
- Bachelor’s degree in a relevant technical field or equivalent experience
- Proficiency in Python, R, or other programming languages for data processing
- Experience managing and configuring cloud infrastructure and resources
- Knowledge of cloud security best practices
- Experience integrating APIs for process automation
- Familiarity with containerization tools (Docker, Singularity)
- Experience with schedulers such as AWS Batch, GCP Batch, or Slurm
- Proficiency in Git and version control
- Ability to critically assess data-handling practices in a commercial R&D setting
- Understanding of data management best practices
DESIRABLE SKILLS:
- Associate-level cloud certification
- Knowledge of SQL and relational databases
- Understanding of UK GDPR requirements for processing human genomics data
Responsibilities:
ROLE OVERVIEW:
We are seeking a skilled Data Engineer to join our Data Team and manage our data infrastructure. In this role, you will play a crucial part in streamlining data flow across the organization by integrating data between teams. You will be responsible for overseeing data flow management and maintaining cloud infrastructure to support our sequencing projects and downstream data analysis.
At Origin Sciences, we utilize advanced sequencing technologies to analyze mucus-based biospecimens, generating large volumes of data. The Data Engineer will manage the cloud infrastructure that supports these sequencing projects, enabling both clinical analytics and BI reporting.
MAIN DUTIES & RESPONSIBILITIES:
- Manage and optimize cloud resources to handle large-scale sequencing data
- Implement infrastructure improvements to enhance usability, performance, and security
- Automate data collection processes from laboratory instruments
- Use ETL processes to centralize organizational data
- Provide ad-hoc engineering support to laboratory and clinical teams
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - DBA / Datawarehousing
Software Engineering
Graduate
A relevant technical field or equivalent experience
Proficient
1
Cambridge CB2 1NT, United Kingdom