Bioinformatics Data Scientist (m/f/d) - Virtual Patient Engine (VPE) at BioMed X GmbH

Heidelberg, , Germany -

Full Time

Start Date

Immediate

Expiry Date

19 Sep, 25

Salary

0.0

Posted On

19 Jun, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Life Sciences, Visualization, Data Manipulation, Documentation, Interpersonal Skills, Python, Sql, Processing, Version Control, Git, Workflow Management Systems, Unit Testing, Continuous Integration

Industry

Information Technology/IT

Description

THE POSITION

We are looking for a talented and curious Bioinformatics Data Scientist to join our team, bringing fresh perspectives and advanced expertise to fuel innovative thinking and scientific excellence. This position is for you if you are passionate about transforming biomedical data into actionable knowledge within a collaborative environment.

The ideal candidate will have:

PhD or equivalent degree in bioinformatics, computational biology, or computer science.
You will leverage your advanced computational skills and deep biological knowledge to tackle complex challenges in digital twin technologies.
You will be part of a dynamic team driving discovery through cutting-edge data science, including the development and application of artificial intelligence, foundation models, and agentic AI systems.

REQUIRED SKILLS

Strong background in Python for statistical analysis, data manipulation, and visualization.
Experience with developing and implementing scalable, reproducible pipelines for processing and analyzing high-throughput sequencing data (e.g., RNA-seq, scRNA-seq, scATAC-seq, etc.) using workflow management systems such as Nextflow and Snakemake.
Hands-on experience in handling, processing, integrating, and analyzing large-scale, heterogeneous datasets—including multi-omics data and real-world data (RWD) such as electronic health records (EHRs), and clinical trial data—in the context of biomedical research.
Familiarity with biological network analysis, pathway enrichment tools, and knowledge graphs.
Experience in SQL and/or NoSQL databases for managing and querying large-scale datasets.
Working knowledge of foundation models in life sciences (e.g., ESM2, AlphaFold, Cell2Sentence, Geneformer) and practical experience in incorporating these models into computational workflows.
Proficiency in using version control with Git, Docker containers and Anaconda; in using code management best practices such as unit testing, linting, and documentation; orchestrating machine learning experiments using cloud computing environments; and using continuous integration and deployment (CI/CD) frameworks.
Ability to work as part of an interdisciplinary team but also independently.
Strong problem-solving skills and scientific curiosity.
Excellent communication, organizational, and interpersonal skills.

ADDITIONAL SKILLS, GOOD TO HAVE

Experience in applying artificial intelligence methods to biomedical data using AI/ML libraries such as PyTorch and PyTorch Lightning.
Experience in building serverless data pipelines with AWS EMR, and integrating with other AWS services (e.g. S3).
Experience working with or developing agentic AI systems is a plus.

Responsibilities

Please refer the Job description for details