Data Engineer (m/f/d) | Legal AI Tech Start-up | Full Remote at Noxtua AG
10115 Berlin, , Germany -
Full Time


Start Date

Immediate

Expiry Date

04 Dec, 25

Salary

0.0

Posted On

06 Sep, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Data Processing, Data Engineering, Benchmarking, Databases

Industry

Legal Services

Description

REQUIREMENTS:

  • Residence & Work Permit: Eligible to work in Germany or within the EU.
  • Language: English proficiency at C2 level.
  • Experience: in AI development or data engineering with successfully deployed projects
  • Data: Expertise in data processing, filtering, and augmentation
  • Databases: Expertise in vector databases, data embedding, benchmarking and management
  • Programming: Strong Python skills and experience with AI pipelines

ABOUT US

Noxtua is Europe’s leading sovereign Legal AI. The legally compliant and competent AI helps legal professionals to research legal issues and review, and draft legal documents. The GDPR-compliant Legal AI meets the high professional and data protection requirements for lawyers (§ 203 Penal Code, § 43e Federal Lawyers’ Act) and is certified according to ISO 27001, 9001, 27018, 27017, 42001, and BSI C5. The product version Beck-Noxtua is based on the exclusive data of Germany’s leading legal publisher C.H.Beck and Germany’s largest business law firm CMS.
Emerging from a 2017 research project by Dr. Leif-Nissen Lundbæk and Professor Dr. Michael Huth at Oxford University and Imperial College London, the Berlin-based Legal-Tech company, formerly known as Xayn, boasts extensive experience in developing highly efficient GDPR-compliant AI solutions. Strategic partners, including Germany’s leading legal publisher C.H.Beck, the High-Performance Computing specialist Northern Data, Germany’s largest business law firm and co-initiator of the Legal AI Noxtua CMS, as well as the world’s largest law firm Dentons, have invested a total of € 80.7 million in the German startup as part of its Series B funding round.
We explicitly encourage women to apply, as they are currently underrepresented. Our goal is to build a diverse and inclusive work environment that values different perspectives. Of course, we welcome applications from all qualified individuals – regardless of gender, ethnic origin, religion, disability, age, or sexual identity.

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities
  • Build and optimize ETL pipelines to process legal data from multiple jurisdictions, including chunking, embedding and ingesting legal data.
  • Develop and maintain data models that ensure consistency, scalability, and accuracy across diverse datasets and large amounts of data.
  • Coordinate data handover from different sources.
  • Implement metadata enrichment strategies to enhance searchability and usability of legal information.
  • Experiment with embedding strategies and training embedding models, including evaluation
  • Conduct database performance benchmarking and tuning to ensure efficient query execution and scalability.
  • Collaborate with product, AI, and legal domain experts to deliver high-quality, reliable data solutions.
Loading...