Sr Data Engineer at Bosch Group
Bengaluru, karnataka, India -
Full Time


Start Date

Immediate

Expiry Date

04 Jun, 26

Salary

0.0

Posted On

06 Mar, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, Java, Docker, Docker Compose, Kubernetes, Knowledge Graphs, RDF, SPARQL, XSLT, Embeddings Pipelines, Azure Storage, Apache Camel, Git, Data Ingestion, Data Transformation

Industry

Software Development

Description
Company Description Bosch Global Software Technologies Private Limited is a 100% owned subsidiary of Robert Bosch GmbH, one of the world's leading global supplier of technology and services, offering end-to-end Engineering, IT and Business Solutions. With over 27,000+ associates, it’s the largest software development center of Bosch, outside Germany, indicating that it is the Technology Powerhouse of Bosch in India with a global footprint and presence in the US, Europe and the Asia Pacific region. Job Description Experience Summary 5–10 years of Data Engineer specialized in building document and knowledge-oriented data pipelines for regulatory/compliance domains, with strong capabilities in structured transformations, knowledge graphs, and containerized platform integration. Core Responsibilities / Focus Build and operate data ingestion and transformation pipelines for legal/regulatory content Normalize and transform heterogeneous source formats (e.g., XML/HTML/structured exports) using tools such as XSLT Implement pipelines for embeddings generation, indexing, and enrichment for downstream AI/RAG systems Design and manage RDF-based knowledge representations and SPARQL-accessible datasets Integrate storage and processing components across containerized/cloud environments Support event-driven or integration-heavy workflows (e.g., via Apache Camel, message brokers) Ensure reproducibility, maintainability, and operational handover of data pipelines Core Skills (Must-Have) Python/ Java Docker / Docker Compose Kubernetes Knowledge Graphs (RDF) SPARQL XSLT Embeddings pipelines / vector preparation Azure Storage (or equivalent cloud storage services) Apache Camel Git Preferred / Nice-to-Have Docling (or similar Document conversion) CloudEvents Kafka (or other message brokers) Event-based systems / event-driven architecture Dev Containers GitOps Documentation practices Domain Advantage Experience processing legal/regulatory source documents and preserving semantic structure / provenance Familiarity with content domains such as EU regulation, privacy, ESG, and compliance frameworks Qualifications Educational qualification: BE/B.Tech or Equivalent Degree Experience : 5-10 Years Mandatory/requires Skills : Strong hands-on expertise in Python/Java, Docker / Docker Compose, Kubernetes, Knowledge Graphs (RDF),SPARQL,XSLT,Embeddings pipelines / vector preparation, Azure Storage (or equivalent cloud storage services), Apache Camel,Git Preferred Skills : Legal Entity: Bosch Global Software Technologies Private Limited
Responsibilities
The role focuses on building and operating data ingestion and transformation pipelines, particularly for legal and regulatory content, including normalizing heterogeneous source formats like XML/HTML. Responsibilities also involve implementing pipelines for embeddings generation, indexing, and designing RDF-based knowledge representations accessible via SPARQL.
Loading...