Data Quality Professional at Capgemini
Dubai, , United Arab Emirates -
Full Time


Start Date

Immediate

Expiry Date

14 Nov, 25

Salary

0.0

Posted On

14 Aug, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Computer Science, Data Quality, Data Governance, Information Systems, Design, Cloud, Python, Pcap, Technology, Strategy, It, Communication Skills, Data Processing

Industry

Information Technology/IT

Description

YOUR SKILLS AND EXPERIENCE

  • Bachelor’s or Master’s degree in Computer Science, Data Management, Information Systems, or a closely related field.
  • Good experience in data quality engineering, data management, or related data roles within complex technology environments.
  • Demonstrable expertise in Python, including the development of reusable data quality and validation libraries.
  • Extensive hands-on experience with Azure Databricks, including cloud-native data processing, ETL/ELT orchestration, and distributed computing concepts.
  • Proficiency with Collibra Data Quality platform or equivalent data governance and stewardship tools.
  • Strong track record working in agile environments, participating in cross-functional teams, and adapting to rapidly evolving project requirements.
  • Excellent analytical, problem-solving, and communication skills, with the ability to convey complex technical topics to both technical and non-technical audiences.

Preferred Certifications (One or More)

  • Databricks Certified Data Engineer Associate or Professional
  • Microsoft Certified: Azure Data Engineer Associate
  • Python Institute Certifications (PCAP, PCPP)

CAPGEMINI IS A GLOBAL BUSINESS AND TECHNOLOGY TRANSFORMATION PARTNER, HELPING ORGANIZATIONS TO ACCELERATE THEIR DUAL TRANSITION TO A DIGITAL AND SUSTAINABLE WORLD, WHILE CREATING TANGIBLE IMPACT FOR ENTERPRISES AND SOCIETY. IT IS A RESPONSIBLE AND DIVERSE GROUP OF 340,000 TEAM MEMBERS IN MORE THAN 50 COUNTRIES. WITH ITS STRONG OVER 55-YEAR HERITAGE, CAPGEMINI IS TRUSTED BY ITS CLIENTS TO UNLOCK THE VALUE OF TECHNOLOGY TO ADDRESS THE ENTIRE BREADTH OF THEIR BUSINESS NEEDS. IT DELIVERS END-TO-END SERVICES AND SOLUTIONS LEVERAGING STRENGTHS FROM STRATEGY AND DESIGN TO ENGINEERING, ALL FUELED BY ITS MARKET LEADING CAPABILITIES IN AI, CLOUD AND DATA, COMBINED WITH ITS DEEP INDUSTRY EXPERTISE AND PARTNER ECOSYSTEM. THE GROUP REPORTED 2023 GLOBAL REVENUES OF €22.5 BILLION.

GetTheFutureYouWant | www.capgemini.com

Responsibilities

YOUR ROLE

  1. Development & Integration
  • Design, develop, and implement automated data quality checks using Python scripts and libraries or Collibra Data Quality components.
  • Integrate data quality validation logic within existing ETL/ELT pipelines operating on Azure Databricks, ensuring quality gates are consistently enforced across all data flows.
  • Develop and maintain reusable Python modules that perform anomaly detection, schema validation, and rule-based data quality checks to enable rapid scaling of quality coverage.
  • Collaborate with data engineering teams to embed continuous quality controls throughout the data ingestion, transformation, and consumption lifecycle.
  • Support the deployment and management of Collibra-based data quality solutions to automate governance workflows and stewardship activities.
  1. Data Quality Management
  • Define, measure, and rigorously enforce data quality metrics, thresholds, and Service Level Agreements (SLAs) tailored to business-critical datasets.
  • Utilize Collibra to manage and operationalize data governance workflows, maintain business glossaries, and delineate stewardship responsibilities.
  • Monitor the integrity of data pipelines for completeness, accuracy, timeliness, and consistency across distributed and cloud-native environments.
  • Conduct detailed root cause analyses for complex data quality issues, collaborating with engineers and domain experts to drive permanent remediation and prevention strategies.
  • Implement and continuously refine monitoring frameworks, utilizing dashboards and alerting systems (built using Python and Collibra integrations) for real-time visibility into key data quality indicators.
Loading...