Data Engineer with Data Formats
at Capgemini
London, England, United Kingdom -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 18 Jan, 2025 | Not Specified | 19 Oct, 2024 | 8 year(s) or above | Strategy,Indexing,Flat Files,Unstructured Data,It,Data Processing,Sql,Technology,Apache Spark,Python,Solace,Automation,Data Models,Docker,Kafka,Cloud,Git,Design,Data Infrastructure,Jenkins,Javascript,Json | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
CAPGEMINI IS A GLOBAL BUSINESS AND TECHNOLOGY TRANSFORMATION PARTNER, HELPING ORGANIZATIONS TO ACCELERATE THEIR DUAL TRANSITION TO A DIGITAL AND SUSTAINABLE WORLD WHILE CREATING TANGIBLE IMPACT FOR ENTERPRISES AND SOCIETY. IT IS A RESPONSIBLE AND DIVERSE GROUP OF 350,000 TEAM MEMBERS IN MORE THAN 50 COUNTRIES. WITH ITS STRONG OVER 55-YEAR HERITAGE, CAPGEMINI IS TRUSTED BY ITS CLIENTS TO UNLOCK THE VALUE OF TECHNOLOGY TO ADDRESS THE ENTIRE BREADTH OF THEIR BUSINESS NEEDS. IT DELIVERS END-TO-END SERVICES AND SOLUTIONS LEVERAGING STRENGTHS FROM STRATEGY AND DESIGN TO ENGINEERING, ALL FUELED BY ITS MARKET-LEADING CAPABILITIES IN AI, CLOUD, AND DATA, COMBINED WITH ITS DEEP INDUSTRY EXPERTISE AND PARTNER ECOSYSTEM. THE GROUP REPORTED 2023 GLOBAL REVENUES OF €22.5 BILLION.
Get The Future You Want | www.capgemini.co
Responsibilities:
WE ARE SEEKING A HIGHLY SKILLED AND EXPERIENCED DATA ENGINEER TO JOIN OUR DYNAMIC TEAM. THE IDEAL CANDIDATE WILL HAVE A DEEP UNDERSTANDING OF DATA ENGINEERING PRINCIPLES, DATA TECHNOLOGIES, AND A PROVEN TRACK RECORD OF DESIGNING AND BUILDING COMPLEX DATA PIPELINES. THIS ROLE REQUIRES STRONG EXPERTISE IN SQL, VARIOUS DATA FORMATS, PYTHON, AND JAVASCRIPT TO SUPPORT OUR DATA-DRIVEN DECISION-MAKING PROCESSES AND ENHANCE OUR DATA INFRASTRUCTURE.
- Architect and maintain scalable data pipelines using various programming technologies.
- Use SQL to query, transform, and process data across relational and NoSQL databases.
- Integrate data from APIs, flat files, and streaming sources for consistency and quality.
- Implement real-time data processing using Kafka or Solace.
- Manage data storage in systems and warehouses, optimizing for performance.
- Design data models and apply techniques like partitioning and indexing for efficiency.
- Handle multiple data formats (CSV, JSON, Parquet) and manage unstructured data.
- Utilize Python and JavaScript (Node.js) for data processing, automation, and ETL development.
- Leverage Microsoft technologies, Apache Spark, and Airflow for distributed computing.
- Implement DevOps tools (Jenkins, Git, Docker) for CI/CD and monitor pipeline performance.
REQUIREMENT SUMMARY
Min:8.0Max:13.0 year(s)
Information Technology/IT
IT Software - DBA / Datawarehousing
Software Engineering
Graduate
Proficient
1
London, United Kingdom