FBS - Elasticsearch Data Engineer (Medallion Architecture) at Capgemini Portugal
Hyderabad, Telangana, India -
Full Time


Start Date

Immediate

Expiry Date

25 Aug, 26

Salary

0.0

Posted On

27 May, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Elasticsearch, Spark, Python, DBT, Amazon EMR, Apache Airflow, AWS Lambda, Apache Iceberg, Amazon S3, Kubernetes, Java Spring Boot, IBM ACE, JSON, Jenkins, CloudWatch, Data Modeling

Industry

IT Services and IT Consulting

Description
Our Client is one of the United States’ largest insurers, providing a wide range of insurance and financial services products with gross written premiums well over US$25 Billion (P&C). They proudly serve more than 10 million U.S. households with more than 19 million individual policies across all 50 states through the efforts of over 48,000 exclusive and independent agents and nearly 18,500 employees. Finally, our Client is part of one the largest Insurance Groups in the world. The Data Engineer will architect, develop, and maintain scalable data pipelines within a medallion architecture (bronze, silver/base vault, business vault, gold layers). This role is key in enabling high-quality, business-ready datasets by leveraging modern data engineering technologies and orchestration practices. Role Responsibilities: • Design, build, and manage end-to-end data pipelines across the medallion architecture—specifically the bronze, silver (base vault with DBT and orchestration tools, business vault), and gold layers. • Ingest and process raw data using Spark and Amazon EMR for scalable, distributed computation. • Develop and automate data transformations for the base vault using DBT (Data Build Tool) to standardize and model data efficiently. At least 5 years of experience as an Elasticsearch Data Engineer - ELK (Elasticsearch, Logstash, and Kibana) stack Expert knowledge) Java Spring Boot IBM ACE Programming BS in Computer Science, Data Engineering (Big Data, AWS certification), Data Modeling or similar Full English Fluency Skills & Competencies Strong understanding of data modeling, governance, and best practices in modern data architectures. Excellent analytical, problem-solving, and communication skills. Software / Tool Skills Elasticsearch - cluster optimization, query development, data modeling, performance tuning & administration (4-6 Years) (Must) Deep experience with Spark, Python and ETLs and Amazon EMR (Must) Hands-on experience with DBT for data transformation and modeling. Apache Airflow, AWS Step Functions, or similar. (Must) Expert knowledge of Amazon S3 and Apache Iceberg for data storage and management. Experience with Kubernetes for container orchestration. Experience with Dremio, Looker, or equivalent business view/semantic layer technologies AWS Cloud – Intermediate, AWS Lambda (Must) , Step Functions, IAM, SNS, API Gateway, VPC, Transit Gateway, Intermediate (3-4 Years) JSON - Intermediate (4-6 Years) Jenkins – Data Pipeline Intermediate (4-6 Years) PLUS CloudWatch - Intermediate (4-6 Years) PLUS This position comes with competitive compensation and benefits package: Competitive salary and performance-based bonuses Comprehensive benefits package Career development and training opportunities Flexible work arrangements (remote and/or office-based) Dynamic and inclusive work culture within a globally renowned group Private Health Insurance Pension Plan Paid Time Off Training & Development About Capgemini Capgemini is a global leader in partnering with companies to transform and manage their business by harnessing the power of technology. The Group is guided everyday by its purpose of unleashing human energy through technology for an inclusive and sustainable future. It is a responsible and diverse organization of over 340,000 team members in more than 50 countries. With its strong 55-year heritage and deep industry expertise, Capgemini is trusted by its clients to address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering and platforms. The Group €22.5 billion in revenues in 2023.
Responsibilities
Architect and maintain scalable data pipelines within a medallion architecture across bronze, silver, and gold layers. Ingest and process raw data using Spark and Amazon EMR while automating transformations with DBT.
Loading...