Data Engineer (AI/ML) at Capgemini
New York, NY 10003, USA -
Full Time


Start Date

Immediate

Expiry Date

05 Dec, 25

Salary

145000.0

Posted On

06 Sep, 25

Experience

7 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Good communication skills

Industry

Information Technology/IT

Description

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able to reimagine what’s possible. Join us and help the world’s leading organizations unlock the value of technology and build a more sustainable, more inclusive world.

JOB DESCRIPTION

  • We are seeking a seasoned Data Engineer with 7+ years of experience in developing and deploying machine learning solutions. The ideal candidate should have hands-on expertise in MLOps (at least two end-to-end production deployments), solution architecture, and experience with one major cloud platform (AWS,
  • Azure, or GCP) is an added advantage. Strong skills in Python, SQL, PySpark, Generative AI (GenAI), and NLP are required.Key Responsibilities
  • Develop and deploy scalable ML/AI solutions, including MLOps pipelines for CI/CD, model monitoring, and governance.
  • Lead the design and development of GenAI and NLP solutions for applications such as text summarization, conversational AI, and entity recognition.
  • Build and optimize data pipelines using PySpark and SQL for large-scale data processing.scalable ML development and deployment
  • Collaborate with stakeholders to align AI initiatives with business goals and mentor junior team members.Qualifications & Skills
  • Bachelors degree with exp in Data, AI, ML
  • Programming: Python (Pandas, NumPy, PyTorch, TensorFlow), SQL.
  • MLOps: Experience with tools like MLflow, Kubeflow, Docker, Kubernetes, and CI/CD pipelines.
  • Generative AI & NLP: Expertise in transformer models (e.g., GPT, BERT), Hugging Face, and LangChain.
  • Data Engineering: Proficient in PySpark and distributed data processing.
  • Cloud Platforms: Proven experience with one major cloud platform (AWS, Azure, or GCP) is an added advantage”
Responsibilities

Please refer the Job description for details

Loading...