Machine Learning Engineer for Generative AI Solution Architect

at  Capgemini

Texas, Texas, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate31 Jan, 2025Not Specified02 Nov, 20245 year(s) or abovePython,Etl,Data Engineering,Java,Computer Science,Azure,Scala,Data WarehouseNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

JOB DESCRIPTION

As a Senior GenAI Solutions Machine Learning Engineer, you will lead the development and implementation of advanced data engineering solutions to support the deployment and optimization of Generative AI models. Your role will involve leveraging your extensive experience to design robust, scalable, and innovative data architectures that align with the unique requirements of General Artificial Intelligence (GenAI) applications.

REQUIRED SKILLS

  • Bachelor’s degree in computer science, data engineering, or a related field with 5+ years experience (Master’s preferred).
  • Proven experience in data engineering, MLOps, ETL, and database management, QL and data manipulation languages.
  • Azure, Python, Java, or Scala.
  • data warehousing platforms (e.g., Databricks, Amazon Redshift, Snowflake) and big data technologies (e.g., Hadoop, Spark).
  • highly scalable Data stores, Data Lake, Data Warehouse, Lakehouse, and unstructured datasets

Responsibilities:

  • The Machine Learning Engineer will be responsible for architectural design and planning, advanced data pipelines, model integration and optimization, scalability, performance and research and innovation supporting production generative AI systems.
  • Production level ML workloads for customers using Databricks platform, including end-to-end ML pipelines, training/inference optimization, integration with cloud-native services and MLOps
  • Build and maintain data engineering solutions on cloud platforms using hyperscaler services.
  • Develop production-grade cloud (AWS/Azure/GCP) infrastructure that supports the deployment of ML applications, including drift monitoring
  • Design, develop, and maintain data pipelines to efficiently collect, process, and load data from various sources into data storage systems (e.g., data warehouses, data lakes).
  • Understanding of indexing and vectorization to use with Generative AI prompt engineering.
  • Strong understanding of fundamental data science concepts in NLP, including selection and understanding of embedding models.
  • Use hyperscaler technologies to support data needs for expansion of Machine Learning/Data Science capabilities including generative AI.
  • Design, develop, and implement scalable data pipelines and ETL/ELT processes using Python, PySpark and API integrations.


REQUIREMENT SUMMARY

Min:5.0Max:10.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Computer science data engineering or a related field with 5 years experience (master's preferred

Proficient

1

Texas, USA