AWS Pyspark MDM Developer

at  Saama Technologies Inc

San Francisco, CA 94015, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate15 Feb, 2025Not Specified19 Nov, 2024N/AOffice Equipment,Recruiting,Life Sciences,Security,Scalability,Cpg,Spark,Glue,Discrimination,Hiring,Data Integration,Genetics,Communication Skills,Color,Optimization,Big Data,Athena,Training,Orchestration,Data Transformation,Aws,Data GovernanceNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:


  • Does solving complex business problems and real world challenges interest you? Do you enjoy seeing the impact your contributions make on a daily basis? Are you passionate about using data analytics to provide game changing solutions to the Global 2000 clients? Do you thrive in a dynamic work environment that constantly pushes you to be the best you can be and more? Are you ready to work with smart colleagues who drive for excellence in everything they do? If you possess a solutions mindset, strong architecting skills, and commitment to be part of a tremendous journey, come join our growing, global team. See what Saama can do for your career and for your journey.

Saama Analytics has been on the forefront of data innovation for the last two decades and continues to offer cutting-edge data analytics solutions powered with big data, cloud, and AI/ML aptitudes for its customers in Life Sciences, Insurance, CPG, and other industries. Saama is committed to finding the best people because the innovations and discoveries that enabled together leads to better technologies, better treatments, and a better future.Responsibilities:

  • Lead the design, implementation, and optimization of scalable data pipelines and architectures utilizing AWS Glue, Elastic MapReduce (EMR), Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
  • Use Spark on AWS for data transformation and processing across large datasets.
  • Develop and maintain efficient data workflows with SQS for task queueing and orchestration.
  • Integrate, transform, and manage data using Mulesoft for seamless data integration.
  • Ensure high-performance data storage, retrieval, and analytics across Redshift, DynamoDB, and Athena.
  • Oversee data consistency, integrity, and compliance through IQVIA MDM solutions.
  • Apply best practices in data governance, security, and scalability within a collaborative and cross-functional team environment.

Qualifications:

  • Proven expertise in AWS data engineering, specifically with Glue, EMR, Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
  • Some experience with data integration (Mulesoft, Talend)
  • Working knowledge of master data management.
  • Demonstrated ability to lead technical projects and mentor data engineering teams.
  • Exceptional analytical and communication skills.WORK ENVIRONMENTThis job operates in a professional remote office environment. This role routinely uses standard office equipment, including but not limited to, computers, phones, and photocopiers.PHYSICAL DEMANDSThis position requires the frequent and repetitive use of a computer, keyboard, and mouse. Hand and finger dexterity is required.OTHER DUTIESPlease note that this job description is not designed to cover or contain a comprehensive listing of activities, duties, or responsibilities required of the employee for this job. Duties, responsibilities, and activities may change at any time, with or without notice.EEOSaama Technologies, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.

Responsibilities:

  • Lead the design, implementation, and optimization of scalable data pipelines and architectures utilizing AWS Glue, Elastic MapReduce (EMR), Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
  • Use Spark on AWS for data transformation and processing across large datasets.
  • Develop and maintain efficient data workflows with SQS for task queueing and orchestration.
  • Integrate, transform, and manage data using Mulesoft for seamless data integration.
  • Ensure high-performance data storage, retrieval, and analytics across Redshift, DynamoDB, and Athena.
  • Oversee data consistency, integrity, and compliance through IQVIA MDM solutions.
  • Apply best practices in data governance, security, and scalability within a collaborative and cross-functional team environment


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Proficient

1

San Francisco, CA 94015, USA