Data Engineer
at Edelman
London, England, United Kingdom -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 22 Apr, 2025 | Not Specified | 23 Jan, 2025 | 3 year(s) or above | Data Quality,Json,Communication Skills,Optimization,English,Continuous Integration,Continuous Delivery,Etl,Aws,Coding Experience,Search Engines | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
Edelman is a voice synonymous with trust, reimagining a future where the currency of communication is action. Our culture thrives on three promises: boldness is possibility, empathy is progress, and curiosity is momentum.
At Edelman, we understand diversity, equity, inclusion and belonging (DEIB) transform our colleagues, our company, our clients, and our communities. We are in relentless pursuit of an equitable and inspiring workplace that is respectful of all, reflects and represents the world in which we live, and fosters trust, collaboration and belonging.
We currently seeking a Data Engineer with 3-5 years’ experience. The ideal candidate would have the ability to work independently within an AGILE working environment and have experience working with cloud infrastructure leveraging tools such as Apache Airflow, Databricks, and Snowflake. A familiarity with real-time data processing and AI implementation is advantageous.
QUALIFICATIONS:
- Minimum of 3 years experience deploying enterprise level scalable data engineering solutions.
- Strong examples of independently developed data pipelines end-to-end, from problem formulation, raw data, to implementation, optimization, and result.
- Proven track record of building and managing scalable cloud-based infrastructure on AWS (incl. S3, Dynamo DB, EMR).
- Proven track record of implementing and managing of AI model lifecycle in a production environment.
- Experience using Apache Airflow (or equivalent) , Snowflake, Lucene-based search engines.
- Experience with Databricks (Delta format, Unity Catalog).
- Advanced SQL and Python knowledge with associated coding experience.
- Strong Experience with DevOps practices for continuous integration and continuous delivery (CI/CD).
- Experience wrangling structured & unstructured file formats (Parquet, CSV, JSON).
- Understanding and implementation of best practices within ETL end ELT processes.
- Data Quality best practice implementation using Great Expectations.
- Real-time data processing experience using Apache Kafka Experience (or equivalent) will be advantageous.
- Work independently with minimal supervision.
- Takes initiative and is action-focused.
- Mentor and share knowledge with junior team members.
- Collaborative with a strong ability to work in cross-functional teams.
- Excellent communication skills with the ability to communicate with stakeholders across varying interest groups.
- Fluency in spoken and written English.
LI-RT9
We are dedicated to building a diverse, inclusive, and authentic workplace, so if you’re excited about this role but your experience doesn’t perfectly align with every qualification, we encourage you to apply anyway. You may be just the right candidate for this or other roles
Responsibilities:
- Design, build, and maintain scalable and robust data pipelines to support analytics and machine learning models, ensuring high data quality and reliability for both batch & real-time use cases.
- Design, maintain, optimize data models and data structures in tooling such as Snowflake and Databricks.
- Leverage Databricks and Cloud-native solutions for big data processing, ensuring efficient management of Spark jobs and seamless integration with other data services.
- Utilize PySpark and/or Ray to build and scale distributed computing tasks, enhancing the performance of machine learning model training and inference processes.
- Monitor, troubleshoot, and resolve issues within data pipelines and infrastructure, implementing best practices for data engineering and continuous improvement.
- Diagrammatically document data engineering workflows.
- Collaborate with other Data Engineers, Product Owners, Software Developers and Machine Learning Engineers to implement new product features by understanding their needs and delivery timeously.
REQUIREMENT SUMMARY
Min:3.0Max:5.0 year(s)
Information Technology/IT
IT Software - DBA / Datawarehousing
Software Engineering
Graduate
Proficient
1
London, United Kingdom