Data Engineer at Apex Operations
Chantilly, Virginia, United States -
Full Time


Start Date

Immediate

Expiry Date

05 Sep, 26

Salary

0.0

Posted On

07 Jun, 26

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, Java, SQL, Big Data, ETL, Apache Airflow, RESTful APIs, Linux, Data Pipelining, AI/ML, BI Dashboards, Data Profiling, Data Cleansing, Data Transformation, Spring, JDBC

Industry

IT Services and IT Consulting

Description
Closure Technologies is seeking a Data Engineer that will leverage their development skills and experience, to support the successful designing, ingesting, cleansing, transformation, loading, and display of significant amounts of data. Clearance Requirement: TS/SCI with Polygraph Key Responsibilities: Designing, implementing, and optimizing data extraction, cleansing, transformation, loading, replication/distribution, and large-scale ingest systems in a Big Data environment Optimizing all stages of the data lifecycle, from initial planning, to ingest, through final display and beyond Developing custom solutions/code to ingest and exploit new and existing data sources Developing data profiling, deduping logic, and matching logic for analysis Organizing and maintaining Data Layer documentation, so others are able to understand and use it. Also, work closely with data scientists to craft data pipelines which serve the development of modern AI/ML workflows Collaborating with teammates, other service providers, vendors, and users to develop new and more efficient methods Effectively articulating the risks and constraints associated with software solutions, based on environment Required Qualifications: 2+ years of relevant software development/programming experience. Demonstrated data analysis, parsing, and programming language experience (e.g. Python, Java) coupled with significant SQL/database experience. Experience with the full data lifecycle, from ingest through display, in a Big Data environment. Hands-on experience with Java-related technologies, such as JDK, J2EE, EJB, JDBC, and/or Spring, and experience with RESTful APIs. Experience with data pipelining systems (e.g. Apache Airflow) and developing/performing ETL tasks in a Linux environment. Preferred Qualifications: Experience deploying systems that leverage AI/ML technology Experience publishing results in BI dashboards.
Responsibilities
Design and optimize large-scale data ingest, transformation, and loading systems within a Big Data environment. Collaborate with data scientists to build pipelines supporting AI/ML workflows and maintain comprehensive data layer documentation.
Loading...