Senior Machine Learning Engineer
at Epam Systems
Desde casa, Cauca, Colombia -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 29 Jan, 2025 | USD 200 Annual | 31 Oct, 2024 | N/A | Data Engineering,Cassandra,Production Experience,Pandas,Spark,Python,Data Products,Kafka,Vertex,Scipy,Scikit Learn,Databases,Data Science,Numpy,Azure,Keras,Aws | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
We are currently searching for a Senior Machine Learning Engineer to join our remote team. The selected candidate will have a key role in contributing to the design, development, and operational lifecycle of our ML pipeline, adhering to industry best practices.
In this role, your responsibilities will include designing, creating, maintaining, troubleshooting, and optimizing ML pipeline steps. You will also contribute to the design and implementation of ML prediction endpoints. Collaborating with System Engineers to configure the ML lifecycle management environment and promoting better coding practices will be essential.
For those who are passionate about innovation, we encourage you to submit your application and become part of our dynamic team!
We accept CVs in English only.
REQUIREMENTS
- Minimum of 3 years programming language experience, ideally in Python, and strong SQL knowledge
- Robust MLOps experience (Sagemaker, Vertex, or Azure ML)
- Intermediate level in Data Science, Data Engineering, and DevOps Engineering
- Experience with at least one project delivered to production in an MLE role
- Expertise in Engineering Best Practices
- Practical experience in implementing Data Products using the Apache Spark Ecosystem (Spark SQL, MLlib/SparkML) or alternative technologies
- Experience with Big Data technologies (e.g., Hadoop, Spark, Kafka, Cassandra, GCP BigQuery, AWS Redshift, Apache Beam, etc.)
- Proficiency in automated data pipeline and workflow management tools, i.e., Airflow, Argo Workflow, etc
- Experience in different data processing paradigms (batch, micro-batch, streaming)
- Practical experience working with at least one major Cloud Provider such as AWS, GCP, and Azure
- Production experience in integrating ML models into complex data-driven systems
- DS experience with Tensorflow/PyTorch/XGBoost, NumPy, SciPy, Scikit-learn, Pandas, Keras, Spacy, HuggingFace, Transformers
- Experience with different types of databases (Relational, NoSQL, Graph, Document, Columnar, Time Series, etc.)
Responsibilities:
- Contribution to the design, development, and operational lifecycle of the ML pipeline based on best practices
- Design, creation, maintenance, troubleshooting, and optimization of ML pipeline steps
- Ownership and contribution to the design and implementation of ML prediction endpoints
- Collaboration with System Engineers to configure the ML lifecycle management environment
- Writing specifications, documentation, and user guides for developed applications
- Promotion of improved coding practices and repository organization in the science work cycle
- Establishment and configuration of pipelines for projects
- Identification of technical risks and gaps, and devising mitigation strategies
- Collaboration with data scientists to productionalize predictive models, understanding the scope and purpose of the models built by data scientists, and creating scalable data preparation pipelines
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Application Programming / Maintenance
Software Engineering
Graduate
Proficient
1
Desde casa, Colombia