Senior Machine Learning Engineer
at EPAM Systems Inc
Desde casa, Cauca, Colombia -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 22 Apr, 2025 | USD 200 Annual | 23 Jan, 2025 | N/A | Databases,Scipy,Spark,Vertex,Numpy,Kafka,Python,Data Engineering,Data Science,Scikit Learn,Azure,Pandas,Aws,Data Products,Keras,Production Experience,Cassandra | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
Our remote team is in need of a Senior Machine Learning Engineer who will actively take part in the ML pipeline design, development, and operation based on the best practices.
In this position, your responsibilities will include designing, creating, maintaining, troubleshooting, and optimizing ML pipeline steps. You will also have ownership and involvement in the ML prediction endpoints design and implementation. Your role will be critical in collaborating with System Engineers to configure the ML lifecycle management environment and aiding in the enhancement of coding practices.
We welcome those who are passionate about innovation to apply and join our team!
We accept CVs in English only.
REQUIREMENTS
- A minimum of 3 years of programming language experience, ideally in Python, with strong SQL knowledge
- Solid MLOps experience (Sagemaker, Vertex, or Azure ML)
- Intermediate level in Data Science, Data Engineering, and DevOps Engineering
- At least one project delivered to production in an MLE role
- Expert level in Engineering Best Practices
- Practical experience in implementing Data Products using the Apache Spark Ecosystem (Spark SQL, MLlib/SparkML) or alternative technologies
- Familiarity with Big Data technologies (e.g., Hadoop, Spark, Kafka, Cassandra, GCP BigQuery, AWS Redshift, Apache Beam, etc.)
- Experience with automated data pipeline and workflow management tools, i.e., Airflow, Argo Workflow, etc
- Experience in different data processing paradigms (batch, micro-batch, streaming)
- Practical experience working with at least one of the major Cloud Providers such as AWS, GCP, and Azure
- Production experience in integrating ML models into complex data-driven systems
- DS experience with Tensorflow/PyTorch/XGBoost, NumPy, SciPy, Scikit-learn, Pandas, Keras, Spacy, HuggingFace, Transformers
- Experience with different types of databases (Relational, NoSQL, Graph, Document, Columnar, Time Series, etc.)
Responsibilities:
- Participation in ML pipeline design, development, and operation based on best practices
- Role in designing, creating, maintaining, troubleshooting, and optimizing ML pipeline steps
- Ownership and involvement in the ML prediction endpoints design and implementation
- Collaboration with System Engineers to configure ML lifecycle management environment
- Specification, documentation, and user guide writing for developed applications
- Support in enhancing coding practices and repository organization in the science work cycle
- Establishment and configuration of pipelines for projects
- Constant identification of technical risks and gaps, and development of mitigation strategies
- Collaboration with data scientists to operationalize predictive models, understanding the scope and purpose of models built by data scientists, and creation of scalable data preparation pipelines
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Application Programming / Maintenance
Software Engineering
Graduate
Proficient
1
Desde casa, Colombia