Data Scientist
at Pearson
Sydney, New South Wales, Australia -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 28 Apr, 2025 | Not Specified | 29 Jan, 2025 | 1 year(s) or above | Data Quality,Model Development,Athena,Rdbms,Postgresql,Recommender Systems,Data Analytics,Data Science,Validation,Design Principles,Python,English,Nlp,Sql,Aws,Data Models,Statistics,Time Series Analysis,Data Extraction,Machine Learning,Computer Science | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
THE TEAM:
The Workforce Solutions Data Science team spearheads the creation of innovative products by developing core intellectual property through a scientific, research-driven approach. We seek talented individuals eager to enhance our capabilities and drive significant customer value through the expansion of our data science IP.
The role sits within a cross-functional squad including Data Scientists, AI Engineers, and Ontologists. Each squad is responsible for the complete lifecycle of data science models—from concept and development through to deployment via APIs. This team specialises in building and maintaining models related to Pearson’s core occupation and skill ontologies, including data extraction and normalisation, analytics and recommendation systems. Positioned within Engineering, the squad collaborates closely with the Product team to ensure that the solutions align with strategic objectives and product priorities.
EXPERIENCE OVERVIEW:
- Data Science Experience: 1+ years in data science, with a robust background in machine learning or data analytics. Demonstrated proficiency in data-driven analysis and model development. Holds a formal qualification in Data Science, Computer Science, Software Engineering, Statistics, or a related field.
- Machine Learning and Statistical Models: Strong foundational skills in applied machine learning and statistical models, including NLP, linear/logistic regression, time series analysis, and recommender systems.
- Python Proficiency: Solid experience in Python for developing data processing tasks, conducting analyses, and deploying machine learning models. Familiarity with building and maintaining REST APIs is a plus.
- SQL and Database Management: Competent in performing SQL queries for effective data extraction, manipulation, and analysis, combined with a sound understanding of database design principles. Familiar with Athena and RDBMS such as PostgreSQL.
- AWS and Cloud Technologies: Proven abilities in using AWS to build and optimize data pipelines, with practical experience in deploying data models and managing large datasets in a cloud environment.
- Generative AI: Foundational understanding of generative AI concepts, with the ability to apply these technologies to improve data quality, model development and validation.
- Professional fluency in English is essential
At Pearson we ‘add life to a lifetime of learning’ so everyone can realise the life they imagine. We do this by creating vibrant and enriching learning experiences designed for real-life impact.
Pearson is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
WHO WE ARE:
At Pearson, our purpose is simple: to help people realize the life they imagine through learning. We believe that every learning opportunity is a chance for a personal breakthrough. We are the world’s lifelong learning company. For us, learning isn’t just what we do. It’s who we are. To learn more: We are Pearson.
Pearson is an Affirmative Action and Equal Opportunity Employer and a member of E-Verify. We want a team that represents a variety of backgrounds, perspectives and skills. The more inclusive we are, the better our work will be. All employment decisions are based on qualifications, merit and business need. All qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, sexual orientation, gender identity, gender expression, age, national origin, protected veteran status, disability status or any other group protected by law. We strive for a workforce that reflects the diversity of our communities.
If you are an individual with a disability and are unable or limited in your ability to use or access our career site as a result of your disability, you may request reasonable accommodations by emailing TalentExperienceGlobalTeam@grp.pearson.com.
Note that the information you provide will stay confidential and will be stored securely. It will not be seen by those involved in making decisions as part of the recruitment process.
Job: RESEARCH AND DEVELOPMENT
Organization: Workforce Skills
Schedule: FULL_TIME
Workplace Type: Hybrid
Req ID: 18142
Responsibilities:
THE ROLE:
Reporting to the Lead Data Scientist, you will support the development and delivery of data science models. You will play a key role in maintaining high standards of data quality and model validation. Additionally, you will contribute to extending and refining existing data pipelines and models.
KEY RESPONSIBILITIES:
- Data Manipulation: Utilise SQL to interact with Athena databases, querying and manipulating large datasets.
- Advanced Analytical Implementation: Implement analytical techniques and experimental methods to discover data patterns and develop classification, prediction, and optimisation models, driving business outcomes.
- Model Development Support: Participate in the model development lifecycle from conception through deployment via APIs, focusing on creating scalable, efficient, and purpose-fit models.
- Data Pipeline Maintenance: Collaborate with team members to maintain, track, and evaluate data pipelines and deployed models, with a strong emphasis on quality assurance processes to ensure accurate and reliable outputs that drive continuous improvement.
- Generative AI Application: Leverage knowledge of Language Model Models (LLMs) to apply generative AI, developing innovative solutions that enhance data quality and creativity.
- API Development & Maintenance: Design, develop, and maintain robust REST APIs for model deployment, ensuring efficient and reliable integration with internal and external systems.
- Qualitative Research and Analysis: Perform research, data analysis, and visualisation in the areas of future of work, workforce analytics, skills, and education, contributing to targeted outcomes.
- Collaboration with Product and Customer Teams: Engage with product and customer teams to understand client issues and ensure solutions deliver substantial value.
REQUIREMENT SUMMARY
Min:1.0Max:6.0 year(s)
Information Technology/IT
IT Software - DBA / Datawarehousing
Software Engineering
Graduate
Proficient
1
Sydney NSW, Australia