Senior AI Engineer, Large Language Model at Shopee

Singapore, Southeast, Singapore -

Full Time

Start Date

Immediate

Expiry Date

29 Apr, 25

Salary

0.0

Posted On

30 Jan, 25

Experience

0 year(s) or above

Remote Job

Telecommute

Sponsor Visa

Skills

Computer Science, Information Technology, Problem Analysis

Industry

Information Technology/IT

Description

Department Engineering and Technology
LevelExperienced (Individual Contributor)
LocationSingapore
The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; We build foundations for a long-lasting future. We don’t limit ourselves on what we can or can’t do; we take matters into our own hands even if it means drilling down to the bottom layer of the computing platform. Shopee’s hyper-growing business scale has transformed most “innocent” problems into huge technical challenges, and there is no better place to experience it first-hand if you love technologies as much as we do.

ABOUT THE TEAM:

We are seeking a highly skilled AI Engineer specializing in Large Language Model to contribute to the development of a scalable and optimized South-East Asia foundation language model. In this role, you will collaborate with multi-region teams, adopting the advanced AI technologies and applying AI models and strategies to enhance business models. Your primary focus will be on following and crafting the advanced AI algorithms for South-East Asia language in the ecommerce domain.

JOB DESCRIPTION:

Contribute to the research and implementation of pre-training and alignment algorithms include ultra-large-scale multilingual pre-training technology, Mixture-of-Experts model training, Instruction Pretraining, SFT, and RLHF.
Contribute to the explanation and safety improvement of AI, especially in trustworthy Large language models.
Follow the frontier technologies and make comparisons about the advanced technologies to apply in business scenarios.
Conduct experiments to test the performance of different AI models, identifying areas for improvement and exploring new directions for enhancement.
Work collaboratively in a team environment, applying expertise in statistics, scripting, and relevant programming languages.

REQUIREMENTS:

Doctorate degree in Computer Science, Information Technology, Programming & Systems Analysis, or other related disciplines
Excellent coding skills, data structure and basic algorithm skills, proficiency in Python/Pytorch coding.
Minimum 1 year of research experience in basic principles and training methods of industry-leading LLM (such as GPT, LLaMA).
Have research experience in text generation or dialogue systems
Excellent problem analysis and solving skills, able to deeply solve problems in large model training and application.
Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.

Responsibilities

Please refer the Job description for details