Senior AI Engineer, Large Language Model

at  Shopee

Singapore, Southeast, Singapore -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate29 Apr, 2025Not Specified30 Jan, 2025N/AComputer Science,Information Technology,Problem AnalysisNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

Department Engineering and Technology
LevelExperienced (Individual Contributor)
LocationSingapore
The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; We build foundations for a long-lasting future. We don’t limit ourselves on what we can or can’t do; we take matters into our own hands even if it means drilling down to the bottom layer of the computing platform. Shopee’s hyper-growing business scale has transformed most “innocent” problems into huge technical challenges, and there is no better place to experience it first-hand if you love technologies as much as we do.

ABOUT THE TEAM:

We are seeking a highly skilled AI Engineer specializing in Large Language Model to contribute to the development of a scalable and optimized South-East Asia foundation language model. In this role, you will collaborate with multi-region teams, adopting the advanced AI technologies and applying AI models and strategies to enhance business models. Your primary focus will be on following and crafting the advanced AI algorithms for South-East Asia language in the ecommerce domain.

JOB DESCRIPTION:

  • Contribute to the research and implementation of pre-training and alignment algorithms include ultra-large-scale multilingual pre-training technology, Mixture-of-Experts model training, Instruction Pretraining, SFT, and RLHF.
  • Contribute to the explanation and safety improvement of AI, especially in trustworthy Large language models.
  • Follow the frontier technologies and make comparisons about the advanced technologies to apply in business scenarios.
  • Conduct experiments to test the performance of different AI models, identifying areas for improvement and exploring new directions for enhancement.
  • Work collaboratively in a team environment, applying expertise in statistics, scripting, and relevant programming languages.

REQUIREMENTS:

  • Doctorate degree in Computer Science, Information Technology, Programming & Systems Analysis, or other related disciplines
  • Excellent coding skills, data structure and basic algorithm skills, proficiency in Python/Pytorch coding.
  • Minimum 1 year of research experience in basic principles and training methods of industry-leading LLM (such as GPT, LLaMA).
  • Have research experience in text generation or dialogue systems
  • Excellent problem analysis and solving skills, able to deeply solve problems in large model training and application.
  • Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.

Responsibilities:

Please refer the Job description for details


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Computer Science, Information Technology, Technology

Proficient

1

Singapore, Singapore