Algorithm Engineer - Audio Understanding

at  BYTEDANCE PTE LTD

Singapore, Southeast, Singapore -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate14 Nov, 2024USD 22500 Monthly17 Aug, 2024N/APublications,Algorithms,Computer Science,Creativity,Mathematics,Machine Learning,Speech Recognition,Python,Speech,Deep LearningNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

ByteDance
Founded in 2012, ByteDance’s mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

ABOUT THE TEAM

The speech team’s mission is to empower content understanding, interaction and creation across TikTok and other products using speech & audio related technologies. We focus on cutting-edge R&D in areas like speech & audio, music processing, natural language understanding and multimodal deep learning. We are looking for top talents to work on these exciting technologies, integrate them into various TikTok and other products and ultimately bring joy to our global user base!

QUALIFICATIONS

  • Master’s or PhD in computer science, mathematics, engineering or related field
  • Experience in one or more areas of machine learning and deep learning, including but not limited to:
  • Automatic Speech Recognition
  • Automatic Speech Translation
  • Speech/audio self-supervised learning and foundation models
  • Speaker recognition and verification
  • Speech emotion recognition
  • Multimodal foundation models
  • Large Language Model pretraining and finetuning

PREFERRED QUALIFICATIONS

  • Publications in top-tier ML/DL venues such as NeurIPS, ICLR, ICML, AAAI and speech venues such as ICASSP, ASRU, Interspeech
  • Deep understanding of Large Language models
  • Familiar with distributed computing and large scale model training
  • Familiar with deep learning frameworks such as Tensorflow and Pytorch.
  • Familiar with engineering principles and best practices.
  • Highly competent in algorithms and programming; Strong coding skills in C/C++ and Python.
  • Ability to work collaboratively in a fast-paced, multi-functional environments
    ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too

Responsibilities:

  • Conduct cutting-edge research and development in speech/audio foundation models
  • Contribute to the advancement of audio understanding, including multilingual speech recognition, speech translation, multimodal understanding and etc.
  • Focus on and drive the practical application of relevant technologies in business scenarios, including but not limited to closed-captions, voice dubbing, video understanding.


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Phd

Proficient

1

Singapore, Singapore