Fundamental AI Research Scientist, Language - FAIR at Meta

, , -

Full Time

Start Date

Immediate

Expiry Date

02 Feb, 26

Salary

0.0

Posted On

04 Nov, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Artificial Intelligence, Machine Learning, Neural Networks, Audio Understanding, Audio Generation, Multimodality, Speech Recognition, Deep Learning, Python Programming, Research Publications, Data Science, Audio-Visual Learning, Text-to-Speech Synthesis, Image Generation, Video Generation, Benchmarking

Industry

Software Development

Description

Meta is seeking Research Scientists to join its Fundamental AI Research (FAIR) organization, focused on making significant advances in AI. We publish groundbreaking papers and release frameworks/libraries that are widely used in the open-source community. The team is working on the industrial leading research on building foundation models for audio understanding and audio generation. We are also closely working with vision research teams on pushing the frontier of multimodality (audio, video, language) research. Our teams research is focusing on audio and multimodality. Individuals in this role are expected to be recognized experts in identified research areas such as artificial intelligence, speech and audio generation and audio-visual learning. Researchers will drive impact by: (1) publishing state-of-the-art research papers, (2) open sourcing high quality code and reproducible results for the community, and (3) bringing the latest research to Meta products for connecting billions of users. They will work with an interdisciplinary team of scientists, engineers, and cross-functional partners, and will have access to cutting edge technology, resources, and research facilities. Responsibilities Develop algorithms based on state-of-the-art machine learning and neural network methodologies Perform research to advance the science and technology of intelligent machines Conduct research that enables learning the semantics of data across multiple modalities (audio, speech, images, video, text, and other modalities) Work towards long-term ambitious research goals, while identifying intermediate milestones Design and implement models and algorithms Work with large datasets, train / tune / scale the models, create benchmarks to evaluate the performance, open source and publish Qualifications Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience PhD degree in AI, computer science, data science, or related technical fields, or equivalent practical experience 2+ years of experience holding an industry, faculty, academic, or government researcher position Research publications reflecting experience in related research fields: audio (speech, sound, or music) generation, text-to-speech (TTS) synthesis, text-to-music generation, text-to-sound generation, speech recognition, speech / audio representation learning, vision perception, image / video generation, video-to-audio generation, audio-visual learning, audio language models, lip sync, lip movement generation / correction, lip reading, etc Familiarity with one or more deep learning frameworks (e.g. pytorch, tensorflow, …) Experienced in Python programming language Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment Publication record at peer-reviewed AI conferences (e.g. ACL, EMNLP, NeurIPS, ICLR, ICML or similar)

Responsibilities

Develop algorithms based on state-of-the-art machine learning methodologies and conduct research to advance intelligent machines. Work towards ambitious research goals while designing and implementing models and algorithms.