Start Date
Immediate
Expiry Date
04 Dec, 25
Salary
300000.0
Posted On
05 Sep, 25
Experience
0 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
Optimization, Empirical Research, Computer Science, Triton, Experimental Design, Rapid Prototyping, Software Engineering Practices, Cuda, Machine Learning, Testing, Analytical Skills, Code Review, Training, Dynamics, Diffusion, Optimization Techniques
Industry
Information Technology/IT
Mirage is redefining short-form video with frontier AI research.
We’re building full-stack foundation models and products that are changing the future of this format and video creation, production and editing more broadly. Over 20 million creators and businesses use Mirage’s products to reach their full creative and commercial potential.
We are a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. As an early member of our team, you’ll have an opportunity to have an outsized impact on our products and our company’s culture.
REQUIREMENTS:
Research Experience:
Technical Expertise:
Engineering Capabilities:
How To Apply:
Incase you would like to apply to this job directly from the source, please click here
ABOUT THE ROLE:
Captions is seeking an exceptional Research Engineer (MOTS) to advance the state-of-the-art in large-scale multimodal video diffusion models. You’ll conduct novel research on generative modeling architectures, develop new training techniques, and scale models to billions of parameters. As a key member of our ML Research team, you’ll work at the cutting edge of multimodal generation while building systems that enable natural, controllable video creation. We’re already training large-scale models with demonstrated product impact, and we’re excited to continue expanding the scope and capabilities of our research.
We’re especially excited about pushing the boundaries of audio-video generation, with a focus on realistic and charismatic human behavior that enables natural storytelling and creative iteration. Our models power creative tools used by millions of creators, and we’re tackling fundamental challenges in how to generate compelling human motion, expression, and speech.
KEY RESPONSIBILITIES:
Research & Architecture Development:
Model Training & Optimization:
Technical Innovation: