Start Date
Immediate
Expiry Date
17 Jun, 26
Salary
413160.0
Posted On
19 Mar, 26
Experience
5 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
Reinforcement Learning, Large Language Models, AI Agents, Policy Optimization, RLHF, RLAIF, Agent Training Pipelines, Benchmarking Systems, PPO, Actor-Critic, Policy Gradient Methods, Python, PyTorch, JAX, Preference Learning, Multi-Agent Systems
Industry
Motor Vehicle Manufacturing
How To Apply:
Incase you would like to apply to this job directly from the source, please click here