Start Date
Immediate
Expiry Date
15 Aug, 26
Salary
0.0
Posted On
17 May, 26
Experience
5 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
Reinforcement Learning, Python, Deep Learning Frameworks, Reward Modeling, Simulation Environments, Policy Gradient, Actor-Critic, Offline RL, RLHF, DPO, Distributed RL, Imitation Learning, GPU Clusters, Probability, Optimization, Neural Network Policies
Industry
Information Technology & Services