Research Scientist, Reinforcement Learning (Training)
Company: OpenAI
Location: San Francisco
Posted on: March 25, 2025
|
|
Job Description:
The Training Core Algorithms / Reinforcement Learning team is
responsible for researching and developing the next generation of
algorithms to power our RLHF stack (reinforcement learning from
human feedback). The algorithms we develop are used in ChatGPT
consumer product and the OpenAI API.About the RoleAs a Member of
Technical Staff on our team, you will research and develop
improvements to all components of our RLHF stack, including data
collection, supervised finetuning, reward modeling, off- and
on-policy learning, active learning, and evaluations. The ultimate
test for our algorithms is how useful they are to our users, and we
often deploy our algorithms into new ChatGPT models.We're looking
for people who have extensive background in reinforcement learning
research, are able to iterate quickly, and are proficient at
coding.This role is based in San Francisco, CA. We use a hybrid
work model of 3 days in the office per week and offer relocation
assistance to new employees.In this role, you will:
#J-18808-Ljbffr
Keywords: OpenAI, San Francisco , Research Scientist, Reinforcement Learning (Training), Other , San Francisco, California
Click
here to apply!
|