Research Scientist, Reinforcement Learning (Training)

Company: OpenAI
Location: San Francisco
Posted on: March 25, 2025

Job Description:

The Training Core Algorithms / Reinforcement Learning team is responsible for researching and developing the next generation of algorithms to power our RLHF stack (reinforcement learning from human feedback). The algorithms we develop are used in ChatGPT consumer product and the OpenAI API.About the RoleAs a Member of Technical Staff on our team, you will research and develop improvements to all components of our RLHF stack, including data collection, supervised finetuning, reward modeling, off- and on-policy learning, active learning, and evaluations. The ultimate test for our algorithms is how useful they are to our users, and we often deploy our algorithms into new ChatGPT models.We're looking for people who have extensive background in reinforcement learning research, are able to iterate quickly, and are proficient at coding.This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.In this role, you will:

Come up with improvements to RLHF
Prototype and evaluate these ideas
Scale up your innovations to ChatGPT scaleYou might thrive in this role if you:
Love being on the cutting edge of RL and language model research
Can iterate fast on lots of ideas
Like doing research that has real-world impactAbout OpenAIOpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
#J-18808-Ljbffr

Keywords: OpenAI, San Francisco , Research Scientist, Reinforcement Learning (Training), Other , San Francisco, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Francisco recruiters find you. Post your resume for free!

Get San Francisco Other jobs via email.

View more San Francisco Other jobs

Other Other Jobs

Senior Backend Engineer
Description: Senior Backend Engineer - Direct-Hire/FTE - San Francisco, CA Hybrid Title: Senior Backend EngineerLocation: San Francisco, CA Hybrid Terms: Direct-Hire/FTECompensation: 180,000 - 200,000 Annual Base (more...)
Company: INSPYR Solutions
Location: San Francisco
Posted on: 04/3/2025

AI Engineer
Description: About this roleWriter is looking for an AI engineer with a strong software engineering background to join our expanding team of AI experts.At Writer, we believe in using the power of AI to unlock the (more...)
Company: Writer
Location: San Francisco
Posted on: 04/3/2025

Founding ML Robotics Engineer
Description: Who is :Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients.https://www.recruitingfromscratch.com/Founding ML Robotics EngineerLocation: New York, NY br (more...)
Company: Recruiting From Scratch
Location: San Francisco
Posted on: 04/3/2025

Salary in San Francisco, California Area | More details for San Francisco, California Jobs |Salary

Forward-Deployed Test Engineer
Description: Ranger, revolutionizing software testingRanger is an AI-powered platform redefining QA. Our mission is to power testing for the world's most ambitious engineering, product, and design teams so they can (more...)
Company: Ranger
Location: San Francisco
Posted on: 04/3/2025

Backend Engineer
Description: ABOUT PLAUD AIPLAUD AI is a pioneering AI-native hardware and software company that turns meetings and conversations into actionable insights with AI devices like PLAUD NOTE and PLAUD NotePin. By recording, (more...)
Company: PLAUD.AI
Location: San Francisco
Posted on: 04/3/2025

- Lead Frontend Engineer (React.js) -
Description: OverviewWe look for Senior Engineers seeking challenging work to bridge the gap between materials/chemistry, AI/ML, and computer science to help us develop a software framework for designing and discovering (more...)
Company: Mat3ra
Location: San Francisco
Posted on: 04/3/2025

Principal/Senior Backend Engineer
Description: The crypto world is changing-we've seen an explosion in innovation but a fragmentation of tools. Today, traders navigate a low visibility landscape, handling multiple chains, scattered data, and many (more...)
Company: Up Closets of North Cincinnati
Location: San Francisco
Posted on: 04/3/2025

Project Engineer
Description: Mercy Ships is a global faith-based charity that uses hospital ships to bring life-changing surgeries and transformational medical training to people in some of the most challenging contexts along the (more...)
Company: Mercy Ships
Location: San Francisco
Posted on: 04/3/2025

Research Engineer, Multimodal Safety
Description: Our team is dedicated to shaping the future of artificial intelligence by equipping ChatGPT with the ability to hear, see, speak, and create visually compelling images, transforming how people interact (more...)
Company: OpenAI
Location: San Francisco
Posted on: 04/3/2025

Sr. Field Service Engineer
Description: Sr. Field Service Engineer br br Job ID br br 211383 br br Posted br br 26-Mar-2025 br br Service line br br GWS Segment br br Role type br br Full-time br br Areas (more...)
Company: CBRE
Location: San Francisco
Posted on: 04/3/2025

Loading more jobs...

Research Scientist, Reinforcement Learning (Training)

Didn't find what you're looking for? Search again!

Other Other Jobs

Log In or Create An Account