Machine Learning Systems Engineer, Reinforcement Learning Engineering

Company: NLP PEOPLE
Location: San Francisco
Posted on: November 14, 2024

Job Description:

About the role:
You want to build the cutting-edge systems that train AI models like Claude. You're excited to work at the frontier of machine learning, implementing and improving advanced techniques to create ever more capable, reliable and steerable AI. As an ML Systems Engineer on our RL Eng team, you'll be responsible for the critical algorithms and infrastructure that our researchers depend on to train models. Your work will directly enable breakthroughs in AI capabilities and safety. You'll focus obsessively on improving the performance, robustness, and usability of these systems so our research can progress as quickly as possible. You're energized by the challenge of supporting and empowering our research team in the mission to build beneficial AI systems.
Our finetuning researchers train our production Claude models and internal research models using RLHF and other related methods. Your job will be to build, maintain, and improve the algorithms and systems that these researchers use to train models. You'll be responsible for improving the speed, reliability, and ease-of-use of these systems.
You may be a good fit if you:

Have 2+ years of software engineering experience
Like working on systems and tools that make other people more productive
Are results-oriented, with a bias towards flexibility and impact
Pick up slack, even if it goes outside your job description
Enjoy pair programming (we love to pair!)
Want to learn more about machine learning research
Care about the societal impacts of your work

Strong candidates may also have experience with:
High performance, large scale distributed systems
Kubernetes
Python
Machine learning
Implementing LLM finetuning algorithms, such as RLHF

Representative projects:
Profiling our reinforcement learning pipeline to find opportunities for improvement
Building a system that regularly launches training jobs in a test environment so that we can quickly detect problems in the training pipeline
Making changes to our finetuning systems so they work on new model architectures
Building instrumentation to detect and eliminate Python GIL contention in our training code
Diagnosing why training runs have started slowing down after some number of steps, and fixing it
Implementing a stable, fast version of a new training algorithm proposed by a researcher

Deadline to apply: None. Applications will be reviewed on a rolling basis.
Company:
Anthropic
#J-18808-Ljbffr

Keywords: NLP PEOPLE, San Francisco , Machine Learning Systems Engineer, Reinforcement Learning Engineering, Other , San Francisco, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Francisco recruiters find you. Post your resume for free!

Get San Francisco Other jobs via email.

View more San Francisco Other jobs

Other Other Jobs

WATER FEATURE INSTALLATION SPECIALIST POSITION: TRANSFORM LANDSCAPES WITH STUNNING AQUATIC DESIGNS
Description: Water Feature Installation Specialist Position: Transform Landscapes with Stunning Aquatic Designs br Read on to find out what you will need to succeed in this position,
Company: Job Hopper
Location: Redwood City
Posted on: 11/21/2024

Relationship Banker - Napa area
Description: Job Description:At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we (more...)
Company: Disability Solutions
Location: Napa
Posted on: 11/21/2024

Serial Crystallography Research Scientist
Description: Serial Crystallography Research ScientistBerkeley Lab's LBNL Molecular Biophysics and Integrated Bioimaging Division MBIB has an opening for a Computational Research Scientist to join the Structural (more...)
Company: Lawrence Berkeley National Laboratory
Location: Berkeley
Posted on: 11/21/2024

Salary in San Francisco, California Area | More details for San Francisco, California Jobs |Salary

Screenwriter
Description: Since 2016, CMS has trailblazed the way for quality interactive storytelling. Through CHAPTERS: INTERACTIVE STORIES, readers immerse themselves in compelling playable novels licensed from best selling (more...)
Company: Crazy Maple Studio
Location: Sunnyvale
Posted on: 11/21/2024

Workday HCM Functional Lead
Description: A company is looking for a Workday HCM Functional Lead to provide expertise in Workday HCM configuration and
Company: VirtualVocations
Location: Concord
Posted on: 11/21/2024

Senior Quality Partner
Description: Roche fosters diversity, equity and inclusion, representing the communities we serve. When dealing with healthcare on a global scale, diversity is an essential ingredient to success. We believe that inclusion (more...)
Company: F. Hoffmann-La Roche AG
Location: Pleasanton
Posted on: 11/21/2024

Variable Universal Life Insurance Specialist
Description: A company is looking for a Variable Universal Life Insurance Operations Specialist. br br br br
Company: VirtualVocations
Location: Concord
Posted on: 11/21/2024

Drive with Lyft
Description: What is Lyft br br Lyft is a flexible earning opportunity and a platform that connects drivers with individuals that need rides. Driving with Lyft is the perfect way to earn money on any schedule (more...)
Company: Lyft
Location: Walnut Creek
Posted on: 11/21/2024

Personal Care Attendant II - Day Center
Description: The Position: The Personal Care Attendant II provides assistance to CEI participants in the participant's home or day center. The Personal Care Attendant II demonstrates knowledge and skills necessary (more...)
Company: Center for Elders' Independence
Location: Livermore
Posted on: 11/21/2024

DevRel Community Advocate
Description: A company is looking for a DevRel Community Advocate to shape and grow their developer community. br br br br Key
Company: VirtualVocations
Location: Sunnyvale
Posted on: 11/21/2024

Loading more jobs...

Machine Learning Systems Engineer, Reinforcement Learning Engineering

Didn't find what you're looking for? Search again!

Other Other Jobs

Log In or Create An Account