SEAL Research Scientist, Frontier Risk Evaluations
Company: Tbwa Chiat/Day Inc
Location: San Francisco
Posted on: February 11, 2025
Job Description:
SEAL Research Scientist, Frontier Risk EvaluationsAs the leading
data and evaluation partner for frontier AI companies, Scale plays
an integral role in understanding the capabilities and safeguarding
large language models (LLMs). Safety, Evaluations and Alignment Lab
(SEAL) is Scale's frontier research effort dedicated to tackling
the challenging research problems in evaluation, red teaming, and
alignment of advanced AI systems.We are actively seeking talented
researchers to join us in shaping the landscape for safety and
transparency for the entire AI industry. We support collaborations
across the industry and academia and the publication of our
research findings.As a Research Scientist focused on Frontier Risk
Evaluations, you will design and create evaluation measures,
harnesses, and datasets for measuring the risks posed by frontier
AI systems. For example, you might do any or all of the
following:
- Design and build harnesses to test AI agents for dangerous
capabilities such as hacking or exploiting security
vulnerabilities;
- Develop and run human-in-the-loop tests of AI capabilities to
deceive, manipulate, blackmail, or otherwise engage in social
engineering;
- Work with government agencies or other labs to collectively
scope and design evaluations to measure and mitigate risks posed by
advanced AI systems.Ideally you'd have:
- Commitment to our mission of promoting safe, secure, and
trustworthy AI deployments in the industry as frontier AI
capabilities continue to advance.
- Practical experience conducting technical research
collaboratively, with proficiency in frameworks like Pytorch, Jax,
or Tensorflow. You should also be adept at interpreting research
literature and quickly turning new ideas into prototypes.
- A track record of published research in machine learning,
particularly in generative AI.
- At least three years of experience addressing sophisticated ML
problems, whether in a research setting or in product
development.
- Strong written and verbal communication skills to operate in a
cross-functional team.Nice to have:
- Hands-on experience with open-source LLM fine-tuning or
involvement in bespoke LLM fine-tuning projects using
Pytorch/Jax.
- Experience in crafting evaluations or a background in data
science roles related to LLM technologies.
- Experience working with cloud technology stack (e.g., AWS or
GCP) and developing machine learning models in a cloud
environment.Our research interviews are crafted to assess
candidates' skills in practical ML prototyping and debugging, their
grasp of research concepts, and their alignment with our
organizational culture. We will not ask any LeetCode-style
questions.Compensation packages at Scale for eligible roles include
base salary, equity, and benefits. The range displayed on each job
posting reflects the minimum and maximum target for new hire
salaries for the position, determined by work location and
additional factors, including job-related skills, experience,
interview performance, and relevant education or training. Scale
employees in eligible roles are also granted equity-based
compensation, subject to Board of Director approval.About Us:At
Scale, we believe that the transition from traditional software to
AI is one of the most important shifts of our time. Our mission is
to make that happen faster across every industry, and our team is
transforming how organizations build and deploy AI. Our products
power the world's most advanced LLMs, generative models, and
computer vision models. We are trusted by generative AI companies
such as OpenAI, Meta, and Microsoft, government agencies like the
U.S. Army and U.S. Air Force, and enterprises including GM and
Accenture. We are expanding our team to accelerate the development
of AI applications.We believe that everyone should be able to bring
their whole selves to work, which is why we are proud to be an
affirmative action employer and inclusive and equal opportunity
workplace. We are committed to equal employment opportunity
regardless of race, color, ancestry, religion, sex, national
origin, sexual orientation, age, citizenship, marital status,
disability status, gender identity, or Veteran status.
#J-18808-Ljbffr
Keywords: Tbwa Chiat/Day Inc, San Francisco , SEAL Research Scientist, Frontier Risk Evaluations, Other , San Francisco, California
Didn't find what you're looking for? Search again!
Loading more jobs...