SEAL Research Scientist, Frontier Risk Evaluations
Company: Tbwa Chiat/Day Inc
Location: San Francisco
Posted on: February 18, 2025
Job Description:
SEAL Research Scientist, Frontier Risk EvaluationsAs the leading
data and evaluation partner for frontier AI companies, Scale plays
an integral role in understanding the capabilities and safeguarding
large language models (LLMs). Safety, Evaluations and Alignment Lab
(SEAL) is Scale's frontier research effort dedicated to tackling
the challenging research problems in evaluation, red teaming, and
alignment of advanced AI systems.We are actively seeking talented
researchers to join us in shaping the landscape for safety and
transparency for the entire AI industry. We support collaborations
across the industry and academia and the publication of our
research findings.As a Research Scientist focused on Frontier Risk
Evaluations, you will design and create evaluation measures,
harnesses and datasets for measuring the risks posed by frontier AI
systems. For example, you might do any or all of the following:
- Design and build harnesses to test AI agents for dangerous
capabilities such as hacking or exploiting security
vulnerabilities;
- Develop and run human-in-the-loop tests of AI capabilities to
deceive, manipulate, blackmail, or otherwise engage in social
engineering;
- Work with government agencies or other labs to collectively
scope and design evaluations to measure and mitigate risks posed by
advanced AI systems.Ideally you'd have:
- Commitment to our mission of promoting safe, secure, and
trustworthy AI deployments in the industry as frontier AI
capabilities continue to advance.
- Practical experience conducting technical research
collaboratively, with proficiency in frameworks like Pytorch, Jax,
or Tensorflow. You should also be adept at interpreting research
literature and quickly turning new ideas into prototypes.
- A track record of published research in machine learning,
particularly in generative AI.
- At least three years of experience addressing sophisticated ML
problems, whether in a research setting or in product
development.
- Strong written and verbal communication skills to operate in a
cross functional team.Nice to have:
- Hands-on experience with open source LLM fine-tuning or
involvement in bespoke LLM fine-tuning projects using
Pytorch/Jax.
- Experience in crafting evaluations or a background in data
science roles related to LLM technologies.
- Experience working with cloud technology stack (eg. AWS or GCP)
and developing machine learning models in a cloud environment.Our
research interviews are crafted to assess candidates' skills in
practical ML prototyping and debugging, their grasp of research
concepts, and their alignment with our organizational culture. We
will not ask any LeetCode-style questions.Compensation packages at
Scale for eligible roles include base salary, equity, and benefits.
The range displayed on each job posting reflects the minimum and
maximum target for new hire salaries for the position, determined
by work location and additional factors, including job-related
skills, experience, interview performance, and relevant education
or training. Scale employees in eligible roles are also granted
equity based compensation, subject to Board of Director approval.
Your recruiter can share more about the specific salary range for
your preferred location during the hiring process, and confirm
whether the hired role will be eligible for equity grant. You'll
also receive benefits including, but not limited to: Comprehensive
health, dental and vision coverage, retirement benefits, a learning
and development stipend, and generous PTO. Additionally, this role
may be eligible for additional benefits such as a commuter
stipend.The base salary range for this full-time position in the
location of San Francisco is:$176,000 - $300,000 USDPLEASE NOTE:Our
policy requires a 90-day waiting period before reconsidering
candidates for the same role. This allows us to ensure a fair and
thorough evaluation of all applicants.About Us:At Scale, we believe
that the transition from traditional software to AI is one of the
most important shifts of our time. Our mission is to make that
happen faster across every industry, and our team is transforming
how organizations build and deploy AI. Our products power the
world's most advanced LLMs, generative models, and computer vision
models. We are trusted by generative AI companies such as OpenAI,
Meta, and Microsoft, government agencies like the U.S. Army and
U.S. Air Force, and enterprises including GM and Accenture. We are
expanding our team to accelerate the development of AI
applications.We believe that everyone should be able to bring their
whole selves to work, which is why we are proud to be an inclusive
and equal opportunity workplace. We are committed to equal
employment opportunity regardless of race, color, ancestry,
religion, sex, national origin, sexual orientation, age,
citizenship, marital status, disability status, gender identity or
Veteran status.We are committed to working with and providing
reasonable accommodations to applicants with physical and mental
disabilities. If you need assistance and/or a reasonable
accommodation in the application or recruiting process due to a
disability, please contact us at accommodations@scale.com. Please
see the United States Department of Labor's Know Your Rights poster
for additional information.PLEASE NOTE: We collect, retain and use
personal data for our professional business purposes, including
notifying you of job opportunities that may be of interest and
sharing with our affiliates. We limit the personal data we collect
to that which we believe is appropriate and necessary to manage
applicants' needs, provide our services, and comply with applicable
laws. Any information we collect in connection with your
application will be treated in accordance with our internal
policies and programs designed to protect personal data. Please see
our privacy policy for additional information.Apply for this job*
indicates a required fieldFirst Name *Last Name *Email
*PhoneResume/CVEnter manuallyAccepted file types: pdf, doc, docx,
txt, rtfEnter manuallyAccepted file types: pdf, doc, docx, txt,
rtf
#J-18808-Ljbffr
Keywords: Tbwa Chiat/Day Inc, San Francisco , SEAL Research Scientist, Frontier Risk Evaluations, Other , San Francisco, California
Didn't find what you're looking for? Search again!
Loading more jobs...