Software Engineer, Data Infrastructure

Company: OpenAI
Location: San Francisco
Posted on: March 27, 2025

Job Description:

The Research Platform Analytics team designs, builds, and operates the critical foundational data and analytics infrastructure that enables research at OpenAI.Our goal is one, and one only: accelerate the progress of research towards AGI. We do this by owning a variety of observability and analytics systems aimed at providing quality signals about our research, and own the entire lifecycle of it, starting with data production from training workloads, to ingestion, post-processing and end-user analytics products. All of this at large scale.About the RoleAs we scale up with more researchers and engineers joining OpenAI, we seek a pragmatic and passionate engineer with a strong focus on the experience for both engineers and scientists that work in our large data sets.Our work involves building a generic data processing platform that enables researchers to store, query, and process petabyte-scale datasets efficiently. This includes developing and maintaining large-scale stream and batch data pipelines, ensuring our infrastructure scales to support ML workloads, and making trade-offs to deliver impact quickly. We work across distributed data systems, infrastructure, and observability, ensuring reliability while moving fast.You will find yourself at home if you are comfortable with work such as scaling Kubernetes services, debugging Kafka consumer lag, diagnosing distributed systems failures, and developing new end-to-end data processing pipelines-from raw data capture to analytics using Presto, Trino, or Flink. A portion of this role involves hands-on infrastructure work, including deploying and troubleshooting core services.This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.In this role, you will:

Build and maintain large-scale stream and batch processing pipelines (Kafka, Spark, Flink, Trino/Presto).
Develop a general-purpose data processing platform for handling massive datasets.
Scale applications for ML research, ensuring smooth operation as workloads grow.
Ensure the security, integrity, and compliance of data according to industry and company standards.
Ensure our analytics and data platforms can scale reliably to the next several orders of magnitude.
Accelerate company productivity by empowering your fellow engineers, researchers, and teammates with excellent data tooling and systems, providing a best-in-case experience.
Bring new features and capabilities to the world by partnering with product engineers, trust & safety and other teams to build the technical foundations.
Like all other teams, we are responsible for the reliability of the systems we build. This includes an on-call rotation to respond to critical incidents as needed.You might thrive in this role if you have:
Proficient in Python and backend development, with experience working in large codebases (monorepos).
Experience building and operating large-scale stream and batch processing pipelines (Kafka, Spark, Flink, Presto/Trino).
Hands-on experience with Kubernetes, Terraform, and deploying/troubleshooting production systems.
Worked on access control, provenance, auditing, and large-scale data movement.
Passion for building systems that provide key insights, especially in ML training workflows.
Comfortable in a fast-moving environment, making trade-offs to deliver impact quickly.
Understanding of data transformations in ML training and inference workflows is a plus.About OpenAIOpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
#J-18808-Ljbffr

Keywords: OpenAI, San Francisco , Software Engineer, Data Infrastructure, IT / Software / Systems , San Francisco, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Francisco recruiters find you. Post your resume for free!

Get San Francisco IT / Software / Systems jobs via email.

View more San Francisco IT / Software / Systems jobs

Other IT / Software / Systems Jobs

Facilities Team Lead
Description: We appreciate your interest in employment with Barry's Barry's is committed to a policy of equal employment opportunity, and will not discriminate against an applicant or employee on the basis of race, (more...)
Company: Barry's Bootcamp
Location: San Francisco
Posted on: 04/3/2025

Embedded Systems Engineer II
Description: RIX Industries is a technology-focused company specializing in the design, development and manufacturing of gas generation systems, precision compressor solutions, and cryogenic cooling technologies for (more...)
Company: RIX INDUSTRIES
Location: Benicia
Posted on: 04/3/2025

Employment Litigation Associate
Description: Direct Counsel is representing an Am Law 50 firm: San Francisco, CA - Employment Litigation AssociateDescription:Direct Counsel is representing an Am Law 50 firm seeking an Employment Litigation Associate (more...)
Company: Directcounsel
Location: San Francisco
Posted on: 04/3/2025

Salary in San Francisco, California Area | More details for San Francisco, California Jobs |Salary

Mammography Technologist
Description: A fantastic opportunity is now available for a Hospital Mammography Tech at a top acute care facility in the gorgeous Bay Area of California. If you are an experienced Mammography Technologist looking (more...)
Company: Clinical Management Consultants
Location: San Gregorio
Posted on: 04/3/2025

Captain
Description: Job Summary: We are seeking a highly skilled software engineer to join our team. The ideal candidate will have experience in software development, programming languages,
Company: Niku Steakhouse
Location: San Francisco
Posted on: 04/3/2025

Veterans Preferred - Specialty Gas Sales Specialist
Description: br Military Veterans are Encouraged to Apply. -R10064861 Specialty Gas Sales Specialist Open br br Location: br San Francisco, CA - Filling industrialRedwood City, CA - Filling industrial, (more...)
Company: Airgas
Location: San Francisco
Posted on: 04/3/2025

Sr. Operations Planner, Demand Planning
Description: Hims Hers is the leading health and wellness platform, on a mission to help the world feel great through the power of better health. We are redefining healthcare by putting the customer first and delivering (more...)
Company: hims & hers
Location: San Francisco
Posted on: 04/3/2025

Educational Research Scientist
Description: Educational Research ScientistDepartment of Surgery and the Center for Faculty Educators:University of California, San FranciscoThe University of California, San Francisco UCSF Department of Surgery (more...)
Company: National Medical Association
Location: San Francisco
Posted on: 04/3/2025

AVP/VP Research Analyst, Healthcare Innovation
Description: Together we fight for everyone's opportunity for a better financial future. We will do this together - with customers, partners and colleagues. We will fight for others, not against: We will stand up (more...)
Company: Disability Solutions
Location: San Francisco
Posted on: 04/3/2025

Software Engineer, Full Stack
Description: At Open Ledger, we're reshaping the future of financial data integration Our dynamic platform unifies financial data from diverse sources, empowering businesses with real-time insights. By integrating (more...)
Company: Open Ledger, Inc.
Location: San Francisco
Posted on: 04/3/2025

Loading more jobs...

Software Engineer, Data Infrastructure

Didn't find what you're looking for? Search again!

Other IT / Software / Systems Jobs

Log In or Create An Account