FewerJobs.
All jobs

Software Engineer, Data Acquisition

OpenAI - San Francisco, California, United States

Posted Sep 22, 2023

Benefits

Parental leave
Not verified
Non-birth-parent leave
Not verified
Family-building benefits
  • Fertility benefits: Not verified
  • Adoption assistance: Not verified
  • Surrogacy assistance: Not verified
Mental health support
Not verified
Relocation assistance
Not verified
Childcare support
Not verified
Learning budget
Not verified
Verification
Not verified
Salary
Not verified
401(k) match
Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type
Not verified
Weekend work
Not verified

Application

Cover letter
Not verified
Assessment
Not verified
Deadline
Not stated

Where they hire

State eligibility is not yet verified.

About this role

Software Engineer, Data Acquisition San Francisco, California, United States Overview: The Data Acquisition team within the Foundations organization at OpenAI is responsible for all aspects of data collection to support our model training operations. Our team manages web crawling and GPTBot services and works closely with Data Processing, Architecture, and Scaling teams. We are looking for a skilled Software Engineer to join our Data Acquisition team. Responsibilities: - Own and lead engineering projects in the area of data acquisition including web crawling, data ingestion, and search. - Collaborate with other sub-teams, such as Data Processing, Architecture, and Scaling, to ensure smooth data flow and system operability. - Work closely with the legal team to handle any compliance or data privacy-related matters. - Develop and deploy highly scalable distributed systems capable of handling petabytes of data. - Architect and implement algorithms for data indexing and search capabilities. - Build and maintain backend services for data storage, including work with key-value databases and synchronization. - Deploy solutions in a Kubernetes Infrastructure-as-Code environment and perform routine system checks. - Conduct and analyze experiments on data to provide insights into system performance. Qualifications: - BS/MS/PhD in Computer Science or a related field. - 4+ years of industry experience in software development. - Experience with large web crawlers a plus - Strong expertise in large stateful distributed systems and data processing. - Proficiency in Kubernetes, and Infrastructure-as-Code concepts. - Willingness and enthusiasm for trying new approaches and technologies. - Ability to handle multiple tasks and

Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.

Apply at jobs.ashbyhq.com

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs