Software Engineer, Data Acquisition
OpenAI - San Francisco, California, United States
Posted Sep 22, 2023
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified
- Salary
- Not verified
- 401(k) match
- Not verified
Was this benefit information wrong? Tell us.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
Software Engineer, Data Acquisition San Francisco, California, United States Overview: The Data Acquisition team within the Foundations organization at OpenAI is responsible for all aspects of data collection to support our model training operations. Our team manages web crawling and GPTBot services and works closely with Data Processing, Architecture, and Scaling teams. We are looking for a skilled Software Engineer to join our Data Acquisition team. Responsibilities: - Own and lead engineering projects in the area of data acquisition including web crawling, data ingestion, and search. - Collaborate with other sub-teams, such as Data Processing, Architecture, and Scaling, to ensure smooth data flow and system operability. - Work closely with the legal team to handle any compliance or data privacy-related matters. - Develop and deploy highly scalable distributed systems capable of handling petabytes of data. - Architect and implement algorithms for data indexing and search capabilities. - Build and maintain backend services for data storage, including work with key-value databases and synchronization. - Deploy solutions in a Kubernetes Infrastructure-as-Code environment and perform routine system checks. - Conduct and analyze experiments on data to provide insights into system performance. Qualifications: - BS/MS/PhD in Computer Science or a related field. - 4+ years of industry experience in software development. - Experience with large web crawlers a plus - Strong expertise in large stateful distributed systems and data processing. - Proficiency in Kubernetes, and Infrastructure-as-Code concepts. - Willingness and enthusiasm for trying new approaches and technologies. - Ability to handle multiple tasks and
Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.
Related jobs
-
Systems Engineer - (Execution) - Level 3/4
Northrop Grumman - United States-Alabama-Huntsville
-
Business Analyst (Top Secret cleared)
ICF International INC - Washington, DC
-
Engineering Project Specialist II (Full Time) - United State
Cisco - San Jose, California, US
-
Automation AI Ops Engineer
Cisco - 2 Locations