Research Web3 Jobs in North America

480 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Research Web3 Jobs in North America
Job Position Company Posted Location Salary Tags

OpenAI

San Francisco, CA, United States

$90k - $90k

Ripple

San Francisco, CA, United States

$94k - $96k

Kraken

United States

$91k - $150k

Uniswap Labs

New York, NY, United States

The Block

New York, NY, United States

Aptos

San Francisco, CA, United States

$45k - $75k

Qarik Group

United States

$84k - $120k

The Tie

New York, NY, United States

$50k - $80k

marginfi

New York, NY, United States

$23k - $750k

The Tie

New York, NY, United States

$90k - $110k

OKX

San Jose, CA, United States

$136k - $166k

Ancilia, Inc.

Mountain View, CA, United States

$89k - $110k

Logos

New York, NY, United States

$90k - $190k

Polymer Labs

New York, NY, United States

$81k - $95k

Coinbase

Miami, FL, United States

$144k - $170k

Research Scientist PostTraining Core Algorithms

OpenAI
$90k - $90k estimated

This job is closed

About the Team

The Post-Training - Core Algorithms team is responsible for researching and developing the next generation of algorithms to power our RLHF stack (reinforcement learning from human feedback). The algorithms we develop are used in ChatGPT consumer product and the OpenAI API.

About the Role

As a Member of Technical Staff on our team, you will research and develop improvements to all components of our RLHF stack, including data collection, supervised finetuning, reward modeling, off- and on-policy learning, active learning, and evaluations. The ultimate test for our algorithms is how useful they are to our users, and we often deploy our algorithms into new ChatGPT models.

We’re looking for people who have extensive background in reinforcement learning research, are able to iterate quickly, and are proficient at coding.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Come up with improvements to RLHF
  • Prototype and evaluate these ideas
  • Scale up your innovations to ChatGPT scale

You might thrive in this role if you:

  • Love being on the cutting edge of RL and language model research
  • Can iterate fast on lots of ideas
  • Like doing research that has real-world impact