ai analyst backend bitcoin blockchain community manager crypto cryptography cto customer support dao data science defi design developer relations devops discord economy designer entry level erc erc 20 evm front end full stack gaming ganache golang hardhat intern java javascript layer 2 marketing mobile moderator nft node non tech open source openzeppelin pay in crypto product manager project manager react refi research ruby rust sales smart contract solana solidity truffle web3js zero knowledge
| Job Position | Company | Posted | Location | Salary | Tags |
|---|---|---|---|---|---|
Binance | Bangkok, Thailand |
| |||
Binance | Bangkok, Thailand |
| |||
Binance | Bangkok, Thailand |
| |||
Coins.ph | Manila, Philippines | $140k - $171k | |||
| Learn job-ready web3 skills on your schedule with 1-on-1 support & get a job, or your money back. | | by Metana Bootcamp Info | |||
Binance | Taipei, Taiwan |
| |||
1inch | Dubai, United Arab Emirates | $122k - $159k | |||
1inch | Dubai, United Arab Emirates | $124k - $159k | |||
Envision Employment Solutions | Abu Dhabi, United Arab Emirates | $90k - $115k | |||
Flipster | APAC | $88k - $105k | |||
Crypto.com | Hong Kong, Hong Kong | $185k | |||
CleanSpark | Georgia | $81k - $84k | |||
CertiK | Dubai, United Arab Emirates | $45k - $80k | |||
CertiK | South Korea | $72k - $100k | |||
Binance | Taipei, Taiwan |
| |||
Crypto.com | Singapore, Singapore | $54k - $90k |
Binance
Thailand, Bangkok
Data Scientist, Reinforcement Learning
Taiwan, Taipei / Thailand, Bangkok / Australia, Brisbane / Australia, Melbourne / Australia, Sydney / Indonesia, Jakarta
Engineering â Data Science/AI /
Full-time: Remote /
Remote
Apply for this job
Binance is a leading global blockchain ecosystem behind the worldâs largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.
About the Role
You will develop and optimize RL models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning. You will explore and evaluate advanced algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RLâcovering policy optimization, reward modeling, and planningâpaired with the engineering skills to build scalable production systems. You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.
Responsibilities:
- Research and develop state-of-the-art RL algorithms, focusing on large model optimization and alignment techniques.
- Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
- Apply RL methods to enhance LLM/VLM/Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
- Collaborate with engineers and researchers to integrate RL solutions into enterprise AI platforms.
- Monitor model performance in production and continuously improve through iterative training and fine-tuning.
Requirements:
- Masterâs degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
- 3+ years of hands-on experience in RL or LLM/VLM/Agentic AI optimization.
- Strong coding skills in Python, with experience in ML frameworks and RL libraries.
- Experience with large-scale distributed training and optimization.
- Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
Why Binance
⢠Shape the future with the worldâs leading blockchain ecosystem
⢠Collaborate with world-class talent in a user-centric global organization with a flat structure
⢠Tackle unique, fast-paced projects with autonomy in an innovative environment
⢠Thrive in a results-driven workplace with opportunities for career growth and continuous learning
⢠Competitive salary and company benefits
⢠Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.
By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.
Apply for this job