Remote Quality Assurance Jobs in Web3
469 jobs found
Job Position | Company | Posted | Location | Salary | Tags |
---|---|---|---|---|---|
Sentient | Remote | $90k - $96k | |||
Bitpanda | Remote | $105k - $115k | |||
Bitpanda | Remote | $106k - $114k | |||
Woo | Remote | $81k - $96k | |||
Learn job-ready web3 skills on your schedule with 1-on-1 support & get a job, or your money back. | | by Metana Bootcamp Info | |||
Bitgo | Remote | $70k - $105k | |||
Coin Market Cap Ltd | Remote | $40k - $67k | |||
Coin Market Cap Ltd | Hong Kong, Hong Kong | $77k - $106k | |||
Bitgo | Remote | $115k - $138k | |||
Falconx | Remote | $153k - $207k | |||
Token Metrics | Athens, Greece | $90k - $90k | |||
Token Metrics | Athens, Greece | $90k - $90k | |||
Token Metrics | Sao Paulo, Brazil | $45k - $80k | |||
SwissBorg | Remote | $90k - $100k | |||
Zinnia | Remote | $84k - $106k | |||
Bitpanda | Remote | $106k - $114k |
As part of our fast-moving team, you'll help shape the quality and reliability of how our AI interacts with consumers.
Role Overview
We're looking for a highly independent AI QA Engineer who thrives at the intersection of traditional QA and cutting-edge AI evaluation. This role spans frontend automation, backend API testing, and the challenge of evaluating AI output for quality, correctness, and speed.
Responsibilities
Design and implement automated UI regression tests
Write and maintain robust API test suites to validate backend functionality with each release
Create and run AI evaluation tests to assess:
Quality and correctness of chatbot responses compared to ground truth and other products
TTFT, end-to-end-latency, and other timing performance
Build internal tools and datasets for testing and benchmarking
Collaborate closely with product, engineering, and AI teams to ensure end-to-end quality of AI product
Requirements
Strong experience with frontend testing tools (e.g., Playwright, Cypress)
Proficient in backend/API testing (e.g., Postman, pytest, REST/GraphQL)
Familiarity with evaluating LLM/AI-generated content
Comfortable designing and analyzing test datasets for AI performance and functionality
Strong scripting skills (e.g., Python)
Highly autonomous with excellent problem-solving skills