Reliability Engineer

491 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

OKX

Singapore, Singapore

$105k - $120k

Coinmarketcap

Remote

$129k - $149k

OKX

Singapore, Singapore

$105k - $120k

bloXroute Labs

Tel Aviv, Israel

$77k - $87k

Solana US

United States

$120k

TRM Labs

Remote

$54k - $80k

TRM Labs

Remote

TRM Labs

Remote

$54k - $90k

RealtyBits

Remote

$54k - $90k

Upshot

Rome, Italy

Ticketmaster

Los Angeles, CA, United States

$72k - $100k

Landry's

Los Angeles, CA, United States

$72k - $100k

Blockdaemon

San Francisco, CA, United States

coto by Eve World

India

$17k - $31k

Triton

Remote

$72k - $75k

Site Reliability Software Engineer Java Spring Boot

OKX
$105k - $120k estimated
03 Singapore, North West Community Development Council, Singapore
Join Talent Pool

This job is closed

About OKX:

Founded in 2017, OKX is one of the world’s leading cryptocurrency spot and derivatives exchanges. OKX innovatively adopted blockchain technology to reshape the financial ecosystem by offering some of the most diverse and sophisticated products, solutions, and trading tools on the market. Trusted by more than 20 million users in over 180 regions globally, OKX strives to provide an engaging platform that empowers everyone to explore the crypto world. In addition to its world-class Defi exchange, OKX serves its users with OKX Insights, a research arm at the cutting edge of the latest trends in the cryptocurrency industry. With its extensive range of crypto products and services and unwavering commitment to innovation, OKX’s vision is a world of financial access backed by the blockchain and the power of decentralised finance.

Responsibilities
  • Responsible for the reliable, stable and efficient operation of OK large-scale distributed system services;
  • Responsible for developing distributed system infrastructure, including framework, micro-service components, middleware, and storage;
  • Designed and developed service stability, automated operation and maintenance solutions, including system observability, system pressure measurement, complex engineering, agile, continuous delivery, capacity planning and resilience, traffic load balancing, site acceleration, security, performance tuning, etc.
  • Coordinating with business, testing, and ops teams in the line of work.
  • Responsible for research and exploration of new technologies in the field and reasonable implementation to maintain technical advancement;
Requirements
  • Bachelor's degree or above, more than three years of working experience;
  • Familiar with Java development and spring boot, concurrent programming, and solid computer foundation (network, algorithm);
  • Familiar with the principle and application of middleware and storage technology;
  • Have specific practical experience and understanding of distributed system design;

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.