Reliability Engineer

489 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Gsrmarkets

Remote

$80k - $100k

Douro Labs

North America

$112k - $156k

Polygon Labs

LATAM

$84k - $100k

Zetachain

Remote

$157k - $171k

Parity

Remote

$80k - $120k

D3

Remote

$112k - $156k

Binance

Dublin, Ireland

Autumn Compass

Sydney, Australia

$120k - $150k

Chainlink Labs

United States

$98k - $112k

Wormholefoundation

Remote

$112k - $156k

Kiln

Paris, France

$112k - $156k

Kiln

Paris, France

$84k - $112k

Zscaler

Remote

$140k - $200k

Zscaler

Remote

$130k - $131k

Zenith

Remote

Gsrmarkets
$80k - $100k estimated
Remote

Global Site Reliability Engineer Location: London About Us Founded in 2013, GSR is a leading market maker and programmatic trading firm in the fast-evolving world of cryptocurrency trading. With over 200 employees across seven countries, we provide billions of dollars in liquidity daily to cryptocurrency protocols and exchanges. We build long-term relationships with crypto communities and institutional investors by offering exceptional service, expertise, and tailored trading solutions. GSR works with token issuers, exchanges, investors, miners, and more than 30 cryptocurrency exchanges around the world. In volatile markets we are a trusted partner to crypto native builders and to those exploring the industry for the first time. Our team of veteran finance and technology executives from Goldman Sachs, Two Sigma, and Citadel, among others, has developed one of the world’s most robust trading platforms designed to navigate issues unique to the digital asset markets.  We have continuously improved our technology throughout our history, allowing for our clients to scale and execute their strategies with the highest level of efficiency. Working at GSR is an opportunity to be deeply embedded in every major sector of the cryptocurrency ecosystem. About the Role We are seeking a Site Reliability Engineer (SRE) to design, optimize, and support highly available systems across our global trading infrastructure. As part of GSR’s SRE team, you will manage a multi-regional cloud environment while integrating and automating our physical server inventory using Infrastructure as Code (IaC). You will work across all layers of infrastructure, including:

Networking & Exchange Connectivity Linux Systems & Kubernetes Administration Microservice Orchestration & Observability Disaster Recovery & Security Optimization

Your mission is to improve latency, scalability, and reliability, ensuring GSR remains a best-in-class market maker. We value engineers who drive automation, reduce friction, and enhance developer velocity through better tooling, CI/CD, and infrastructure design.

Who We’re Looking For Core Skills

Containers & Orchestration: Strong expertise in container security and Kubernetes (multi-cluster/global deployment is a plus). Distributed Systems & Messaging: Knowledge of clusters, storage, Kafka, Aeron, and experience with multicast or HPC. Automation & IaC: Proficiency in Python, Golang, or Rust with experience in IaC tools and immutable infrastructure. Continuous Delivery & Config Management: Familiarity with FluxCD, ArgoCD, and custom CD deployments. Strong grasp of CI/CD pipelines. Linux & Networking: Solid understanding of Linux internals, cgroups, routing, switching, firewalls, and DNS/service discovery. Databases: Experience with MySQL, MongoDB, and database administration (Flyway or Liquibase a plus).

Bonus Experience

Data center operations Crypto, fintech,bare-metal provisioning or trading experience

What we offer: 

A collaborative and transparent company culture founded on Integrity, Innovation and Performance.  Competitive Salary with two discretionary bonus’ payments a year. Benefits such as Healthcare, Dental, Vision, Retirement Planning, 30 days holiday and free lunches when in the office.  Hybrid working pattern in all of our offices from London, New York, Singapore, Zug and Malaga. Regular Town Halls and offsites, team lunches and drinks.  A Corporate and Social Responsibility program as well as charity fundraising matching and volunteer days.    Immigration and relocation support where required.

GSR is proudly an Equal Employment Opportunity employer. We do not discriminate based upon any applicable legally protected characteristics such as race, religion, colour, country of origin, sexual orientation, gender, gender identity, gender expression or age. We operate a meritocracy, all aspects of people engagement from the decision to hire or promote as well as our performance management process will be based on the business needs and individual merit, competence in the role. Learn more about us at www.gsr.io.

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.