Reliability Engineer

494 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

asymmetric.re

Remote

$124k - $150k

Chainlink Labs

Argentina

$112k - $156k

Kraken

Remote

$88k - $101k

Layerzerolabs

Remote

$86k - $110k

Zinnia

Remote

$126k - $127k

Gsrmarkets

Remote

$80k - $100k

Douro Labs

North America

$112k - $156k

Polygon Labs

LATAM

$84k - $100k

Zetachain

Remote

$157k - $171k

Parity

Remote

$80k - $120k

D3

Remote

$112k - $156k

Binance

Dublin, Ireland

Autumn Compass

Sydney, Australia

$120k - $150k

Chainlink Labs

United States

$98k - $112k

Wormholefoundation

Remote

$112k - $156k

asymmetric.re
$124k - $150k estimated
Remote

Asymmetric Research:

Asymmetric Research ("AR") is a boutique security venture focused on deep partnerships with L1/L2 blockchains and DeFi protocols in an effort to keep them safe. We specialize in four core domains of web3 security: research, engineering, incident response, and infrastructure services.


Culture:

AR is a fully remote organization. Our team has deep roots in open-source development and decades of security-first experience at organizations such as Google, Netflix, Mozilla, Stripe, and Jump Crypto. We value autonomy, professionalism, and a commitment to excellence.

About the Role:

We are looking for a Site Reliability Engineer to join Asymmetric Research initially on a six-month contract engagement, with a strong opportunity to extend into a full-time position. In this role, you will design, operate, and scale mission-critical blockchain infrastructure supporting leading L1/L2 networks and DeFi protocols, working within a high-trust team to deliver secure, highly available, production-grade systems while driving automation, reliability, and operational excellence across our globally distributed environments.

Responsibilities

  • Manage and maintain a globally distributed blockchain infrastructure fleet

  • Design, architect, deploy, and operate production-grade infrastructure services

  • Implement and maintain infrastructure-as-code across development, staging, and production environments

  • Ensure high availability and performance of mission-critical systems

  • Contribute to automation, CI/CD pipelines, and operational tooling

  • Monitor system health and respond to incidents with strong troubleshooting fundamentals

  • Uphold the highest standards of integrity, professionalism, and operational discipline

Requirements

  • 2+ years of experience in a Site Reliability, DevOps, or Infrastructure Engineering role

  • Strong experience managing Linux systems and network infrastructure

  • Hands-on experience with load balancers and high-availability technologies (e.g., HAProxy, ALB/ELB)

  • Experience with configuration management tools (e.g., Ansible, Chef, Puppet, SaltStack)

  • Solid troubleshooting skills across hardware, networking, and software systems

  • Development experience in Go, Python, or Rust

  • Experience building and maintaining CI/CD pipelines and automated deployment workflows

  • Experience with open-source monitoring and observability tools (e.g., Grafana, Loki, Prometheus, Alertmanager)

Nice to Have

  • Experience operating distributed systems using tools such as Nomad or Kubernetes

  • Familiarity with blockchain infrastructure, including Bitcoin, Ethereum, Solana, Cosmos, or Move-based ecosystems

Benefits:

  • 25-days paid vacation

  • Office and equipment stipend

  • Pension / 401K programs

  • Life Insurance

  • Premium Healthcare

  • Competitive Base Salary

  • Lucrative Bonus Programs

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.