Reliability Engineer

420 jobs found

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Consensys

Remote

$118k - $209k

Blockdaemon

Singapore, Singapore

$87k - $120k

Triton

Remote

$72k - $75k

Crypto.com

Shenzhen, China

$185k

Galaxy

Remote

$90k - $100k

Nethermind

Istanbul, Turkey

$90k - $100k

Ledger

Paris, France

$185k

Crypto.com

Taipei, Taiwan

$185k

Coinbase

Remote

$185k

Fuel Labs

Web3

$103k - $156k

Chainlink Labs

Remote

Ramp

Poland

$90k - $100k

Chainlink Labs

Remote

Chainlink Labs

Remote

Wallet

Remote

$90k - $100k

Consensys
$118k - $209k
CANADA - Remote, UNITED STATES - Remote
Apply

Our mission is to unlock the collaborative power of communities by making Web3 universally easy to use, access, and build on.

Working with Consensys puts you at the forefront of an evolving paradigm, transforming our society for the better. We fundamentally believe blockchain is the next generation of technology that can lay the foundation for a more just and equitable society. 

Blockchain tech is just over 10 years old. Ethereum itself is still a toddler and we’re far from reaching our full potential. You’ll get to work on the tools, infrastructure, and apps that scale these platforms to billions of users. 

You’ll be constantly exposed to new concepts, ideas, and frameworks from your peers, and as you work on different projects — challenging you to stay at the top of your game. You’ll join a network of entrepreneurs and technologists that reaches the edge of our ecosystem. Consensys alumni have moved on to become tech entrepreneurs, CEOs, and team leads at tech companies. 

 

About Infura
Infura is the leading infrastructure platform providing scalable and easy-to-use APIs and cloud solutions for blockchain and decentralized application developers. Infura currently offers APIs for Ethereum, IPFS, and other leading decentralized networks, allowing developers to immediately begin building instead of spending time on architecting and maintaining their own infrastructure.

Infura provides developers instant access to decentralized and blockchain networks without the need to sync or understand complicated peer to peer infrastructure. Similar to how one might access a cloud service to alleviate the friction of owning proprietary compute and storage, Infura lowers the barrier to entry for engineers to begin building.

We believe in a more equitable and decentralized future, and we know developers will be at the forefront of that progress. We want to empower developers to spend their time building, creating, and evolving by providing them with the best and easiest solution to their infrastructure needs.

Some things to know about the Infura team, we are a remote first company, with team members all over the world. We strive to reduce meetings, respect work life balance, promote diversity, and generally be good people. We depend on each other not only to make an amazing product but to be a group that wants to succeed together. 

 

What you’ll do

Be part of a world-class infrastructure and tools team responsible for core Infura engineering projects. Infura builds and supports a globally available, high-performance service. Modern tools and practices help our developers work safely and efficiently.

As a reliability engineer, you would be instrumental in building tooling for test automation, infrastructure as code, single panes of glass observability, Kubernetes, and execution tracing. Our reliability engineers are key in helping our SRE teams, engineering teams, and testing teams run efficient and secure infrastructure. Security, platform engineering, and right-sized capacity planning are key aspects of the reliability engineering work stream.

We work with infrastructure as code tools (e.g. Terraform), containers and container orchestration (e.g. Kubernetes, EKS, ECS); event source / message based architectures (e.g. Kinesis, Kafka); frontend applications (e.g. React, nodejs, etc); both RDBMS and NoSQL Datastores (e.g. Postgres, Mongo, Redis, DynamoDb); Linux based operating systems; and of course blockchain and DLT. Understanding both traditional virtual server infrastructure, containerized workloads, and modern serverless workloads is a key part of the expertise we seek.

Would be great if you brought this to the role

  • The ideal candidate would have 7+ years of experience in a DevOps, system operations, or SRE role. 
  • Extensive experience troubleshooting, debugging, and working with microservices, Cloud Native, or Kubernetes/EKS.
  • Experience with Kubernetes/EKS and supporting components utilized to run k8s (e.g. karpenter, kyverno, promtail, prometheus, etc)
  • Experience with at least one or more of the following Infrastructure as Code frameworks: Terraform or Amazon CDK.
  • Infura maintains cloud-based systems, so deep experience with one or more cloud providers (bonus for AWS) is preferable.

Don't meet all the requirements? Don't sweat it. We’re passionate about building a diverse team of humans and as such, if you think you've got what it takes for our chaotic-but-fun, remote-friendly, start-up environment—apply anyway, detailing your relevant transferable skills in your cover letter. While we have a pretty good idea of what we need, we're ready for you to challenge our thinking on who needs to be in this role.

It is a requirement of employment in this position that applicants will be required to submit to background checks including but not limited to employment, education and criminal record checks. Further details will be provided to applicants that successfully meet the criteria for the position as determined by the company in its sole discretion. By submitting an application for employment, you are acknowledging and consenting to this requirement.

Consensys is an equal opportunity employer. We encourage people from all backgrounds to apply. We are committed to ensuring that our technology is made available and accessible to everyone. All employment decisions are made without regard to race, color, national origin, ancestry, sex, gender, gender identity or expression, sexual orientation, age, genetic information, religion, disability, medical condition, pregnancy, marital status, family status, veteran status, or any other characteristic protected by law. Consensys is aware of fraudulent recruitment practices and we encourage all applicants to review our best practices to protect yourself which can be found (https://consensys.io/careers/best-practices-to-avoid-recruitment-fraud/).

The salary range for US-based candidates only will be determined throughout the interview process depending on experience and skills.

US pay range (not including bonus, equity or other benefits)
$118,000—$209,000 USD

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.