Reliability Engineer

420 jobs found

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Ankr Network

San Francisco, CA, United States

$63k - $75k

Ankr Network

Amsterdam, Netherlands

$63k - $75k

Ankr Network

Australia

$63k - $75k

BlockFi

New York, NY, United States

Ankr Network

San Francisco, CA, United States

$63k - $75k

Ankr Network

Amsterdam, Netherlands

$63k - $75k

Ankr Network

Sydney, Australia

$63k - $75k

Ankr Network

San Francisco, CA, United States

$63k - $75k

NFT.Kred

Remote

$130k - $160k

Messari

New York, NY, United States

$135k - $155k

Binance

Sydney, Australia

Binance

Singapore, Singapore

DFINITY

Zurich, Switzerland

Ankr Network

San Francisco, CA, United States

$63k - $75k

koodos

New York, NY, United States

$94k - $102k

Junior Site Reliability Engineer

Ankr Network
$63k - $75k estimated

This job is closed

About Ankr

Paving the way to the open internet of the future, Ankr offers node solutions for over 50 different chains and a “1 click” API service for Ethereum, Binance Smart Chain, Polygon, Avalanche and more. Our primary mission is to help usher in developers into the web3 ecosystem. To do this, we pioneer new solutions to solve some of the most pressing problems across decentralized systems and the DeFi movement, to lower the entry barrier for everyday people, enterprises, and developers to contribute to blockchain ecosystems.

Check us out:https://www.ankr.com/

Ankr was founded in 2017 in Berkeley, California. The founding team and headquarters are based in San Francisco. Ankr has a distributed team of over 150+ people operating remotely and from offices in San Francisco, Shanghai, Moscow, and Amsterdam.

The next phase of the internet is based on distributed networks which make the new generation of platforms, applications and services more private, secure, reliable and censorship resistant.

By cutting out intermediaries and gatekeepers, builders and users gain back control over their applications and data.

Our mission is to make web3 easy to use for everyone!

What we’re looking for:

  • Someone who grabs on to problems, mitigates their impact and then resolves the underlying issue.
  • Someone who likes to operate, secure and orchestrate Linux servers to provide a service.
  • Someone who can thrive in a 100% remote environment.

What you will do:

  • Investigate anomalies in a load balanced, global Linux environment.
  • Leverage and help maintain server automation for globally distributed bare metal servers (Ansible, Helm, Etcetera).
  • Help other teams as a jack of all trades sysadmin.
  • Participate in a follow the sun support model.

Requirements:

  • 2+ years in a Server Infrastructure role.
  • Functional knowledge of orchestration tools (e.g Ansible, SaltStack)
  • Experience with Linux systems administration.
  • Experience with scripting tasks (e.g Bash, Python, Ansible, Go)
  • Proficient in IPv4 networking
  • Awareness of IPv6 networking
  • Awareness of firewall policy (client side or external)
  • Great listener and communicator
  • Bilingual or high fluency in English
  • Troubleshooting and Teamwork skills

ANKR is an equal opportunity employer. All applicants will receive consideration for employment without regard to race, religion, color, national origin, sex, sexual orientation, gender identity, age, status as a protected veteran, or status as a qualified individual with a disability.

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.