Reliability Engineer

458 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Mojito

New York, NY, United States

$43k - $56k

MobileCoin

remote

Eco

remote

$61k - $80k

BlockFi

New York, NY, United States

$94k - $102k

BlockFi

New York, NY, United States

$63k - $90k

Newton

Toronto, Canada

$72k - $100k

GSR

Singapore, Singapore

$72k - $100k

GSR

New York, NY, United States

$72k - $100k

GSR

London, United Kingdom

$72k - $100k

GSR

Singapore, Singapore

$72k - $100k

GSR

London, United Kingdom

$72k - $100k

Coinbase

Remote

$39k - $60k

Coinbase

Remote

Binance

Asia

HQ Digital

New York, NY, United States

$36k - $75k

Software Engineer Site Reliability DevOps

Mojito
$43k - $56k estimated

This job is closed

About Mojito

Mojito is an end-to-end solution for fully branded NFT marketplaces and tokenized customer engagement. The space is growing at an incredible pace and we are in prime position to be a market leader. Mojito is the first project to graduate from the Serotonin Product Studio and is backed by industry veterans and world-class VCs and partners. We combine cutting edge technology with design thinking and user empathy to build products that make blockchain accessible to a mainstream audience.

We are passionate about building products for a diverse, global audience and think our team should be a reflection of that. Candidates from underrepresented minorities and groups are encouraged to apply.

About the role

To capitalize on this momentum, we are growing the team and are searching for a Software Engineer who can help build our our software delivery pipelines and improve the resiliency and scalability of our production systems. You will work closely with fellow engineers and cross-functional colleagues on both of these problems. Your work will bring the next generation of blockchain products to a global audience.

What you'll be doing

  • Design, implement, and operate our GCP infrastructure to enable reliable deployment of our services with resiliency and redundancy.
  • Support the design of scalable, reliable, cost efficient, and performant software system architecture.
  • Improve our team velocity and time to market via CI/CD, automation, and process improvements.
  • Establish end-to-end monitoring, alerting, and dashboards on all critical systems of our applications.
  • Setup and participate in on-call rotations with the engineering team.
  • Setup best practices around incident management.
  • Advise engineers on best practices for instrumentation of applications.

Requirements

  • 3+ years experience as a DevOps and/or SRE engineer supporting production systems and pipelines.
  • Experience managing and scaling production infrastructure in GCP or other cloud infrastructure services.
  • Experience with Terraform or other infrastructure-as-code platform.
  • Experience managing and scaling Postgres or other SQL databases in production.
  • Experience with CI/CD platforms like Github Actions or CircleCI.
  • Experience managing NATs or Kafka.
  • Ability to work cross functionally with product, project, engineering and QA.
  • Ability to navigate and work with bleeding edge technologies.
  • Believes in attention to detail and quality.

Bonuses

  • Knowledge of blockchain technologies like Ethereum and smart contracts.
  • Basic knowledge of Golang or gqlgen.

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.