Reliability Engineer

420 jobs found

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Bitso

Latin America

$112k - $156k

Bitso

European Economic Area

$112k - $156k

Kraken

United States

$92k - $101k

Asymmetric Research

Remote

$105k - $180k

Limit Break

Tokyo, Japan

$90k - $145k

Asymmetric Research

Remote

$105k - $180k

Gemini

Remote

$136k - $170k

Kraken

United States

$63k - $87k

Token Metrics

Manila, Philippines

$73k - $95k

Syndr

Delhi, India

$98k - $114k

Kraken

United States

$92k - $101k

Kraken

European Union

$36k - $54k

Circle - Referrals

Remote

$157k - $175k

Token Metrics

Manila, Philippines

$73k - $95k

Token Metrics

Lisbon, Portugal

$73k - $95k

Bitso
$112k - $156k estimated
Latin America
Apply

Working At Bitso

We are a diverse team that takes pride in understanding the perspectives of others. We fully embrace working remotely and we are eager to act, improve and accelerate progress inside and outside of our organization.

To drive revolutionary changes in society and make crypto useful, we delight our customers with world-class products, deep care, and intentional empathy.

<span >

As a Site Reliability Engineer you will be in charge of collaborating with other squads and fleets to educate and mentor them, ensuring reliability is built from the ground up on their services. You will be building self-service functionalities to enable our backend, frontend and mobile developers so they can ship products fasterand will be helping improve the monitoring and scalability of our platform and services.

Who You Are

  • 2+ years of experience with AWS (and others)
  • Strong skills around observability, debugging and performance tuning
  • Experience with fine-tunning containerised java applications is an advantage
  • In-depth experience with multiple CI tools (e.g. Github Actions)
  • In-depth experience with CD tools (e.g ArgoCD)
  • 2+ years of experience managing Production Kubernetes clusters and deploying applications
  • 2+ years of experience writing scripts in different languages (e.g Java, TS, bash, Go)
  • Experience managing Infrastructure as Code (Terraform, Crossplane)
  • Experience with Software development (ideally in Java) - to better understand the needs and problems software engineers are facing with their services

What You Will Do

  • Improve observability, reliability and availability by defining and measuring key metrics
  • Closely collaborate with our Core Services fleet to performance tune and optimize our architecture
  • Proactively find and analyze reliability problems across our business units and stack, then design and implement solutions to create improvements
  • Educate, mentor, and hold accountable the engineering team to improve the reliability of our systems

Research in Diversity, Equity, and Inclusion suggests that individuals may hesitate to apply for jobs if they do not meet all the listed criteria. At Bitso, we value diversity and your unique strengths could be just what we're looking for. If this role excites you but you don't match every point in the description, we still want to hear from you.

#LI-Remote

<div class="content-conclusion">

Who We Are

With over 8 million users, Bitso is the leading cryptocurrency platform in Latin America. We are developing the cryptocurrency ecosystem in the region and enabling financial inclusion. We believe crypto is the future of finance, and we’re committed to making it useful by providing equal access to safe and intuitive financial products.

When we hire people for our team, we specifically test for the following traits in addition to our cultural values:

  • Mission-Driven: We seek individuals who are passionate about crypto and Bitso’s mission and resilient in facing industry challenges

  • High Sense of Urgency: We prioritize candidates who demonstrate a high sense of urgency and responsibility.

  • Exceptional Hard Skills: We seek individuals who possess exceptional skills in their respective fields, with no room for mediocrity.

  • Self-Management: We look for individuals who can independently manage their work, career, and professional development.

Compensation & Benefits

At Bitso, you are taking the front seat on the edge of crypto innovation, creating the next generation of crypto-powered products.

So for those willing to commit, adapt and pioneer the most important change of the century we offer:

  • Me Time program, including unlimited paid time off.
  • Remote-first work environment.
  • Employee Stock Option program.
  • Zero trading fees through our Bitso Alpha app.
  • Extended Family Leave Policy: all birthing parents, non-birthing parents and adopting parents are eligible for a 4-months leave.
  • Premium health, dental and life insurances in Mexico, Gibraltar, Colombia, USA, Brazil and Argentina.
  • Volunteering days.
  • Monthly stipend for gym memberships, relaxation activities, sports equipment, cooking classes, books, entertainment and more.

Want to leave an undoubtedly legacy with us? Fasten your seatbelt and join this spaceship, where you will find exponential growth and the opportunity to thrive!

  • These are the applicable requisites, although equivalent competencies in any of the above will also be considered.
  • To see our Privacy Policy please click here.

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.