Reliability Engineer

485 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Gemini

Remote

$172k - $215k

Nethermind

Remote

$112k - $156k

Fmr

Bangalore, India

$105k - $120k

Coinbase

Remote

$211k - $249k

Alchemy

Bucharest, Romania

$80k - $85k

Bitso

Latin America

$112k - $156k

Bitso

European Economic Area

$112k - $156k

Kraken

United States

$92k - $101k

Asymmetric Research

Remote

$105k - $180k

Limit Break

Tokyo, Japan

$90k - $145k

Asymmetric Research

Remote

$105k - $180k

Gemini

Remote

$136k - $170k

Kraken

United States

$63k - $87k

Token Metrics

Manila, Philippines

$73k - $95k

Syndr

Delhi, India

$98k - $114k

Gemini
$172k - $215k
Remote (USA)

About the Company

Gemini is a global crypto and Web3 platform founded by Tyler Winklevoss and Cameron Winklevoss in 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.

Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency. 

At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.

In the United States, we have a flexible hybrid work policy for employees who live within 30 miles of our office headquartered in New York City and our office in Seattle. Employees within the New York and Seattle metropolitan areas are expected to work from the designated office twice a week, unless there is a job-specific requirement to be in the office every workday. Employees outside of these areas are considered part of our remote-first workforce. We believe our hybrid approach for those near our NYC and Seattle offices increases productivity through more in-person collaboration where possible.

The Department: Threat Detection & Response

The Role: Staff Site Reliability Engineer

Gemini is looking for a Staff Site Reliability Engineer, Threat Detection & Response to join our growing information security team.

In this role, you will be part of the team responsible for designing, building, and automating detection, response and intelligence gathering solutions, developing unique and creative detection mechanisms, monitoring security events, and leading responses to any security incidents.

Responsibilities:

  • Own individual security solutions throughout their lifecycle, including design, development, and deployment, in order to continuously improve Gemini’s ability to detect and respond to advanced, targeted threats
  • Develop and improve processes and tools that supports the team rapidly iterating and responding to threats Gemini faces
  • Build and improve security controls and capabilities at all layers of infrastructure
  • Produce well documented, resilient and manageable code that supports the streamlining and automation of the above
  • Provide mentorship and guidance to junior engineers on the team in their growth and implementation of the above

Minimum Qualifications:

  • Experience with distributed systems or cloud computing. We often use AWS
  • Experience with computer security engineering
  • Significant experience with configuration management and infrastructure as code. We often use Terraform
  • Significant software development experience. We often use Python
  • Able to self-scope, define, and manage short and long term technical goals
  • Aptitude in the use of containerization technologies (eg. Docker)
  • Able to troubleshoot and debug issues, and demonstrate a methodical approach to root cause analysis 
  • Excellent oral and written communication skills, including the ability to interact effectively with leadership, engineers, vendors and peers

Preferred Qualifications:

  • 7+ years of experience in SRE, systems engineering, or network engineering
  • Familiarity in the use of container orchestration systems (e.g. Kubernetes, EKS) 
  • Experience applying CI/CD concepts to the development and deployment of security detection mechanisms and tools
  • Understanding of ETL and Workflow engines (e.g. Argo Workflows, Airflow)
It Pays to Work Here
 
The compensation & benefits package for this role includes:
  • Competitive starting salary
  • A discretionary annual bonus
  • Long-term incentive in the form of a new hire equity grant
  • Comprehensive health plans
  • 401K with company matching
  • Paid Parental Leave
  • Flexible time off

Salary Range: The base salary range for this role is between $172,000 - $215,000 in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.

At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.