Reliability Engineer

491 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Limit Break

Tokyo, Japan

$90k - $145k

Asymmetric Research

Remote

$105k - $180k

Gemini

Remote

$136k - $170k

Kraken

United States

$63k - $87k

Token Metrics

Manila, Philippines

$73k - $95k

Syndr

Delhi, India

$98k - $114k

Kraken

United States

$92k - $101k

Kraken

European Union

$36k - $54k

Circle - Referrals

Remote

$157k - $175k

Token Metrics

Manila, Philippines

$73k - $95k

Token Metrics

Lisbon, Portugal

$73k - $95k

Token Metrics

Cape Town, South Africa

$73k - $95k

Gemini

Remote

$172k - $215k

Stellar

New York, NY, United States

$150k - $200k

Uniswaplabs

Remote

$243k - $269k

Limit Break
$90k - $145k estimated
Tokyo

Senior Site Reliability Engineer

Tokyo
Engineering – Blockchain Engineering /
Full-time /
On-site

Apply for this job
Senior Site Reliability Engineer

Location: Tokyo Onsite

About us:

Deep expertise. Personal and Industry evolution. Impeccable craft. These are Limit Break’s founding principles.

Limit Break founded by global industry leaders in mobile gaming, We are unlocking its potential beyond games to transform digital markets into real-world economies and digital worlds into vibrant communities who will take gaming economies for both players and traders to new limits. We combine the power of technology, crypto currency, and creative vision to create experiences that connect people from all corners of the globe.

Limit Break is backed by leading investors that include Buckley Ventures & Paradigm Ventures. The total crypto market capitalization increased by 25 times in a single year. The adoption rate and a number of public and private partnerships is proving it to be the next wave of technology of future and Gaming is leading the way.

About the Role:

Limit Break is looking for Senior Site Reliability Engineers to join our team. As the first addition to the Limit Break Site Reliability team (SRE), you will have the autonomy to influence our SRE practices, team and processes, and to create a truly customer-driven culture of SRE from the ground up.

This is a unique opportunity for an intellectually curious and hardworking individual to help the organization shape what best practice SRE for blockchain technologies will be.

Responsibilities:

●      Identify, propose and execute improvements to performance and scalability bottlenecks in our current systems/infrastructure on AWS
●      Measure systems' health, scalability and performance metrics and identify areas of improvement
●      Utilize your knowledge of code to solve broad operational challenges within the Limit Breaks Infrastructure and Platform
●      Work with the wider engineering team to identify how we can provide the most production-like environment for running both manual and automated testing
●      Define SLOs, SLIs, monitoring, alerting and incident response practices

Qualifications:
 
●      5+ years experience in SRE, Dev Ops or Systems engineering
●      Strong background in kubernetes
●      Extensive experience in Terraform and Ansible
●      CI/CD and automation experience
●      Strong background in AWS
●      Ability to participate in an on-call rotation
●      Effective communication skills to be able to clearly explain your reasoning and thought process for anything you propose
●      Excellent collaboration skills to be able to work closely with product engineers and product owners to understand their context and co-design appropriate solutions which balance feature velocity with site reliability
●      Implementation of in-house monitoring and observability infrastructure
●      Implementation of Elastic Search stack or equivalent solutions for capturing logs from all environments
●      Working with InfoSec to implement various tools to monitor and protect the environment in real-time
 
A Plus:

●      Experience with containerised and server less architectures (we use both to keep our teams flexible)
●      Prior blockchain experience
●      Worked on a large Greenfields project(s)


Apply now:

If you are looking for an exciting role, this job could be a great opportunity for you. Apply to careers@limitbreak.com to explore this new opportunity. This is an international blockchain gaming company that operates regionally, and is opening offices in multiple locations around the globe. We value diversity at our company. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or any other applicable legally protected characteristics in the location in which the candidate is applying. 
Apply for this job

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.