Reliability Engineer

420 jobs found

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Token Metrics

Islamabad, Pakistan

$90k - $95k

Token Metrics

Hyderabad, India

$90k - $95k

Token Metrics

Ho Chi Minh City, Vietnam

$90k - $95k

Token Metrics

Delhi, India

$90k - $95k

Token Metrics

Remote

$90k - $95k

Token Metrics

Bucharest, Romania

$90k - $95k

Token Metrics

Bengaluru, India

$90k - $95k

Galaxy

Remote

$90k - $100k

Gelato

Remote

Blockchain.com

London, United Kingdom

$120k - $144k

Chainlink Labs

Remote

Chainlink Labs

Remote

Circle

Los Angeles, CA, United States

$147k - $195k

Omni Network

Remote

$90k - $145k

Wormhole Foundation

New York, NY, United States

$98k - $145k

Token Metrics
$90k - $95k estimated
Islamabad
Apply

DevOps/Site Reliability Engineer (Islamabad-Remote)

Islamabad
Engineering Team /
Full-Time /
Remote

Apply for this job
Token Metrics is seeking a results-oriented IT administrator to manage our company's IT infrastructure. You will be upgrading and installing hardware and software, troubleshooting to resolve IT issues, and maintaining our networks and servers. Candidate should possess extensive experience in administration including system administration for cloud infrastructure (AWS primarily and knowledge of multi-cloud infrastructure), process automation, site reliability and the ability to optimize the performance of our IT infrastructure.

Responsibilities

    • Act as a cloud system admin (AWS primarily and knowledge of multi-cloud infrastructure).
    • Monitoring and maintaining networks and servers.
    • Creating and automating alerting and monitoring system logs.
    • Building tools to mitigate weaknesses in incident management or software delivery.
    • Troubleshooting Support Escalation requests.
    • Upgrading, installing and configuring new hardware and software to meet company objectives.
    • Implementing security protocols and procedures to prevent potential threats.
    • Creating user accounts and performing access control.
    • Performing diagnostic tests and debugging procedures to optimize computer systems.
    • Documenting processes, as well as backing up and archiving data.
    • Developing data retrieval and recovery procedures.
    • Designing and implementing efficient end-user feedback and error reporting systems.
    • Supervising and mentoring IT department employees, as well as providing IT support.
    • Keeping up to date with advancements and best practices in IT administration.

Requirements

    • Bachelor's degree in Computer Science, Information Technology, Information Systems, or similar.
    • Applicable professional qualification, such as Microsoft, Oracle, or Cisco certification.
    • At least two years' experience in a similar role.
    • Extensive experience with IT systems, networks, and related technologies.
    • Solid knowledge of best practices in IT administration and system security.
    • Exceptional leadership, organizational, and time management skills.
    • Strong analytical and problem-solving skills.
    • Excellent interpersonal and communication skills.
Token Metrics helps crypto investors build profitable portfolios using artificial intelligence based crypto indices, rankings, and price predictions. 

Token Metrics has a diverse set of customers, from retail investors and traders to crypto fund managers, in more than 50 countries.
Apply for this job
⬇
Apply Now

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.