Reliability Engineer

485 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Chainlink Labs

Remote

Ramp

Poland

$90k - $100k

Chainlink Labs

Remote

Chainlink Labs

Remote

Wallet

Remote

$90k - $100k

Ripple

Lausanne, Switzerland

$95k - $144k

Ripple

Singapore, Singapore

$90k - $115k

Heretic

San Francisco, CA, United States

$103k - $117k

LayerZero Labs

Remote

$112k - $156k

Blockchain.com

London, United Kingdom

$105k - $180k

Flock Safety

Atlanta, GA, United States

$150k - $185k

Chainlink Labs

Remote

Circle

Seattle, WA, United States

$147k - $195k

Chainlink Labs

Remote

Chainlink Labs

Remote

Technical Program Manager - Reliability Engineering

Argentina / Remote / Toronto / Remote / United Kingdom / Remote / Barcelona / Remote / Lisbon / Remote / Dublin / Remote / Ciudad de México / Remote / São Paulo / Remote / Bogotá / Remote
Engineering /
Remote

Apply for this job
About Us 
Chainlink Labs is the primary contributing developer of Chainlink, the decentralized computing platform powering the verifiable web. Chainlink is the industry-standard platform for providing access to real-world data, offchain computation, and secure cross-chain interoperability across any blockchain. Chainlink Labs helps power verifiable applications for banking, DeFi, global trade, and gaming by collaborating with some of the world’s largest financial institutions, notably Swift, DTCC, and ANZ. Chainlink Labs also works with top Web3 teams, including Aave, Compound, GMX, Maker, and Synthetix. Chainlink Labs was ranked in Newsweek’s 100 Most Loved Workplaces 2023 in both the United States and United Kingdom.

The Engineering Team
At Chainlink Labs, our engineering team pushes the scale and capabilities of decentralized applications across the industry. The Chainlink Network holds >70% market share in the oracle space, solving real-world problems by enabling smart contracts to securely interact with off-chain data/computation.

We value talented and driven craftsmen who work collaboratively to tackle complex challenges, deliver product impact, and grow as builders. Join us and shape the future of blockchain technology and decentralized finance. 

Chainlink’s Technical Program Manager role is responsible for organizing, directing and managing team and cross-team initiatives across our Incident Response and Production Engineering (SREs) Teams. This includes managing cross functional dependencies, blockers and task tracking to deliver quality Chainlink Product or Services.  The TPM provides the overall direction of program activities.  This position works under limited supervision and direction, but often closely with product & engineering managers across multiple teams 

Roles and Responsibilities:

    • Closely work with Engineering Teams to develop team and cross-team plans to deliver quality products or services
    • Serve as the primary interface and point of contact for the program’s performance, schedule, quality, and technical issues
    • Build, develop, and grow any stakeholder relationships vital to the success of the program
    • Reviews and collaborate with Engineering Team on technical approach, level of effort estimates and feasibility 
    • Develop and maintains plans including schedules, cross team dependencies and milestones using tools such as JIRA / Confluence 
    • Drive the planning, execution, and controlling of the Program Ensure quality and timely delivery of deliverables
    • Support the development of ambitious quarterly objectives and key results (OKRs) 
    • Ensures that program objectives and key results (OKRs) are met in the most efficient manner
    • Ensure commitments are being satisfied against the schedule by team members and stakeholders and developing mitigations and escalating schedule risks and issues
    • Develop and communicate strategies, goals, and objectives as well as program status and health to stakeholders and leadership
    • Develop repeatable processes to scale the team  
    • Develop strategies to improve the Program’s delivery efficiency 
    • Manage program risks and conditions, and facilitate development of mitigation strategies
    • Provide leadership and guidance to coach, motivate, and lead team members to their optimum performance levels and career development.

Must Have:

    • At least 3-5+ years of experience in the domain of Incident Response and Site Reliability Engineering
    • At least 3-5+ years of demonstrated Agile TPM experience, across multiple globally dispersed cross functional Agile Teams 
    • Expertise with the use of JIRA roadmapping and advanced features 
    • Strong presence with excellent written and verbal communications

Nice to Have:

    • Experience maturing Teams Agile Practices
    • Experience working in an async / remote-first environment across time zones
    • Agile / Program Management / Service Management Certifications such as SAFe, PMP, PgMP, CAP, ITIL
    • Prior experience with Blockchain/Smart Contracts/Web3
    • Technical experience (was an SWE/SRE at some point)
All roles with Chainlink Labs are global and remote-based. Unless otherwise stated, we ask that you try to overlap some working hours with Eastern Standard Time (EST).

Commitment to Equal Opportunity
Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via this form.

Global Data Privacy Notice for Job Candidates and Applicants
Information collected and processed as part of your Chainlink Labs Careers profile, and any job applications you choose to submit is subject to our Privacy Policy. By submitting your application, you are agreeing to our use and processing of your data as required.
Apply for this job

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.