Reliability Engineer

890 jobs found

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Chainlink Labs

Remote

$100k - $200k

Ignite

San Francisco, CA, United States

$179k - $200k

OKX

Singapore, Singapore

$30k - $100k

OKX

Hong Kong, Hong Kong

$29k - $68k

Terraform Labs

Remote

$25k - $35k

Nethermind

London, United Kingdom

$59k - $182k

Pintu

Setiabudi, Indonesia

$70k - $200k

Consensys

Remote

$100k - $200k

Pyth Network

$70k - $250k

Worldcoin

$80k - $366k

Stellar Development Foundation

New York, NY, United States

$165k - $205k

CoinDesk

New York, NY, United States

$135k - $195k

CoinDesk

Sao Paulo, Brazil

$100k - $200k

CoinDesk

London, United Kingdom

$100k - $200k

CoinDesk

Bangalore, India

$100k - $200k

Gemini

Gurgaon, India

$70k - $140k

Chainlink Labs

Remote

$80k - $295k

Cake DeFi

Singapore, Singapore

$60k - $100k

Art Blocks

Remote

$70k - $180k

Consensys

Remote

$99k - $212k

Phantom

Remote

$170k - $220k

Coinmarketcap

London, United Kingdom

$69k - $165k

Coinmarketcap

London, United Kingdom

$80k - $180k

Cake Pte Ltd

Singapore, Singapore

$60k - $100k

NFT Now

Remote

$60k - $200k

Chainlink Labs
$100k - $200k*
United States / Remote

Senior Site Reliability Engineer, Node Operations

United States / Remote /
Engineering /
Remote - Full-time
/ Remote

Apply for this job
All roles with Chainlink Labs are global and remote-based. Unless otherwise stated, we ask that you try to overlap some working hours with Eastern Standard Time (EST). We encourage you to apply regardless of your location. 

About Us 
Chainlink is the industry-standard Web3 services platform that enables developers to build feature-rich Web3 applications with seamless access to real-world data and off-chain computation.

• Chainlink has helped enable $7T+ in transaction value since the start of 2022.
• Over 1,700 Web3 projects have integrated Chainlink services.
• Chainlink is live on 15+ blockchains with many having joined the Chainlink SCALE program.
• Chainlink is relied upon by industry-leading protocols like Aave, Compound, Paxos, Synthetix, and ENS.
• Chainlink has delivered 7.4B+ data points on-chain and onboarded 900+ decentralized oracle networks.
• Chainlink has established collaborations with Associated Press, Accuweather, AWS, Google Cloud, Meta, and Twilio.
• The world-class Chainlink Labs research team has won various awards for its work on distributed systems, security, and more.

Who we’re looking for: 
• You’re focused on what matters most and ignore unimportant industry distractions. 
• You take extreme ownership and deliver outstanding results. 
• You have a growth mindset, seek out feedback and engage in constructive dialogue with others to help them grow.
• You move fast and evolve with rapidly advancing technologies. 
• You want to be part of a team that excels and is committed to building the Chainlink Network and growing the Web3 ecosystem over the long term. 
• You are welcoming toward a diverse network of participants joining an open, global standard.
• You’re excited about the future of Web3 and building a world powered by cryptographic truth. 

At Chainlink Labs, our engineering team pushes the scale and capabilities of decentralized applications across the industry. The Chainlink Network holds >70% market share in the oracle space, solving real-world problems by enabling smart contracts to securely interact with off-chain data/computation.

We value talented and driven craftsmen who work collaboratively to tackle complex challenges, deliver product impact, and grow as builders. Join us and shape the future of blockchain technology and decentralized finance. 

At Chainlink Labs, our engineering team pushes the scale and capabilities of decentralized applications across the industry. The Chainlink Network holds >70% market share in the oracle space, solving real-world problems by enabling smart contracts to securely interact with off-chain data/computation.

We value talented and driven craftsmen who work collaboratively to tackle complex challenges, deliver product impact, and grow as builders. Join us and shape the future of blockchain technology and decentralized finance. 

The infrastructure org enables Chainlink development and maintains services that support the health of the most widely-adopted oracle network in the world. The Node Operations team are the ‘Gold Standard’ when it comes to running Chainlink, streamlining the experience for any would-be node operator. They manage all of the internally-deployed nodes—like testnet OCR, VRF, Automation and blockchain nodes -- both internal and external. As a Site Reliability Engineer, you will help us solve some of the unique challenges of blockchain oracle architecture and be primarily responsible for the Chainlink ecosystem's off-chain part.

We are distributed across time zones and continents, and we embrace remote work. In the Infrastructure team, we follow the infrastructure-as-code approach and practice GitOps. Our on-call rotation uses the follow-the-sun pattern: you will be on call some of the time, but there should not be any overnight shifts.

We all have different backgrounds and are determined to help you succeed no matter where you are or who you are. If you think you would do a great job at Chainlink, we are looking forward to speaking with you, even if you don't match 100% of the job requirements: those describe people we've usually had a great time working with, but they're not a tick-box exercise.

Your Impact

    • Run internal Chainlink Labs nodes
    • Provide enterprise level blockchain connectivity to those who are trying to build something
    • Pair with engineers from across the company to help with troubleshooting, deploy new services, and figure out how to increase developer velocity and eliminate pain points
    • Building automation and tooling to make those things easier
    • Support monitoring services that watch over the entire Chainlink network.
    • Deploy and maintain various externally-facing services like reference Chainlink nodes used by developers and customers (including critical services such as Chainlink VRF)
    • Improve the reliability and observability of our internal infrastructure
    • Provide our engineers with a reliable release pipeline and empower them to release and deploy Chainlink and adjacent tools extremely quickly

Requirements

    • Excitement for blockchain, Web 3.0, and similar decentralized technologies
    • Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience
    •  4+ years of relevant professional experience in SRE and Software Engineering background
    • Experience with a modern cloud platform, infrastructure-as-code, containers, container-orchestration and observability
    • We use AWS, Terraform, Docker, Kubernetes, Grafana and Prometheus
    • Experience building Automation and/or Powertooling to reduce toil 
    • Strong communication skills. It’s a fully remote job that requires working with multiple stakeholders across all of Engineering. Great communication is as essential as the technical knowledge for this role
    • Experience with distributed systems and container orchestration. You have maintained or even built Kubernetes clusters before and feel comfortable deploying complete new services on them or have familiarity with Docker
    • Experience working closely with software engineering teams to promote best practices such as CI/CD and security
    • Awareness of reliability concepts such as: SL(A/O/I)s, building scalable systems and what it means to be “production ready"

Desired Qualifications

    • Experience with Go, Python, Typescript
    • Familiarity with GitOps and tools like ArgoCD and GitHub Actions
    • Industry recognised certifications such as the CKA and AWS Certified Solutions Architect
    • Blockchain experience running blockchain/chainlink nodes and/or creating smart contracts using Solidity
    • Experience working in scale-up companies of a similar size or early-stage startups
Privacy Policy and an Equal Opportunity Employer: 
Chainlink Labs is an Equal Opportunity Employer. To request an accommodation in our recruitment process, please contact us at [email protected]

Please see our Privacy Policy for more information about how we collect and use your application information.

Apply for this job

When applying, mention the word CANDYSHOP to show you read the job post completely. This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they are human RMzUuMTcyLjE2NC4zMgM

⬇

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.