Reliability Engineer

466 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Seedify

Remote

$36k - $48k

Chainlink Labs

San Francisco, CA, United States

$98k - $112k

Coinbase

Remote

$186k - $218k

Tenderly

Remote

Avalabs

Remote

$90k - $100k

CleanSpark

Las Vegas, NV, United States

$90k - $110k

Zamp

Bangalore, India

$77k - $85k

Stellar

Remote

$210k - $310k

Parity

Remote

$72k - $72k

Chainlink Labs

United States

$98k - $112k

Coinbase

Remote

$180k - $218k

Zscaler

Remote

$140k - $200k

Coinbase

Remote

$211k - $249k

Coinbase

Remote

$122k - $140k

Seedify
$36k - $48k
Remote
Apply

Site Reliability Engineer

Remote
Seedify Product Team /
Full-time /
Remote

Apply for this job
Seedify is a leading cryptocurrency launchpad platform dedicated to fostering innovation and success in the Web3 space. Our mission is to identify and assist promising teams and projects and offer outstanding returns to our investor base.

Job Description
We are seeking a highly skilled Site Reliability Engineer with extensive experience in DevOps, infrastructure optimization, and incident reporting & monitoring. In this role, you will be working alongside other DevOps Engineers, Technical Architect and Developers to optimize performance of the Seedify platform.
 
Responsibilities:

Infrastructure & IaC: Manage AWS infrastructure using Terraform/Terragrunt; optionally Pulumi or AWS CDK. Optimize cost, reliability, and scalability.
Kubernetes Ops: Deploy and maintain Kubernetes clusters with Helm and Kustomize. Architect for high availability and zero downtime.
CI/CD Automation: Own pipelines in GitHub Actions to improve release velocity.
Observability: Implement monitoring and alerting using New Relic, Prometheus, Grafana, and OpenTelemetry. Create health dashboards and custom metrics.
Incident & SLA Management: Define SLAs, lead incident response, and run postmortems to improve reliability.
Dev Collaboration: Partner with engineers to embed reliability, monitoring, and alerting into the SDLC.

Skills & Qualifications:
Core Tools: Kubernetes, Helm, Kustomize, Docker, Bash, Ansible.
Cloud: Strong AWS experience (EC2, S3, EKS, RDS, Lambda, etc.).
Observability Stack:, Prometheus, Grafana, New Relic, OpenTelemetry.
CI/CD: ArgoCD, GitHub
IaC: Terraform, Terragrunt; optional Pulumi/AWS CDK.
Languages: Optional NodeJS or similar for automation.
Certifications (optional but preferred): AWS Solutions Architect, Kubernetes Admin/Developer, or other cloud/DevOps certifications.

Experience:
3+ years in SRE or related roles.
Hands-on infra ownership, incident response, and system optimization.
Designed reliability-focused SDLC integrations and dashboards.
 Soft Skills:
Collaboration: Works well across engineering, product, and ops.
Ownership: Drives initiatives end-to-end, especially under pressure.
Adaptability: Thrives in fast-paced, shifting environments
$3,000 - $4,000 a month
Apply for this job

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.