Reliability Engineer
420 jobs found
Job Position | Company | Posted | Location | Salary | Tags |
---|---|---|---|---|---|
Aztec | Remote | $165k - $209k | |||
Mythical East | Lisbon, Portugal | $80k - $100k | |||
Startale | Remote | $103k - $117k | |||
Osmosis | United States |
| |||
Learn job-ready web3 skills on your schedule with 1-on-1 support & get a job, or your money back. | | by Metana Bootcamp Info | |||
Circle | San Francisco, CA, United States | $147k - $195k | |||
Circle | Washington, United States | $147k - $195k | |||
Circle | San Francisco, CA, United States | $147k - $195k | |||
Circle | London, United Kingdom | $103k - $117k | |||
Circle | Los Angeles, CA, United States | $147k - $195k | |||
Ava Labs | Seoul, South Korea | $99k - $124k | |||
Token Metrics | Remote | $90k - $95k | |||
Token Metrics | Remote | $90k - $95k | |||
Token Metrics | Remote | $90k - $95k | |||
Token Metrics | Remote | $90k - $95k | |||
Token Metrics | Remote | $90k - $95k |
This job is closed
The Role:
Aztec is gearing up to launch a sophisticated privacy-preserving blockchain on Ethereum. To achieve this, our engineers need support making our code run through first continuous integration, then test networks, and finally peer-to-peer software that anyone in the world can run. We need you! We seek engineers versed in devops and system architecture that are enthusiastic about building production systems with unique and challenging constraints
- Design and build the foundation for monitoring a decentralized private network.
- Develop metrics and monitoring of our hosted services.
- Automate infrastructure provisioning and configuration.
- Manage AWS services and resources ensuring cost effectiveness.
- Collaborate with development teams to ensure they have the best development/CI experience.
- Maintain a CI/CD pipeline for 30 engineers working across C++, Rust, Noir, Solidity, and Typescript.
- Monitor system issues and create strategies for their detection.
Responsibilities:
- CI platforms such Github Actions, CircleCI.
- AWS cloud services, EC2, ECS, ECR, etc.
- Expert in Docker and creating fast efficient builds and containers. Earthly is a nice-to-have.
- Terraform for defining IAC.
- Prometheus and Grafana for metrics and monitoring, and working with codebases to extract required information.
- Toolchain experience: C++ (clang, gcc, cmake). Rust. Solidity. NPM/Yarn.
- Demonstrable experience in enabling an engineering team of our size have a robust and efficient development experience.
- Self starter. Able to clearly identify areas to prioritise to deliver most value.
Qualifications:
- Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
- 4+ years of experience in site reliability engineering or DevOps roles, preferably in the cryptocurrency or financial services industry.
- Strong communication and collaboration skills, with the ability to work effectively in a fast-paced, dynamic environment.
- Can operate in Greenwich Mean Time Zone
What we offer:
- Flexible and remote work environment
- 25 days holiday + bank holidays annually
- Additional benefits include health insurance, retirement plans, and opportunities for professional development.
- Quarterly offsite travel for collaboration
- Events and conference budget
- An opportunity to work at the cutting edge of blockchain and FinTech with a world class cryptography and engineering team
Compensation Range: $165,000 - $209,000 + equity + additional benefits. The salary for this position will be commensurate with experience and qualifications.
What does Reliability Engineer do?
A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization
They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance
Here are some of the typical tasks and responsibilities of a Reliability Engineer:
- Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
- Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
- Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
- Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
- Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
- Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.