| Job Position | Company | Posted | Location | Salary | Tags |
|---|---|---|---|---|---|
Osmosis | Remote |
| |||
Myshell | Remote | $105k - $150k | |||
Blockdaemon | EMEA | $200k | |||
alchemy | New York, NY, United States | $135k - $350k | |||
| Learn job-ready web3 skills on your schedule with 1-on-1 support & get a job, or your money back. | | by Metana Bootcamp Info | |||
Helius | Remote | $225k - $350k | |||
Gemini | Remote | $172k - $215k | |||
Crypto.com | Shenzhen, China | $185k | |||
Alchemy | Remote | $135k - $350k | |||
Uniswap Labs | New York, NY, United States | $243k - $269k | |||
Genies, Inc. | San Mateo, CA, United States | $160k - $190k | |||
SwissBorg | Krakow, Poland | $112k - $156k | |||
Flow Traders | Amsterdam, Netherlands | $77k - $106k | |||
Bitso | Latin America | $112k - $156k | |||
Ava Labs | New York, NY, United States | $89k - $94k | |||
Circle | Chicago, IL, United States | $147k - $195k |
About Us Do you dream of a future where finance is open, accessible, and user-driven? Are you passionate about Decentralized Exchanges (DEXs) and the potential of DeFi to revolutionize financial markets? If so, then we want to hear from you! Osmosis is the leading interchain DEX built on the Cosmos ecosystem. We're on a mission to build the future of DeFi, and we're searching for a talented and visionary Site Reliability Engineer (SRE) to join our growing team. About the Role: We are looking for a Site Reliability Engineer with a passion for blockchain technology to be responsible for operating and scaling the infrastructure and services that power our platform. You will work closely with the chain development team to ensure that our systems are reliable, scalable, and maintainable. What you could work on:
Operating and optimizing backend infrastructure and services, including osmosis nodes, testnets and data services Improve the current observability, monitoring, and alerting systems to provide better insights into system behavior and performance Sharing observability best practises across the organization Developing internal tools that integrate with existing infrastructure (e.g., controllers, node health checks, custom CLIs) Participate actively in the on-call rotation schedule, contributing to the rapid identification and resolution of production issues through effective debugging and troubleshooting Document processes, procedures, post-incident reports, and best practices for running services in production, ensuring consistency and quality across the team Establish and maintaining robust CI/CD pipelines to automate the deployment process, facilitating faster and more reliable releases of new features and updates
You may be a fit to this role if you:
Familiarity with SRE best practices and passion for observability (Datadog, New Relic, Prometheus/Grafana, …) Strong experience with containerization and orchestration technologies (Docker, Kubernetes) Have experience in running and operating production workloads Strong background with Infrastructure as a Code (preferably Terraform) Experience with Google Cloud Platform Experience with Cloudflare and Cloudflare Workers Have a strong understanding of distributed systems and how they can be operated at scale Are passionate about blockchains and decentralized technology Have great communication skills and the ability to collaborate with others Have a demonstrated ability to take ownership
Experience that will set you apart:
Previous experience working on high-scale or highly critical systems Previous experience working on POS blockchain projects in production Experience operating Cosmos nodes and relayers Contributions to open-source projects Worked in remote and globally distributed teams
At Osmosis, you'll join a passionate and talented team working at the forefront of DeFi innovation. We offer a competitive compensation package, a collaborative work environment, and the opportunity to make a real impact on the future of finance. Ready to join the DeFi revolution? We look forward to hearing from you! Legal Stuff Employees and contractors are engaged through contributing entities, including but not limited to Osmurica, the Osmosis Foundation, and Chainapsis. Contributing entities provide equal employment opportunities to all employees, contractors, and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.
What does Reliability Engineer do?
A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization
They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance
Here are some of the typical tasks and responsibilities of a Reliability Engineer:
- Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
- Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
- Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
- Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
- Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
- Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.