Reliability Engineer

468 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Reliability Engineer
Job Position Company Posted Location Salary Tags

Fmr

Bangalore, India

$105k - $120k

Coinbase

Remote

$211k - $249k

Alchemy

Bucharest, Romania

$80k - $85k

Bitso

Latin America

$112k - $156k

Bitso

European Economic Area

$112k - $156k

Kraken

United States

$92k - $101k

Asymmetric Research

Remote

$105k - $180k

Limit Break

Tokyo, Japan

$90k - $145k

Asymmetric Research

Remote

$105k - $180k

Gemini

Remote

$136k - $170k

Kraken

United States

$63k - $87k

Token Metrics

Manila, Philippines

$73k - $95k

Syndr

Delhi, India

$98k - $114k

Kraken

United States

$92k - $101k

Kraken

European Union

$36k - $54k

Fmr
$105k - $120k estimated
Off Embassy Golf Links Business Park, Bangalore India India
Apply

Job Description:

Job Title : Lead - Cloud Site Reliability Engineer   The Purpose of this Role   As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments currently support numerous mission-critical workloads. In this exciting role, you’ll have the opportunity to further develop and refine your skills, collaborate across numerous Fidelity teams, and continue to grow in a fun, collaborative, and rapidly changing environment. This is a phenomenal opportunity to have a direct impact on the emerging strategies of our infrastructure and deployments, while at the same time, helping enable the expansion of our business.   The Value You Deliver

Leading the initiative to craft and deploy our applications to the cloud Promoting a DevOps mentality, providing mentorship and establishing development standard methodologies for AWS infrastructure-as-code Championing automation tools to improve software delivery and reduce risk

  The Expertise You Bring

6-8 years of hands-on experience with AWS in a production environment Experience building and deploying Docker images including Docker Compose Production experience running Kubernetes workloads ideally on AWS EKS Experience managing and maintaining Kubernetes Clusters on AWS EKS Experience creating and deploying Helm charts & libraries Production experience with infrastructure-as-code (IaC), Terraform preferred Hands-on experience with Jenkins Core, including authoring and maintaining declarative CI/CD pipelines and libraries Experience with monitoring tools e.g., CloudWatch, Datadog & Splunk Cloud Proficiency with UNIX operating systems and shell scripting Programming experience, e.g., Python preferred Experience with distributed version control systems, Git preferred Experience with the agile software development lifecycle and Kanban preferred Experience with CDN Providers e.g., Akamai preferred

  The Skills that are good to have for this Role

Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog and Splunk Ability to communicate at all levels with track record of strong written and verbal communications See problems as opportunities to automate Ability to work independently with minimal direction Drive and champion the overall design of highly available, secure, scalable microservices-based applications in AWS

  How your Work Impacts the Organization   The Team Fidelity Digital Assets, a Fidelity Investments Company, is developing a full-service enterprise-grade platform for storing, trading, and servicing digital assets, such as Bitcoin and Ethereum. Fidelity Digital Assets embraces an entrepreneurial culture and startup mindset while serving as one of the most innovative business units within Fidelity Investments. Our global, diverse team of hundreds of forward-thinking professionals lead with agility and creativity to build solutions that bridge the gap between traditional institutional investors and their exposure to digital assets. The firm’s tenure and experience across multiple business lines present our employees with unprecedented access to knowledge, technology, and resources that help our team reshape the future of finance.   Within Fidelity Digital Assets, Technical Operations team is central to our initiative of moving to the cloud. The team uses AWS services to secure our network and scale our applications to ensure their up-time and reliability. Team members are hands-on Site Reliability Engineers who promote a DevOps approach, with a focus on infrastructure-as-code, security, and automation.

Cryptojobs

Certifications:

Category: Information Technology

What does Reliability Engineer do?

A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization

They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance

Here are some of the typical tasks and responsibilities of a Reliability Engineer:

  1. Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
  2. Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
  3. Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
  4. Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
  5. Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
  6. Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.