| Job Position | Company | Posted | Location | Salary | Tags |
|---|---|---|---|---|---|
Fmr | Bangalore, India | $105k - $120k | |||
Coinbase | Remote | $211k - $249k | |||
Alchemy | Bucharest, Romania | $80k - $85k | |||
Bitso | Latin America | $112k - $156k | |||
| Learn job-ready web3 skills on your schedule with 1-on-1 support & get a job, or your money back. | | by Metana Bootcamp Info | |||
Bitso | European Economic Area | $112k - $156k | |||
Kraken | United States | $92k - $101k | |||
Asymmetric Research | Remote | $105k - $180k | |||
Limit Break | Tokyo, Japan | $90k - $145k | |||
Asymmetric Research | Remote | $105k - $180k | |||
Gemini | Remote | $136k - $170k | |||
Kraken | United States | $63k - $87k | |||
Token Metrics | Manila, Philippines | $73k - $95k | |||
Syndr | Delhi, India | $98k - $114k | |||
Kraken | United States | $92k - $101k | |||
Kraken | European Union | $36k - $54k |
Job Description:
Job Title : Lead - Cloud Site Reliability Engineer The Purpose of this Role As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments currently support numerous mission-critical workloads. In this exciting role, you’ll have the opportunity to further develop and refine your skills, collaborate across numerous Fidelity teams, and continue to grow in a fun, collaborative, and rapidly changing environment. This is a phenomenal opportunity to have a direct impact on the emerging strategies of our infrastructure and deployments, while at the same time, helping enable the expansion of our business. The Value You Deliver
Leading the initiative to craft and deploy our applications to the cloud Promoting a DevOps mentality, providing mentorship and establishing development standard methodologies for AWS infrastructure-as-code Championing automation tools to improve software delivery and reduce risk
The Expertise You Bring
6-8 years of hands-on experience with AWS in a production environment Experience building and deploying Docker images including Docker Compose Production experience running Kubernetes workloads ideally on AWS EKS Experience managing and maintaining Kubernetes Clusters on AWS EKS Experience creating and deploying Helm charts & libraries Production experience with infrastructure-as-code (IaC), Terraform preferred Hands-on experience with Jenkins Core, including authoring and maintaining declarative CI/CD pipelines and libraries Experience with monitoring tools e.g., CloudWatch, Datadog & Splunk Cloud Proficiency with UNIX operating systems and shell scripting Programming experience, e.g., Python preferred Experience with distributed version control systems, Git preferred Experience with the agile software development lifecycle and Kanban preferred Experience with CDN Providers e.g., Akamai preferred
The Skills that are good to have for this Role
Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog and Splunk Ability to communicate at all levels with track record of strong written and verbal communications See problems as opportunities to automate Ability to work independently with minimal direction Drive and champion the overall design of highly available, secure, scalable microservices-based applications in AWS
How your Work Impacts the Organization The Team Fidelity Digital Assets, a Fidelity Investments Company, is developing a full-service enterprise-grade platform for storing, trading, and servicing digital assets, such as Bitcoin and Ethereum. Fidelity Digital Assets embraces an entrepreneurial culture and startup mindset while serving as one of the most innovative business units within Fidelity Investments. Our global, diverse team of hundreds of forward-thinking professionals lead with agility and creativity to build solutions that bridge the gap between traditional institutional investors and their exposure to digital assets. The firm’s tenure and experience across multiple business lines present our employees with unprecedented access to knowledge, technology, and resources that help our team reshape the future of finance. Within Fidelity Digital Assets, Technical Operations team is central to our initiative of moving to the cloud. The team uses AWS services to secure our network and scale our applications to ensure their up-time and reliability. Team members are hands-on Site Reliability Engineers who promote a DevOps approach, with a focus on infrastructure-as-code, security, and automation.
Cryptojobs
Certifications:
Category: Information Technology
What does Reliability Engineer do?
A Reliability Engineer is a professional who is responsible for ensuring the reliability and availability of systems and equipment in an organization
They use their knowledge of engineering principles, statistical analysis, and data science to identify and mitigate risks, prevent failures, and optimize system performance
Here are some of the typical tasks and responsibilities of a Reliability Engineer:
- Analyze data and perform statistical modeling: Reliability Engineers analyze data related to equipment performance, failure rates, and maintenance history to identify trends and patterns. They use statistical modeling to predict future failures and plan maintenance activities accordingly.
- Develop and implement reliability strategies: Reliability Engineers develop and implement strategies to improve the reliability and availability of equipment and systems. This may include performing root cause analysis, implementing preventive maintenance programs, and conducting failure mode and effects analysis (FMEA).
- Collaborate with other teams: Reliability Engineers collaborate with other teams such as operations, maintenance, and engineering to identify and address reliability issues. They may also work with suppliers to ensure the reliability of equipment and materials.
- Monitor and evaluate performance: Reliability Engineers monitor the performance of systems and equipment to identify areas for improvement. They use data to evaluate the effectiveness of reliability strategies and make adjustments as necessary.
- Provide technical support: Reliability Engineers provide technical support to other teams and stakeholders, answering questions and providing guidance on reliability-related issues.
- Continuously improve processes: Reliability Engineers are responsible for continuously improving reliability processes and methodologies. They stay up-to-date with the latest technologies and best practices in the field and identify opportunities for improvement.