Stellar Development Foundation is hiring a
Web3 Senior Site Reliability Engineer

Compensation: $29k - $56k *

Location: CA San Francisco, California, United States

Open to considering remote candidates in the USA

Interested in working on cutting-edge blockchain technology and creating equitable access to the global financial system? Since 2014, the mission-driven team at the Stellar Development Foundation (SDF) has helped fuel the tremendous growth of the Stellar blockchain network, an open-source platform that operates at high-scale today. Developers and companies around the world build on it, and the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem.

SDF is looking for a talented and hands-on Site Reliability Engineer to join our team. In this role, you will be ensuring the reliability of our services, building infrastructure to enable our team's production and testing environments, and greasing the rails of our systems to ensure they're robust, efficient, and easy to deploy.

In this role, you will:

  • Maintain, improve, scale and secure our AWS infrastructure and Ubuntu Linux systems.
  • Assist our development teams in running, packaging, deploying and troubleshooting applications.
  • Work with developers on streamlining deployment processes with Jenkins and other tooling.
  • Maintain, monitor and improve our Kubernetes clusters.
  • Work with development teams on migrating applications to Kubernetes.
  • Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes, Prometheus, ELK and LDAP.
  • Monitor, triage and respond to alerts in our 24/7/365 environment.
  • Participate in design and code reviews, and ensure that the foundation for our services is best in class.
  • Evaluate new technologies, design and implement as appropriate.
  • Identify automation opportunities and implement by creating custom or by using off the shelf solutions.

You have:

  • 3+ years of experience of working in cloud-based systems operations, as a Linux systems administrator, SRE or DevOps engineer.
  • Comfortability with Linux command line.
  • First-hand experience with configuration management tools (Puppet, Chef, etc.); preferably Puppet.
  • The natural ability to troubleshoot and debug - no issue is impossible to solve.
  • A good understanding of computer networking, TCP/UDP, load balancing, distributed computing, web services, and the fundamental protocols used by the internet (HTTP, HTTPS, DNS, etc.).
  • Experience supporting production workloads and are familiar with monitoring concepts and tooling. You’re able to take part in an on-call rotation.
  • Proficiency in at least one scripting language and you are familiar with a few (Ruby, Perl, Python, Bash, etc.).
  • The willingness to do what it takes to help your teammates - especially in stressful situations.
  • Enthusiasm about working in a small and growing team. You are open, empathetic, and care about putting the best ideas forward in a collaborative and helpful manner.
  • The ability to be independent and are able to deliver results without supervision.

Bonus points if:

  • Experience with Docker and Kubernetes
  • Experience with Prometheus and Grafana
  • Experience with AWS
  • Ability to understand Go, C++ and TypeScript source code
  • Experience with CI pipelines and Jenkins
  • Deb or RPM packaging experience

Why work for us:

  • You’ll have a lot of autonomy in the team
  • You’ll work with Kubernetes in production and we’ll help you get up to speed if needed
  • You will be able to make visible impact quickly and will have a strong influence on the team’s direction, tooling, processes and technology choices
  • You will work on many open source projects that aim to improve financial inclusion on a global scale

Apply Now:

This job is closed

Compensation: $29k - $56k *

Location: CA San Francisco, California, United States

This job is closed


Receive similar jobs:

Cover Letter / AI Interview