BitMEX is hiring a
Web3 Site Reliability Engineer: Database & Messaging

Compensation: $105k - $120k *

Location: Vancouver, British Columbia, Canada

Company Overview

BitMEX explores, incubates, and pursues opportunities and investments, as part of its mission to reshape the modern digital financial system into one which is inclusive and empowering. BitMEX is a pioneer in the industry whose trading platform handles tens of thousands of low latency transactions per second, representing several billions of dollars traded every day.

Job Purpose

The BitMEX infrastructure team is responsible for the reliability and scalability of all the services that power the BitMEX exchange, and for providing turn-key self-service platforms to the developers. As a Site Reliability Engineer focused on databases and messaging technologies, you will be in charge of setting and improving our databases/messaging standards & practices to match our stringent consistency and reliability needs. You will be collaborating closely with our Product Engineering, Trading Technology and Kubernetes teams, leveraging “infrastructure as code”, to build resilient databases and messaging systems that spans datacenters and monitor their performance through observatibility.

Responsibilities

  • Improving resiliency, reliability, scalability of our production databases and messaging systems (e.g. PostgreSQL, Kafka, etcd, Clickhouse, Chronicle Queue, etc)
  • Participating in application architecture/design with cross-functional teams to ensure the highest technical standards are practiced
  • Monitoring and observability of databases
  • Organizing BCP/DR/Chaos events for our databases
  • Building of self-service databases and messaging systems on Kubernetes
  • Pro-active remediation of database operational problems
  • Pro-active development/improvement of procedures for automated monitoring, proactive intervention, and remediation of problems related to database availability/stability/data integrity
  • Database deployments and modifications in support of application development activities
  • Database capacity planning (storage, load, etc.)

Qualifications

  • 5 years of relevant experience with at least 4 years experience supporting production critical workloads on PostgreSQL
  • 2 years Docker experience
  • Proven experience with other database or/and distributed platforms such as Cassandra, Kafka, etcd, Clickhouse, etc
  • Technical certifications for DBMS platforms, AWS, or Linux/Unix is a plus
  • Familiarity with or knowledge of Terraform (or similar product)
  • Strong AWS, Linux/UNIV knowledge

  • Experience working with offshore support teams
  • Experience with database architecture, logical and physical design, installations, catalog navigation, monitoring and tuning (system, DB, resource contention), backup and recovery, replication, HA/DR
  • Experience with automation, documentation, shell scripting, PL/SQL programming, query tuning, system tuning, resource contention analysis, backup and recovery, standby, replication, etc.
  • Strong collaboration, analytical, verbal, and written communication skills
  • Experience working with offshore support teams
  • Utilizes sound decision-making skills and communicates well with other team members and business users. Identifies problems and recommends solutions.
  • Works in a team environment, including cross-functional teams and teams with business users throughout the company. Interacts with all levels of management and staff across the organization
  • You are comfortable context-switching across a wide variety of platforms and technologies and are able to find ways to clue different technologies together
  • You are comfortable managing a complex, polyglot, and global infrastructure as code, and you understand how to fully automate their management from a centralized git repository.



Apply Now:

This job is closed

Compensation: $105k - $120k *

Location: Vancouver, British Columbia, Canada

This job is closed


Receive similar jobs:

Cover Letter / AI Interview