Grafana Jobs in Web3

6 jobs found

web3.career is now part of the Bondex Logo Bondex Ecosystem

Receive emails of Grafana Jobs in Web3

About BitFit Labs

 
We are building an enterprise-grade blockchain custody platform powered by advanced Multi-Party Computation (MPC) Threshold Signature Scheme (TSS). Our solution provides institutional-grade security for digital asset custody through distributed key management technology — designed for the security requirements of banks, funds, and Web3 institutions.
 
Core platform features:

  • 2/3 threshold signature distribution across isolated key nodes
  • Multi-chain support: Bitcoin, Solana, EVM-compatible chains (Ethereum, BNB Chain, Avalanche, Arbitrum, Polygon, etc.), and TRON
  • Hardware wallet integration: Ledger and Trezor
  • Secure key management: TSS nodes distributed across isolated environments with HSM integration

 

About the Role

You will be the primary infrastructure owner for a hybrid cloud + on-premise architecture: AWS-hosted application and API layers, combined with on-premise TEE (Trusted Execution Environment) servers used exclusively for MPC key computation. Both environments are equally important — you will design, build, and operate both.
 
This is a high-ownership, high-impact role at an early stage. You will work directly with the founding team and have real influence over architecture decisions.
 
 


Key Responsibilities


 
Infrastructure & Environment Management

  • Design, build, and maintain secure, highly available AWS environments using Infrastructure as Code (Terraform / CDK)
  • Set up and manage on-premise TEE infrastructure for secure MPC key computation
  • Deploy and maintain all critical components: MPC TSS nodes (BTC, SOL, EVM, TRON), backend API servers, frontend web application, and blockchain full nodes


Blockchain Infrastructure

  • Configure and monitor full node synchronization for Ethereum and other supported chains
  • Manage RPC endpoints and load balancing across multiple nodes
  • Ensure high availability of blockchain connectivity with automated failover and health checks


CI/CD & Deployment Automation

  • Design and maintain end-to-end CI/CD pipelines using GitHub Actions or GitLab CI
  • Implement blue-green and canary deployment strategies for zero-downtime releases
  • Provide self-service build, test, and deployment tooling for the development team


Monitoring & Observability

  • Build and maintain monitoring, logging, alerting, and tracing systems (Prometheus, Grafana, ELK Stack, Jaeger)
  • Monitor all layers: AWS infrastructure, on-prem TEE nodes, blockchain nodes, MPC computation, and application performance
  • Maintain SLA commitments with proactive alerting and incident response


Security & Compliance

  • Enforce strict IAM, network segmentation, and zero-trust principles across both cloud and on-prem environments
  • Integrate HSM/KMS for secure key management and MPC TSS operations
  • Conduct regular security scans, vulnerability assessments, and penetration testing
  • Manage SSL/TLS, Nginx reverse proxies, and firewall rules


High Availability & Disaster Recovery

  • Design and implement cross-AZ and cross-environment disaster recovery plans
  • Execute regular DR drills and maintain runbooks
  • Maintain 99.9%+ uptime for all critical wallet services


 

Requirements

  • 5+ years of DevOps, SRE, or cloud infrastructure engineering experience with large-scale distributed systems
  • Expert-level knowledge of AWS and its managed services
  • Deep hands-on experience with Docker and Kubernetes in production environments
  • Proficiency with GitHub Actions or GitLab CI/CD
  • Strong scripting or programming ability in at least one of: Go, Python, or Shell
  • Solid Linux administration experience (Ubuntu / RedHat)
  • Strong understanding of networking: firewalls, SSL/TLS, load balancing, DNS
  • Hands-on experience with Prometheus, Grafana, or equivalent monitoring solutions
  • Excellent technical documentation and cross-team communication skills


 

Nice to Have

  • Direct experience operating Ethereum, Bitcoin, or other blockchain full nodes
  • Familiarity with Web3 technologies, smart contracts, or DeFi protocols
  • Understanding of MPC, cryptographic key management, or HSM integration
  • Experience with service mesh technologies (Istio, Linkerd)
  • Chaos engineering or fault injection testing experience
  • Experience with HashiCorp Vault, AWS Secrets Manager, or similar secrets management tools
  • Familiarity with Redis, TimescaleDB, or InfluxDB
  • Security certifications (CISSP, AWS Security Specialty, or equivalent)
  • Experience with multi-cloud or hybrid cloud architectures