We are hiring a Senior Site Reliability Engineer to join the HashSphere engineering team.
This role is hands-on and high impact. You will own the design, deployment, and reliability of mission-critical, multi-region infrastructure used by large organizations across financial services, supply chain, and healthcare. You will partner closely with product, security, and platform leadership to shape architecture, reliability strategy, and operational standards from the ground up.
This is not a support role. We are looking for someone who has operated real systems at scale, measured outcomes, and taken end-to-end ownership.
Design, build, and operate highly available, multi-region distributed systems with clear recovery strategies and tested RTO/RPO.
Partner with the Head of SRE to define the reliability roadmap, platform architecture, and operational standards.
Own large-scale Infrastructure as Code using Terraform, including reusable modules, multi-account patterns, and policy guardrails.
Operate and scale Kubernetes environments (EKS, GKE, or AKS) using GitOps practices (ArgoCD), Helm, and strong RBAC and network policies.
Build and maintain secure CI/CD pipelines, including blue/green and canary deployments, promotion and rollback strategies, and artifact integrity (SBOM, signing).
Define and improve SRE practices, including SLOs, error budgets, observability, and measurable reductions in MTTR/MTTA.
Work closely with product and engineering teams to translate customer and business requirements into reliable, secure platform services.
Contribute to the operational support and continuous improvement of customer-facing HashSphere deployments.
Required Qualifications
The candidate should demonstrate true ownership of Azure, not just hands-on implementation. Specifically, they should be able to describe an end-to-end system they owned (from design → production → operations), clearly outline their areas of responsibility, and show experience building infrastructure from scratch (greenfield) with key architectural decisions. They should also have experience handling production incidents. Most importantly, they must be able to distinguish between basic hands-on work and real ownership in a production environment.
7+ years of experience in SRE, platform engineering, or infrastructure engineering operating production distributed systems.
Strong multi-cloud experience (AWS, GCP, or Azure), with SME-level depth in AWS or GCP.
Proven experience running multi-region production systems, including disaster recovery testing, runbooks, and real incident ownership.
Deep, hands-on experience with Kubernetes at scale (EKS/GKE/AKS), including GitOps workflows and production-grade security controls.
Extensive experience with Terraform-first Infrastructure as Code in large, real-world environments (not POCs).
Strong security and compliance mindset, including Zero Trust principles, secrets management (Vault or cloud-native equivalents), and exposure to regulated environments (PCI, SOC 2, HIPAA, NIST).
Comfortable owning systems end to end, with clear metrics and outcomes to show impact.
Nice to Have
Experience with distributed ledger or blockchain systems, particularly private or consortium deployments.
Familiarity with Hedera services such as HCS, HTS, Hedera SDKs, or the Smart Contract Service.
Understanding of EVM-based systems and smart contract tooling (Solidity, Hardhat).
Experience operating active-active, globally distributed architectures.
Prior experience supporting financial services or other highly regulated industries.
Compensation & Package
Equity & Tokens
Performance Bonuses
Health insurance & 401k for US employees only.
Interview Process
1.Recruiter / HR Call
Technical question interview with Ani + Introductory questions
2.Hiring Manager Interview
3.Technical Interview
4.Final Interview
Interview with the VP of Engineering
Build a compelling profile, add your portfolio, and start applying instantly.
Moderation tools and safe messaging keep conversations professional.
Live, transparent vacancies — no gimmicks, just real opportunities.
Negotiate terms quickly and move projects forward without friction.
Create your profile with skills, languages, and availability.
Use filters to find jobs that match your strengths.
Chat with employers and secure your next contract.
Join Online.jobs today and access remote opportunities from anywhere.