Blockchain site reliability engineer
InfStones
About the role
About
Inf Stones is an advanced, enterprise-grade Platform as a Service (PaaS) blockchain infrastructure provider trusted by the top blockchain companies in the world. Inf Stones’ AI-based infrastructure provides developers worldwide with a rugged, powerful node management platform alongside an easy-to-use API. With over 20,000 nodes supported on over 80 blockchains, Inf Stones gives developers all the control they need - reliability, speed, efficiency, security, and scalability - for cross-chain DeFi, NFT, GameFi, and decentralized application development. Inf Stones is trusted by the biggest blockchain companies in the world including Binance, CoinList, BitGo, OKX, Chainlink, Polygon, Harmony, and KuCoin, among a hundred other customers. Inf Stones is dedicated to developing the next evolution of a better world through limitless Web3 innovation.
To date, Inf Stones has raised over $110 million in capital and is backed by Softbank, GGV Capital, Susquehanna International Group (SIG), Dragonfly Capital, Qiming Venture Partners, Plug and Play, and many renowned institutional investors.
If you enjoy being on the cutting edge of technology, we encourage you to
Requirements
- Strong Linux system administration skills (networking, performance tuning, debugging, security).
- Expertise with at least one mainstream programming language such as Golang, Python, Javascript, Rust, etc., and have good programming skills and programming habits.
- Experience with monitoring/alerting tools (e.g., Prometheus, Grafana, ELK, etc.).
- Strong problem-solving skills and the ability to respond quickly under pressure.
- Solid technical documentation skills.
Responsibilities
- Deploy, monitor, and maintain blockchain nodes across multiple networks.
- Ensure system reliability and uptime by actively managing incidents, troubleshooting, and resolving node failures.
- Develop automation and maintenance tools (using Golang, Shell, Python, etc.) to streamline operations.
- Build and maintain monitoring, alerting, and logging systems to proactively detect and address issues.
- Collaborate with engineering teams and solution architects on reliability improvements and incident prevention.
- Participate in the on-call rotation to provide timely incident response and resolution.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free