Site Reliability Engineer (CET +/- 2)
Join a world-class team reshaping cloud infrastructure from the ground up. Work on cutting-edge tech, learn from OS veterans, operate at scale—fully remote, deeply technical, and zero bureaucracy.
We usually respond within a day
Site Reliability Engineer (CET +/- 2)
About Us
We’re a fast-growing startup in the cloud computing space. We believe that while cloud platforms are functionally great and quite powerful, they are built on legacy software and are irritably inefficient (and expensive!). Based on award-winning research and open-source tech, we have built Unikraft Cloud, a next-generation cloud platform that allows for order-of-magnitude better efficiency, performance, and security.
If you’re passionate about cloud infrastructure, love solving real-world problems, thrive in customer-facing roles, and enjoy working with cutting-edge technologies we want you on our team!
What You’ll Do
- Maintain and operate customer on-prem and cloud deployments of our platform, ensuring reliability and rapid troubleshooting of technical issues.
- Plan, package, and roll out software updates both internally and to customers, including testing and validation.
- Collaborate with engineering to ensure quality deployments and maintain a high standard of product reliability.
- Set up and manage monitoring systems to proactively detect and resolve issues in production environments.
- Write scripts and automation for deployment, infrastructure management, and CI/CD workflows.
- Deploy, manage, and troubleshoot Kubernetes clusters for reliable, scalable infrastructure.
- Build tooling and automation to streamline deployment and platform integration.
- Contribute to continuous integration pipelines that catch regressions across components and system integrations.
- Create and maintain clear documentation for systems, processes, and tools to support team effectiveness.
What We’re Looking For
- Proven experience in Linux system administration, software packaging, and delivery.
- Solid understanding of Linux networking fundamentals including firewalls, DNS, proxies, and best practices.
- Experience managing and troubleshooting Kubernetes clusters in production.
- Good understanding of the CNCF/cloud-native landscape and associated tools.
- Familiarity with observability tools such as Prometheus and Grafana.
- Basic scripting skills (e.g., Bash, Python).
- Familiarity with cloud platforms (e.g., AWS, GCP, Azure).
- Interest in automation tools like Ansible, Terraform, or similar.
- Exposure to CI/CD pipelines (e.g., GitHub Actions, Jenkins, GitLab CI).
- Familiarity with microservice architectures, Serverless, and DevOps best practices.
- Familiarity with virtualization solutions like QEMU/KVM. Micro-VMMs like Cloud-Hypervisor or Firecracker are a plus.
Mindset
- Eagerness to learn and take on new challenges.
- Strong problem-solving skills and a curious, analytical mindset.
- Enthusiasm for building reliable, high-performance systems.
- Team player with good communication skills.
- Ability to quickly adapt to new programming languages, runtimes, and environments.
Why This Role is Career-Defining
- Help revolutionize the future of cloud compute runtime while embracing continuously evolving modern technologies.
- Work alongside a high-energy, top-notch, technical, and entrepreneurial team.
- Make impactful contributions and help shape our rapidly growing company.
- Gain deep hands-on experience with infrastructure and modern DevOps practices while learning from experienced engineers.
Why You’ll Love This Team
World-class Engineering: Collaborate with OS veterans, kernel hackers, and distributed systems experts.
No Crud: Founder-led, product-obsessed, and deeply technical. The best tech argument wins!
Groundbreaking Technology: Our tech powers the future of cloud infrastructure – come build it.
Build your Favorite Work Set up: A generous equipment budget to spend on anything you need to do your best work.
Fully Remote, Fully Flexible: Work from your favorite place, work at your favorite and most productive times.
Retreats, Game Nights and More: Fun-focused team retreats and other events to recharge and build great relationships.
The Standard Stuff: Competitive Salary, 6 weeks vacation, development opportunities.
- Department
- Platform Team
- Role
- Platform Engineer
- Remote status
- Fully Remote
