Full-time

Site Reliability Engineer in Growing Team

Posted by Hiive • vancouver, metro vancouver regional district, Canada

📍 vancouver, metro vancouver regional district 🕒 June 05, 2026

About the Role

Join a dynamic infrastructure team as a Site Reliability Engineer. Focus on enhancing platform reliability, ensuring availability, and supporting AI workloads for improved system performance.
In this role, you'll directly impact platform operational performance and reliability. Collaborating with DevOps and engineering teams, you will help build scalable infrastructure and address incident responses. You'll play a key role in implementing security measures and improving observability for AI systems.
Key Responsibilities:
• Maintain platform reliability and availability
• Optimize and secure infrastructure systems
• Proactively address scaling and reliability challenges
• Configure monitoring and incident response strategies
• Support AI/ML infrastructure and workloads
Requirements:
• Experience in Site Reliability Engineering or similar
• Proven skills with AWS, particularly EKS and RDS
• Familiarity with Kubernetes for production environments
• Prof...

Ready to Apply?

Submit your application today and take the next step in your career journey with Hiive.

Apply Now