About the Role
Required Skills & Experience
~6–8+ years of experience in a Site Reliability Engineer or similar role (10+ years total IT experience) Deep expertise working in Azure cloud environments at scale Extensive hands-on experience with monitoring and observability tools (e.g., Elastic, Prometheus, Grafana, or similar), including designing and architecting monitoring strategies Proven experience supporting production applications in complex, high-availability environments (application-focused SRE vs. infrastructure-only) Strong knowledge of Kubernetes (AKS) for monitoring, alerting, administration, and troubleshooting Ability to troubleshoot and debug applications at a deep level, including reading, understanding, and reviewing code Solid experience with .NET/C# application environments Experience with databases (SQL and/or NoSQL such as Cosmos DB, PostgreSQL, etc.) Demonstrated ability to mentor engineers and drive SRE adoption across teams
Ni...
Ready to Apply?
Submit your application today and take the next step in your career journey with Insight Global.
Apply Now