About the Role
Key Responsibility:
Reporting and Data Management:
- Maintain accurate, up-to-date operational dashboards and system performance reports.
- Generate regular service reliability, incident, and availability reports that provide actionable insights for engineering and leadership teams.
- Analyze trends in incidents, alerts, capacity, and system performance to identify opportunities for optimization.
- Ensure data quality, consistency, and completeness across monitoring and observability tools.
- Collaborate with DevOps and Cloud Engineering to collect and consolidate metrics required for SLO/SLI reporting.
- Support automation of reporting processes to reduce manual effort and improve real-time visibility.
- Organize and maintain documentation repositories, runbooks, and technical knowledge bases.
Ticketing Tool Support:
- Monitor, triage, and manage operational tickets related to infrastructure, system ...
Ready to Apply?
Submit your application today and take the next step in your career journey with Proactis.
Apply Now