About the Role
A large Wealth Management firm operating under a Broker-Dealer model is seeking an experienced **Site Reliability Engineer** to support feature development on its newly built Trading Platform. The platform has been in development for two years and is currently in a stabilization phase, with a production launch targeted in four months.
Req# 1023988611
**Responsibilities**
+ Implement and champion DevOps and SRE best practices across the organization
+ Drive technology roadmap discussions for the SRE team
+ Define, craft, and maintain SLIs and SLOs, along with key metrics including MTTR, Lead Time for Change, Deployment Frequency, and Change Failure Rate
+ Design, develop, and manage monitoring, alerting, and observability solutions using Dynatrace, Splunk, and Grafana
+ Conduct performance assessments, identify bottlenecks, and recommend enhancements to improve system performance
+ Partner with application teams to enforce performance and...
Req# 1023988611
**Responsibilities**
+ Implement and champion DevOps and SRE best practices across the organization
+ Drive technology roadmap discussions for the SRE team
+ Define, craft, and maintain SLIs and SLOs, along with key metrics including MTTR, Lead Time for Change, Deployment Frequency, and Change Failure Rate
+ Design, develop, and manage monitoring, alerting, and observability solutions using Dynatrace, Splunk, and Grafana
+ Conduct performance assessments, identify bottlenecks, and recommend enhancements to improve system performance
+ Partner with application teams to enforce performance and...
Ready to Apply?
Submit your application today and take the next step in your career journey with EPAM Systems.
Apply Now