We are looking for an excellent Site Reliability Engineer who brings extensive experience working in 24/7 highly available production environments that operate at scale. If you know cloud based infrastructure through and through and have experience implementing robust monitoring and observability plans, then check this out!
Who are we?
We are a mission driven and leading provider of End-to-End Electronic Medical Record and Practice Management technology in Applied Behavioral Analysis (ABA). We are transforming the landscape of healthcare and education producing superior outcomes for people with Autism and Intellectual or Developmental Disabilities. With over 150,000 active users and growing, our technology is changing lives on a daily basis.
What is the job?
As a Senior Site Reliability Engineer on our team, you will support a 24x7 highly available production environment. Your responsibilities will be heavily weighted towards availability, latency, performance, monitoring/observability.
You will develop automated observability, APM, and monitoring plans for capacity prediction and planning, including setting and maintaining SLO’s and SLI’s based on current and future SLA’s. You will implement the latest and greatest tooling, including Grafana, New Relic, Splunk, and Prometheus to ensure the multi-environment observability stack is fully-automated.
What technical skills are necessary for this role?
What is in it for you?
This is an immediate hire and looking to move quickly, please apply today for consideration!
Create an Account or Sign In