Build reliable, “fit for purpose” tools and processes to enable our engineers to become more productive and autonomous (drive a shift left)Optimise, report and forecast our licensing usage for key vendors we use to manage our Cloud (GitLab, HashiCorp, Datadog, Atlassian for example).Take responsibility for driving the optimisation of our AWS costs.Manage and drive the adoption of observability within the engineering teams (Service Level Indicators & Objectives), best practices for monitoring various cloud services.Ensure key operational processes (on-call, incident, ORR etc.) are continually improved, documented and communicated.Collaborate with Engineers, QA, Security and Architecture teams to enable you to be successful.Maintain up-to-date documentation.Perform scheduled maintenance.Spread a SRE culture within Aircall teams.Able to context switch when required.mortar_board: 5% of your time will be dedicated to learning and improving soft skills or technical skills!
You are pragmatic, can challenge the status-quo, can work autonomously and are curious. You are able to embrace change at pace.You have hands-on experience with containerized environments.You have hands-on experience with a cloud provider (AWS would be the preference).You have strong experience with continuous integration and continuous deployment pipelines, GitlabCI, ArgoCDYou have strong experience implementing, running and optimising modern Observability solutions (ideally Datadog but experience with NewRelic, Dynatrace, Prometheus/Grafana/ELK etc)You have experience with large scale, complex distributed systems.You have strong coding skills.You have a good experience with Infrastructure as Code (IAC) – Terraform and Cloudformation.