Intermediate Site Reliability Engineer, Cloud Cost Utilization
New
United KingdomFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSGCPGrafanaPrometheusTerraformAnsible
Requirements
- Hands-on experience with cloud cost management in GCP and/or AWS.
- Familiarity with billing data, pricing models, and cloud optimization approaches.
- Familiarity with or interest in adopting the FinOps FOCUS specification for multi-cloud cost analysis.
- Experience designing or implementing cloud resource tagging and labeling strategies.
- Experience with infrastructure as code tools, specifically Terraform and Ansible.
- Familiarity with observability tooling, including Grafana.
- Ability to connect reliability and cost signals with operational data.
- Strong cross-functional collaboration skills to work with Engineering and Finance.
- Ability to explain technical cost data clearly to non-engineering audiences.
- Self-directed approach suitable for a fully remote and asynchronous environment.
Responsibilities
- Design and maintain cloud resource tagging and labeling strategies across GCP and AWS to support accurate cost attribution.
- Develop tooling and pipelines to ingest, normalize, and report on cloud billing data using the FOCUS specification.
- Automate cost anomaly detection, forecasting, and alerting to help engineering teams respond to infrastructure spend changes.
- Contribute to observability and monitoring stacks (Prometheus, LGTM, ELK) to surface cost efficiency signals.
- Partner with Finance and Engineering leadership to support cloud cost forecasting and planning.
- Act as a subject matter expert for cloud cost attribution and tagging strategy.
- Collaborate with Finance and Compliance teams on audits and financial reporting requirements.
- Integrate cost controls and tagging requirements into provisioning workflows using Terraform and Ansible.
View Full Description & ApplyYou'll be redirected to the employer's site