Site Reliability Engineer – Google Cloud Platform (GCP)
Location:Leeds, West Yorkshire
Salary & Benefits:£47,790-£53,100
At Lloyds Banking Group, we’re driven by our purpose –Helping Britain Prosper. It guides the decisions we make, how we show up each day, and the impact we aim to have for customers, colleagues and communities!
The world is changing rapidly, and we’re evolving with it. This is an exciting time to join us as we modernise our technology platforms and reshape the future of financial services for the better.
About the Team
Our Cloud Platform team is a well‑established, solution‑focused engineering community, delivering one of the UK’s largest technology transformations. We're modernising the bank’s next‑generation cloud platform and partnering closely with product teams to enable secure, scalable and compliant cloud solutions across:
- Analytics
- GenAI/ML
- Databases
- Storage
- Serverless HPC
- Application workloads
Our engineering work spans product curation, data‑platform capability, data segregation, automation, quality assurance, and embedding AI into our workflows. Everything we build aims to empower engineering teams, improve the developer experience and raise delivery standards across the Group.
About the Role
We’re looking for aSite Reliability Engineerwith strong experience inGoogle Cloud Platform (GCP).
You’ll collaborate with Engineering Leads and Product Owners to shape and deliver our platform roadmap. You’ll help plan and prioritise work, automate processes using both traditional and GenAI tooling, remove impediments, and contribute to our continuous improvement culture.
You’ll also have the opportunity to participate in technical communities, work with internal customers across multiple domains, and support early‑career engineers through role‑modelling and mentoring.
Core technology areas for this role include:
- Google Cloud Platform:Analytics, AI, Databases, Serverless products
- Engineering fundamentals:Networking, Security, IAM, Platform Engineering
- Tooling:Terraform, CI/CD (Harness or GitHub Actions), Python, Git workflows, Backstage
- Security & Policy‑as‑Code:Open Policy Agent, Organisation Policy, Security Health Analytics, Wiz
- Observability:Dynatrace
In this role, you’ll spend around half your time resolving production incidents and ensuring operational health, and the other half improving our platform through engineering and automation. Participation in an out‑of‑hours support rota is required.
About You
You enjoy solving complex engineering challenges and improving systems over time. You work collaboratively, communicate clearly and proactively share knowledge. You’re comfortable working in various fields, adapting your approach, and continuously learning as technology evolves.
You value inclusion, teamwork and mentorship, and you enjoy contributing to community‑based learning and capability uplift across the platform.
Key Responsibilities
- Apply hands‑on engineering to maintain Infrastructure‑as‑Code and CI/CD‑based services
- Deliver enhancements that improve reliability, scalability and customer experience
- Reduce toil and improve efficiency through automation and new tooling adoption
- Drive operational perfection across monitoring, incident management, problem resolution, cost optimisation and reliability
- Be responsible for the health of production and non‑production environments and lead incident response activities
- Investigate and fix service‑related issues using code‑first engineering approaches
- Contribute to Agile ceremonies and support continuous team improvement
- Provide clear and regular communication of incident status to stakeholders
- Apply SRE practices and introduce chaos engineering where appropriate to strengthen resilience
Essential Skills & Experience
- Strong DevOps and cloud‑engineering background, including IaC (Terraform) and CI/CD pipelines (Jenkins, Harness, Azure DevOps or similar)
- Experience working with a broad range of public‑cloud technologies
- Ability to write, update and maintain scripts (Python, Groovy, PowerShell, Bash)
- Strong understanding of cloud security principles
- Excellent problem‑solving skills and structured logical thinking
- Experience with observability and monitoring tools
Desirable Skills
- Experience using SDKs and APIs to deliver automation
- Certifications in GCP or another cloud provider (e.g., Azure)
- Transferable experience from sysadmin, software engineering or other technical subject areas
- Technology‑agnostic approach and willingness to adopt the best tool for the job
- Curiosity and aim to learn continuously, applying emerging cloud best practices
What You’ll Get in Return
We’re committed to equal opportunity and ensuring colleagues represent the communities we serve. Your growth, wellbeing and development matter to us.
You’ll receive:
- A performance‑based share bonus
- A flexible benefits allowance
- A generous pension contribution
- Private health cover
- Up to 30 days annual leave (plus ability to purchase more)
- Access to a range of colleague share schemes
If this sounds like the kind of work you want to be part of – we’d love to hear from you.
Working With Us
We’re committed to building an inclusive organisation that reflects modern society and celebrates diversity in all its forms. We want colleagues to feel they belong and can be their best, regardless of background, identity or culture.
If you’d like reasonable adjustments made during the recruitment process, please let us know.