Public Cloud - Google SRE Engineer

Lloyds Banking Group • Leeds, UK • 1d ago

Site Reliability Engineer – Google Cloud Platform (GCP)

Location:Leeds, West Yorkshire

Salary & Benefits:£47,790-£53,100

At Lloyds Banking Group, we’re driven by our purpose –Helping Britain Prosper. It guides the decisions we make, how we show up each day, and the impact we aim to have for customers, colleagues and communities!

The world is changing rapidly, and we’re evolving with it. This is an exciting time to join us as we modernise our technology platforms and reshape the future of financial services for the better.

About the Team

Our Cloud Platform team is a well‑established, solution‑focused engineering community, delivering one of the UK’s largest technology transformations. We're modernising the bank’s next‑generation cloud platform and partnering closely with product teams to enable secure, scalable and compliant cloud solutions across:

Analytics
GenAI/ML
Databases
Storage
Serverless HPC
Application workloads

Our engineering work spans product curation, data‑platform capability, data segregation, automation, quality assurance, and embedding AI into our workflows. Everything we build aims to empower engineering teams, improve the developer experience and raise delivery standards across the Group.

About the Role

We’re looking for aSite Reliability Engineerwith strong experience inGoogle Cloud Platform (GCP).

You’ll collaborate with Engineering Leads and Product Owners to shape and deliver our platform roadmap. You’ll help plan and prioritise work, automate processes using both traditional and GenAI tooling, remove impediments, and contribute to our continuous improvement culture.

You’ll also have the opportunity to participate in technical communities, work with internal customers across multiple domains, and support early‑career engineers through role‑modelling and mentoring.

Core technology areas for this role include:

Google Cloud Platform:Analytics, AI, Databases, Serverless products
Engineering fundamentals:Networking, Security, IAM, Platform Engineering
Tooling:Terraform, CI/CD (Harness or GitHub Actions), Python, Git workflows, Backstage
Security & Policy‑as‑Code:Open Policy Agent, Organisation Policy, Security Health Analytics, Wiz
Observability:Dynatrace

In this role, you’ll spend around half your time resolving production incidents and ensuring operational health, and the other half improving our platform through engineering and automation. Participation in an out‑of‑hours support rota is required.

About You

You enjoy solving complex engineering challenges and improving systems over time. You work collaboratively, communicate clearly and proactively share knowledge. You’re comfortable working in various fields, adapting your approach, and continuously learning as technology evolves.

You value inclusion, teamwork and mentorship, and you enjoy contributing to community‑based learning and capability uplift across the platform.

Key Responsibilities

Apply hands‑on engineering to maintain Infrastructure‑as‑Code and CI/CD‑based services
Deliver enhancements that improve reliability, scalability and customer experience
Reduce toil and improve efficiency through automation and new tooling adoption
Drive operational perfection across monitoring, incident management, problem resolution, cost optimisation and reliability
Be responsible for the health of production and non‑production environments and lead incident response activities
Investigate and fix service‑related issues using code‑first engineering approaches
Contribute to Agile ceremonies and support continuous team improvement
Provide clear and regular communication of incident status to stakeholders
Apply SRE practices and introduce chaos engineering where appropriate to strengthen resilience

Essential Skills & Experience

Strong DevOps and cloud‑engineering background, including IaC (Terraform) and CI/CD pipelines (Jenkins, Harness, Azure DevOps or similar)
Experience working with a broad range of public‑cloud technologies
Ability to write, update and maintain scripts (Python, Groovy, PowerShell, Bash)
Strong understanding of cloud security principles
Excellent problem‑solving skills and structured logical thinking
Experience with observability and monitoring tools

Desirable Skills

Experience using SDKs and APIs to deliver automation
Certifications in GCP or another cloud provider (e.g., Azure)
Transferable experience from sysadmin, software engineering or other technical subject areas
Technology‑agnostic approach and willingness to adopt the best tool for the job
Curiosity and aim to learn continuously, applying emerging cloud best practices

What You’ll Get in Return

We’re committed to equal opportunity and ensuring colleagues represent the communities we serve. Your growth, wellbeing and development matter to us.

You’ll receive:

A performance‑based share bonus
A flexible benefits allowance
A generous pension contribution
Private health cover
Up to 30 days annual leave (plus ability to purchase more)
Access to a range of colleague share schemes

If this sounds like the kind of work you want to be part of – we’d love to hear from you.

Working With Us

We’re committed to building an inclusive organisation that reflects modern society and celebrates diversity in all its forms. We want colleagues to feel they belong and can be their best, regardless of background, identity or culture.

If you’d like reasonable adjustments made during the recruitment process, please let us know.