Principal Site Reliability Engineer

I’m interested


Leeds, London and Krakow



Job description

We're building our Site Reliability capability in order to support William Hill's vast tech landscape. As a Principal Site Reliability Engineer, you'll work with an autonomous team to run operations and help improve development pipelines and infrastructure as well as potentially acting as a mentor to others within the team.

About us 

Working with us, you'll be at the heart of the technological revolution of one of the world's most trusted betting and gaming companies. We deal with projects ranging from mobile Casinos to online Sportsbooks and everything in between. The software you write will process 500 online bets per second, accommodate 20 million users, and process 160 terabytes a day. You can be sure there are many more challenges waiting for you. 

Your role in the team 

The Principal SRE has in-depth knowledge and experience of large-scale E-Commerce platforms, combining analytical, engineering and operations skills. They are focused on automating and improving operations whilst leading, shaping and mentoring a high-performing team of SREs.  Their job is to guarantee system reliability, performance, and supportability with an emphasis on building autonomous solutions that deliver value to end-users early, often, & fast. They are central to the reputation and trustworthiness of the product and act as an advocate for engineering best practices - they are laser focused on improving the customer experience 

Leadership - Lead through example, mentoring SRE’s and driving quality through continuous improvement.

Platform improvement - Use a data driven approach and strong analytical skills to tenaciously improve the reliability, and scalability of the platform

Communicate – Promotes and communicates across product team on reliability topics. Is an advocate for Engineering best practices, drafts design documents and presents solutions to stakeholders

Infrastructure liaison – Help Infrastructure teams plan disaster recovery drills. Coordinate patching, upgrades, and compliance with best practices 

Skills and experience

Technical skills:

·       Strong understanding of cloud and on-premise infrastructure, such as AWS, VM’s physical servers, networks

·       Strong understanding of containerization (Docker, Helm) and orchestration (Kubernetes, Istio)

·       Strong experience with monitoring and logging stacks/tools e.g. NewRelic, Splunk, Promotheus

·       Strong experience of troubleshooting complex problems, working across different development teams, programming languages and design patterns

Agile/digital experience:

·       Experience working in Agile teams and Agile SDLC practices

·       Experience managing technical priorities within the Backlog

·       Experience with constant upkeep of documentation and runbooks

Individual skills:

·       Leadership – Ability to lead through examples, mentoring and training SRE’s whilst remaining hands on

·       Communication - Displays strong communication skills with ability to align diverse stakeholder groups on complex technical decisions

·       Influencing skills - Ability to influence business stakeholders to align on arch changes

·       Entrepreneurial - Is excited about trying new technologies and architectural solutions, but doesn’t shy away of maintaining and upgrading legacy systems

·       Active coach and mentor whose goals are to grow and maximize the team’s potential


What we offer


Our flexible benefits package includes a competitive package and bonus, a range of lifestyle benefits plus a heavily discounted zone 1-6 Oyster card if you join us in London. In addition to the standard 25 days holiday, you'll get an extra day off for your birthday. If you're looking for long term career development, our global business is the ideal place to establish yourself and make an impact


We take the safety and wellbeing of our employees seriously especially at this challenging time. We have put in place robust Covid 19 measures and are supporting our new employees with a smooth mixture of in-person & remote onboarding and training programme to make you feel safe, welcome and part of our team.


We offer a balanced approach to office and home working - now and for the long-term future. We know that not everyone is the same, many people have embraced the home working whilst others can't wait to get back. Therefore, we are offering our employees the opportunity to work from home up to 80% of the time with 20% of office time built in to ensure we get some face-to-face collaborative team time - and the chance for a coffee and catch up!


Join us #behindthebet