Site Reliability Engineer / Senior DevOps Engineer

I’m interested

Location

UK

Hours

37,5

Job description

We're building our Site Reliability capability in order to support William Hill's vast tech landscape. As a Site Reliability Engineer, working as part of our Platform Team, you'll work with an autonomous team to run operations and help improve development pipelines and infrastructure.

 

About us 

 

Working with us, you'll be at the heart of the technological revolution of one of the world's most trusted betting and gaming companies. We deal with projects ranging from mobile Casinos to online Sportsbooks and everything in between. The software you write will process 500 online bets per second, accommodate 20 million users, and process 160 terabytes a day. You can be sure there are many more challenges waiting for you. 

 

Your role in the team 

 

Our team is building the next generation Sports Betting platform that optimizes flexibility, performance, and responsiveness. The project's technologies include Scala (ZIO/Cats/Akka-HTTP/Http4s/Alpakka), Cassandra, Kafka, Kubernetes, AWS, Postgres, and JavaScript (React/Redux). As an SRE, your role will focus on guaranteeing system reliability, performance, and supportability.

 

SRE is a discipline which involves a highly skilled combination of engineering and operations skills, and is designed to  automate and improve operations. You'll place a strong engineering emphasis on building bespoke autonomous solutions that deliver value to end-users early, often, and at a fast pace. Your work will be central to the reputation and trustworthiness of the product, and we'll rely on you to act as an advocate for engineering best practices. 

 

You'll take ownership of process management – automating provisioning, quality controls, security auditing and maintenance. You'll own continuous improvement of platforms by analysing and improving efficiency, reliability, and scalability of the platform including deployment tooling and infrastructure. You'll create software delivery pipelines to increase development teams' velocity and confidence.

 

Communication is key–you'll promote and advocate across product teams on reliability topics, plus you'll draft design documents and present solutions to stakeholders. As infrastructure liaison, you'll help infrastructure teams plan disaster recovery drills, coordinate patching, oversee upgrades, and ensure compliance with best practice.

 

The role requires flexibility to participate in rotating on-call duties and timely post-mortems of production incidents.

 

Skills and experience

  

You're fluent in cloud / AWS infrastructure and confident in the deployment of automation solutions. You have strong experience with DevOps practices and toolsets, such as Terraform, Packer, Lambda, GitLab, CI/CD delivery model(s) and infrastructure-as-code. You're confident using containerisation (Docker, Helm) and monitoring and logging stacks / tools e.g. NewRelic, Splunk. You're fluent in at least one widely used front or back end programming language (e.g. Java / Javascript) and have the ability to debug code locally and remotely with strong understanding of all levels of a distributed system. Any knowledge of Scala would be an advantage, but is not essential.

 

You'll need to display strong communication skills with the ability to align diverse stakeholder groups on complex technical decisions. You can influence business stakeholders to align on architectural changes, and you can use your expertise to coach and mentor colleagues around you, harnessing the team’s potential.

 

You're entrepreneurial and excited about trying new technologies and architectural solutions -but you don't shy away from maintaining and upgrading legacy systems.

 

Desirable

 

  • Familiarity with orchestration tools (Kubernetes, Istio)
  • Experience working in Agile teams and adopting Agile SDLC practices
  • Experience managing technical priorities within the Backlog, plus constant upkeep of documentation and runbooks

 

What we offer

 

We take the safety and wellbeing of our employees seriously, especially at this challenging time. We have put in place robust Covid19 measures and are supporting our new employees with a smooth remote onboarding and training programme to make you feel welcome and part of our team. We offer a balanced approach to office and home working - now and for the long term future.

  

We’ll welcome you on-board with 25 days holiday (not to mention 8 bank holidays and an extra day for your birthday), a rewarding bonus scheme, healthcare, an attractive pension package, healthy gym discounts and a staff benefits scheme. 

 

Our SRE roles are based in the UK and we have offices in Leeds and London. Flexible working is embraced and all remote working options will be considered, but we will want you to join us in the office on a reasonably regular basis for some collaborative team time!

 

Join us #behindthebet