We use cookies on our site to track usage and preferences. Learn more

Site Reliability Engineer

  • Closing date 24 Apr 2020
  • Type Full-time
  • Duration Temporary

Site Reliability Engineer - Full-Time / Temporary

 

The Royal Society of Chemistry (RSC) is the world’s leading chemistry community, advancing excellence in the chemical sciences. The UK’s professional body for chemical scientists with over 50,000 members worldwide, we are a £50m-turnover not-for-profit organisation with 600 staff operating around the world and an award-winning global knowledge business. For 175 years we have promoted, supported, and celebrated chemistry. We work to shape the future of the chemical sciences – for the benefit of science and humanity.

 

Working within the SRE team and your colleagues across the directorate, you will focus on continually improving our ability to monitor, alert and automate manual operational work.  Your work will enable the Royal Society of Chemistry to maximise service delivery, high availability, performance optimisation and ensuring that services are efficient at accomplishing their duties, even as those duties scale and evolve.

 

We are looking for:

 

Essential:

 

  • Experience of managing and securing Windows server at scale.
  • Experience with cloud hosting datacentres (Amazon Web Services, Google Cloud Platform. Microsoft Azure) at scale.
  • Experience with PowerShell or similar scripting language.
  • Knowledge of software security vulnerabilities and mitigation.
  • Experience working within Agile pratices, particularly Scrum and Kanban.
  • Able to perform detailed technical analysis.
  • Experience writing technical and project documentation.
  • Experience working in a DevOps environment.
  • Experience with engineering best practice, such as continuous deployment, configuration management experience, process automation.
  • End to end understanding of modern web architectures (DNS, HTTP, SSL, TCP/IP, Load-balancing, edge-delivery (CDN) to persistence layers), and how to effectively monitor and measure availability and reliability of web applications.
  • Be a fast learner and capable of working with minimal supervision.
  • An ability to explain technical concepts to both technical and non-technical colleagues.

 

Desirable:

  • ITIL certification.
  • Experience of operating in an SRE role at a previous company.
  • Some development experience.
  • Experience with load-balancing and/or traffic management.
  • Experience with APM and system monitoring tools (for example New Relic and/or PRTG).
  • Experience with Microservices Architecture.
  • Experience with Continuous Delivery/Deployment.
  • Ability to quickly learn and implement unfamiliar technologies.

If you are interested, please apply now. If selected for interview, interviews will be arranged remotely via Zoom video call. 

 

Download the full description here

 

Contact us

Thank you for your enquiry!
We'll be in touch soon.

We couldn't send your message.
Please review the fields then try again