Cloud Infrastructure Site Reliability Engineer.

  • technology
  • permanent
  • London

Cloud Infrastructure Site Reliability Engineer (SRE)

£55,000 - £65,000

Fully remote

Due to the nature of the position candidates must be eligible and willing to undergo Security Clearance

My client are a household name and global organisation who deliver innovative, digitally enabled solutions to transform, simplify and support their customers. They are recruiting for a Site reliability engineer to support their customers using their public cloud infrastructure.

Job Description:

The Cloud Infrastructure Site Reliability Engineer (SRE) supports the public cloud infrastructure used to deliver public cloud hosted managed services to customers.

We will have a high customer focus being actively involved in the support and development of the service including: the resolution of support cases, live service monitoring and maintenance, new service provision and continuous improvement projects. You will provide high quality operational and technical support to customers and will be responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. You must have excellent technical knowledge across Microsoft public Cloud Services (Azure and Microsoft 365). You should have a good knowledge of security practices working in a regulated environment and the flexibility to work out of hours will be required, including on call.

This is an exciting opportunity for a highly experienced Microsoft Azure Cloud Engineer with operational support and project delivery experience to provide L3/L4 analytical incident management and resolution alongside project-based deliverables across a large, expanding customer base to ensure quality service delivery and Service Level Agreement compliancy.

What you will be doing:

  • Contribute to the planning of application / infrastructure releases and configuration changes
  • Resolve support requests from customers by phone, email and online making use of the call logging system
  • Interact with key internal stakeholders and external third-party vendors to troubleshoot and resolve complex problems
  • Provide input to administering and maintaining all production and development environments
  • Create detailed technical and procedural documentation (e.g. architecture, configuration, and setup)
  • Design appropriate metrics for reporting on key performance and quality indicators, particularly in terms of in-depth trend analysis
  • Service transition and complete Operational Acceptance (OA) of new customer services
  • Implementation and delivery of Microsoft Azure projects
  • Training and Development - Learn about the latest public Cloud products and services and increase your knowledge.

What we are looking for:

  • Microsoft Azure and its relevant build, deployment, automation, networking, and security technologies in cloud and hybrid environments.
  • AZ-104 - Microsoft Certified: Azure Administrator Associate
  • Operational experience supporting Microsoft public cloud technologies and services at an enterprise level (multi-tenant) with in-depth knowledge of the following:
  • Azure Active Directory / Entra ID and Infrastructure Service
  • Azure Backups
  • Azure Compute (IAAS VMs)
  • Azure Migrate
  • Azure Monitor and Log Analytics
  • Azure Networking
  • Azure Site Recovery (ASR)
  • Azure Storage
  • ARM Templates (JSON)
  • Microsoft Defender for Cloud and Endpoint
  • In-depth knowledge of a scripting language (PowerShell, Bash, Azure Cli)
  • Experience with helpdesk IT Service Management Tools (e.g. BMC Remedy / Service Now).
  • Experience with Azure DevOps - deploying Infrastructure using CI/CD pipelines
  • Previously have worked with infrastructure-as-code and immutable builds (e.g. Terraform)
  • Experience with deployment and management of container technologies (e.g. Kubernetes, AKS and Docker)
  • Embraces challenges
  • Ability to quickly learn new technologies
  • Good problem-solving and communication skills
  • Ability to work well with individuals and teams
  • Desired Skills and experience
  • Experience of Infrastructure migrations to the Azure Cloud
  • Experience with other public cloud technologies and services (e.g. AWS / GCP)
  • Azure or AWS Certifications
  • Any exposure to Agile working practices
  • Experience with deployment and management of Azure PAAS database technologies (e.g. Azure SQL)
  • Experience of hardening IT infrastructure based on security audits, standards and industry best practice (e.g. vulnerability scanning, Penetration testing and ISO27001/17/18).