£650 Per day
Undetermined
Undetermined
Sheffield, South Yorkshire, UK
p> Principal responsibilities
- Own the day-to-day health, uptime, monitoring, and reliability of services and server infrastructure
- Participate in architecture and design reviews to provide recommended improvements to the development teams to improve the reliability and performance of applications
- Minimize manual involvement by implementing continuous improvements that create an operating environment, including the development of new tools, dynamically monitoring, alerting, automated self-healing recovery
- Engage in application performance analysis and system tuning, and capacity planning
- Perform root cause analysis to identify implement continuous improvements
- Capable of presenting analyses and recommendations to leadership or discussing the technical merits of solutions with engineers and architects
- Moderate level of technical complexity experience with multiple integrated applications
- Engineer solutions on GCP foundation platform using Infrastructure as Code methods (eg Terraform)
- Ensure compliancy with Operational risk standards (eg Network, Firewall, OS, Logging, Monitoring, Availability, Resiliency
- Build and support continuous integration (CI), continuous delivery (CD) and continuous testing activities
Requirements
- Hands on experience on Cassandra/MongoDB noSQL solutions in performance optimisation
- Experience of building a range of Services in a Cloud Service provider (ideally GCP)
- Demonstrable Cloud service provider experience (ideally GCP) - infrastructure build and configurations of a variety of services including Compute, Storage
- Experience in containerized solution, eg docker/GKE
- Security and Compliance, eg IAM and cloud compliance/auditing/monitoring tools
- Experience in Linux compute system, eg file system/schedule system/boot system and essential command with Shell.
- Excellent skills in at least one of following: Python, Ruby, Java, JavaScript, Go, Groovy, Scala
- Expert understanding of DevOps principles and Infrastructure as a Code concepts and techniques
- Strong understanding of DevOPS/CI/CD and available tools, including Jenkins, Cloudbees, Ansible
- Experienced in full automation and configuration management
- A track record of constantly looking for ways to do things better and an excellent understanding of the mechanism necessary to successfully implement change
- A successful track record of delivering complex projects and/or programmes, utilising appropriate techniques and tools to ensure and measure success, as well as making a good documentation to record what we have achieved.
- Built effective networks across business areas, developing relationships based on mutual trust and encouraging others to do the same
- Customer/stakeholder focus.
- Ability to build strong relationships with Application teams, cross functional IT and global/local IT teams
- Good leadership and teamwork skills - Works collaboratively in an agile environment with DevOps application pods' to provide GCP specific capability/skills required to deliver the service.
- Operational effectiveness - delivers solutions that align to approved design patterns and security standards
- A comprehensive understanding of risk management and proven experience of ensuring own/others' compliance with relevant regulatory processes.
- Hands-on with opensource products/services will be a plus, eg Kafka/Rabbit/Zookeeper/Keycloak/Cassandra/Mongo/ClickHouse/Airflow/Beam/Grafana/Promethuse/Telegraf/Nginx/Jupyter
- Excellent written and spoken communication skill will be a plus.
- Problem-solving and critical thinking mindset will be a plus.