The post holder is responsible for building running and supporting durable and efficient solutions for a world-leading petascale genomic endeavour. The post holder provides DevOps/SRE-aligned expertise, feeding into architectural solutions while seeking to identify improvements and efficiencies within new and existing tooling.
Genomics England is progressing a strategy to move key applications to AWS. However, currently we have @ 800 VM within UK Cloud. These systems are Linux (Ubuntu) based and utilise configuration management with SALT.
The role will be focused in the 3rd line engineering space supporting the operations function. Key responsibilities will be to ensure best practices are followed with respect to configuration management with a focus on ensuring SALT is optimised and used across the engineering and operations team supporting UK Cloud. Focus will be on developing long term remediation for deeper business issues and drive down repeat issues.
As the migration progresses from UK Cloud to AWS / Azure, the focus will move to SRE/Squad engineering work.
*Keeping repeat operational issues to a minimum through permanent long term fixes
*Designing, building and running secure, scalable, available and efficient cloud-based solutions
*Identifying repetitive, manual tasks and implement automation, eliminating toil and reducing waste where possible
*Developing and maintaining tools and services in a DevOps/SRE-aligned model using robust engineering practice, including clean and efficient infrastructure-as-code, thorough and consistent testing, dependable CI/CD pipelines and cloud resource creation processes
*Embodying best practices in your work with colleagues, delivering clear communication in your commits, your messaging, written communication, and generated documentation
*Reviewing proposed changes to platform systems and components and contribute constructive insights to the wider organisation
*Primary 3rd line engineer supporting operations
*Ensuring key infrastructure services e.g. SALT are running effectively with full coverage
*Pro-active collaboration and example setting across peer on standards and proper code development
*Pro-active collaboration with operations/SRE to ensure code is developed and released with full operational capability (automation, monitoring, documentation)
*Expertise in SALT configuration management is essential.
*Passionate about collaborative communication and work patterns, with an understanding of how individual excellence strengthens the team
*Approachable, open to change, and be able to challenge directly and professionally
*Hold subject matter expertise in Linux and at least two or more of the following technology areas: database, containerisation, application server, network, storage, authentication, monitoring
*Extensive experience in a major config management tool (e.g. Ansible, Salt, Puppet, Chef) and an excellent working knowledge of Terraform
*Have extensive experience building and managing resources and services in a public cloud provider (AWS, Azure or Google Cloud Platform) with an ability to design and deploy cloud-native solutions for application stacks in a hybrid cloud environment, and experience of the challenges in migrating services to the cloud while incorporating an engineered-for-failure approach. Ideally you have proven your experience in operating the cloud through professional certification
*Able to program in at least one general-purpose language, ideally Python, and demonstrate fluency in scripting languages
*Extensive experience in CI/CD tooling (e.g. GitLabCI, Jenkins, TravisCI), with a pragmatic and constructive approach to building, improving and automating pipelines
*Strong knowledge of Git at the command line and evangelise best practice
*Ability to design and implement solutions that incorporate standardised security practices (such as principle-of-least-privilege)
*Excellent knowledge of modern compute solutions, identifying the challenges and benefits of containers (Docker), serverless and emerging technologies
*Passionate about monitoring and continual service improvement, contributing to high service availability
*An automation enthusiast, continually looking for ways to automate, simplify and remove any manual tasks
*Significant experience integrating with restful APIs, following an API-first approach
*Experience in debugging a complex multi-components services, applications and stacks
*Preferably be educated to degree level in Computer Science (or with equivalent experience)
There are some great benefits on offer with this role including a competitive salary, pension, generous holidays and more.
Please do not hesitate to contact Harvey Uppal at firstname.lastname@example.org or call (+44) 121 616 3407 to discuss this opportunity further.
Keywords: Platform, Engineer, Operations, Cloud, AWS, Azure, SALT, DevOps, CI, CD, Pipelines, Automation, Docker, Ansible, Puppet, Git, Genomics, Bioinformatics, London.
Job Reference: PEGEL_1583169886
Broadcast As: 18372
Salary To: Negotiable
Job Type: Full Time
Job Term: Permanent
Recruitment Consultant: Harvey Uppal