Note: This position requires the Australian Baseline Security Clearance. If offered employment, you must be willing to complete and successfully pass the adjudication process. Australian citizens currently residing in Australia for at least the past 3 years will be considered.
The Team
Our Site Reliability Engineering (SRE) team consists of highly skilled engineers responsible for maintaining and enhancing the reliability, scalability, and performance of the ServiceNow infrastructure. Our SRE’s are empowered to resolve technical issues across the entire technology stack, from hardware to applications. Additionally, they work to improve the platform's operability, aiming to reduce the number of incidents and minimize Mean Time to Recovery (MTTR).
To achieve this, the team combines software development, networking, database, and systems engineering skills to tackle complex problems, striving to maintain our platform operating for our customers.
The Role
As a Shift Manager, Site Reliability Engineering - Federal – 3rd Shift* at ServiceNow, you'll lead a team of SREs focused on ensuring the reliability and availability of critical enterprise platforms/applications, with a focus on federal sector clients, and driving automation and continuous improvement.
*The expected workdays /hours for the shift manager are Mon-Fri, from 11PM. 38 hours per week.
Let’s go over some questions to see if you are the right candidate:
- Do you have a technical background in roles such systems engineering, systems administrator, or DevOps?
- Are you proficient in troubleshooting and diagnosing operating systems, and diverse aspects of the technology stack?
- Do you dislike repetitive tasks and prefer to automate your work?
- Do you have experience leading a diverse team of engineers and managing people?
If you answer 'yes' to these questions, we want to hear from you. Click the Apply button, and let's discuss the role, your skills, and experiences.
What you get to do in this role
- Team leadership. Mentor, and develop a team of SRE engineers, manage career development, project prioritization and performance review.
- Drive initiatives to automate operational processes, reduce manual tasks, and improve overall efficiency.
- Work with other engineering teams, product managers, and stakeholders to ensure alignment, and improve the reliability of the infrastructure.
- Orchestrate actions during incidents and outages to ensure swift resolution and minimize impact and take actions towards sustainable solutions.
- Analyse and evaluate existing processes to identify areas for improvement and implement best practices.
- Provide training and support to partner teams that interface with SRE.
- Onboarding of new hires to enable their success in their roles.
- Onboarding of new technologies, systems and automations into the team.