- Define and implement the Site Reliability Engineering (SRE) strategy, vision, and roadmap for the company
- Manage, mentor, and grow a team of SRE & DevOps engineers, providing guidance, feedback, and career development opportunities
- Establish and monitor service level objectives, indicators, and agreements for TALs products and services
- Design and implement solutions for improving the reliability, availability, scalability, and performance of TALs systems and infrastructure both on premise and on the cloud
- Lead incident response, root cause analysis, and postmortem processes for major outages and service disruptions
- Drive the adoption of best practices and standards for SRE across the organization, such as automation, testing, monitoring, alerting, and documentation
- Collaborate with other engineering leaders and stakeholders to align SRE goals and priorities with the company's objectives and values
- Ensure a frictionless experience for our customers by managing the operational aspects of partnering, closely monitoring the incident, change and problem management process and playing a pivotal role in reducing the likelihood of breaches in service level agreements.
- Lead the effort in sizing and migrating appropriate workloads to the cloud by driving application discovery and migration services with the aim of simplifying and reducing infrastructure run costs.
- Collaborate with digital and infrastructure teams to drive adoption of modern practices related to cloud adoption and DevSecOps practices.
- Develop and foster positive working relationships within the organisation.
- Ensure appropriate governance, frameworks, knowledge & capabilities are adhered to enable compliance with all regulatory standards & requirements aligning to risk appetite.
- Drive business value by collaborating with business, technology partners and suppliers to define infrastructure roadmap and strategy.
- Support risk, audit, and compliance activities.
- Ensure partnering contracts account for sound business continuity plans in compliance with regulatory obligations.
- Identify and drive opportunities for automation and innovation.
- Embed cybersecurity controls in partnering agreements to ensure appropriate infrastructure controls are enforced and assets are safeguarded.
- Minimum of 15 years of experience in technology with a strong focus on partner and service management (with 10 in a leadership role).
- Strong technical acumen in cloud and infrastructure services (Azure, AWS, Data centre, Networks, Voice, Storage, Virtualisation, Windows, Linux/Unix, Backups and Citrix).
- Deep expertise in contract and vendor management
- Strong knowledge on cloud technologies (AWS/Azure, SaaS, PaaS and IaaS)
- Excellent knowledge of industry and market trends to determine potential impacts on technology environments
- Experience with a variety of tools and frameworks for automation, testing, monitoring, and alerting
- Excellent written and verbal communication skills, interpersonal and collaborative skills, and the ability to communicate infrastructure concepts to technical and nontechnical audiences at various hierarchical levels
- Poise and ability to act calmly and competently in high-pressure, high-stress situations.
- Critical thinker, with strong problem-solving and consulting skills.
- Sound knowledge and understanding of relevant legal and regulatory requirements
- Excellent analytical skills, the ability to manage multiple projects under strict timelines, as well as the ability to work well in a demanding, dynamic environment and meet overall objectives
- Project management skills: financial/budget management, scheduling and resource management.
- Ability to lead and motivate both direct and indirect team members.
- Sound knowledge of cyber and information frameworks.
- Experience in coaching and mentoring staff.
- Excellent stakeholder management skills.
- High level of personal integrity, as well as the ability to professionally handle confidential matters and show an appropriate level of judgment and maturity
- Relevant industry certifications
We extend this acknowledgment to the many Traditional Lands that we operate across and pay our respects to Elders past, present, and emerging.Everyone at TAL has a responsibility to do the right thing and is accountable for the way they conduct themselves. Our expectations are that you follow the principles set out in our Code of Conduct when you come to work every day. Risk management is everyone's responsibility.If you are already a TAL employee please apply via the SmartRecruiters button in Workday and navigate to the Employee Portal. This is important to ensure that your application is recorded accurately.