Our client, a major Financial Institution, is seeking an experienced Azure Cloud Platform SRE Engineering Lead to join the team. As the Azure Cloud Platform SRE Engineering Lead, you will play a pivotal role in ensuring the reliability, scalability, and efficiency of the Azure Cloud platform.
Responsibilities:
Technical Leadership:
Provide technical leadership and expertise in designing, implementing, and maintaining our Azure cloud platform infrastructure.
Drive the adoption of best practices and ensure compliance with industry standards.
Collaborate with cross-functional teams to define and implement cloud solutions aligned with business needs.
Strategy and Planning:
Develop and execute strategic plans, roadmaps, and project initiatives for the Azure cloud platform.
Identify areas for improvement, cost optimization, and innovation within the Azure ecosystem.
Stay up to date with the latest Azure technologies and trends to recommend enhancements and drive continuous improvement.
Infrastructure Reliability and Performance:
Establish and maintain a robust monitoring and alerting system to proactively identify and address potential issues.
Implement automation and scripting to streamline operational tasks and improve system performance.
Conduct regular performance testing and capacity planning exercises to ensure optimal platform performance and scalability.
Incident Management and Troubleshooting:
Lead the incident management process to promptly resolve issues and minimize system downtime.
Analyze and troubleshoot complex production incidents, applying root cause analysis and implementing preventive measures.
Coordinate with external vendors and support teams for issue resolution and escalation management.
Documentation and Training:
Create and maintain comprehensive documentation of system configurations, processes, and operational procedures.
Provide training and knowledge-sharing sessions to enhance the technical skills of the team.
Requirements:
Bachelor's degree in Computer Science, Information Technology, or a related field.
Strong experience working as a Cloud Engineer/SRE in an Azure environment for a financial institution or similar industry.
Expertise in Azure cloud services, including Azure Resource Manager, Azure Functions, Azure Storage, Azure Networking, Azure SQL Database, and Azure Active Directory.
Proficient in scripting and automation tools such as PowerShell, Python, or Azure CLI.
Solid understanding of DevOps principles, CI/CD pipelines, and Infrastructure as Code (IaC) using tools like Azure DevOps or Terraform.
Experience with monitoring and logging tools like Azure Monitor, Azure Log Analytics, and Azure Application Insights.
Strong analytical and problem-solving skills with the ability to troubleshoot complex issues efficiently.
Excellent communication and leadership skills to effectively collaborate with cross-functional teams and stakeholders.