JOB PROFILE FOR CLOUD OPERATION MANAGER
cloudEQ is looking for Cloud OPS Manager with proven experience in the Management of Cloud Operations and Maintenance teams for any of the Public Cloud Platform (AZURE/AWS/GCP).
Duties and Responsibilities
People Management and manage resources, both individual contributors and 1st line managers
Must understand the technical components of client server relationships and strong experience in managing tough customer conversations.
Hands on experience with minimum 2 Public Cloud environments like AWS, Azure , GCP
Ability to manage multiple projects / programs and influence cross functional contributors
Clear and concise written and verbal communications skills; ability to communicate with stakeholders effectively
Ability to build and maintain relationships with internal departments to enable cross functional relationships (engineering, service delivery and product)
Analytical ability to establish baselines and deviation from baselines in a clear manner
Manage and coordinate day to day activities of the Cloud Infrastructure and Cloud Automation teams in a global support model and continuously keep goals and deliverables.
Support management of colocation data centers, AWS / Azure / GCP cloud environments, and automation platforms to ensure operational excellence.
Provide guidance for resource management and operations coverage for 24x7 infrastructure hosting.
Guide engineers developing automation to provision and manage infrastructure resources in a diverse, hybrid cloud environment.
Support continued refinement and expansion of infrastructure health monitoring and incident response.
Develop and grow the team through recruitment, trainings, challenging assignments, and rewards/recognition on consistent basis.
Ensure teams follow the established change and incident and change management processes.
Understand and continuously recommend improvements to Cloud Operations processes, documentation, and tools.
Obtain and keep up to date the organizational and technical knowledge required to perform the role.
Candidate Requirements :
This role requires a seasoned, skilled, independent, self-motivated, and smart leader, who is experienced with 24x7 mission-critical cloud infrastructure, operations, processes, tools, and best practices.
8+ years of experience in IT including 5+ years in a management/leadership position in a cloud, online services based organization.
Manage a Cloud OPS team with specific focus of Cloud Services are fully up and running.
Work with Customer, Partner, SRE s and Engineering team on planning and upgrading the platform with minimal down time.
Responsible for Change management of the Cloud Platform and Cloud Services.
Build and maintain monthly shift schedules, track progress of the incidents and provide regular status updates on key project milestones to leadership team and other stakeholders.
Responsible for risk identification and mitigation
Provide technical and hands-on leadership across multiple project areas
Responsible for Initiating hiring/engagement, maintenance of employee data and all supervisor transactions for their direct reports
Manage a team of 7+ people. Perform all coaching and management functions including setting priorities, performance assessment with some upper management oversight.
Extensive knowledge of any of the public cloud platform (AZURE/GCP/AWS).
Extensive experience in Incident Change Management
Hands on experience in deployment of Microservices in Kubernetes based orchestration suites.
Should have working knowledge of Public/Hybrid Cloud/Virtualization, Windows/Linux servers, Database and Networking concepts.
Ability to quickly assimilate technical and non-technical information
Experience with monitoring tools like Prometheus, Grafana, Datadog, ELK etc.
Significant practical experience managing or supporting large-scale on-prem and public cloud infrastructure, including :
• Sizeable AWS / Azure / GCP public cloud infrastructure
• Large on-prem data-centers, servers, storage and VMware
• Configuration management and infrastructure automation
platforms, e.g. Chef, Github actions, Terraform, AWS
CloudFormation, Ansible etc
Significant practical experience with 24x7 support or administration of large-scale, transaction-intensive, multi-site, international IT infrastructure operations involving thousands of servers and 99.99% SLA.
Good working knowledge of Linux and Windows operating systems, virtualization, SQL and/or NoSQL databases, servers, storage, web servers, networking concepts, and infrastructure security.
Practical experience or a high level of comfort working with DevOps teams in a fast-moving, agile software development environment.
Due to 24x7x365 nature of cloud infrastructure operations, this role may require flexible work schedule and off-hours support from time to time.
Certification for Azure / AWS or GCP preferred.
• Competitive salary Total CTC (18,00,000 to 26,00,000) Per
• Opportunity for advancement
• Fun and exciting environment.