MANAGER, CLOUD OPERATIONS, SRE WANTED?

 

Talent Hunter is an IT and Telecom Recruitment Company ensuring the best professional opportunities for talents in the high-tech industry and providing quick and cost-effective solutions to client companies seeking the best talent for their business.

Talent Hunter is currently looking for an experienced Manager, Cloud Operations, SRE to lead a global team of 15+ engineers and drive operational excellence across complex cloud environments.

Role Overview

In this role, you will combine strategic leadership with deep technical expertise to ensure the stability, scalability, and reliability of modern cloud infrastructure. You will play a key role in implementing Site Reliability Engineering (SRE) principles, advancing automation initiatives, and aligning technical operations with broader business objectives.

Operational Strategy & SRE Governance:

  • Reliability Frameworks: Define and maintain SLIs (Service Level Indicators) and SLOs (Service Level Objectives), managing Error Budgets to balance reliability with delivery speed.
  • AIOps & Observability: Lead the adoption of AI-driven monitoring and observability solutions to proactively detect and address issues before they impact operations.
  • Automation & Toil Reduction: Identify repetitive manual processes and drive automation initiatives to improve efficiency and reduce operational overhead.

Global Team Leadership:

  • 24/7 Operational Model: Establish effective follow-the-sun processes across global regions to ensure uninterrupted service delivery.
  • Workload Optimization: Balance operational support (KTLO) activities with strategic engineering initiatives to maintain team sustainability.
  • People Development: Conduct regular 1:1 meetings, support career growth, and manage performance for a distributed and diverse engineering team.

Technical Leadership & Architecture:

  • Infrastructure as Code & GitOps: Promote and enforce IaC best practices using tools such as Terraform and Ansible, ensuring automated and version-controlled deployments.
  • Cloud Architecture Guidance: Provide architectural direction to ensure new services are scalable, reliable, and cost-efficient within Azure or AWS environments.
  • Kubernetes Governance: Oversee Kubernetes cluster lifecycle management, including upgrades, patching, security improvements, and custom operator implementation.

Incident & Reliability Management:

  • Major Incident Escalation: Act as the primary escalation point during critical incidents, ensuring timely resolution and transparent stakeholder communication.
  • Blameless Post-Mortems: Foster a culture of continuous improvement by leading root cause analyses and implementing preventative measures.

Required Skills & Experience:

Leadership & Background

  • Minimum 2+ years of experience managing teams of 10+ engineers.
  • 5+ years of hands-on experience in DevOps, SRE, or Cloud Operations roles.
  • Proven track record in managing distributed teams within 24/7 operational environments.

Technical Expertise:

  • Strong experience with Azure (preferred) or AWS.
  • Solid understanding of SRE methodologies, including SLO lifecycle management and toil reduction.
  • Hands-on experience with Terraform, Ansible, and GitOps practices (GitHub Action experience).
  • Deep experience with Observability and AIOps tools such as Prometheus, Grafana, New Relic, or Logz.io.
  • Advanced knowledge of Kubernetes, including cluster lifecycle management and Operators.
  • Strong system administration skills (Linux kernel tuning and/or Windows Server administration).

Personal Competencies:

  • Strategic mindset with the ability to align technical execution with business priorities.
  • Clear and effective communicator, comfortable engaging with senior leadership.
  • Strong analytical and problem-solving skills, particularly in high-pressure environments.
  • Advocate for engineering excellence and continuous improvement.

We offer:

  • Attractive compensation package;
  • Career and Development – worldwide career opportunities, access to a high-tech Engineering Lab;
  • Work That Fits Your Life- possibility to work from home, and transition support through life events.
  • Wellness and Health Programs;
  • Additional Health Insurance with Dentist (Luxury package);
  • Certification and Training Programs;
  • Performers Bonus Scheme;
  • Food Stamps (extra money to the salary for food);
  • Extra Days Paid Leave;
  • Secured Parking Space;
  • Exciting Workplace Experience;

Please send your recent CV stating the position title in the subject line and we will contact you if you have the required skill set!

Licensed by MLSP, license N 2651, valid from 29.10.2018

Talent Hunter Ltd. informs you that part of the data you provide by sending your application is personal data and falls under the special treatment and protection of the Data Protection Law and the 2016/679 Regulation. The provided personal data will be processed for legally acknowledged purposes, related to the present job ad, as well as to the realization of the legal interest of the personal data administrator. Talent Hunter Ltd. processes, stores and uses the voluntarily provided personal data in the legally determined timeframes, guaranteeing their security and confidentiality. Please be informed that hereby you agree that Talent Hunter Ltd. might provide your personal data to governmental bodies and institutions, or third parties when there is such obligation by law, or it is required for the realization of your rights and legal interests as a participant in a recruitment process with the purpose of concluding a future labor contract. As per the internal rules of Talent Hunter Ltd. you have the right to access and edit your personal data, the right to be deleted and the right to object against processing, presenting or revealing of your personal data for purposes different from the ones, stated above.

APPLY FOR THIS JOB OR CONTACT US TO RECOMMEND A PROFESSIONAL