As an Azure Cloud Engineer – Monitoring Specialist, you will play a pivotal role in ensuring the availability, performance, and security of our cloud-based infrastructure on Microsoft Azure. You will leverage your expertise in cloud monitoring to design, implement, and maintain comprehensive monitoring solutions that provide real-time insights into our cloud environment.
Responsibilities
- Monitoring Solution Design Collaborate with cross-functional teams to understand monitoring requirements, and design scalable and efficient monitoring solutions using ELK, Datadog or similar.
- Implementation Deploy and configure monitoring components to collect, aggregate, and analyze logs, metrics, and traces from Azure services and applications running on PaaS resources or Kubernetes clusters.
- Customization Develop custom dashboards, alerts, and reports to provide actionable insights into system performance, security, and availability.
- Performance Optimization Continuously optimize monitoring solutions to reduce noise and improve the accuracy of anomaly detection and alerting.
- Incident Response Act as the subject matter expert in monitoring-related incidents, troubleshoot issues, and provide timely resolution to minimize downtime.
- Documentation Create and maintain detailed documentation for monitoring solutions, including configuration, best practices, and troubleshooting guides.
- Automation Implement automation scripts and workflows to streamline monitoring processes and enable proactive responses to issues.
- Collaboration Collaborate with DevOps and development teams to ensure monitoring requirements are integrated into the CI/CD pipeline.
- Stay Current Keep up-to-date with industry trends and best practices in cloud monitoring and contribute to continuous improvement efforts.
Qualifications
- Bachelors degree in Computer Science, Information Technology, or related field (or equivalent work experience).
- Proven experience as a Cloud Engineer with a focus on Azure.
- Strong expertise in monitoring solutions like ELK stack (Elasticsearch, Logstash, Kibana), Datadog or similar.
- Proficiency in scripting and automation using tools such as PowerShell, Python, and Terraform.
- Experience with Azure Monitor, Azure PaaS, Azure IaaS, AKS.
- Knowledge of cloud-native observability and monitoring solutions.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills.
- Certifications such as AZ-900, AZ-104, Elastic Certified Engineer, and Datadog Certified Professional are a plus.
What we offer
- Competitive salary
- Bonus benefits
- Outstanding professional environment
- Option for full-time remote work
- Flexibility
- Exciting projects