Overview

Responsibilities:
  • Monitor infrastructure, applications, and services in real-time using tools like Zabbix, Grafana, Prometheus, ELK Stack, or custom monitoring systems.
  • Collaborate with DevOps, SysAdmin, and Support teams to triage, escalate, and resolve incidents.
  • Analyze system trends and logs to identify potential bottlenecks, risks, or anomalies.
  • Contribute to the development of monitoring automation and incident response playbooks.
  • Perform regular health checks and report on key system metrics (uptime, response time, load, etc.).
  • Participate in on-call rotations and respond to critical incidents as needed.
  • Ensure monitoring coverage for new features, services, or infrastructure as they are deployed.
Required Qualifications:
  • Knowledge of Windows Servers, Linux servers
  • The knowledge of Apache or Nginx
  • Knowledge of Monitoring systems (Zabbix, Grafana, Kibana)
  • Knowledge of the OSI model and/or TCP/IP protocols
  • Know the Proxy and DNS principles of work
  • Know Cloudflare features
  • Debugging and troubleshooting skills
  • Fast reaction
  • Prioritization and multi-tasking
  • Written and verbal communication skills
  • Willingness to work in shifts
Note:

✨ Our intelligent job search engine discovered this job and republished it for your convenience.
Please be aware that the job information may be incorrect or incomplete. The job announcement remains the property of its original publisher. To view the original job and its full details, please visit the job's URL on the owner’s page.

Please clearly mention that you have heard of this job opportunity on https://ijob.am.