Overview

The Site Reliability Engineer will be a key player in managing, optimizing, and ensuring the reliability and scalability of our SQL Server and PostgreSQL databases both in the cloud and on-premises. The ideal candidate will have extensive experience with Azure and AWS platforms, with a strong preference for Azure expertise. You will work closely with our development and operations teams to drive improvements in database performance, automate processes, and implement robust backup and recovery procedures.

Responsibilities:
  • Design, implement, and manage SQL Server and PostgreSQL database systems on both cloud (Azure and AWS) and on-premises environments.
  • Develop and enforce database administration and security standards.
  • Monitor database performance, implement changes and apply new patches and versions when required.
  • Automate repetitive DBA tasks using scripting and automation tools.
  • Ensure high availability and acceptable levels of performance of mission-critical database resources.
  • Develop strategies for database disaster recovery including setting up RTO (Recovery Time Objective) and RPO (Recovery Point Objective) metrics.
  • Work with cloud infrastructures, focusing on Infrastructure as Code (IaC) using Terraform, container orchestration with Kubernetes, and Docker.
  • Implement and manage continuous integration and deployment (CI/CD) systems using tools such as TeamCity, GitHub Actions, and Azure DevOps.
  • Utilize observability tools (Datadog, ELK stack, Grafana, Prometheus) to monitor systems and databases effectively.
  • Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
Required Qualifications:
  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 5+ years of experience as a DBA or SRE, with a focus on SQL Server and PostgreSQL.
  • Strong experience with Azure and AWS cloud platforms, with a preference for Azure expertise.
  • Solid understanding of SLI/SLOs and general SRE practices.
  • Experience with Infrastructure as Code (IaC), preferably Terraform, Kubernetes, and Docker.
  • Proficiency in CI/CD tools such as TeamCity, GitHub Actions, and Azure DevOps.
  • Knowledge of observability tools like Datadog, ELK stack, Grafana, and Prometheus.
  • Strong problem-solving skills and the ability to work under pressure.
Benefits:
  • Flextime, recognition, and support for autonomous work: Flexible time off with ample learning and development opportunities to continue growing your career.
  • Holistic health and wellness benefits: Company-paid medical, dental, and vision (available to employees and their dependents day 1), parent and siblings’ insurance, wellness benefit, office massage, etc.
  • Support for Titans at all stages of life: Parental leave and support, financial planning tools, Employee Assistance Program services, and more.
Nice To Have:
  • Familiarity with GitOps and experience with Flux/Argo CD is a big plus.
Note:

✨ Our intelligent job search engine discovered this job and republished it for your convenience.
Please be aware that the job information may be incorrect or incomplete. The job announcement remains the property of its original publisher. To view the original job and its full details, please visit the job's URL on the owner’s page.

Please clearly mention that you have heard of this job opportunity on https://ijob.am.