← all jobs

Site Reliability Engineer | Dayshift | Remote

Work from home Full-time role Hiring

ZigZag is looking for a Site Reliability Engineer to join our team! As a Site Reliability Engineer, you’ll design, build, and maintain the infrastructure and automation that power our platform. Working closely with software engineering teams and SRE peers, you'll embed reliability, performance, and compliance into the development lifecycle. Your focus will be on scalability, resilience, security, and operational efficiency across all environments.

Key Responsibilities

Infrastructure and Platform Engineering Design, build, and maintain scalable and reliable infrastructure and platform services. Develop and maintain infrastructure-as-code (e.g., CloudFormation, Terraform). Develop custom automation workflows and internal tools to support infrastructure provisioning, monitoring, and incident response. (e.g., Python leveraging libraries = such as boto3 for AWS automations) Liaise with vendors to assess and implement third-party solutions. Maintain well-documented system configurations to support maintainability and compliance. Reliability and Operations Monitor system performance, availability, and capacity using observability tools (e.g., SumoLogic, AWS CloudWatch). Create and maintain dashboards and monitoring solutions that offer deep insight into platform health and support rapid incident diagnosis. Automate operational processes (e.g., deployments, failovers, scaling) to reduce toil and enhance system resilience. Participate in incident response activities, including postmortems and root cause analysis, to drive continual improvement. Continuously evolve and maintain SLOs and SLIs, ensuring a balance between development velocity and system reliability. Work as part of a highly engaged team of SREs to ensure the stability, performance, cost-effectiveness, and observability of all environments. Build, Deploy, and Development Enablement Design and implement robust CI/CD pipelines and zero-downtime deployment strategies. Build efficient and reliable build systems to empower development teams with self-service deployment capabilities. Collaborate with engineering teams to embed reliability, scalability, performance, and security best practices into the SDLC. Security and Compliance: Maintain and monitor vulnerability scanning systems (e.g., Tenable Nessus, Lacework, Snyk) to work closely with Software Engineering teams to ensure the platform remains secure and up to date. Perform recurring security tasks such as reporting, maintaining security registers, and ensuring compliance with internal standards. Support the organisation in maintaining PCI-DSS certification by ensuring infrastructure is securely configured and well-documented. Skills & Experience Essential 2+ years of experience in a SRE role or similar (e.g. DevOps Engineer) Experience managing an AWS environment and working in a SaaS business. Strong knowledge and experience of infrastructure-as-code Experience with building and supporting robust CI/CD pipelines Strong problem solving and analytical skills Excellent communication and collaboration skills. Ability to work in a fast-paced, agile environment Desirable Experience with BuildKite Experience with distributed systems and microservice architecture Exposure to compliance frameworks (PCI-DSS, ISO27001). ZigZag is committed to building a diverse, inclusive, and equitable workplace. We believe that talent knows no borders, and we welcome individuals from all backgrounds to help us shape the future of work. Guided by transparency and agility, we foster an environment where everyone is valued and empowered to thrive. By submitting this application, you acknowledge that you have read and agree with the company’s Privacy Policy.

More open positions

Senior Key Account Manager (DACH)

Work from home Full-time role

Web Developer

Work from home Full-time role

HR Generalist

Work from home Full-time role

Online Cross-cultural trainers from Romania

Work from home Full-time role

Online Cross-cultural trainers from Bulgaria

Work from home Full-time role

Tech Lead, Android Core Product - Albuquerque, NM, USA

Work from home Full-time role

Claims Adjuster - Remote

Work from home Full-time role

[Remote] Attacker Operations Center (AOC) Analyst

Work from home Full-time role

Project Manager

Work from home Full-time role

Master Tech Serv Desk

Work from home Full-time role

AI Data Engineer

Work from home Full-time role

Patient Intake Coordinator (Part time, Remote)

Work from home Full-time role

Experienced Data Entry Analyst – Leisure Products Development

Work from home Full-time role

.Remote Operations & Support Role | Entry Level | Beginner Friendly

Work from home Full-time role

Support Worker 16-17 Year Olds (Sleeping Night)...

Work from home Full-time role

HCAI - HEALTH INFORMATION AND ELECTRONIC RECORDS ANALYST TRAINING PROGRAM

Work from home Full-time role

Experienced Remote Customer Service Support Agent – Delivering Exceptional Client Experiences at careerzynith

Work from home Full-time role

Workers Compensation Pricing Analyst

Work from home Full-time role

Associate Director/Senior Workday Engagement Manager

Work from home Full-time role

Remote Chat Support Specialist – Entry‑Level Customer Service Role (No Phone Calls) – Flexible Work‑From‑Home Position at careerzynith

Work from home Full-time role

Spanish Interpreter

Work from home Full-time role