home-screen-logo
    Site Reliability Engineer III - Operations & Observability
    Posted Apr 28, 2025
    Hybrid
    Galway, Éire / ireland
    About Renttherunway
    Design, implement, and maintain observability solutions using the Splunk Observability Platform, including Infrastructure Monitoring, APM (Application Performance Monitoring), RUM (Real User Monitoring), and Log Observer. Develop, manage, and optimise dashboards, alerts, and telemetry configurations to deliver full-stack observability across applications, infrastructure, and services. Manage and maintain scalable, reliable data ingestion pipelines using Terraform for infrastructure-as-code (IaC) deployments, ensuring seamless integration with the Splunk Observability Platform Partner with Security and Engineering teams to embed observability best practices throughout the software development lifecycle. Optimise and manage Splunk Cloud environments, ensuring scalability, cost-effectiveness, performance, and high availability. Champion the adoption of standardised observability patterns, instrumentation frameworks, and service-level objectives (SLOs). Continuously refine monitoring, alerting, and telemetry pipelines to minimise noise and improve incident response. Provide technical guidance and support to engineering teams in diagnosing and resolving observability-related issues. Collaborate with Compliance, Security, and Infrastructure teams to ensure data governance, privacy, and regulatory compliance within observability pipelines.
    Requirements
    4+ years of hands-on experience in Observability, SRE, DevOps, or Platform Engineering roles. Proven experience in Splunk (or equivalent monitoring platform) Solid understanding of observability principles: metrics, traces, logs, and events. Experience operating in modern cloud environments (AWS, GCP, Azure) and working with distributed systems. Familiarity with Kubernetes, Docker, Infrastructure as Code (e.g., Terraform), and CI/CD pipelines is a strong plus. Strong analytical and problem-solving skills with the ability to understand complex system behaviours. Excellent communication and collaboration skills, able to work effectively across diverse teams. Benefits: At Rent the Runway, we’re committed to the happiness and wellbeing of our employees, and aim to create a workplace that fosters both personal and professional growth. Our inclusive benefits include, but are not limited to: Generous Paid Time Off including annual leave, paid bereavement, and family sick leave - every employee needs time to take care of themselves and their family. Universal Paid Parental Leave for both parents + flexible return to work program  - because we know your newest family member(s) deserve your undivided attention. Paid Sabbatical after 5 years of continuous service - unplug, recharge, and have some fun. Competitive Stakeholder Pension - taking care of your future.  Comprehensive health, dental care and dependents care from day 1 of employment - Your health comes first and we’ve got you covered.  Company wide events and outings - our team spirit is no joke - we know how to have fun! Hybrid Work -This hybrid role requires 2-3 days per week in our Galway, Ireland office with the option to work 2-3 days remotely. Rent the Runway is an equal opportunity employer. In accordance with applicable law, we prohibit discrimination against any applicant or employee on any legally-recognised basis, including, but not limited to: gender, marital status, family status, age disability, sexual orientation, race, religion, and membership of the Traveller community. #LI - EM1By submitting your application below, you agree that you have read and acknowledge Rent the Runway's Candidate Privacy Policy, found here. 
    Site Reliability Engineer III - Operations & Observability at Renttherunway