home-screen-logo
    Staff Site Reliability Engineer
    Posted Feb 19, 2025
    Remote
    $156000/ yearly
    Remote Usa
    About Attentive
    https://tech.attentive.com/About the Role Our Search Platform team is the backbone of Attentive’s data infrastructure, processing, storing, and optimizing data at massive scale and speed. We handle billions of events from over 100 million customers daily, enabling near-real-time data insights and AI-driven capabilities through our Data, Optimization, and ML Platforms. Joining our team offers a high-growth career opportunity to work with some of the world’s most talented engineers in a high-performance and high-impact culture.As part of the Infrastructure and Platform organization, the Production Engineering Team is focused on delivering a fast and reliable platform that empowers Attentive engineers to deliver solutions quickly and safely. We build scalable systems that automate routine tasks so we can focus on other impactful efforts. Reliability, scalability, and security are our areas of expertise. We focus on release, observability, and cost optimization. Our mission is to create robust platforms and tools that allow stakeholders to concentrate on delivering exceptional products.As a Staff Engineer, you will take a strategic role in designing and implementing solutions that enhance the reliability and scalability of our systems, while mentoring others and influencing technical roadmaps across the organization.What You'llDefine and enforce production standards, processes, and tools to ensure operational excellenceGuide and mentor team members, fostering technical growth and helping to develop the next generation of engineering leadersStrong coding ability in at least one language (e.g., Golang, Python, Java, Typescript) with the capability to solve complex issues through codeDeep understanding of production reliability concepts, including SLIs, SLOs, and incident managementFamiliarity with working in dynamic, reliability-focused production environments (preferred)What We UseOur infrastructure runs primarily in Kubernetes hosted in AWS’s EKSInfrastructure tooling includes Istio, Datadog, Terraform, CloudFlare, and HelmOur backend is Java / Spring Boot microservices, built with Gradle, coupled with things like DynamoDB, Kinesis, AirFlow, Postgres, Planetscale, and Redis, hosted via AWSOur frontend is built with React and TypeScript, and uses best practices like GraphQL, Storybook, Radix UI, Vite, esbuild, and PlaywrightOur automation is driven by custom and open source machine learning models, lots of data and built with Python, Metaflow, HuggingFace 🤗, PyTorch, TensorFlow, and PandasYou'll get competitive perks and benefits, from health & wellness to equity, to help you bring your best self to work.
    Requirements
    Design and implement systems that enhance reliability, observability, traceability, and incident management, ensuring the platform scales effectivelyCollaborate with engineers from AI/ML, Data, Platform, and Product teams to develop best-in-class servicesPartner with engineers from AI/ML, Data, Platform, Product, and other groups to deliver best-in-class servicesAdvocate for and implement SLIs, SLOs, and other reliability-focused metrics across the engineering organizationDrive continuous improvement by bringing creative ideas and challenging the status quoYour Expertise7+ years of experience in Production Engineering, Backend Engineering, SRE,- The US base salary range for this full-time position is $156,000 - $240,000 annually + equity + benefits- Our salary ranges are determined by role, level and location#LI-JK1Attentive Company ValuesDefault to Action - Move swiftly and with purposeBe One Unstoppable Team - Rally as each other’s championsChampion the Customer - Our success is defined by our customers' successAct Like an Owner - Take responsibility for Attentive’s successLearn more about AWAKE, Attentive’s collective of employee resource groups.If you do not meet all the requirements listed here, we still encourage you to apply! No job description is perfect, and we may also have another opportunity that closely matches your skills and experience.At Attentive, we know that our Company's strength lies in the diversity of our employees. Attentive is an Equal Opportunity Employer and we welcome applicants from all backgrounds. Our policy is to provide equal employment opportunities for all employees, applicants and covered individuals regardless of protected characteristics. We prioritize and maintain a fair, inclusive and equitable workplace free from discrimination, harassment, and retaliation. Attentive is also committed to providing reasonable accommodations for candidates with disabilities. If you need any assistance or reasonable accommodations, please let your recruiter know. 
    Staff Site Reliability Engineer at Attentive