Software Engineer II, Machine Learning Platform

New Jobs From DTC Brands Uploaded Daily!

Attentive

Posted Mar 9, 2025

Hybrid

$148000/ yearly

Hybrid

About Attentive

https://tech.attentive.com/About the RoleWe’re looking for a self-motivated, highly driven Software Engineer II to join our Machine Learning Platform (MLOps) team. As a team, we enable Attentive’s Machine Learning (ML) practice to directly impact Attentive’s AI product suite through the tools to train, inference, and deploy ML models with higher velocity and performance, while maintaining reliability. We build and maintain a foundational ML platform spanning the full ML lifecycle for consumption by ML engineers and data scientists. This is an exciting opportunity to join a rapidly growing ML Platform team at the ground floor with the ability to drive and influence the architectural roadmap enabling the entire ML organization at Attentive.This team and role is responsible for building and operating the ML data, tooling, serving, and inference layers of the ML platform. We are excited to bring on more engineers to continue expanding this stack. What You'll AccomplishExpand, mature, and optimize our ML platform built around cutting edge tooling like Ray, MLFlow, Argo, and Kubernetes to support traditional and deep learning ML modelsBuild and mature capabilities to support CPU / GPU clusters, model performance monitoring, drift detection, automated roll-outs, and improved developer experienceBuild, operate, and maintain a low-latency, high volume ML serving layer covering both online and batch inference use casesOrchestrate Kubernetes and ML training / inference infrastructure exposed as an ML platformExpose and manage environments, interfaces, and workflows to enable ML engineers to develop, build, and test ML models and servicesClose the latency gap on model inference to online, real-time model servingDevelop automation workflows to improve team efficiency and ML stabilityAnalyze and improve efficiency, scalability, and stability of various system resourcesPartner with other teams and business stakeholders to deliver business initiativesHelp onboard new team members, provide mentorship and enable successful ramp up on your team's code basesAbout youYou have been working in the areas of MLOps / Platform Engineering / DevOps / Infrastructure for 5+ years, and have an understanding of gold standard practices and best in class tooling for MLYour passion is exposing platform capabilities through interfaces that enable high performance ML practices, rather than designing ML experiments (this team does not directly develop ML models)You understand the key differences between online and offline ML inferences and can voice the critical elements to be successful with each to meet business needsYou have experience building infrastructure for an ML platform and managing CPU and GPU computeYou have a background in software development and are passionate about bringing that experience to bear on the world of ML infrastructureYou have experience with Infrastructure as Code using Terraform and can’t imagine a world without itYou understand the importance of CI/CD in building high-performing teams and have worked with tools like Jenkins, CircleCI, Argo Workflows, and ArgoCDYou are passionate about observability and worked with tools such as Splunk, Nagios, Sensu, Datadog, New RelicYou are very familiar with containers and container orchestration and have direct experience with vanilla Docker as well as Kubernetes as both a user and as an administratorYour ExpertiseYou have been working in the areas of ML Platform / MLOps / Platform Engineering / DevOps / Infrastructure for 3+ years, and have an understanding of gold standard practices and best in class tooling for MLYour passion is exposing platform capabilities through interfaces that enable high performance ML practices, rather than designing ML experiments (this team does not directly develop ML models)You understand the key differences between online and offline ML inferences and can voice the critical elements to be successful with each to meet business needsYou have experience building infrastructure for an ML platform and managing CPU and GPU computeYou have a background in software development and are passionate about bringing that experience to bear on the world of ML infrastructureYou have experience with Infrastructure as Code using Terraform and can’t imagine a world without itYou understand the importance of CI/CD in building high-performing teams and have worked with tools like Jenkins, CircleCI, Argo Workflows, and ArgoCDYou are passionate about observability and worked with tools such as Splunk, Nagios, Sensu, Datadog, New RelicYou are very familiar with containers and container orchestration and have direct experience with vanilla Docker as well as Kubernetes as both a user and as an administrator. What We UseOur infrastructure runs primarily in Kubernetes hosted in AWS’s EKSInfrastructure tooling includes Istio, Datadog, Terraform, CloudFlare, and HelmOur backend is Java / Spring Boot microservices, built with Gradle, coupled with things like DynamoDB, Kinesis, AirFlow, Postgres, Planetscale, and Redis, hosted via AWSOur frontend is built with React and TypeScript, and uses best practices like GraphQL, Storybook, Radix UI, Vite, esbuild, and PlaywrightOur automation is driven by custom and open source machine learning models, lots of data and built with Python, Metaflow, HuggingFace 🤗, PyTorch, TensorFlow, and PandasYou'll get competitive perks and benefits, from health & wellness to equity, to help you bring your best self to work.

Requirements

- The US base salary range for this full-time position is $148,000 - $195,000 annually + equity + benefits- Our salary ranges are determined by role, level, and location#LI-EZ1Attentive Company ValuesDefault to Action - Move swiftly and with purposeBe One Unstoppable Team - Rally as each other’s championsChampion the Customer - Our success is defined by our customers' successAct Like an Owner - Take responsibility for Attentive’s successLearn more about AWAKE, Attentive’s collective of employee resource groups.If you do not meet all the requirements listed here, we still encourage you to apply! No job description is perfect, and we may also have another opportunity that closely matches your skills and experience.At Attentive, we know that our Company's strength lies in the diversity of our employees. Attentive is an Equal Opportunity Employer and we welcome applicants from all backgrounds. Our policy is to provide equal employment opportunities for all employees, applicants and covered individuals regardless of protected characteristics. We prioritize and maintain a fair, inclusive and equitable workplace free from discrimination, harassment, and retaliation. Attentive is also committed to providing reasonable accommodations for candidates with disabilities. If you need any assistance or reasonable accommodations, please let your recruiter know.

Software Engineer II, Machine Learning Platform at Attentive