Roadie Logo

Roadie

Senior Site Reliability Engineer

Reposted 10 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
The Senior Site Reliability Engineer will optimize platform reliability, manage Kubernetes infrastructure, deploy monitoring solutions, and collaborate on system performance.
The summary above was generated by AI

Roadie, a UPS company, is a leading logistics and delivery platform that helps businesses tackle the complexities of modern retail with unmatched delivery coverage, flexibility and visibility. Reaching 97% of U.S. households across more than 30,000 zip codes — from urban hubs to rural communities — Roadie provides seamless, scalable solutions that meet a variety of delivery needs. 

With a network of more than 310,000 independent drivers nationwide, Roadie offers flexible delivery solutions that make complex logistics challenges easy, including solutions for local same-day delivery, delivery of big and bulky items, ship-from-store and DC-to-door. 

Roadie is seeking a Senior Site Reliability Engineer to join our growing Technical Operations Team. We are looking for a candidate who has experience implementing site reliability principals, as well as production level Kubernetes experience.  The ideal candidate is a skilled problem solver with intimate knowledge of site reliability practices, standard dev ops principles, AWS, scripting languages and Kubernetes. 

What You'll Do

  • Build systems that optimize the uptime and reliability of our platform, and support the management and optimization of our software delivery pipeline, observability and infrastructure operations
  • Maintain, support, and engineer production and non-production Kubernetes Clusters (EKS) as well as ES, MSK, RDS, and EC (Redis) clusters
  • Deploy and maintain monitoring and logging solutions based on Prometheus, Loki, Thanos, Grafana, OpenTelemetry and New Relic
  • Collaborate with cross-functional teams to identify and address potential bottlenecks, optimize resource utilization, and proactively prevent system failures
  • Define and manage SLO, SLI and error budgets
  • Develop processes, tools and automation to reduce toil across engineering teams
  • Plan and forecast service capacity and demand, assess cost optimization, and tune systems and software
  • Debug production/non-production issues
  • Take part in 24/7 on-call rotation

Technology We're Using Now

  • Python, Ruby on Rails, Golang
  • React/Redux, Objective-C and Swift, Android
  • Postgres, Redshift, Redis, Kafka
  • AWS/GCP
  • Docker/Kubernetes
  • OpenTelemetry/Prometheus/Thanos/Loki/Grafana/New Relic/Sentry
  • Git/CircleCI
  • ArgoCD

What You Bring

  • 6+ Years in various SRE roles
  • 6+ Years in various DevOPS/System Engineering roles
  • 6+ Years of experience building and managing production Kubernetes infrastructure
  • 6+ Years experience with popular scripting languages (Python, Ruby, Bash, etc.)
  • Experience with Infrastructure as code such as Terraform or Crossplane
  • Experience with CI/CD Development tools (CircleCI, etc.)
  • Experience with GitOPS Tools (ArgoCD)
  • Experience using a broad range of AWS technologies (RDS, ElasticSearch, VPC, EKS, S3, CloudFront, MSK, Elasticache, CloudWatch, etc.)
  • Experience developing and maintaining YAML templating systems (Helm charts, Kustomize, etc)
  • Must be able to work independently, be self-motivated and handle multiple priorities
  • Comfortable working in a fast-paced agile environment

Finally, a willingness to admit what you don’t know, and learn what you need to learn quickly.

Why Roadie? 

  • Competitive compensation packages 
  • 100% covered health insurance premiums for yourself
  • 401k with company match
  • Tuition and student loan repayment assistance (that’s right - Roadie will contribute directly to your existing student loans!) 
  • Flexible work schedule with unlimited PTO 
  • Monthly 3-day weekends
  • Monthly WFH stipend 
  • Paid sabbatical leave- tenured team members are given time to rest, relax, and explore
  • The technology you need to get the job done

Top Skills

Android
Argocd
AWS
CircleCI
Crossplane
Docker
GCP
Git
Go
Grafana
Kafka
Kubernetes
Loki
New Relic
Objective-C
Opentelemetry
Postgres
Prometheus
Python
React
Redis
Redshift
Redux
Ruby On Rails
Sentry
Swift
Terraform
Thanos

Similar Jobs

Yesterday
Remote or Hybrid
San Diego, CA, USA
127K-215K Annually
Senior level
127K-215K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Senior Site Reliability Engineer, you will maintain and enhance the reliability and performance of ServiceNow's cloud infrastructure, using a combination of software development, systems engineering, and networking skills.
Top Skills: AIAnsibleAWSAzureBashGCPGitlab Ci/CdGoGrafanaJavaScriptKubernetesLinuxMariadbMySQLOpentelemetryPostgresPrometheusPythonTerraform
2 Days Ago
Remote or Hybrid
Phoenix, AZ, USA
75K-130K
Senior level
75K-130K
Senior level
Artificial Intelligence • Big Data • Information Technology • Software
The Senior Site Reliability Engineer will design and manage cloud infrastructure, ensure FedRAMP compliance, and enhance incident response efforts while collaborating with various teams on reliability and security practices.
Top Skills: AnsibleAWSAzureBashCloudFormationCrossplaneDockerGCPGitGitlabGoJenkinsKubernetesPythonTerraform
2 Days Ago
Remote or Hybrid
San Diego, CA, USA
127K-215K Annually
Senior level
127K-215K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Senior Site Reliability Engineer, maintain and develop cloud infrastructure reliability and performance, support AI integration, and automate processes.
Top Skills: AutomationAWSAzureCloud TechnologiesDevOpsJavaScriptLinuxMariadbMonitoringMySQLObservabilityPostgresPythonScripting

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account