Systems Reliability Engineer

| Hybrid
Sorry, this job was removed at 6:00 p.m. (CST) on Thursday, October 8, 2020
Find out who's hiring in Austin.
See all Developer + Engineer jobs in Austin
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

About the Role

An engineering role at Cloudflare provides an opportunity to address some big challenges, at scale.  We believe that with our talented team, we can solve some of the biggest security, reliability and performance problems facing the Internet. Just how big?  

  • We have in excess of 15 Terabits of network transit capacity
  • We operate 153 Points-of-presence around the world
  • We serve more traffic than Twitter, Amazon, Apple, Instagram, Bing, & Wikipedia combined
  • Anytime we push code, it immediately affects over 200 million internet users
  • Every day, up to 20,000 new customers sign-up for Cloudflare service
  • Every week, the average Internet user touches us more than 500 times

We are looking for talented Systems Reliability Engineers to build and operate the platform which makes Cloudflare customers place their trust in us.  Our SREs come from a variety of technical backgrounds and have built up their knowledge working in different environments. But the common factors across all of our reliability-focused engineers include a passion for automation, scalability, and operational excellence.  Our SRE teams monitor our network in a “follow the sun” approach with offices in Singapore, London, and San Francisco.

We are still a small team, well-funded, growing quickly and focused on building an extraordinary company.  This is a superb opportunity to join a high-performing team and scale our high-growth network as Cloudflare’s business grows.  You will build tools to constantly improve availability, performance, uptime and response times. You will nurture a passion for an “automate everything” approach that makes systems failure-resistant and ready-to-scale.   

Cloudflare SREs work in one of these 4 teams:

  • Core Operations
  • Edge Operations
  • Core Platform
  • Edge Platform

The Operations teams focus on the immediate state and functionality of the Cloudflare platform around the world, leveraging an array of monitoring, alerting and diagnostics tools.  The Platform teams focus on developing and enhancing the Cloudflare platform and its capabilities. The Platform and Operations team are both “devops” teams, responsible for reliability engineering across a wide portfolio of applications and services, leveraging developer and operator patterns.  Many of our SREs have had the opportunity to work at multiple offices on interim and long-term project assignments. The ideal SRE candidate has a passionate curiosity about how the Internet fundamentally works and has a strong knowledge of DNS, Linux and TLS along with strong coding ability in Bash, Python or Go. We prefer to hire very experienced candidates; however raw skill trumps experience and we welcome strong junior applicants.

Requisite Skills

  • Linux systems administration experience
  • 3 years of relevant Site Reliability Engineering experience
  • Intermediate level software development skills in Python, Go or SQL
  • Strong skills in network services, including DNS, TLS/SSL and HTTP
  • Network fundamentals DHCP, ARP, subnetting, routing, firewalls, IPv6

Examples of desirable skills, knowledge and experience

  • 5 years of relevant work experience
  • Experience with the Linux kernel and Linux software packaging
  • Performance analysis and debugging with tools like perf, sar, strace, dtrace
  • Configuration management systems such as Saltstack, Chef, Puppet or Ansible
  • Load balancing and reverse proxies such as Nginx, Varnish, HAProxy, Apache
  • SQL databases (Postgres or MySQL)
  • Time series databases (OpenTSDB, Graphite, Prometheus, Grafana)
  • Key/Value stores (Redis, KyotoTycoon, Cassandra, LevelDB)
  • Internetworking and BGP

Bonus Points

  • Experience with network programming in C, C++ or Go
  • Experience with continuous / rapid release engineering
  • Strong tooling and automations development experience
  • Experience working in a 24/7/365 service environment
  • High-bandwidth transit Internetworking and routing experience

Some tools that we use

  • Nginx
  • Salt
  • Go
  • Rust
  • Python
  • Kubernetes 
  • PostgreSQL
  • Docker
  • Prometheus


Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Technology we use

  • Engineering
  • Product
  • Sales & Marketing
    • C++Languages
    • GolangLanguages
    • JavascriptLanguages
    • PythonLanguages
    • RLanguages
    • SqlLanguages
    • rustLanguages
    • ReactLibraries
    • ConfluenceManagement
    • JIRAManagement
    • SmartsheetManagement
    • SalesforceCRM

Location

405 Comal St, Austin, TX 78702

What are Cloudflare Perks + Benefits

Culture
Volunteer in local community
Open door policy
OKR operational model
Team based strategic planning
Pair programming
Open office floor plan
Flexible work schedule
Remote work program
Diversity
Dedicated diversity and inclusion staff
Mandated unconscious bias training
Diversity employee resource groups
Cloudflare has many ERGs including Afroflare, Latinflare, Nativeflare, Asianflare, Desiflare, Womenflare, Women in Engineering, Proudflare, Vetflare, and more.
Hiring practices that promote diversity
Health Insurance & Wellness Benefits
Flexible Spending Account (FSA)
Disability insurance
Dental insurance
Vision insurance
Health insurance
Life insurance
Pet insurance
Wellness programs
Team workouts
Financial & Retirement
401(K)
Company equity
Employee stock purchase plan
Child Care & Parental Leave Benefits
Childcare benefits
Generous parental leave
Family medical leave
Return-to-work program post parental leave
Vacation & Time Off Benefits
Unlimited vacation policy
Generous PTO
Paid volunteer time
Sabbatical
Paid holidays
Paid sick days
Office Perks
Commuter benefits
Company-sponsored outings
Free snacks and drinks
Some meals provided
Company-sponsored happy hours
Relocation assistance
Fitness stipend
Professional Development Benefits
Job training & conferences
Lunch and learns
Promote from within
Online course subscriptions available

More Jobs at Cloudflare

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about CloudflareFind similar jobs like this