Senior Site Reliability Engineer (Cloud Ops)

Sorry, this job was removed at 10:58 a.m. (CST) on Friday, November 2, 2018
Find out who's hiring remotely in Austin.
See all Remote Developer + Engineer jobs in Austin
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

Forcepoint is transforming cybersecurity by focusing on what matters most: understanding people’s intent as they interact with critical data and intellectual property wherever it resides. Our uncompromising systems enable companies to empower employees with unobstructed access to confidential data while protecting intellectual property and simplifying compliance. Based in Austin, Texas, Forcepoint supports more than 20,000 organizations worldwide. For more about Forcepoint, visit www.Forcepoint.com and follow us on Twitter at @ForcepointSec.

 

 

Senior Site Reliability Engineer

 

Job Description

 

Forcepoint is looking for a Senior Site Reliability Engineer to join the Cloud Operations SRE team.  This is a unique opportunity to join a newly formed team who will focus on world-class monitoring/alerting, platform performance, availability, reliability, and capacity planning.  The right candidate will have a software development mindset and will automate as much as possible to avoid repetitive tasks.  The individual will work closely with Engineering teams to optimize the deployment and monitoring of mission critical, customer-facing systems across private and public cloud environments. 

 

The successful candidate is customer focused, a self-starter, able and willing to work with geo-dispersed teams.  This role will also be responsible for mentoring less-experienced staff.

 

Responsibilities

 

  • Serve as a senior technical resource on the Cloud Operations team providing escalation support and mentorship for team members across the IT organization
  • Assist management to develop processes and programs that define how Cloud Operations interfaces with its customers
  • Monitor and debug issues across the platforms (applications, networks, databases)
  • Administer, maintain, automate systems to ensure reliability, resiliency, scalability and security
  • Deploy, maintain, and enhance monitoring solutions and provide technical resolutions and root cause analysis for high severity incidents
  • Work closely with Engineering and Software Development teams to design, deploy, and operate components/services that are automated, resilient, and scalable
  • Create, update, and maintain documentation for all configurations for the production environment
  • Develop and deliver timely reports on service metrics including but not limited to availability, capacity, performance, and latency across all production systems
  • Provide 24x7 on-call support as required

 

Skills & Qualifications

 

Must Haves

 

  • Be willing to bring your best to work every day, challenge the status quo, and move the Forcepoint culture forward
  • Bachelor’s Degree in Computer Science or equivalent experience related to Information Technology
  • 5+ years’ experience as a Cloud Systems Engineer or SRE managing a SaaS / PaaS environment
  • Deep experience managing Linux (RHEL/CentOS)
  • Deep experience with the configuration and automation toolsets such as Puppet, Chef and Ansible
  • Deep experience in monitoring a global Cloud footprint.  Hands-on with modern monitoring platforms and time-series databases, such as Graphite, Prometheus, Data Dog, or ScienceLogic mandatory
  • Experience in the design and/or deployment of Public Cloud technologies (AWS, Azure, GCP)
  • Experience in Network Services such as DNS, DHCP, WAN Routing, TCP and UDP based
  • Experience with containerization and container orchestration especially with Dockers, Kubernetes, or Mesos
  • Experience in the deployment and management of microservices
  • Experience maintaining and managing Spark, Kafka, Tomcat, Cassandra / Druid, and MySQL based systems
  • Proficient with C/C++, Python, or Java
  • Solid understanding of incident management, change management, and problem management

Nice to Haves

 

  • Experience working with a globally distributed team
  • Understanding of software development lifecycle and CI/CD pipelines
  • Experience architecting and optimizing cloud platforms
  • AWS Certifications
Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Location

We are located in the North Austin area by The Domain, off of Braker between Mopac and 183. Our office patio overlooks Quarry Lake.

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about ForcepointFind similar jobs