Site Reliability Engineer
Q2 is seeking a Site Reliability Engineer to help Q2 deliver industry leading uptime and exceptional end-to-end banking and lending services to nearly 20 million users nationally. The production systems you will design and sustain are critical to how we enable Banks, Credit Unions, and Fintech companies to create simple, smart, and reliable experiences for their customers.
Forward thinking and ever watchful for opportunities to improve performance and automate away toil, we are a collaborative and highly resourceful team entrusted with sustaining capacity and engineering the future. From developing deep Q2 application knowledge, managing our container orchestration, logging, and data platforms, supporting private and public cloud environments, learning how to leverage automation to drive efficiencies, and troubleshooting critical infrastructure, your opportunities to make a significant impact at Q2 are endless.
Named as one of Austin’s fastest-growing companies and one of the best places to work, Q2 offers our employees a culture fueled by engaged, motivated, and dedicated team. We love to solve complex problems, learn and apply new technologies and provide our customers with superior, resilient services.
If you were working for us, here are some of the things you would have done last week:
- Quickly restore system services based on your ability to diagnose problems, make critical decisions, investigate root cause and plan to mitigate future failures
- Implement automation to scale systems sustainably and minimize highly repetitive and error-prone tasks
- Support, maintain, and improve production container hosting environment
- Proactively analyze client environments and collaborate across business and technology organizations to identify opportunities to improve performance and resiliency
- Partner with our Incident Response Team to provide blameless post-mortem analysis of why services broke or became degraded
- Leverage your diverse toolkit to identify and fix software bugs and misconfigurations
- Engage in services design, environment maturation, and disaster recovery planning
- Share your knowledge with a peer, document your process, and facilitate lessons learned reviews
- Participate in an on-call rotation to assist Customer Success teams with client-impacting outages
Qualifications
- Knowledge of CI/CD Pipelines Implementation for applications and infrastructure
- Proficiency in HashiCorp tools such as Consul, Nomad, Vault, Packer and Terraform
- Advanced knowledge of Linux and Windows Systems Administration
- Troubleshooting experience with Docker containers and other container orchestration technologies including Nomad and Kubernetes
- Knowledge of best practices of running applications in containerized environments including health checks and rolling update strategies
- Experience with scripting languages such as Bash, Powershell, or Python
- Knowledge of VMWare and cloud environments such as AWS and Azure
- Understand how to read network packet captures and troubleshoot connectivity issues
- General understanding of Content Delivery Networks
- Foundational understanding of networks and the 7 layers of the OSI model
- Knowledge of development languages such as Python, C#, and Node.js
- Knowledge of T-SQL and ability to write complex queries
At Q2, our goal is to be a diverse and inclusive workforce that fosters mutual respect for our employees and the communities we serve. Q2 is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.