Take-Two Interactive Software Logo

Take-Two Interactive Software

Senior Director, Site Reliability Engineering, Technical Operations Center & Observability

Sorry, this job was removed at 08:12 p.m. (CST) on Tuesday, Aug 05, 2025
Hybrid
Austin, TX
Hybrid
Austin, TX

Similar Jobs at Take-Two Interactive Software

Yesterday
Hybrid
Austin, TX, USA
Senior level
Senior level
Gaming • Information Technology • Mobile • Software
Lead and mentor a team of ServiceNow developers; oversee the end-to-end development lifecycle; ensure platform governance and performance; collaborate with stakeholders to translate requirements into technical solutions, and manage the ServiceNow roadmap for continuous improvement.
Top Skills: App EngineHr Service DeliveryItamItsmJavaScriptRest ApisSecurity OperationsServicenowSourcing And Procurement
Yesterday
Hybrid
Austin, TX, USA
Mid level
Mid level
Gaming • Information Technology • Mobile • Software
The Business Analyst will bridge business needs with technical solutions, focusing on process improvements, data analysis, project management support, and ensuring successful technology implementation across teams.
Top Skills: AirtableItomItsmJIRALucid ChartMiroMonday.ComPower BIServicenowSmartsheetSQLTableau
3 Days Ago
Hybrid
Austin, TX, USA
Senior level
Senior level
Gaming • Information Technology • Mobile • Software
As a Senior Product Security Engineer, you will ensure product security by developing threat models, conducting penetration tests, and providing code reviews. You'll guide security architecture and promote security practices within the development teams.
Top Skills: C#CloudContainersJavaJavaScriptPython
Who We Are:

Headquartered in New York City, Take-Two Interactive Software, Inc. is a leading developer, publisher, and marketer of interactive entertainment for consumers around the globe. We develop and publish products principally through Rockstar Games, 2K, and Zynga. Our products are designed for console gaming systems, PC, and mobile, including smartphones and tablets. We deliver our products through physical retail, digital download, online platforms, and cloud streaming services. The Company’s common stock is publicly traded on NASDAQ under the symbol TTWO. For more corporate and product information please visit our website at http://www.take2games.com.

While our offices (physical and virtual) are casual and inviting, we are deeply committed to our core tenets of creativity, innovation and efficiency, and individual and team development opportunities. Our industry and business are continually evolving and fast-paced, providing numerous opportunities to learn and hone your skills. We work hard, but we also like to have fun, and believe that we provide a great place to come to work each day to pursue your passions.


The Challenge:

The Senior Director of SRE/TOC and Observability will lead global teams responsible for the reliability, scalability, and performance of critical systems across both cloud and on-prem environments. This role is responsible for Site Reliability Engineering, Technical Operations Center (TOC), and enterprise observability strategy, ensuring proactive monitoring, incident response, and platform stability. The ideal candidate combines deep technical expertise with strong leadership skills to drive operational excellence, minimize downtime, and deliver a seamless experience to internal and external collaborators.

What You’ll Take On:
  • Provide strategic leadership for global Site Reliability Engineering (SRE) and Technical Operations Center (TOC) teams, ensuring high availability and resilience of critical systems.
  • Supervise enterprise-wide observability initiatives, including logging, monitoring, tracing, and alerting frameworks to improve system visibility and incident response.
  • Establish and implement SLOs/SLIs, performance baselines, and reliability metrics aligned with business goals.
  • Develop and scale a 24/7 incident response model, including incident command practices, on-call rotations, and critical issue protocols.
  • Drive root cause analysis (RCA) and continuous improvement processes following major incidents.
  • Partner with engineering, infrastructure, and security teams to embed reliability and operational standard methodologies into system design and delivery pipelines.
  • Own and optimize TOC operations, including real-time monitoring, alert triage, and first-line response to critical issues.
  • Champion automation, tooling, and self-healing systems to reduce manual interventions and improve uptime.
  • Hire, mentor, and develop a high-performing team across multiple geographies and time zones.
  • Collaborate with product and business partners to align operational strategies with customer needs and growth plans.
  • Track and report on platform health, incident trends, and reliability critical metrics to executive leadership.
What You Bring:Infrastructure & Cloud:
  • Deep experience with cloud platforms: AWS, GCP, and/or Azure
  • Proven understanding of hybrid and on-prem infrastructure (VMware, bare metal, etc.)
  • Expertise in high-availability architecture, scalability, and disaster recovery planning
Monitoring & Observability:
  • Hands-on experience with observability tools: Datadog, Prometheus, Grafana, New Relic, Splunk, ELK stack, or similar
  • Building and tuning SLOs/SLIs, alerting thresholds, and dashboards
Automation & DevOps:
  • Proficiency in Infrastructure as Code (IaC) tools: Terraform, CloudFormation, Ansible
  • Understanding of CI/CD pipelines and DevOps tooling (e.g., Jenkins, GitLab CI, ArgoCD)
  • Experience with container orchestration platforms: Kubernetes, Docker, Helm
Incident Management & TOC Operations:
  • Experience with incident response tools: PagerDuty, ServiceNow
  • Familiarity with incident command processes, RCA frameworks, and postmortem best practices
  • Understanding of networking fundamentals, DNS, load balancing, and traffic routing
Security & Compliance:
  • Solid understanding of security standard processes, access control, and vulnerability management
  • Awareness of compliance standards (e.g., SOC 2, ISO 27001, HIPAA) relevant to operational reliability
Leadership & Communication:
  • Strong analytical and decision-making skills under pressure
  • Ability to lead multi-functional teamwork during high-severity incidents
  • Experience scaling and mentoring global technical teams

What We Offer You:
  • Great Company Culture. Ranked as one of the most creative and innovative places to work, creativity, innovation, efficiency, diversity and philanthropy are among the core tenets of our organization and are integral drivers of our continued success.
  • Growth: As a global entertainment company, we pride ourselves on creating environments where employees are encouraged to be themselves, inquisitive, collaborative and to grow within and around the company.
  • Work Hard, Play Hard. Our employees bond, blow-off steam, and flex some creative muscles – through corporate boot camp classes, company parties, game release events, monthly socials, and team challenges.
  • Benefits. Medical (HSA & FSA), dental, vision, 401(k) with company match, employee stock purchase plan, commuter benefits, in-house wellness program, broad learning & development opportunities, a charitable giving platform with company match and more!
  • Perks. Fitness allowance, employee discount programs, free games & events and stocked pantries.

Please be aware that Take-Two does not conduct job interviews or make job offers over third-party messaging apps such as Telegram, WhatsApp, or others. Take-Two also does not engage in any financial exchanges during the recruitment or onboarding process, and the Company will never ask a candidate for their personal or financial information over an app or other unofficial chat channel. Any attempt to do so may be the result of a scam or phishing exercise. Take-Two’s in-house recruitment team will only contact individuals through their official Company email addresses (i.e., via a take2games.com email domain). If you need to report an issue or otherwise have questions, please contact [email protected]

As an equal opportunity employer, Take-Two Interactive Software, Inc. (“Take-Two”) is committed to fostering and celebrating the diverse thoughts, cultures, and backgrounds of its talent, partners, and communities throughout its organization. Consistent with this commitment, Take-Two does not discriminate or retaliate against any employee or job applicant because of their race, color, religion, sex (including pregnancy, sexual orientation, and gender identity), national origin, age, disability, and genetic information (including family medical history), or on the basis of any other trait protected by applicable law. If you need to report a concern or have questions regarding Take-Two’s equal opportunity commitment, please contact [email protected].


What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account