Flex is a growth-stage, NYC headquartered FinTech company that is creating the best rent payment experience. It’s hard to believe that it’s 2026 and paying rent on time is expensive, inflexible, and difficult. We’re here to change that! Flex enables our users to pay rent throughout the month on a schedule that better fits their finances and budget. Our mission is to empower as many renters as possible with flexibility over their most significant recurring expense. After deliberately keeping a stealth profile as we built up unprecedented investor support and an enthusiastic user base, we are looking for motivated individuals to help us keep our mission growing. Will you be a part of the team?
Flex is looking for an exceptional Senior Staff Infrastructure Engineer with a passion for driving impact while managing high levels of ambiguity and Getting Stuff Done.
In this role, you will be the bar raiser for the Infrastructure Engineering team, a small team responsible for creating and maintaining a sustainable set of platforms that ensures the effectiveness, reliability and scalability of our systems. You'll lead and enable the team in designing, building, and maintaining our robust and scalable infrastructure for engineers, customers, AI agents and fellow employees. You'll collaborate closely with our service engineering teams and leaders to determine the direction of our platform, and get ahead of where they need to be.
At Flex, we are an AI-first engineering organization. We believe that the future of infrastructure isn't just about managing resources—it’s about building the intelligent, automated systems that manage them for us. We aren't looking for "task-takers"; we are looking for domain experts who use their deep knowledge of cloud architecture and SRE principles to steer these AI tools effectively. On this team, your value is defined by your ability to combine your technical mastery with an AI-augmented workflow to deliver world-class reliability at a growth-stage pace.
We are particularly interested in candidates with software engineering experience in languages like Java, Python, or TypeScript. This background will allow you to collaborate effectively with product teams, build tools and automation, and improve the developer experience across our engineering organization. You’ll have the opportunity to influence key infrastructure and architecture decisions while ensuring high reliability and smooth delivery pipelines.
This remote role requires a minimum of 10 years of cloud infrastructure experience.
What you’ll do- Define the long-term technical strategy for scalable and resilient infrastructure, guiding cross-functional teams to implement solutions that optimize for performance, resilience, and cost at an organizational level.
- Serve as the top technical authority ensuring the entire infrastructure platform aligns with critical business objectives and sets a high bar for industry standards.
- Own the end-to-end "build vs. buy" evaluation and decision process for all major infrastructure technology, weighing long-term cost, maintenance, scalability, and strategic business alignment.
- Serve as a hands-on technical authority, with the ability to dive deep into any part of the technology stack to diagnose complex systemic issues and drive best-in-class engineering solutions.
- Establish and evangelize a world-class SRE culture and practices across all engineering teams, defining SLOs/SLIs and leading initiatives to achieve target reliability goals for critical systems.
- Own and drive significant improvements in the end-to-end developer experience, including the architecture of self-service platforms, advanced CI/CD systems, and deployment mechanisms to maximize organizational velocity.
- Take command of major, high-severity incident responses that cross team boundaries, instituting structured post-incident review processes to extract systemic lessons and drive fundamental, long-term resilience improvements.
- Lead the vision for hyper-automation across the infrastructure domain, building sophisticated automated pipelines and tools that reduce operational toil to near zero and enable a self-healing system.
- Communicate complex technical strategies and decisions with executive leadership, peer organizations, and the broader engineering community, driving alignment and buy-in for the infrastructure roadmaps.
- Deep mastery and architectural-level experience in designing, building, and operating highly-scaled, resilient cloud infrastructure on AWS, with deep expertise in services like EKS, S3, RDS, API Gateway, and various NoSQL/database solutions (DocumentDB, DynamoDB).
- Expert-level proficiency and a track record of driving Infrastructure as Code (IaC) best practices using Terraform at an organizational scale, including developing reusable modules and governance frameworks.
- Extensive experience and leadership in architecting and managing enterprise-grade, highly-available Kubernetes (EKS) and microservice platforms, driving adoption of modern container orchestration patterns.
- Ownership of and demonstrated expertise in defining and implementing world-class CI/CD pipelines (e.g., GitHub Actions), significantly improving deployment speed, safety, and velocity across engineering teams.
- Proven track record of architecting and delivering internal self-service platforms and advanced developer tooling that significantly boosts organizational productivity and automates common infrastructure operations.
- Deep understanding and hands-on experience with advanced networking concepts (e.g., mesh, service discovery, cloud VPC design, security groups) to ensure global security and high performance.
- Exceptional technical communication and cross-functional leadership skills, with the ability to drive consensus and alignment on complex technical strategies across executive, product, and engineering teams.
- Demonstrated experience defining and leading the implementation of a unified, end-to-end observability (metrics, logs, traces) framework using industry-leading tools (Datadog preferred), transforming operational monitoring into predictive health insights.
- Experience coding/reading in one of the industry standard language such as Java, Python, TypeScript
- Tier A (NYC/SF/Seattle): $240,000 - $300,000 USD
- Tier B: $216,000 - $270,000 USD
- Tier C: $204,000 - $255,000 USD
#LI-Remote
Life at Flex:
We understand that it takes a diverse team of highly intelligent, curious, determined, empathetic, and self aware people to grow a successful company. Our HQ is located in New York City, but we have employees located throughout the US, Australia, Canada and South America. We are growing quickly, but deliberately, with a focus on building an inclusive culture. Our dynamic team has incredible perspectives to share, just as we know you do, and we take great pride in being an equal opportunity workplace.
We offer many employee benefits & perks. For full-time U.S based positions we offer:
- Competitive medical, dental, and vision available from Day 1
- Company equity
- 401(k) plan with company match (our company match kicks off at the beginning of 2026)
- Unlimited paid time off + 13 company paid holidays
- Parental leave
- Flex Cares Program
- Free Flex subscription
For full time non-US employees, we offer
- Competitive compensation + company equity
- Unlimited PTO
Top Skills
Similar Jobs
What you need to know about the Austin Tech Scene
Key Facts About Austin Tech
- Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
- Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
- Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
- Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center



