Oscilar Jobs

Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)

Oscilar

Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)

Reposted 23 Days Ago

Remote

Hiring Remotely in USA

Senior level

Remote

Hiring Remotely in USA

Senior level

The SRE will own reliability for a cloud-native platform, optimizing performance, availability, and observability, while mentoring engineering teams.

The summary above was generated by AI

Shape the future of trust in the age of AI
At Oscilar, we're building the most advanced AI Risk Decisioning™ Platform. Banks, fintechs, and digitally native organizations rely on us to manage their fraud, credit, and compliance risk with the power of AI. If you're passionate about solving complex problems and making the internet safer for everyone, this is your place.

Why join us:

Mission-driven teams: Work alongside industry veterans from Meta, Uber, Citi, and Confluent, all united by a shared goal to make the digital world safer.
Ownership and impact: We believe in extreme ownership. You'll be empowered to take responsibility, move fast, and make decisions that drive our mission forward.
Innovate at the cutting edge: Your work will shape how modern finance detects fraud and manages risk.

About the Role

Oscilar is growing fast, and so is the complexity of our systems. We’re looking for a experienced SRE to take ownership of reliability across our multi-region, cloud-native platform. You’ll have the mandate and autonomy to design, implement, and evolve systems that stay performant and resilient—through traffic spikes, dependency failures, and global deployments. You’ll be shaping how we scale, how we build observability, and how we run infrastructure that supports billions of events and large-scale data pipelines.

What You’ll Own

Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes).
Lead initiatives to improve availability, latency, and performance at scale.
Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability.
Define the metrics, alerts, and runbooks that form our observability backbone.
Run chaos experiments and failure simulations to harden the platform.
Mentor engineers and set best practices for SRE across the company.

What You Bring

Proven track record as a senior SRE or Infrastructure Engineer in high-scale environments.
Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform).
Strong programming ability in Go or Python. We use Go.
Deep understanding of distributed systems (Kafka, ClickHouse) and microservices architecture.
Mastery of container orchestration (Kubernetes) and production debugging.
Strong sense of ownership, and the judgment to balance velocity with reliability.

Benefits

Compensation: Competitive salary and equity packages, including a 401k plan
Flexibility: Remote-first culture — work from anywhere
Health: 100% Employer covered comprehensive health, dental, and vision insurance with a top tier plan for you and your dependents (US)
Balance: Unlimited PTO policy
Technical: AI First company; both Co-Founders are engineers at heart; and over 50% of the company is Engineering and Product
Culture: Family-Friendly environment; Regular team events and offsites
Development: Unparalleled learning and professional development opportunities
Impact: Making the internet safer by protecting online transactions

Similar Jobs

AuthZed

Senior Site Reliability Engineer

2 Days Ago

Remote

United States

Senior level

Artificial Intelligence • Information Technology • Software • Database

As a Site Reliability Engineer, you will design, implement, and maintain scalable infrastructure, ensure system reliability, automate processes, and collaborate with engineering teams.

Top Skills: DockerElk StackGoGrafanaJavaKubernetesNode.jsPrometheusPulumiPythonRubyTerraform

Air Apps

Site Reliability Engineer

17 Days Ago

Remote

Mid level

Information Technology • Mobile • Software

As a Site Reliability Engineer at Air Apps, you'll enhance system reliability through automation, monitoring, and performance optimization, collaborating with development teams to build resilient cloud solutions.

Top Skills: AWSAzureBashCloudFormationDatadogDockerElkGCPGoGrafanaHelmKubernetesPrometheusPulumiPythonTerraform

Newton.co

Site Reliability Engineer

20 Days Ago

In-Office or Remote

Mid level

Blockchain • Financial Services • Cryptocurrency • Web3

The Site Reliability Engineer will enhance operational reliability, managing incidents, improving system performance, and maintaining metrics to ensure scalability and resilience.

Top Skills: AWSJavaJavaScriptLinux ShellPython

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center