Speechify Logo

Speechify

AI Infrastructure Engineer

Sorry, this job was removed at 08:35 p.m. (CST) on Tuesday, Feb 24, 2026
Remote
Hiring Remotely in USA
Remote
Hiring Remotely in USA

Similar Jobs

18 Days Ago
Remote or Hybrid
150K-170K Annually
Senior level
150K-170K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Lead full‑stack development of AI infrastructure prototypes and production systems. Build backend Python services, integrate front-end apps with APIs, AD and OAuth, deploy serverless cloud solutions (AWS/Azure/GCP), write automated tests and documentation, and collaborate across teams to deliver generative-AI agents using foundation model APIs and agent frameworks.
Top Skills: Active DirectoryAgent FrameworksAi GatewaysAngularAWSAzureCode Versioning ToolsFoundation Model ApisFront-End Testing FrameworksGCPJavascript (Es6+)OauthPythonReactRestful ApisServerlessTerraformVue
25 Days Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Senior SRE on the IT Operations team owning reliability, monitoring, and incident response for AI infrastructure. Build automation, CI/CD and Kubernetes tooling, improve observability and documentation, and develop internal full-stack tools using Go or Python. Partner with Infrastructure, Security, and Compliance to scale secure, resilient AI deployment pipelines.
Top Skills: AnsibleAWSBashChefCi/CdDockerEc2GitGoKubernetesLinuxPuppetPythonRubySaltTerraform
25 Days Ago
Easy Apply
Remote
USA
Easy Apply
218K-257K Annually
Senior level
218K-257K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Own reliability, monitoring, and incident response for AI infrastructure; build automation and CI/CD tooling; manage Kubernetes/Docker production workloads; partner with infrastructure, security, and compliance; improve observability and documentation; develop internal full‑stack tooling in Go or Python.
Top Skills: AnsibleAWSBashChefCi/CdDockerEc2GitGoKubernetesLinuxLog AggregationNetwork SecurityPuppetPythonRubySaltTerraform

Mission

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.  

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

Overview

We are looking for AI Infrastructure engineer to help build and scale the infrastructure that powers our machine learning initiatives. In this role, you will design, develop, and optimize the core platforms and services that enable data scientists and ML engineers to train, deploy, and monitor models efficiently. You’ll partner closely with Data Science, Data Engineering, and Product teams to create a robust, self-service ML ecosystem that accelerates innovation.

What You’ll Do

Build & Scale AI Infrastructure: Design, implement, and maintain high-performance ML training and inference platforms. Develop MLOps Tools: Ship tools that allow any ML engineer to deploy a model in minutes, not days. Optimize Performance: Improve scalability, reliability, and cost efficiency of model training and serving systems. Collaborate Across Teams: Partner with researchers to turn experimental voice models into production-ready systems. Ensure Best Practices: Establish standards for model versioning, testing, monitoring, and governance. Drive Automation: Automate data/model pipelines to reduce manual intervention and speed up experimentation.

  • Experience: 3+ years in Software Engineering or ML Platform/Infrastructure roles, with a focus on distributed systems, cloud services, or MLOps.
  • Technical Expertise: Proficiency in Python (or similar), containerization (Docker, Kubernetes), CI/CD pipelines, Kubernetes, Cloud proficiency
  • Strong knowledge of cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code tools (Terraform, CloudFormation).
  • Experience with ML frameworks (TensorFlow, PyTorch, or similar) and orchestration tools (Kubeflow, Airflow, MLflow).
  • Deep understanding of data pipelines, model deployment, real-time inference systems, and reliability of AI systems
  • Strong communication skills and the ability to work across engineering and data science teams.
  • Hands-on with CI/CD and Model Serving

Nice to Have

  • Experience with feature stores, vector databases, or large-scale model training.
  • Familiarity with streaming data technologies (Kafka, Spark Streaming, Flink).
  • Knowledge of monitoring/observability tools (Prometheus, Grafana, Datadog).
  • Contributions to open-source ML/MLOps projects.
  • GPU optimization (TensorRT, ONNX, vLLM, Triton)
  • Experience with low-latency audio/streaming systems
  • Familiarity with vector DBs or feature stores

What we offer

  • A dynamic environment where your contributions shape the company and its products
  • A team that values innovation, intuition, and drive
  • Autonomy, fostering focus and creativity
  • The opportunity to have a significant impact in a revolutionary industry
  • Competitive compensation, a welcoming atmosphere, and a commitment to an exceptional asynchronous work culture
  • The privilege of working on a product that changes lives, particularly for those with learning differences like dyslexia, ADD, and more
  • An active role at the intersection of artificial intelligence and audio – a rapidly evolving tech domain

Compensation: The United States base salary range for this full-time position is $140,000-$200,000 + bonus + equity depending on experience

Think you’re a good fit for this job? 

Tell us more about yourself and why you're interested in the role when you apply.
And don’t forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit? 

Refer them! 

Speechify is committed to a diverse and inclusive workplace. 

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account