Speechify Logo

Speechify

AI Infrastructure Engineer

Reposted 14 Days Ago
Easy Apply
Remote
Hiring Remotely in USA
Mid level
Easy Apply
Remote
Hiring Remotely in USA
Mid level
As an AI Infrastructure Engineer, you will build and optimize ML platforms, ensure best practices in model management, and collaborate across teams to enhance AI solutions.
The summary above was generated by AI

Mission

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.  

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

Overview

We are looking for AI Infrastructure engineer to help build and scale the infrastructure that powers our machine learning initiatives. In this role, you will design, develop, and optimize the core platforms and services that enable data scientists and ML engineers to train, deploy, and monitor models efficiently. You’ll partner closely with Data Science, Data Engineering, and Product teams to create a robust, self-service ML ecosystem that accelerates innovation.

What You’ll Do

Build & Scale AI Infrastructure: Design, implement, and maintain high-performance ML training and inference platforms. Develop MLOps Tools: Ship tools that allow any ML engineer to deploy a model in minutes, not days. Optimize Performance: Improve scalability, reliability, and cost efficiency of model training and serving systems. Collaborate Across Teams: Partner with researchers to turn experimental voice models into production-ready systems. Ensure Best Practices: Establish standards for model versioning, testing, monitoring, and governance. Drive Automation: Automate data/model pipelines to reduce manual intervention and speed up experimentation.

  • Experience: 3+ years in Software Engineering or ML Platform/Infrastructure roles, with a focus on distributed systems, cloud services, or MLOps.
  • Technical Expertise: Proficiency in Python (or similar), containerization (Docker, Kubernetes), CI/CD pipelines, Kubernetes, Cloud proficiency
  • Strong knowledge of cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code tools (Terraform, CloudFormation).
  • Experience with ML frameworks (TensorFlow, PyTorch, or similar) and orchestration tools (Kubeflow, Airflow, MLflow).
  • Deep understanding of data pipelines, model deployment, real-time inference systems, and reliability of AI systems
  • Strong communication skills and the ability to work across engineering and data science teams.
  • Hands-on with CI/CD and Model Serving

Nice to Have

  • Experience with feature stores, vector databases, or large-scale model training.
  • Familiarity with streaming data technologies (Kafka, Spark Streaming, Flink).
  • Knowledge of monitoring/observability tools (Prometheus, Grafana, Datadog).
  • Contributions to open-source ML/MLOps projects.
  • GPU optimization (TensorRT, ONNX, vLLM, Triton)
  • Experience with low-latency audio/streaming systems
  • Familiarity with vector DBs or feature stores

What we offer

  • A dynamic environment where your contributions shape the company and its products
  • A team that values innovation, intuition, and drive
  • Autonomy, fostering focus and creativity
  • The opportunity to have a significant impact in a revolutionary industry
  • Competitive compensation, a welcoming atmosphere, and a commitment to an exceptional asynchronous work culture
  • The privilege of working on a product that changes lives, particularly for those with learning differences like dyslexia, ADD, and more
  • An active role at the intersection of artificial intelligence and audio – a rapidly evolving tech domain

Think you’re a good fit for this job? 

Tell us more about yourself and why you're interested in the role when you apply.
And don’t forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit? 

Refer them! 

Speechify is committed to a diverse and inclusive workplace. 

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Top Skills

Airflow
AWS
Azure
Ci/Cd
CloudFormation
Datadog
Docker
GCP
Grafana
Kubeflow
Kubernetes
Mlflow
Prometheus
Python
PyTorch
TensorFlow
Terraform

Similar Jobs

7 Days Ago
In-Office or Remote
8 Locations
168K-297K Annually
Senior level
168K-297K Annually
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The AI Research Infrastructure Engineer will design, build, and implement AI systems and automated data pipelines for research, focusing on workflow automation and integration with tech stacks.
Top Skills: Cloud PlatformsLlmsMcp IntegrationsPythonQualtricsRSnowflakeSQL
14 Days Ago
Remote or Hybrid
8 Locations
168K-297K Annually
Senior level
168K-297K Annually
Senior level
Blockchain • Fintech • Mobile • Payments • Software • Financial Services
The role involves designing and implementing AI research infrastructure, building automated data pipelines, and integrating AI systems to enhance research efficiency and insights delivery.
Top Skills: EltETLGooseLlmsModel Context Protocol (Mcp)PythonQualtricsRSnowflakeSQL
4 Days Ago
Remote
USA
160K-220K Annually
Senior level
160K-220K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
Lead the architecture and management of AI/ML infrastructure using Kubernetes and Terraform, optimizing hybrid environments for performance and scalability.
Top Skills: AWSBashGoKubernetesPythonSlurmTerraform

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account