Featherless AI Logo

Featherless AI

Machine Learning Engineer — Distillation

Posted Yesterday
In-Office or Remote
Hiring Remotely in World Golf Village, FL
Mid level
In-Office or Remote
Hiring Remotely in World Golf Village, FL
Mid level
Design and implement knowledge distillation pipelines, optimize training and inference performance, and collaborate with research on production-ready ML models.
The summary above was generated by AI
About the Role

We’re looking for a Machine Learning Engineer focused on model distillation to help us build smaller, faster, and more efficient models without sacrificing quality. You’ll work at the intersection of research and production—taking cutting-edge techniques and turning them into systems that scale.

This is a hands-on role with real ownership: you’ll design distillation pipelines, run large-scale experiments, and ship models used in production.

What You’ll Do
  • Design and implement knowledge distillation pipelines (teacher–student, self-distillation, multi-teacher, etc.)

  • Distill large foundation models into smaller, faster, and cheaper models for inference

  • Run and analyze large-scale training experiments to evaluate quality, latency, and cost tradeoffs

  • Collaborate with research to translate new distillation ideas into production-ready code

  • Optimize training and inference performance (memory, throughput, latency)

  • Contribute to internal tooling, evaluation frameworks, and experiment tracking

  • (Optional) Contribute back to open-source models, tooling, or research

What We’re Looking For
  • Strong background in machine learning or deep learning

  • Hands-on experience with model distillation (LLMs or other neural networks)

  • Solid understanding of training dynamics, loss functions, and optimization

  • Experience with PyTorch (or JAX) and modern ML tooling

  • Comfort running experiments on multi-GPU or distributed setups

  • Ability to reason about model quality vs. performance tradeoffs

  • Pragmatic mindset: you care about shipping, not just papers

Nice to Have
  • Experience distilling LLMs or large sequence models

  • Experience with inference optimization (quantization, pruning, kernels, etc.)

  • Familiarity with evaluation for language models

  • Open-source contributions or research publications

  • Experience in early-stage or fast-moving startups

Why Join
  • Work on core model quality and cost efficiency—not side projects

  • High ownership and direct impact on product and roadmap

  • Small, senior team with strong research + engineering culture

  • Competitive compensation + meaningful equity

  • Remote-friendly, async-first environment

Top Skills

Deep Learning
Distributed Computing
Jax
Machine Learning
Model Distillation
Multi-Gpu
Pruning
PyTorch
Quantization

Similar Jobs

5 Hours Ago
Remote
United States
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
The Senior Machine Learning Engineer will build and own production ML systems, manage end-to-end workflow, debug issues, and mentor others.
Top Skills: JaxPythonPyTorch
25 Days Ago
Remote
United States
Senior level
Senior level
Big Data • Analytics • Business Intelligence • Big Data Analytics
The Machine Learning Engineer will develop software solutions, optimize data pipelines, and deploy machine learning models using cloud platforms and best practices.
Top Skills: AzureAzuremlDatabricksDockerMlflowPythonSpark
10 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
135K-228K Annually
Senior level
135K-228K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
As a Senior Machine Learning Engineer, you will develop scalable ML solutions, collaborate with teams, optimize models, and implement best practices for deployments and infrastructures.
Top Skills: C++GoJavaKubernetesPythonPyTorchRayScalaSparkTensorFlow

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account