TRM Labs Logo

TRM Labs

Staff MLOps Engineer – LLMOps

Posted 2 Days Ago
Easy Apply
Remote
Hiring Remotely in United States
220K-240K Annually
Senior level
Easy Apply
Remote
Hiring Remotely in United States
220K-240K Annually
Senior level
As a Staff MLOps Engineer, you'll build scalable AI infrastructures, integrate cutting-edge AI tools, and automate model deployments, with a focus on Large Language Models.
The summary above was generated by AI
Build to Protect Civilization

TRM is a blockchain intelligence company that’s on a mission to build a safer financial system for billions of people. We’re a lean, high-impact team tackling some of the world’s most critical challenges, ranging from human trafficking and financial fraud to terrorist financing. We are builders who power governments, financial institutions, and crypto companies when the clock is running and the consequences are real. This is why every TRMer is a bet on our future and has the power to change our trajectory.

The AI Engineering Team is chartered with enabling next-generation AI applications, with a special focus on Large Language Models (LLMs) and agentic systems. Our mission is to build robust pipelines, high-performance infrastructure, and operational tooling that allow AI systems to be deployed with speed, safety, and scale.

We manage petabyte-scale pipelines, serve models with millisecond-level latency, and provide the observability and governance needed to make AI production-ready. We’re also deeply involved in evaluating and integrating cutting-edge tools in the LLM and agent space — including open-source stacks, vector databases, evaluation frameworks, and orchestration tools that unlock TRM’s ability to innovate faster than the market.

As a Staff MLOps Engineer with a focus in LLMOps, you’ll be at the core of building and scaling the technical infrastructure for AI/ML systems. You will:

  • Build reusable CI/CD workflows for model training, evaluation, and deployment — integrating Langfuse, GitHub Actions, and experiment tracking, etc.
  • Automate model versioning, approval workflows, and compliance checks across environments.
  • Build out a modular and scalable AI infrastructure stack — including vector databases, feature stores, model registries, and observability tooling.
  • Partner with engineering and data science to embed AI models and agents into real-time applications and workflows.
  • Continuously evaluate and integrate state-of-the-art AI tools (e.g. LangChain, LlamaIndex, vLLM, MLflow, BentoML, etc.).
  • Drive AI reliability and governance, enabling experimentation while ensuring compliance, security, and uptime.
  • Build and enhance AI/ML Model Performance
  • Ensure data accuracy, consistency and reliability, leading to better model training and inferencing
  • Deploy infrastructure to support offline and online evaluation of LLMs and agents — including regression testing, cost monitoring, and human-in-the-loop workflows.
  • Enable researchers to iterate quickly by providing sandboxes, dashboards, and reproducible environments.
What We’re Looking For
  • Write high-quality, maintainable software — primarily in Python, but we value engineering ability over language familiarity.
  • Have a strong background in scalable infrastructure, including:
    • Containerization and orchestration (e.g. Docker, Kubernetes)
    • Infrastructure-as-code and deployment (e.g. Terraform, CI/CD pipelines)
    • Monitoring and logging frameworks (e.g. Datadog, Prometheus, OpenTelemetry)
  • Understand and implement ML Ops best practices, including:
    • Model versioning and rollback strategies
    • Automated evaluation and drift detection
    • Scalable model and agent serving infrastructure (e.g. vLLM, Triton, BentoML)
  • Deploy and maintain LLM and agentic workflows in production, including:
    • Monitoring cost, latency, and performance
    • Capturing traces for analysis and debugging
    • Optimizing prompt/response flows with real-time data access
  • Demonstrate strong ownership and pragmatism, balancing infrastructure elegance with iterative delivery and measurable impact.

Learn about TRM Speed in this position:

  • Rapid Issue Resolution. TRM Engineers identify and resolve critical onsite issues in minutes to hours, not weeks. We create virtual war rooms, implement fixes, and share lessons with both customer stakeholders and internal teams within 48 hours.
  • Navigating Bureaucracy. We anticipate and address procedural hurdles, build trust with key stakeholders, and find alternative pathways to approvals. This keeps projects moving even in complex environments.
  • Efficient Knowledge Transfer. Engineers document and share updates in real time, ensuring the entire team—onsite and remote—has full visibility into plans, blockers, and resolutions. Knowledge sharing sessions and clear documentation reduce friction and accelerate delivery.
About TRM's Engineering Levels:

Engineer: Responsible for helping to define project milestones and executing small decision decisions independently with the appropriate tradeoffs between simplicity, readability, and performance. Provides mentorship to junior engineers, and enhances operational excellence through tech debt reduction and knowledge sharing.

Senior Engineer: Successfully designs and documents system improvements and features for an OKR/project from the ground up. Consistently delivers efficient and reusable systems, optimizes team throughput with appropriate tradeoffs, mentors team members, and enhances cross-team collaboration through documentation and knowledge sharing.

Staff Engineer: Drives scoping and execution of one or more OKRs/projects that impact multiple teams. Partners with stakeholders to set the team vision and technical roadmaps for one or more products. Is a role model and mentor to the entire engineering organization. Ensures system health and quality with operational reviews, testing strategies, and monitoring rigor.

The following represents the expected range of compensation for this role:

  • Individual pay is determined by skills, qualifications, experience, and location. The compensation details listed in this posting reflect the US base salary only.
  • The estimated base salary range for this role is $220,000 - $240,000.
  • Additionally, this role may be eligible to participate in TRM’s equity plan.
  • Please note – we factor in the different costs for geographies outside the United States.

Life at TRM

We build to protect civilization. That promise shows up in how we work every day.

TRM runs fast. Really fast. We’re a high-velocity team that expects ownership, clarity, and follow-through. People who thrive here are inspired by hard problems, experimentation, direct feedback. If it takes months elsewhere, it often ships here in days. If you are optimizing primarily for consistent work-life balance, use the interview process to pressure-test fit. We want teammates who thrive here, not just survive here.

We coach directly, assume positive intent, and play for the front of the jersey.

Leadership Principles
  • Impact-Oriented Trailblazer: We put customers first, driving for speed, focus, and adaptability.
  • Master Craftsperson: We prioritize speed, high standards, and distributed ownership.
  • Inspiring Colleague: We value humility, candor, and a one-team mindset.

Want to learn more about how we interview at TRM Labs? Check out more about our leadership principles and hiring process here.

What You’ll Do Here

This work has teeth. At TRM, your week might include:

  • Driving critical investigations that can’t wait for typical business hours.
  • Shipping products in days when others would schedule quarters.
  • Partnering with teams across time zones to deliver insights while the story is still unfolding.
  • Building new solutions from first principles when the playbook doesn’t yet exist.
  • Protecting victims and customers by tracing illicit activity and disrupting criminal networks.
Join our Mission

We look for people who want their work to matter, who build with speed and rigor, and who take pride in protecting others through their craft. If you’re excited by TRM’s mission but don’t check every box, apply anyway. We hire for slope, judgment, and the will to learn fast.

Build to protect civilization. Let’s do it together.

Recruitment agencies

TRM Labs does not accept unsolicited agency resumes. Please do not forward resumes to TRM employees. TRM Labs is not responsible for any fees related to unsolicited resumes and will not pay fees to any third-party agency or company without a signed agreement.

Privacy Policy

By submitting your application, you are agreeing to allow TRM to process your personal information in accordance with the TRM Privacy Policy

Learn More: Company Values | Interviewing | FAQs

Top Skills

Bentoml
Ci/Cd
Datadog
Docker
Github Actions
Kubernetes
Langchain
Llamaindex
Mlflow
Opentelemetry
Prometheus
Python
Terraform
Triton
Vllm

Similar Jobs

5 Minutes Ago
Easy Apply
Remote
USA
Easy Apply
131K-154K Annually
Senior level
131K-154K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Program Lead will manage the Proactive Support program, improving customer experience by rapidly addressing issues and optimizing operations through real-time data solutions and cross-functional collaboration.
Top Skills: Contact Center Task RoutingDecisioning SystemsEvent StreamingOperations Data Analysis
20 Minutes Ago
Remote or Hybrid
United States
42K-42K Annually
Junior
42K-42K Annually
Junior
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Telemarketing Representative will engage potential customers, manage sales leads, and enhance customer retention through effective communication and problem-solving in a call center environment.
Top Skills: ExcelMs Word
20 Minutes Ago
Remote or Hybrid
United States
157K-209K Annually
Senior level
157K-209K Annually
Senior level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The role involves managing reinsurance investment portfolios, developing investment strategies, and collaborating with actuarial and business teams for pricing support.
Top Skills: AladdinBlackrockBloombergExcel

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account