Rohirrim Logo

Rohirrim

Senior Data Engineer

Reposted 4 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
As a Senior Data Engineer, you'll build and optimize data pipelines, manage Azure-based data infrastructure, and ensure compliance while collaborating with ML and product teams.
The summary above was generated by AI

Are you passionate about pushing the boundaries of technology in the Gen AI space? Rohirrim is seeking a Senior Data Engineer to mentor engineers, provide technical direction, and drive the development of cutting-edge applications. If you thrive in a fast-paced environment and enjoy leading by example while staying hands-on with coding, we want to hear from you!

Why Join Rohirrim?

At Rohirrim, we're at the forefront of innovation in the Gen AI space. Joining our team means being part of a dynamic environment where your leadership and expertise make a tangible impact on our products and team growth.


About the Role

As a Data Engineer at Rohirrim, you’ll design, build, and optimize the data pipelines and infrastructure that fuel our AI products. You’ll work closely with our AI/ML teams, product teams, customer success managers,and security/compliance partners to transform complex enterprise datasets into clean, reliable, structured foundations for Rohan deployments — especially in controlled, secure, or GovTech environments.

You’ll help us scale:

  • ingestion pipelines
  • vector stores
  • embedding workflows
  • metadata & document-processing frameworks
  • Azure-native data services

…in a way that is fast, compliant, and deeply reliable.



What You’ll Do
  • Blend capabilities in software engineering, data engineering and devops to build and maintain scalable data ingestion pipelines for structured/unstructured data (documents, PDFs, knowledge bases, enterprise systems, APIs, etc.).
  • Develop and operate ETL/ELT workflows that ensure data integrity, security, and lineage.
  • Implement and optimize vector database systems and embeddings pipelines supporting RAG and AI features.
  • Collaborate with ML engineers to support model training, evaluation, and feature engineering pipelines.
  • Architect and manage Azure-based data infrastructure (e.g., Azure Functions, Azure Storage, Azure SQL, Azure Kubernetes Service, Azure OpenAI integrations).
  • Build internal tools for metadata extraction, OCR/document parsing, text normalization, and validation.
  • Ensure pipelines meet compliance, auditability, and security requirements (SOC2, FedRAMP, etc.).
  • Support customer-specific data onboarding workflows for government + enterprise deployments.
  • Monitor and improve pipeline performance, reliability, and scalability.



What Makes You a Great Fit
  • 10+ years in Data Engineering, Software Engineering, or ML/Data Infrastructure roles.
  • Strong experience with Python, SQL, and modern data engineering tools (Airflow, Dagster, dbt, Prefect, etc.).
  • Experience building large-scale document extraction ETL pipelines (OCR, PDF parsing, metadata extraction, NLP preprocessing).
  • Proficiency with Kubernetes, Docker, and containerized data pipelines deployed on Azure, AWS and/or Google Cloud
  • Hands-on experience with relational databases (Postgres, SQL Server, MySQL) and non-relational systems such as Elasticsearch, Redis, and graph databases
  • Experience with document-heavy or text-heavy data processing (OCR, parsing, NLP preprocessing).
  • Strong data quality, governance, lineage, and validation mindset.
  • Excellent communicator who can align with ML, engineering, and product teams.



Bonus Skills
  • Experience building or supporting GenAI / LLM / RAG pipelines.
  • Experience with Azure OpenAI Service.
  • Experience with min.io
  • Background with knowledge graphs, semantic search, or indexing at scale.
  • Familiarity with CI/CD pipelines in Azure DevOps, GitHub Actions, or similar.

Top Skills

Airflow
Azure
Dagster
Dbt
Docker
Elasticsearch
Kubernetes
MySQL
Postgres
Prefect
Python
Redis
SQL
SQL Server

Similar Jobs

5 Days Ago
Remote or Hybrid
United States
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Seeking a Senior Data Engineer to design and optimize data pipelines, ensuring data quality and supporting advanced analytics. Responsibilities include building data architectures, developing automated testing, and collaborating with stakeholders.
Top Skills: Apache AirflowAWSAzureAzure SynapseDbtHadoopJavaKafkaKinesisPytestPythonPyTorchRedshiftScalaScikit-LearnSeleniumSnowflakeSparkSQLTensorFlow
5 Days Ago
Remote or Hybrid
United States
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Seeking a Senior Data Engineer to design and optimize data architecture and pipelines, ensuring data quality and enabling advanced analytics through AI and machine learning techniques.
Top Skills: Apache AirflowAWSAws RedshiftAzureAzure SynapseDbtHadoopJavaKafkaKinesisPytestPythonPyTorchScalaScikit-LearnSeleniumSnowflakeSparkSQLTensorFlow
6 Days Ago
In-Office or Remote
8 Locations
168K-297K Annually
Senior level
168K-297K Annually
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
As a Senior Data Engineer, you'll design and manage ETL pipelines, optimize data models, monitor data quality, and collaborate with teams to support compliance operations.
Top Skills: AirflowDatabricksDbtGitPrefectPythonSnowflakeSQLTableauTerraform

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account