Sand Technologies Logo

Sand Technologies

Senior Data Engineer

Posted 4 Hours Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
Design, build, and operate secure, scalable cloud-native data platforms (lakehouse/data mesh) supporting batch, streaming, ML, GIS and operational workloads. Lead ingestion/transformation pipelines, distributed processing (Spark/Kafka), data governance, observability, and mentor teams while ensuring compliance in regulated environments.
The summary above was generated by AI
About Sand

Sand Technologies is a global Physical AI company using data and AI to make critical industries work better. We partner with governments, cities and enterprises to improve how essential systems operate across healthcare, water, energy, telecommunications and infrastructure.

Our work delivers proven real-world impact. We have built AI systems that help manage London’s water supply, supported telecom network planning across hundreds of cities, and developed digital healthcare platforms serving tens of millions of people across Africa. From intelligent command centers to AI-powered infrastructure platforms, we help organizations sense, analyze and act in complex environments.

Our people are ambitious, curious and relentlessly practical. Our teams work alongside clients in the field, solving hard problems and deploying solutions that last. With colleagues across Africa, Europe, the UK and the US, we operate across the full stack - from research and engineering to deployment and capability building.

Our mission is simple: to harness AI to solve humanity’s most pressing challenges.

About the role

Sand Technologies build data-intensive systems that enable insight, intelligence, and informed decision-making. We typically work with hybrid data architectures with centralised lakehouses or data warehouses and distributed data products on top. Our stack includes tools such as Databricks, dbt, Docker, Python, SQL, and PySpark. We primarily work in cloud-native environments across AWS, Azure, and GCP, while occasionally supporting self-hosted open-source deployments.
A Senior Data Engineer is responsible for designing, building, and maintaining scalable data architecture that underpins our decision-support applications. Our decision-support applications range from traditional Analytics (data warehouse), to Machine Learning, to Digital Twins and on occasion serving LLMs and Agentic workflows, and as such your data architecture should support various use cases. You will work closely with cross-functional teams and contribute to the strategic direction of our data initiatives.
We operate with a strong code-first, “data as a product” mindset, where testing, reliability, observability, and performance are non-negotiable.

Specific Responsibilities
  • Architect and build a secure, scalable urban data platform integrating multi-agency and infrastructure datasets at scale.
  • Design resilient cloud-native architectures supporting batch, streaming, and near-real-time operational workloads.
  • Lead development of high-performance ingestion and transformation pipelines across legacy systems, APIs, IoT/telemetry, and structured data sources.
  • Implement distributed and event-driven processing systems (e.g., Spark, Kafka or equivalent) for large-scale analytical and operational use cases.
  • Establish platform reliability standards, including observability, automated data quality validation, lineage, monitoring, and defined SLAs/SLOs.
  • Design and enforce strong data governance and access control frameworks, including RBAC, encryption, auditability, and secure data handling practices.
  • Build modern lakehouse or equivalent architectures that enable advanced analytics, GIS, and production-grade machine learning.
  • Partner closely with data scientists, ML engineers, and senior stakeholders to operationalize AI and analytics at scale.
  • Optimize platform performance, scalability, and cost efficiency as adoption grows.
  • Contribute to long-term architectural direction and mentor engineering team members.
 Requirements - Essential
  • 6+ years designing and operating large-scale semi-distributed data platforms (hybrid centralised and distributed) in cloud or hybrid environments.
  • Proven experience architecting modern data systems (lakehouse, data mesh, or equivalent) supporting both analytical (descriptive and predictive) and operational workloads.
  • Deep hands-on expertise with distributed processing frameworks (e.g., Spark) and streaming/event systems (e.g., Kafka or similar).
  • Strong experience building secure, governed data environments with robust access controls, encryption, lineage, and audit capabilities.
  • Experience designing secure data platforms in regulated or government environments, with strong understanding of compliance, auditability, and data protection standards.
  • Experience integrating heterogeneous data sources, including legacy systems, APIs, telemetry/IoT systems, and relational databases.
  • Demonstrated ability to design highly available, observable, production-grade data systems.
  • Experience enabling machine learning and advanced analytics through robust data infrastructure and feature pipelines.
  • Strong proficiency in Python, SQL, and ideally DBT with a track record of writing clean, production-quality code.
  • Experience deploying and operating solutions in AWS, Azure, or GCP, including CI/CD and infrastructure-as-code is beneficial.
  • Ability to operate effectively in complex, multi-stakeholder environments.
  • Strong systems-thinking mindset with a focus on scalability, modularity, and long-term platform evolution.
  • Experience designing data platforms in U.S. public sector or highly regulated environments, with working knowledge of applicable federal and state data privacy and security requirements (e.g., HIPAA, CJIS, FERPA, state-level privacy acts), and the ability to embed compliance, auditability, and data governance principles into architectural design.
Location

This role is not a remote position. We would require our Senior Data Engineer to be able to travel to client sites in Baltimore 4 days a week.

Personal Attributes
  • Client Centricity & Integrity: We let Our Clients Run the Company, Surf Like Yvon to stay true to our values, and Play the Long Game with integrity.
  • Collaboration and Inclusion: We live by Each One, Teach Ten and ensure Everybody is Welcome.
  • Operational Excellence and Simplicity: We K.I.S.S. by keeping things simple while always striving to Raise the Bar.
  • Action, Ownership, and Execution: We Decide, Get Stuff Done, and Do Hard Things with accountability.
  • Growth, Innovation, and Resilience: We Choose Growth, Pioneer boldly, and remember There is No Failure.

Due to the considerable amount of virtual work and interaction with colleagues and customers in different physical locations internationally, it is essential that the successful applicant has the drive and ethic to succeed in working in small teams physically but in larger efforts virtually. Self-drive to communicate constantly using web collaboration and video conferencing is essential.

Similar Jobs

Yesterday
Easy Apply
Remote or Hybrid
United States
Easy Apply
118K-179K Annually
Senior level
118K-179K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Design, build, and operate large-scale data platforms and Spark/PySpark pipelines. Enable data integration, modeling, quality, and observability. Build MCP servers and AI-augmented tooling, mentor engineers, and lead cross-functional projects to deliver reliable data products.
Top Skills: Ai AgentsApache IcebergAuroraAWSAws RdsAzureDatabricksDbtFivetranGCPGoogle BigqueryMcp ServersMs Sql ServerMySQLOraclePostgresPysparkPythonSnowflakeSparkSQL
4 Days Ago
Remote or Hybrid
Austin, TX, USA
124K-280K Annually
Senior level
124K-280K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead data engineering efforts within Technology Consulting: design data architecture and pipelines, implement AWS/Redshift and ETL solutions, support BI (QlikView/Oracle BI), coach teams, manage client relationships and SLAs, apply systems thinking to optimize outcomes and validate solutions with stakeholders.
Top Skills: AWSDatastageDb2ETLJavaManaged ServicesOracle BiPythonQlikviewRedshiftSlasSQL ServerWorkload Orchestration And Scheduling
5 Days Ago
Remote or Hybrid
Austin, TX, USA
99K-232K Annually
Senior level
99K-232K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead data engineering engagements to design, build, and maintain ETL/ELT pipelines and cloud data architectures. Manage client accounts and mentor teams, leverage tools like DataStage, AWS/Redshift, DB2/SQL Server, GoldenGate, and BI/visualization platforms to deliver analytics, performance tuning, and scalable reporting solutions.
Top Skills: AWSBirtCdcDatastageDb2Etl/EltGlueGoldengateJavaPythonQlikviewRedshiftS3SpotfireSQL Server

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account