Delos Data Logo

Delos Data

System Software Engineer - AI

Posted 6 Days Ago
Remote
Hiring Remotely in USA
140K-200K Annually
Mid level
Remote
Hiring Remotely in USA
140K-200K Annually
Mid level
As a System Software Engineer, you will design communication primitives for AI models, optimize performance for training workloads, and benchmark performance on clusters.
The summary above was generated by AI

System Software Engineer - AI

About us:

We are a stealth-mode startup building foundational technology to address performance, scalability, and resiliency challenges in large-scale AI data center clusters. We are backed by top-tier VC firms and notable angel investors.

The company is led by experienced builders and operators who have founded companies, taken them to scale, and exited successfully. We work with a strong sense of unity and shared responsibility, and we expect trust, integrity, and respect in how we collaborate and make decisions. We hold ourselves accountable to one another and to the quality of the work we deliver.

Headquartered in Silicon Valley, we operate across a mix of remote and on-site locations in the U.S. and Canada. We aim to create an environment where people are treated fairly, supported in their growth, and are empowered to do meaningful work alongside others who take the craft seriously.

We are looking for:

We are looking for a talented System Software Engineer to help us redefine the infrastructure layer of AI. In this role, you will bridge the gap between high-level AI frameworks and low-level system software. You will be responsible for designing and implementing the communication and execution primitives that allow large-scale AI models to run efficiently across thousands of GPUs. We are looking for a "builder" who thrives in the early stages of a product’s lifecycle and is passionate about solving the "hard" systems problems of the generative AI era.

Key Responsibilities:

  • Collaborate across the stack to influence the design of our foundational technology, ensuring it meets the needs of next-generation AI models.

  • Identify and resolve performance bottlenecks in distributed training and inference workloads through deep-dive analysis of the software-hardware interface.

  • Conduct rigorous performance benchmarking and characterization on multi-node clusters.

Required Skills and Qualifications:

  • Strong proficiency in C++ and Python, with a deep understanding of systems programming fundamentals (memory management, concurrency, OS internals).

  • Proficient in a Linux development environment.

Desired Skills:

  • Experience with GPU programming (CUDA) and performance optimization for parallel architectures.

  • Familiarity with distributed AI frameworks (PyTorch, JAX, or DeepSpeed) and/or inference engines (vLLM, SGLang, Dynamo/TRT-LLM).

  • Hands-on experience with large-scale cluster orchestration and telemetry tools.

Education:

  • Bachelor's or Master's degree in Computer Engineering, Computer Science, or a related field.

Compensation:

Target base salary for this role is $140,000 - $200,000 per year + meaningful equity + benefits + 401k. Our salary ranges are determined by role, level, experience, and location.

Agency Note:

We do not accept resumes from agencies or search firms. Please do not forward candidate profiles through our careers page, email, LinkedIn messages, or directly to company employees. Any resumes submitted will be deemed the property of the company, and no fees will be paid in the event the candidate is hired.

#LI-EW1

Top Skills

C++
Cuda
Deepspeed
Dynamo/Trt-Llm
Jax
Linux
Python
PyTorch
Sglang
Vllm

Similar Jobs

11 Minutes Ago
In-Office or Remote
60K-75K Hourly
Entry level
60K-75K Hourly
Entry level
Generative AI
The Remote Data Analyst will collect, organize, and analyze data to identify trends and improve decision-making, collaborating with teams remotely.
Top Skills: Google SheetsGoogle WorkspaceExcelMS OfficePower BIPythonRSQLTableau
An Hour Ago
Easy Apply
Remote
U.S.
Easy Apply
107K-158K Annually
Senior level
107K-158K Annually
Senior level
Artificial Intelligence • Enterprise Web • Software • Design • Generative AI
The Revenue Analytics Manager will analyze customer success and partner analytics, delivering insights to drive retention and revenue growth, working cross functionally with leadership and data teams.
Top Skills: SalesforceSnowflakeSQLTableau
An Hour Ago
Remote or Hybrid
3 Locations
130K-190K Annually
Expert/Leader
130K-190K Annually
Expert/Leader
Artificial Intelligence • Big Data • Healthtech • Machine Learning • Analytics • Biotech • Generative AI
The Key Account Director drives global account strategy, maintains relationships with pharmaceutical clients, negotiates contracts, and collaborates across teams to enhance Tempus' role in precision medicine.
Top Skills: AIClinical Trial DesignCroDiagnosticsMultiomicsR&DSequencing

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account