webAI Logo

webAI

Senior AI Software Engineer

Reposted 8 Days Ago
In-Office
Austin, TX
Senior level
In-Office
Austin, TX
Senior level
The Senior AI Software Engineer will design and optimize inference engines in C++, focusing on performance for on-device Apple applications, and implement advanced machine learning architectures.
The summary above was generated by AI

About Us:

webAI is pioneering the future of artificial intelligence by establishing the first distributed AI infrastructure dedicated to personalized AI. We recognize the evolving demands of a data-driven society for scalability and flexibility, and we firmly believe that the future of AI lies in distributed processing at the edge, bringing computation closer to the source of data generation. Our mission is to build a future where a company's valuable data and intellectual property remain entirely private, enabling the deployment of large-scale AI models directly on standard consumer hardware without compromising the information embedded within those models. We are developing an end-to-end platform that is secure, scalable, and fully under the control of our users, empowering enterprises with AI that understands their unique business. We are a team driven by truth, ownership, tenacity, and humility, and we seek individuals who resonate with these core values and are passionate about shaping the next generation of AI.

About the Role:

We are looking for a Senior C++ Systems Engineer with deep expertise in high-performance computing and machine learning inference. In this role, you’ll design and optimize inference engines for on-device applications on Apple platforms (macOS & iOS), leveraging Metal and MLX to push performance to its limits.

This is an ideal opportunity for someone who thrives at the intersection of low-level systems programming, applied machine learning, and hardware-aware optimization.

Key Responsibilities

  • Implement and optimize advanced ML architectures (Transformers, Mixture of Experts, Diffusion models) in C++, with a focus on performance and memory efficiency.

  • Develop and fine-tune custom Metal kernels for performance-critical inference operations.

  • Apply advanced model quantization techniques (low-bit, mixed-precision) to accelerate performance while minimizing footprint.

  • Profile, benchmark, and tune inference on Apple Silicon (M-series, A-series), identifying and eliminating bottlenecks.

  • Collaborate on API design and build Python bindings for C++ libraries.

  • Contribute to robust testing frameworks to ensure reliability and performance.

Requirements & Skills

  • Bachelor’s degree in CS, EE, or a related field, or equivalent experience.

  • 4+ years of professional experience in C++ systems programming.

  • Strong understanding of computer architecture, data structures, and algorithms.

  • Demonstrated experience with performance profiling and low-level optimization.

  • Familiarity with deep learning concepts and architectures (Transformers, Diffusion models, Mixture of Experts).

Preferred (but not required)

  • Deep expertise with Apple’s MLX framework.

  • Demonstrable experience writing and optimizing custom Metal kernels.

  • Experience with model quantization techniques and their performance implications.

  • Familiarity with the iOS/macOS development ecosystem and build systems (CMake).

  • Experience creating Python bindings for C++ libraries.

We at webAI are committed to living out the core values we have put in place as the foundation on which we operate as a team. We seek individuals who exemplify the following:

  • Truth - Emphasizing transparency and honesty in every interaction and decision.

  • Ownership - Taking full responsibility for one’s actions and decisions, demonstrating commitment to the success of our clients.

  • Tenacity - Persisting in the face of challenges and setbacks, continually striving for excellence and improvement.

  • Humility - Maintaining a respectful and learning-oriented mindset, acknowledging the strengths and contributions of others.

Benefits:

  • Competitive salary and performance-based incentives.

  • Comprehensive health, dental, and vision benefits package.

  • 401k Match (US-based only)

  • $200/mos Health and Wellness Stipend

  • $400/year Continuing Education Credit

  • $500/year Function Health subscription (US-based only)

  • Free parking, for in-office employees

  • Unlimited Approved PTO

  • Parental Leave for Eligible Employees

  • Supplemental Life Insurance


webAI is an Equal Opportunity Employer and does not discriminate against any employee or applicant on the basis of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We adhere to these principles in all aspects of employment, including recruitment, hiring, training, compensation, promotion, benefits, social and recreational programs, and discipline. In addition, it is the policy of webAI to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.

Top Skills

C++
Cmake
Metal
Mlx
HQ

webAI Austin, Texas, USA Office

515 Congress Ave, Austin, Texas, United States, 78701

Similar Jobs

9 Hours Ago
In-Office
Austin, TX, USA
Senior level
Senior level
Big Data • Software • Analytics
As a Senior Software Engineer, you will design and build enterprise-grade AI applications, collaborating closely with teams to enhance development velocity and employ advanced technologies for data innovation.
Top Skills: Amazon Web ServicesApache HiveC#CSSGoGoogle Cloud PlatformGrpcHTMLImpalaJavaKnativeKserveKubeflowKubernetesAzureMlflowNode.jsPythonRReactSparkSQLTensorFlow
7 Hours Ago
In-Office or Remote
5 Locations
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design and build high-performance optimization frameworks for the CUDA ecosystem, collaborating on innovative workflows and compiler-driven solutions for AI workloads.
Top Skills: C/C++CudaLlvmMlirOpenai TritonPtxPython
4 Days Ago
In-Office or Remote
3 Locations
Senior level
Senior level
Financial Services
The Senior Software Engineer will develop an AI model for mortgage guidelines, optimizing algorithms, collaborating with teams, and ensuring code quality through testing and reviews.
Top Skills: LanggraphNextjsOpenai ApisPineconePython 3Retrieval-Augmented Generation (Rag)Tailwind

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account