xAI

RL Environments Specialist

Reposted 5 Days Ago

Be an Early Applicant

Easy Apply

Remote

Hiring Remotely in USA

100-200 Hourly

Mid level

Easy Apply

Remote

Hiring Remotely in USA

100-200 Hourly

Mid level

Create full reinforcement learning environments, including UI and backend, and manage task creation and validation processes for training AI agents.

The summary above was generated by AI

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the Role

We need talented engineers that will create full RL environments (UI, backend, programmatically generate tasks and validation) for training computer use agents. This means that we need you to take ownership of the entire task creation process for a given environment.

In this role, you will

Build sandbox UIs that our agents and RL actors will interact with.
Create tasks for built environments and programmatically validate task completion.
Enjoys working remotely

Qualifications

Strong professional experience with React.js (hooks, modern state management, TypeScript preferred) — required
Strong professional experience building backend services in Python (FastAPI, Flask, or Django) — required
Hands-on experience with containerization (Docker required; Docker Compose/Kubernetes a plus)
Strong front-end design skills and exceptionally high taste in UI/UX, polish, and visual detail
Proven ability to design a relational database schema in Python and populate it with large-scale, realistic mock data
Experience creating and exposing clean, well-documented API endpoints (REST or GraphQL)
Exceedingly high standards for code quality, readability, testing, and front-end craftsmanship
Extensive day-to-day experience using coding agents / AI assistants as a power user (Cursor, Claude, Copilot, Grok, Aider, etc.)
Good understanding of the Reinforcement Learning paradigm (RLHF, PPO, DPO, reward modeling, etc.)

Preferred Qualifications

Posses strong logical reasoning skills, is detail-oriented, and thrives in a fast-paced work environment.
Eager to teach to and learn from teammates.
Enthusiasm to collaboratively build the best truth-seeking AI out there!

Interview Process

Technical hands-on live coding round
Hiring Manager / Final interview round

Location & Other Expectations

Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit. They may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role specific needs.
For US based candidates, please note we are unable to hire in the states of Wyoming and Illinois at this time.
We are unable to provide visa sponsorship.
For those who will be working from a personal device, your computer must be a Chromebook, Mac with MacOS 11.0 or later, or Windows 10 or later.

Compensation

US based candidates: $35/hour - $100/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications.

International candidates: Information will be provided to you during the recruitment process.

Benefits

Benefits vary based on employment type, location and jurisdiction. Benefits for eligible U.S. based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role specific information will be provided to you during the interview process.

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Top Skills

Containerization

Python

Similar Jobs

Cox Enterprises

Sales Representative

A Minute Ago

Remote or Hybrid

Wisconsin, USA

45K-88K Annually

Senior level

45K-88K Annually

Senior level

Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity

The Senior Field Sales Representative develops new business and manages relationships in a defined territory for NextGear Capital, achieving sales targets and customer satisfaction.

Top Skills: Microsoft Salesforce

CrowdStrike

Marketing Manager

4 Minutes Ago

Remote or Hybrid

USA

125K-180K Annually

Senior level

125K-180K Annually

Senior level

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity

The Senior Technical Marketing Manager for Cloud Security drives product positioning and messaging, creates technical content, and collaborates across teams to enhance market success and customer adoption of cloud security solutions.

Top Skills: AWSAzureCi/CdCiemCloud Detection And ResponseCloud SecurityContainer SecurityCspmCwppCybersecurityDevsecopsGCPIacKubernetes SecurityRuntime Protection

CrowdStrike

Manager, Federal Civilian Sales Engineering (Remote)

4 Minutes Ago

Remote or Hybrid

VA, USA

135K-205K Annually

Senior level

135K-205K Annually

Senior level

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity

The Manager of Federal Civilian Sales Engineering leads a team of Sales Engineers, guiding pre-sales efforts and aligning solutions with federal requirements. They foster collaboration and mentor talent while presenting to diverse audiences.

Top Skills: AvAWSAzureBashEdrFirewallForensicsGCPHips/IdsIncident ResponseLinuxmacOSPowershellPythonSIEMWindows

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center