Careerflow.ai Logo

Careerflow.ai

AI/ML Software Engineer (RL Environments) (Contract)

Posted 7 Days Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
Design and build reinforcement-learning training environments and diverse tasks to evaluate and improve LLM agents; iterate rapidly on task designs from customer feedback, deliver high-quality outputs with minimal supervision, and maintain PST overlap for collaboration.
The summary above was generated by AI
About the Role

We're seeking experienced Machine Learning Engineers and Software Engineers with ML experience to design and build high-quality RL training environments for LLM agents. As an RL Environment Engineer, you'll create diverse machine learning tasks that challenge and improve language models, working with minimal supervision to deliver consistent, quality outputs.

What You'll Do
  • Design and build tasks for machine learning domains that target specific language models and difficulty distributions

  • Iterate rapidly on task designs based on customer feedback, with 24-hour turnaround times

  • Create diverse, challenging scenarios that test language model capabilities and expose their limitations

  • Hit the ground running with minimal onboarding time

What We're Looking For
  • Strong machine learning background through coursework, previous work experience, or personal projects

  • Python fluency: you write clean, efficient Python code regularly

  • Heavy LLM user who understands current model capabilities and failure modes through daily hands-on experience

  • Self-directed and creative. You can generate novel ML task ideas in your domain without constant guidance

  • High responsibility and integrity. You deliver quality work consistently and meet deadlines

  • Availability overlap with PST 9am-5pm (minimum 3 hours required)

Work Details
  • Location: Remote

  • Type: Contractor

Time Commitment: 40 hours a week. Must have at least 3 hours of overlap with PST business hours (9am-5pm)

Selection Process:
  1. Screening

  2. Hacker rank assessment

  3. 1 Week paid task

  4. Full time

Similar Jobs

An Hour Ago
Remote
United States
100K-160K Annually
Entry level
100K-160K Annually
Entry level
Artificial Intelligence • Blockchain • Professional Services • Security • Consulting • Cybersecurity • Defense
Perform hands-on application and system security assessments: discover and validate vulnerabilities, develop proof-of-concepts and custom tooling, conduct threat modeling and architecture reviews, and communicate clear remediation guidance to clients while contributing to security research.
Top Skills: AslrCC++CfiDepGoJavaScriptPythonRustTypescript
2 Hours Ago
Remote or Hybrid
140K-165K Annually
Senior level
140K-165K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Create reusable "paved paths" (documentation, reference architectures, IaC modules, code templates, and tools) to simplify building on enterprise platforms. Partner with architects and platform teams, develop and maintain templates and AI-assisted developer workflows, gather feedback from application teams, and iterate to maximize usability and adoption across a large, federated engineering organization.
Top Skills: Agent-Based ToolsAWSAzureCi/CdCloudformation (Cft)GCPInfrastructure As Code (Iac)Internal Developer AssistantsPrompt EngineeringPulumiTerraform
4 Hours Ago
Remote
United States
155K-170K Annually
Senior level
155K-170K Annually
Senior level
Software
The role involves leading projects as a full-stack engineer, focusing on SaaS products, enhancing user experiences, and building accessible software.
Top Skills: CSSHTMLPostgresTypescript

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account