Dynatrace Logo

Dynatrace

Senior/Principal Engineer (m/f/x) for Evaluations Generative AI

Posted 13 Hours Ago
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in Boston, MA
74K-112K Annually
Senior level
Remote or Hybrid
Hiring Remotely in Boston, MA
74K-112K Annually
Senior level
Design and build evaluation and simulation systems for generative AI agents using Dynatrace observability data. Create large-scale simulation pipelines, define metrics/datasets/judging strategies, build developer CLIs, generate adversarial scenarios, and measure agent tool-use and failure modes. Prototype, run feedback cycles, set technical strategy, and mentor engineers.
The summary above was generated by AI
Your role at Dynatrace

Most AI developer tools operate without any knowledge of how software actually behaves in production. Dynatrace is in a unique position to change that.

We're looking for a Senior or Principal Generative AI Engineer to design and build the evaluation and simulation capabilities at the core of our product. You'll work across the stack, from CLI tooling that engineers run locally, to large-scale simulation pipelines, to LLM-as-a-judge evaluation frameworks running against real Dynatrace AI Observability data.

This role sits inside Dynatrace's Engineering organization and works closely with product, design, and the platform teams that power Dynatrace's AI-observability stack.

Your responsibilities:

  • Conduct research in the field of Generative AI
  • Design and build systems that let users replay and stress-test AI Agents at scale. Detect regressions across model versions, prompt changes, and data drift. Define the metrics, datasets, and judging strategies that make results trustworthy.
  • Build infrastructure to simulate multi-turn, tool-using agents in realistic environments. Generate adversarial scenarios, measure task completion, tool-use correctness, and failure modes. Help teams ship agents with confidence.
  • Own developer-facing CLIs that run evaluations on top of Dynatrace AI Observability data, from trace ingestion to judge configuration to reporting. Make it the tool AI engineers reach for first when debugging a production behavior.
  • Prototype quickly, run user feedback cycles, and ship to production
  • Define technical strategy for the team's AI systems, set architectural direction, and mentor other engineers
  • Collaborate with product and design to identify which developer problems are most worth solving
What will help you succeed
  • 5+ years (Senior) or 10+ years (Principal) of professional software engineering experience
  • Demonstrated experience shipping production systems that use LLMs, including prompting, tool calling, evaluation, and iteration
  • Strong foundation in at least one of: developer tooling (IDEs, compilers, static analysis, code intelligence), AI/ML engineering, or large-scale distributed systems
  • Hands-on experience with agentic patterns: planning, tool use, retrieval, memory management
  • Ability to evaluate and critique AI-generated output. You understand when a model is wrong, not just that it is.
  • Clear communication with cross-functional partners across product and engineering
  • Background in observability, APM, or infrastructure monitoring
  • Familiarity with engineering platforms at scale: CI/CD systems, developer portals, internal tooling
  • Hands-on experience with LLMs: prompt engineering, evaluation frameworks (e.g. LLM-as-a-judge, golden datasets, pairwise comparisons), or agent frameworks.
Why you will love being a Dynatracer
  • Dynatrace is a leader in unified observability and security.
  • We provide a culture of excellence with competitive compensation packages designed to recognize and reward performance.
  • Our employees work with the largest cloud providers, including AWS, Microsoft, and Google Cloud, and other leading partners worldwide to create strategic alliances.
  • You'll get to work at the forefront of innovation with Dynatrace Intelligence—the industry's first agentic operations system. Bringing together deterministic and agentic AI, it helps teams understand what's happening, why it matters, and what to do next— automatically.
  • Over 50% of the Fortune 100 companies are current customers of Dynatrace.
Compensation and Rewards
  • We offer attractive compensation packages and stock purchase options with numerous benefits and advantages.
  • Due to legal reasons, we are obliged to list a salary range for this position, which is €74,000 up to €112,000 gross per year based on full-time employment (38.5 h/week). We’ve listed the salary range for transparency, but if your experience and skills bring unique value, we’d still love to hear from you—please apply even if you’re outside the range.
Equal Employment Opportunity

Dynatrace provides equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, or any other protected characteristic. We actively foster an inclusive workplace that celebrates differences and promotes accessibility, collaboration, and growth for all.

Similar Jobs at Dynatrace

An Hour Ago
Remote or Hybrid
United States
146K-220K Annually
Senior level
146K-220K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Design, build, and ship production-grade agentic AI systems that connect code context with runtime observability. Implement end-to-end LLM systems (prompting, tool calling, retrieval, memory, agents), define evaluation metrics and datasets, integrate with development workflows and Dynatrace, set technical strategy, mentor engineers, and own monitored production systems.
Top Skills: Agentic Ai/AgentsAWSDynatraceGoGCPIdesLarge Language Models (Llms)Memory ManagementAzurePromptingPythonRetrieval (Rag)Static AnalysisTool Calling
13 Hours Ago
Remote or Hybrid
61K-92K Annually
Senior level
61K-92K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Lead development of OpenTelemetry ingest for GenAI workloads, validate integrations across AI SDKs and frameworks, align auto-instrumentation with OneAgent, drive OpenTelemetry GenAI semantic conventions, prototype and ship production features, set technical strategy, and mentor engineers.
Top Skills: Agent FrameworksAgentic PatternsAWSAzureDynatrace OneagentGCPLlmsLogs)MetricsOpentelemetryOpentelemetry Genai SigTelemetry (Traces
Yesterday
Remote or Hybrid
74K-74K Annually
Senior level
74K-74K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Build high-performance UI components using React, contribute to product design, collaborate in a team environment, and lead technical initiatives.
Top Skills: JavaScriptReactTypescript

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account