Contract Spy
Remote (United Kingdom)
Work Mode: Remote Engagement Type: Independent Contractor Schedule: Full-Time or Part-Time Contract
Language Requirement: Fluent English
Role :
Partners with leading AI teams to improve the quality, usefulness, and reliability of general-purpose conversational AI systems. These systems are used across a wide range of everyday and professional scenarios, and their effectiveness depends on how clearly, accurately, and helpfully they respond to real user questions.
In engineering-related contexts, conversational AI systems must demonstrate accurate applied reasoning, quantitative precision, and practical problem-solving aligned with real-world systems. This project focuses on evaluating and improving how models reason about and explain engineering concepts across multiple disciplines.
What You’ll Do
Write and refine prompts to guide model behavior in engineering scenarios
Evaluate LLM-generated responses to...