Contract Spy
  • Join Contract Spy
  • Member Area
  • Blog
  • Member Sign in
  • Sign up
  • Join Contract Spy
  • Member Area
  • Blog

17 research engineer jobs found in Remote

Refine Search
Current Search
Remote Remote research engineer
Refine by Contract Type
Duration not stated  (15) Hourly/Freelance  (2)
Refine by State
Remote  (17)
Contract Spy
Mar 12, 2026
Duration not stated
Python Developer - Remote $ 120/Hours at YO IT CONSULTING, United Kingdom, £80 - £200 an hour
Contract Spy Remote (United Kingdom)
hiring experienced Python Engineers to support a variety of high-impact research collaborations with leading AI labs. Freelancers will help improve AI systems through work extending coding benchmarks that reflect real-world development across diverse languages and domains. This is a unique opportunity to apply your engineering expertise toward shaping the next generation of intelligent systems. Key Responsibilities Develop and validate coding benchmarks in Python by curating issues, solutions, and test suites from real-world repositories Ensure benchmark tasks include comprehensive unit and integration tests for solution verification Maintain consistency and scalability of benchmark task distribution Provide structured feedback on solution quality and clarity Debug, optimize, and document benchmark code for reliability and reproducibility Ideal Qualifications 3–10 years of experience as a backend software engineer, ML engineer, or...
Contract Spy
Mar 11, 2026
Duration not stated
Senior Front End Developer at Bridebook, Remote, £Competitive Day Rate
Contract Spy Remote
Senior Front-End React Developer At Bridebook, we're digitising the wedding industry! Join us in transforming the wedding planning experience for engaged couples around the world. Full-time Contractor · Remote · Competitive Day Rate Who is Bridebook anyway? Bridebook is a revolutionary wedding planning platform that seeks to make wedding planning an enjoyable, intuitive, and rewarding experience for couples, guests, and industry partners. Launched in the UK, Bridebook has over 1,000,000 planning couples and has been featured in major outlets such as the New York Times, the BBC, and the App Store. After seven years and many iterations later, we have 2 in 3 engaged UK couples and 1 in 3 engaged German couples using Bridebook, and have recently expanded into the French and Irish markets. While we serve a very emotional and creative industry, we are disrupting the wedding industry by leveraging cutting-edge technology to provide seamless and...
Contract Spy
Mar 04, 2026
Hourly/Freelance
Machine Learning Engineer (Python) at Mindrift, Remote, £Freelance Rate
Contract Spy Remote (United Kingdom)
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. What we do The Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe. About the Role GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as a Machine Learning expert, you'll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Design original computational STEM problems that...
Contract Spy
Mar 04, 2026
Duration not stated
Senior Front End Developer at Bridebook, Remote, £Competitive Day Rate
Contract Spy Remote
Senior Front-End React Developer At Bridebook, we're digitising the wedding industry! Join us in transforming the wedding planning experience for engaged couples around the world. Full-time Contractor · Remote · Competitive Day Rate Who is Bridebook anyway? Bridebook is a revolutionary wedding planning platform that seeks to make wedding planning an enjoyable, intuitive, and rewarding experience for couples, guests, and industry partners. Launched in the UK, Bridebook has over 1,000,000 planning couples and has been featured in major outlets such as the New York Times, the BBC, and the App Store. After seven years and many iterations later, we have 2 in 3 engaged UK couples and 1 in 3 engaged German couples using Bridebook, and have recently expanded into the French and Irish markets. While we serve a very emotional and creative industry, we are disrupting the wedding industry by leveraging cutting-edge technology to provide seamless and...
Contract Spy
Feb 26, 2026
Hourly/Freelance
Python Developer at YO IT CONSULTING, Remote, £Hourly Rate
Contract Spy Remote (United Kingdom)
hiring experienced Python Engineers to support a variety of high-impact research collaborations with leading AI labs. Freelancers will help improve AI systems through work extending coding benchmarks that reflect real-world development across diverse languages and domains. This is a unique opportunity to apply your engineering expertise toward shaping the next generation of intelligent systems. Key Responsibilities Develop and validate coding benchmarks in Python by curating issues, solutions, and test suites from real-world repositories Ensure benchmark tasks include comprehensive unit and integration tests for solution verification Maintain consistency and scalability of benchmark task distribution Provide structured feedback on solution quality and clarity Debug, optimize, and document benchmark code for reliability and reproducibility Ideal Qualifications 3–10 years of experience as a backend software engineer, ML engineer,...
Contract Spy
Feb 25, 2026
Duration not stated
Senior Front End Developer at Bridebook, Remote, £Competitive Day Rate
Contract Spy Remote
Senior Front-End React Developer At Bridebook, we're digitising the wedding industry! Join us in transforming the wedding planning experience for engaged couples around the world. Full-time Contractor · Remote · Competitive Day Rate Who is Bridebook anyway? Bridebook is a revolutionary wedding planning platform that seeks to make wedding planning an enjoyable, intuitive, and rewarding experience for couples, guests, and industry partners. Launched in the UK, Bridebook has over 1,000,000 planning couples and has been featured in major outlets such as the New York Times, the BBC, and the App Store. After seven years and many iterations later, we have 2 in 3 engaged UK couples and 1 in 3 engaged German couples using Bridebook, and have recently expanded into the French and Irish markets. While we serve a very emotional and creative industry, we are disrupting the wedding industry by leveraging cutting-edge technology to provide seamless and...
Contract Spy
Mar 12, 2026
Duration not stated
Python Developer - LLM at YO IT CONSULTING, Remote, £Contract Rate
Contract Spy Remote
Engagement Type: Independent Contractor Work Mode: Remote Global Schedule: Part-Time (15–25 hrs/week, flexible up to 40 hrs) Duration: 1–3 Months Immediate Start Role Overview We are hiring experienced Software Engineers (Python-focused) to support high-impact research collaborations with leading AI labs. In this project-based engagement, you will contribute to building training datasets that improve AI model reasoning and problem-solving across real-world coding tasks. This is a unique opportunity to apply your software engineering expertise toward shaping the next generation of intelligent systems. About the Project You will work on SWE-bench–style tasks derived from real open-source repositories. Since frontier AI models do not expose their internal reasoning traces, your role will involve: Reconstructing model reasoning Annotating solution trajectories Inferring thought processes behind...
Contract Spy
Mar 11, 2026
Duration not stated
QA Software Engineer - Code Review Expert at YO IT CONSULTING, Remote, £Contract Rate
Contract Spy Remote
Partnering with a top AI research organization to evaluate and improve how coding assistants reason, act, and communicate during development workflows. We’re seeking technically sharp experts (especially those with experience in code review, testing, or documentation) to assess full transcripts of user–AI coding conversations. This short-term engagement helps shape the future of developer-assisting AI systems. Key Responsibilities Review long-form transcripts between users and AI coding assistants Analyze the AI’s logic, execution, and stated actions in detail Score each transcript using a 10-point rubric across multiple criteria Optionally write brief justifications citing examples from the dialogue Detect mismatches between claims and actions (e.g., saying “I’ll run tests” but not doing so) Ideal Qualifications Top choices: Senior or Staff Engineers with deep code review experience and execution insight QA Engineers...
Contract Spy
Mar 09, 2026
Duration not stated
Creative Software Expert - Remote at YO IT CONSULTING, United Kingdom, £Contract Rate
Contract Spy Remote (United Kingdom)
Location: Remote (must have access to a physical Mac) Fluent Language Skills Required: English Why This Role Exists Supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. What You’ll Do Depending on the task phase, you may be asked to complete one or both of the following: Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements Follow provided staging instructions to...
Contract Spy
Mar 09, 2026
Duration not stated
Software Test Engineer QA at YO IT CONSULTING, Remote, £Contract Rate
Contract Spy Remote
Partnering with a top AI research organization to evaluate and improve how coding assistants reason, act, and communicate during development workflows. We’re seeking technically sharp experts (especially those with experience in code review, testing, or documentation) to assess full transcripts of user–AI coding conversations. This short-term engagement helps shape the future of developer-assisting AI systems. Key Responsibilities Review long-form transcripts between users and AI coding assistants Analyze the AI’s logic, execution, and stated actions in detail Score each transcript using a 10-point rubric across multiple criteria Optionally write brief justifications citing examples from the dialogue Detect mismatches between claims and actions (e.g., saying “I’ll run tests” but not doing so) Ideal Qualifications Top choices: Senior or Staff Engineers with deep code review experience and execution insight QA Engineers...
Contract Spy
Mar 09, 2026
Duration not stated
Software Test Engineer QA at YO IT CONSULTING, Remote, £Contract Rate
Contract Spy Remote
Partnering with a top AI research organization to evaluate and improve how coding assistants reason, act, and communicate during development workflows. We’re seeking technically sharp experts (especially those with experience in code review, testing, or documentation) to assess full transcripts of user–AI coding conversations. This short-term engagement helps shape the future of developer-assisting AI systems. Key Responsibilities Review long-form transcripts between users and AI coding assistants Analyze the AI’s logic, execution, and stated actions in detail Score each transcript using a 10-point rubric across multiple criteria Optionally write brief justifications citing examples from the dialogue Detect mismatches between claims and actions (e.g., saying “I’ll run tests” but not doing so) Ideal Qualifications Top choices: Senior or Staff Engineers with deep code review experience and execution insight QA Engineers...
Contract Spy
Mar 05, 2026
Duration not stated
Python Developer - LLM at YO IT CONSULTING, Remote, £Contractor Rate
Contract Spy Remote
Engagement Type: Independent Contractor Work Mode: Remote | Global Schedule: Part-Time (15–25 hrs/week, flexible up to 40 hrs) Duration: 1–3 Months | Immediate Start Role Overview We are hiring experienced Software Engineers (Python-focused) to support high-impact research collaborations with leading AI labs. In this project-based engagement, you will contribute to building training datasets that improve AI model reasoning and problem-solving across real-world coding tasks. This is a unique opportunity to apply your software engineering expertise toward shaping the next generation of intelligent systems. About the Project You will work on SWE-bench–style tasks derived from real open-source repositories. Since frontier AI models do not expose their internal reasoning traces, your role will involve: Reconstructing model reasoning Annotating solution trajectories Inferring thought processes behind...
Contract Spy
Mar 04, 2026
Duration not stated
QA Software Engineer - Code Review Expert at YO IT CONSULTING, Remote, £Contract Rate
Contract Spy Remote
Partnering with a top AI research organization to evaluate and improve how coding assistants reason, act, and communicate during development workflows. We’re seeking technically sharp experts (especially those with experience in code review, testing, or documentation) to assess full transcripts of user–AI coding conversations. This short-term engagement helps shape the future of developer-assisting AI systems. Key Responsibilities Review long-form transcripts between users and AI coding assistants Analyze the AI’s logic, execution, and stated actions in detail Score each transcript using a 10-point rubric across multiple criteria Optionally write brief justifications citing examples from the dialogue Detect mismatches between claims and actions (e.g., saying “I’ll run tests” but not doing so) Ideal Qualifications Top choices: Senior or Staff Engineers with deep code review experience and execution insight QA Engineers...
Contract Spy
Mar 02, 2026
Duration not stated
Software Test Engineer QA at YO IT CONSULTING, Remote, £Contract Rate
Contract Spy Remote
Partnering with a top AI research organization to evaluate and improve how coding assistants reason, act, and communicate during development workflows. We’re seeking technically sharp experts (especially those with experience in code review, testing, or documentation) to assess full transcripts of user–AI coding conversations. This short-term engagement helps shape the future of developer-assisting AI systems. Key Responsibilities Review long-form transcripts between users and AI coding assistants Analyze the AI’s logic, execution, and stated actions in detail Score each transcript using a 10-point rubric across multiple criteria Optionally write brief justifications citing examples from the dialogue Detect mismatches between claims and actions (e.g., saying “I’ll run tests” but not doing so) Ideal Qualifications Top choices: Senior or Staff Engineers with deep code review experience and execution insight QA Engineers...
Contract Spy
Mar 02, 2026
Duration not stated
Creative Software Expert at YO IT CONSULTING, Remote, £Contractor Rate
Contract Spy Remote (United Kingdom)
Location: Remote (must have access to a physical Mac) Fluent Language Skills Required: English Why This Role Exists Supporting a high-priority data collection initiative aimed at improving how AI systems understand complex software interfaces and real-world, multi-step workflows. Current datasets lack the fidelity and expert grounding needed to reflect authentic professional software usage. This project addresses that gap by collecting high-quality screen annotations and screen recordings performed by experienced domain experts working in real digital environments. What You’ll Do Depending on the task phase, you may be asked to complete one or both of the following: Record screen sessions demonstrating specific tasks, accompanied by clear verbal narration explaining each step Annotate screenshots of professional software by drawing precise bounding boxes around relevant UI elements Follow provided staging...
Contract Spy
Feb 27, 2026
Duration not stated
Python Developer - LLM at YO IT CONSULTING, Remote, £Contractor Rate
Contract Spy Remote
Engagement Type: Independent Contractor Work Mode: Remote Global Schedule: Part-Time (15–25 hrs/week, flexible up to 40 hrs) Duration: 1–3 Months Immediate Start Role Overview We are hiring experienced Software Engineers (Python-focused) to support high-impact research collaborations with leading AI labs. In this project-based engagement, you will contribute to building training datasets that improve AI model reasoning and problem-solving across real-world coding tasks. This is a unique opportunity to apply your software engineering expertise toward shaping the next generation of intelligent systems. About the Project You will work on SWE-bench–style tasks derived from real open-source repositories. Since frontier AI models do not expose their internal reasoning traces, your role will involve: Reconstructing model reasoning Annotating solution trajectories Inferring thought processes...
Contract Spy
Feb 25, 2026
Duration not stated
QA Software Engineer - Code Review Expert at YO IT CONSULTING, Remote, £Contract Rate
Contract Spy Remote
Partnering with a top AI research organization to evaluate and improve how coding assistants reason, act, and communicate during development workflows. We’re seeking technically sharp experts (especially those with experience in code review, testing, or documentation) to assess full transcripts of user–AI coding conversations. This short-term engagement helps shape the future of developer-assisting AI systems. Key Responsibilities Review long-form transcripts between users and AI coding assistants Analyze the AI’s logic, execution, and stated actions in detail Score each transcript using a 10-point rubric across multiple criteria Optionally write brief justifications citing examples from the dialogue Detect mismatches between claims and actions (e.g., saying “I’ll run tests” but not doing so) Ideal Qualifications Top choices: Senior or Staff Engineers with deep code review experience and execution insight QA Engineers...
  • Contact
  • How it Works
  • Terms & Conditions
  • Help on Search & Alerts
  • Employer
  • Post a Contract Job
  • Employer Sign in
  • Contractor
  • Subscribe
  • Twitter
  • LinkedIn
  • Facebook
© 2008-2026 ContractSpy