Data Engineer at Tractable AI, Remote then London, £Contract Rate

  • Contract Spy
  • Remote (London, UK)
  • Dec 30, 2020
Duration not stated

Contract Description

Data Engineer - Contract

London, UK


Who are we? 


Welcome to Tractable, we’re an AI company with a mission of transforming breakthrough discoveries in deep supervised and semi-supervised learning into products that actually help people.


Engineering @ Tractable


We are a tight-knit team, taking a data-driven approach to solve real-world machine learning problems. We have a track record of taking cutting edge research and making it work for real products: we have the necessary resources (in-house labelling and domain experts) and a large volume of data to train our algorithms, so we can focus on solving those big problems and adding value to our customers!


The project you will be working on:


You will be joining our talented team working on our new Data Platform, the tool that will help bring data usage in Tractable to a different level. The Platform will be the backbone of Tractable internal reporting and AI Models improvements and training. 


You’ll be focussing on developing, architecting and scaling our data services, data lakes and ETLs to enable reporting, ad-hoc querying and model trainings. You will be required to gather context about the wider data space and how our data is collected and processed. Your day-to-day tasks would involve thinking of data models, ETLs, data lakes, data-warehouses and feature stores. You will also work closely with our research team to understand how the AI / ML training occurs and the data that need to be provided so we can maximize the accuracy and reduce the data preparation step for new models.


We work in a lean team comprising data engineers and product managers. We continuously learn how to best achieve our mission: we make the best of feedback and experiences brought in by everyone and we look forward to hearing about what you can bring!


The role: 


You'll play a key role in developing our platform, as part of a small but high performing team. You will join as a Data Engineer and influence the data engineering strategy at Tractable with plenty of scope and autonomy to make meaningful impact and grow your skills.


You will:


  • Help build the next gen data platform in Tractable.
  • Make sure that data is secure, reliable and easily accessible across the company by leveraging the latest technologies
  • Build tools for automation, monitoring and alerting of the data pipelines.
  • Write complex ETL in Python/PySpark to generate reports from a variety of sources.
  • Build internal tools and libraries for our engineers and internal customers
  • Write Infrastructure as Code using Terraform
  • Collaborate in design and problem solving sessions.
  • Research and implement new tools and technologies in the data space
  • Work with our stakeholders to iterate on our data products
  • Suggest improvements and introduce best practices into the team


Our Tech Stack:


  • AWS - IAM, Glue, Athena, Lambda, SQS, SNS, S3 
  • Python, Apache Spark
  • PostgresSQL, Kafka
  • JSON, Parquet
  • Terraform
  • Airflow


What we’re looking for: 


  • This is a mid-senior level role and we're strong skills in Python and Apache Spark
  • Strong experience developing, architecting and scaling data services 
  • Previous experience introducing best practices or processes into a team and a strong desire to help others succeed
  • Strong architectural design skills and being able to discuss the merits and trade-offs of using any particular design approach and technologies
  • Bonus: Experienced working in a AWS Data Stack (AWS Glue, Lambda, S3, Athena)
  • Bonus: Experienced using Apache Spark for data processing
  • Bonus: Experienced with data streaming applications middleware (e.g. Kafka, Rabbitmq)
  • Bonus: Experienced using Terraform for Infrastructure as code



Location - Old Street, City of London (we're fully remote until Jan 2021 minimum due to COVID. We're ordinarily based in office in Old Street, London)