Jakub Gajdul

Data Consulting · jakub.gajdul@gmail.com

Versatile data consultant with experience in engineering, analytics, cloud, and DevOps.
I combine strong technical skills with understanding the business and clear communication.
I've worked at leading companies including Mastercard, NatWest, and Accenture.


Experience

Founder, Data Consultant

Jakub Gajdul
  • I provide technical expertise as Big Data Engineer on a large scale media project.
  • Building scalable big data processes on Databricks, transforming data into actionable insights
  • October 2024 - now

    Cofounder

  • We help companies get the most value out of data to grow their business and make better informed decisions.
  • Data engineering consulting.
  • Case studies: Forex data pipeline (GCP Bigquery, terraform), Ecommerce Sales Data Dashboard (Tableau)
  • slothdrop.io
  • Built instagram automation platform on serverless AWS stack
  • Implemented GPT-4 Turbo with Vision model from OpenAI for image labelling in production
  • January 2024 - now

    Senior Data Engineering Consultant

    Infinity Works, Accenture
    Senior Engineer
  • Led and implemented platform improvements that enabled higher scalability and resulted in $5k+ monthly cost savings
  • Improved reliability of the services by increasing the unit test coverage
  • tech: AWS, terraform, Python, Jenkins, Kafka

    Senior Engineer
  • Led a small team on Snowflake migration project for a major UK bank
  • Automated the environment promotion process, making the process of building data assets quicker and more reliable
  • Introduced testing to the client and evangelised good practices (peer reviews, version control)
  • Delivered proof of concepts: near real time ELT to Data Vault, Data Visualisation web app with Streamlit
  • tech: snowflake, SQL, dbt, javascript, GitHub actions, Streamlit, Snowpark

    Lead Instructor
  • Lead instructor for a 3-month data engineering bootcamp
  • Trained, coached and mentored 20 career changers on their transition to tech
  • Acted as a Tech Lead and Product Owner during the 5 weeks final project where each of the 4 squads successfully delivered an end to end data pipeline on AWS
  • tech: AWS, Python, SQL, Cloudformation, Docker, github Actions

    June 2022 - January 2024

    Data Engineering Consultant

    Infinity Works, Accenture
  • Built, tested, and deployed an ETL pipeline, enabling reporting for a nation-wide health programme
  • Contributed production-level code to the serverless system built on AWS
  • Collaborated with Cloud Architects and Business Analysts to ensure accurate reporting.
  • Built a range of statistical reports using Tableau
  • Onboarded and trained junior developers
  • tech: python, terraform, AWS (lambda, s3, state machines, dynamodb, athena, glue), gitlab CI/CD

    June 2021 - June 2022

    Data Scientist

    NatWest
  • Used PySpark to engineer big data features for the AML model with an estimated benefit of £566k per annum.
  • Delivered automated Model performance validation process, introducing CI/CD tools and DevOps practices in Data Science Team.
  • Developed Python packages that help to get insights from data quicker. Automated Word documents analysis & NLP data pipelines.
  • Orchestrated processes in development environment configuration (AWS EC2 bootstrapping), achieving significant time savings and 50% cost reduction.
  • tech: python, PySpark, SQL, git, bash, AWS, data science, machine learning, big data, DevOps

  • 1st Prize Winner of the 2019 Innovation Challenge and the Internal Kaggle Challenge
  • September 2019 - June 2021

    Data Analyst & Data Engineer

    Self-employed
  • Managing a competitor-based pricing project for a client
  • Collecting and analysing the commercial data from eCommerce websites
  • Conducted workshops on Business Intelligence software including MS PowerBI
  • keywords: Python, Web scraping, Automation, Competitive intelligence

    October 2018 - March 2019

    Data Analyst

    Vocalink, a Mastercard company
  • Provided effective data analytics and reporting to the Vocalink CFO Procurement by extracting and querying data, guiding Procurement Team to improve sourcing decisions, reducing FY spend by £5M.
  • Optimised the procedures within Procurement Team data analytics, facilitating significant reduction in time spent on monthly updates and increased data integrity.
  • Proposed a structured plan on automating the MI & Reporting Model across wider Mastercard Group and wrote a Python script automating high volume data analysis (>200k rows).
  • Introduced dynamic performance visualisation in line with requirements from senior stakeholders.
  • June 2017 - August 2018

    Education

    Udacity

    AWS Cloud DevOps Engineer nanodegree
  • Cloud fundamentals: Cloudfront & S3
  • Infrastructure as code: Cloudformation
  • Full stack web application CI/CD: CircleCI, Ansible, Prometheus
  • Machine learning model as API: Docker, Kubernetes
  • Flask web application CI/CD
  • 2021

    Udacity

    Data Scientist Nanodegree
  • 5 Personality Test- analysis of the results for European countries
  • Disaster Response Data Engineering & Machine Learning Pipelines (including a Flask web app with interactive visualisations)
  • Personalised recommendations with IBM
  • Churn prediction on music streaming platform
  • 2020

    Coventry University

    Financial Economics
  • Advanced Economic issues 83.5%
  • Investment Analysis 83.5%
  • Advanced Issues in Banking 75%
  • Dissertation: The Relationship Between the Treasury Yield Spreads and the Unemployment Rate: the Analysis of Historical U.S. Data (1982-2019) 74%
  • 2015- 2019

    Skills

    Programming Languages & Tools
    Workflow
    • OOP Programming
    • Big Data & Distributed Computing
    • Cloud Engineering
    • Machine Learning
    • Infrastructure as a code & CI/CD
    • Business Inteligence & Dashboards


    Awards & Certifications

    • AWS Certified Solutions Architect – Associate
    • Udacity Cloud DevOps engineer nanodegree
    • Udacity Data Science nanodegree
    • Databricks Lakehouse Fundamentals
    • Google Analytics 4 Certification
    • 1st Prize Winner of the 2019 Innovation Challenge at NatWest
    • 1st Prize Winner of Internal Kaggle Challenge at NatWest