Jakub Gajdul

Data Consulting · jakub.gajdul@gmail.com

Versatile data engineer with experience in data engineering, BI, cloud, and DevOps. Proven track record of technical delivery on multiple projects and accounts. Combines strong technical skills with understanding the business and clear communication.


Experience

Founder, Data Consultant

Jakub Gajdul
  • I provide technical expertise as Big Data Engineer on a large scale project using Databricks and AWS.
  • October 2024 - now

    Cofounder

  • We help companies get the most value out of data to grow their business and make better informed decisions.
  • Data engineering consulting.
  • Case studies: Forex data pipeline (GCP Bigquery, terraform), Ecommerce Sales Data Dashboard (Tableau)
  • slothdrop.io
  • Built instagram automation platform on serverless AWS stack
  • Implemented GPT-4 Turbo with Vision model from OpenAI for image labelling in production
  • January 2024 - now

    Senior Data Engineering Consultant

    Infinity Works, Accenture
    Senior Engineer
  • Led and implemented platform improvements that enabled higher scalability and resulted in \$5k+ monthly cost savings
  • Improved reliability of the services by increasing the unit test coverage
  • tech: AWS, terraform, Python, Jenkins, Kafka

    Senior Engineer
  • Led a small team on Snowflake migration project for a major UK bank
  • Automated the environment promotion process, making the process of building data assets quicker and more reliable
  • Introduced testing to the client and evangelised good practices (peer reviews, version control)
  • Delivered proof of concepts: near real time ELT to Data Vault, Data Visualisation web app with Streamlit
  • tech: snowflake, SQL, dbt, javascript, GitHub actions, Streamlit, Snowpark

    Lead Instructor
  • Lead instructor for a 3-month data engineering bootcamp
  • Trained, coached and mentored 20 career changers on their transition to tech
  • Acted as a Tech Lead and Product Owner during the 5 weeks final project where each of the 4 squads successfully delivered an end to end data pipeline on AWS
  • tech: AWS, Python, SQL, Cloudformation, Docker, github Actions

    June 2022 - January 2024

    Data Engineering Consultant

    Infinity Works, Accenture
  • Built, tested, and deployed an ETL pipeline, enabling reporting for a nation-wide health programme
  • Contributed production-level code to the serverless system built on AWS
  • Collaborated with Cloud Architects and Business Analysts to ensure accurate reporting.
  • Built a range of statistical reports using Tableau
  • Onboarded and trained junior developers
  • tech: python, terraform, AWS (lambda, s3, state machines, dynamodb, athena, glue), gitlab CI/CD

    June 2021 - June 2022

    Data Scientist

    NatWest
  • Used PySpark to engineer big data features for the AML model with an estimated benefit of £566k per annum.
  • Delivered automated Model performance validation process, introducing CI/CD tools and DevOps practices in Data Science Team.
  • Developed Python packages that help to get insights from data quicker. Automated Word documents analysis & NLP data pipelines.
  • Orchestrated processes in development environment configuration (AWS EC2 bootstrapping), achieving significant time savings and 50% cost reduction.
  • tech: python, PySpark, SQL, git, bash, AWS, data science, machine learning, big data, DevOps

  • 1st Prize Winner of the 2019 Innovation Challenge and the Internal Kaggle Challenge
  • September 2019 - June 2021

    Data Analyst & Data Engineer

    Self-employed
  • Managing a competitor-based pricing project for a client
  • Collecting and analysing the commercial data from eCommerce websites
  • Conducted workshops on Business Intelligence software including MS PowerBI
  • keywords: Python, Web scraping, Automation, Competitive intelligence

    October 2018 - March 2019

    Data Analyst

    Vocalink, a Mastercard company
  • Provided effective data analytics and reporting to the Vocalink CFO Procurement by extracting and querying data, guiding Procurement Team to improve sourcing decisions, reducing FY spend by £5M.
  • Optimised the procedures within Procurement Team data analytics, facilitating significant reduction in time spent on monthly updates and increased data integrity.
  • Proposed a structured plan on automating the MI & Reporting Model across wider Mastercard Group and wrote a Python script automating high volume data analysis (>200k rows).
  • Introduced dynamic performance visualisation in line with requirements from senior stakeholders.
  • June 2017 - August 2018

    Education

    Udacity

    AWS Cloud DevOps Engineer nanodegree
  • Cloud fundamentals: Cloudfront & S3
  • Infrastructure as code: Cloudformation
  • Full stack web application CI/CD: CircleCI, Ansible, Prometheus
  • Machine learning model as API: Docker, Kubernetes
  • Flask web application CI/CD
  • 2021

    Udacity

    Data Scientist Nanodegree
  • 5 Personality Test- analysis of the results for European countries
  • Disaster Response Data Engineering & Machine Learning Pipelines (including a Flask web app with interactive visualisations)
  • Personalised recommendations with IBM
  • Churn prediction on music streaming platform
  • 2020

    Coventry University

    Financial Economics
  • Advanced Economic issues 83.5%
  • Investment Analysis 83.5%
  • Advanced Issues in Banking 75%
  • Dissertation: The Relationship Between the Treasury Yield Spreads and the Unemployment Rate: the Analysis of Historical U.S. Data (1982-2019) 74%
  • 2015- 2019

    Skills

    Programming Languages & Tools
    Workflow
    • OOP Programming
    • Big Data & Distributed Computing
    • Cloud Engineering
    • Machine Learning
    • Infrastructure as a code & CI/CD
    • Business Inteligence & Dashboards


    Awards & Certifications

    • AWS Certified Solutions Architect – Associate
    • Udacity Cloud DevOps engineer nanodegree
    • Udacity Data Science nanodegree
    • Databricks Lakehouse Fundamentals
    • Google Analytics 4 Certification
    • 1st Prize Winner of the 2019 Innovation Challenge at NatWest
    • 1st Prize Winner of Internal Kaggle Challenge at NatWest