Senior Data Engineering Consultant
Infinity Works, Accenture
Senior Engineer
Led and implemented platform improvements that enabled higher scalability and resulted in \$5k+ monthly cost savings
Improved reliability of the services by increasing the unit test coverage
tech: AWS, terraform, Python, Jenkins, Kafka
Senior Engineer
Led a small team on Snowflake migration project for a major UK bank
Automated the environment promotion process, making the process of building data assets quicker and more reliable
Introduced testing to the client and evangelised good practices (peer reviews, version control)
Delivered proof of concepts: near real time ELT to Data Vault, Data Visualisation web app with Streamlit
tech: snowflake, SQL, dbt, javascript, GitHub actions, Streamlit, Snowpark
Lead Instructor
Lead instructor for a 3-month data engineering bootcamp
Trained, coached and mentored 20 career changers on their transition to tech
Acted as a Tech Lead and Product Owner during the 5 weeks final project where each of the 4 squads successfully delivered an end to end data pipeline on AWS
tech: AWS, Python, SQL, Cloudformation, Docker, github Actions
June 2022 - January 2024
Data Engineering Consultant
Infinity Works, Accenture
Built, tested, and deployed an ETL pipeline, enabling reporting for a nation-wide health programme
Contributed production-level code to the serverless system built on AWS
Collaborated with Cloud Architects and Business Analysts to ensure accurate reporting.
Built a range of statistical reports using Tableau
Onboarded and trained junior developers
tech: python, terraform, AWS (lambda, s3, state machines, dynamodb, athena, glue), gitlab CI/CD
June 2021 - June 2022
Data Scientist
NatWest
Used PySpark to engineer big data features for the AML model with an estimated benefit of £566k per annum.
Delivered automated Model performance validation process, introducing CI/CD tools and DevOps practices in Data Science Team.
Developed Python packages that help to get insights from data quicker. Automated Word documents analysis & NLP data pipelines.
Orchestrated processes in development environment configuration (AWS EC2 bootstrapping), achieving significant time savings and 50% cost reduction.
tech: python, PySpark, SQL, git, bash, AWS, data science, machine learning, big data, DevOps
1st Prize Winner of the 2019 Innovation Challenge and the Internal Kaggle Challenge
September 2019 - June 2021
Data Analyst & Data Engineer
Self-employed
Managing a competitor-based pricing project for a client
Collecting and analysing the commercial data from eCommerce websites
Conducted workshops on Business Intelligence software including MS PowerBI
keywords: Python, Web scraping, Automation, Competitive intelligence
October 2018 - March 2019
Data Analyst
Vocalink, a Mastercard company
Provided effective data analytics and reporting to the Vocalink CFO Procurement by extracting and querying data, guiding Procurement Team to improve sourcing decisions, reducing FY spend by £5M.
Optimised the procedures within Procurement Team data analytics, facilitating significant reduction in time spent on monthly updates and increased data integrity.
Proposed a structured plan on automating the MI & Reporting Model across wider Mastercard Group and wrote a Python script automating high volume data analysis (>200k rows).
Introduced dynamic performance visualisation in line with requirements from senior stakeholders.
June 2017 - August 2018