Hi, I'm YOUSEF MAHMOUD

Data Engineer

Building scalable ETL pipelines and modern Data Warehouses. I leverage a strong CS & AI foundation to optimize data architectures using Apache Airflow, Spark, and Snowflake.

Yousef Mahmoud - Data Engineer

About Me

I am a Computer Science graduate passionate about data engineering and automation. Recently completed an intensive training at ITI, where I gained rigorous hands-on experience in designing robust data solutions. My expertise lies in bridging the gap between software engineering and data infrastructure, utilizing tools like Docker and Linux to build efficient, automated workflows. I am driven by the challenge of transforming raw data into actionable business insights.

Featured Projects

Real-Time NYC Transit Analytics

Real-time streaming platform for monitoring NYC transit system with Apache Kafka, Spark Streaming, and interactive dashboards. Processing millions of transit updates per hour with sub-second latency.

Apache Kafka
Spark Streaming
PostgreSQL
Streamlit
Real-Time Processing
<500ms Latency
10M+ Events/Hour
Live Dashboard

Repo: github.com/Y0U5F/Real-Time_NYC_Transit_Monitoring

View Real-Time Analytics Project →

Olist E-commerce Data Warehouse

Architected a scalable ELT pipeline processing 100K+ records of Brazilian e-commerce data using Medallion Architecture. Orchestrated daily workflows using Apache Airflow and implemented dbt with 26+ automated tests.

Snowflake
Apache Airflow
dbt
Power BI
Medallion Architecture
26+ Data Quality Tests
Star Schema Design
Business Intelligence

Repo: github.com/Y0U5F/Olist_ETL_Project

View Enterprise DWH Project →

Hollywood in the Cloud

Serverless data pipeline on AWS analyzing 100+ years of US Box Office data (1902-2024) using Lambda, S3, Glue, Athena, and Power BI.

AWS Lambda
AWS S3
AWS Athena
Power BI
Serverless Architecture
Event-Driven ETL
Data Lake Design
BI Dashboards

Repo: github.com/Y0U5F/Hollywood_in-the-Cloud

View Project Details →

ITI Examination System

Engineered a normalized relational database using SQL Server (T-SQL) to manage students, courses, and exams with automated processes.

SQL Server
T-SQL
SSMS
Docker
Stored Procedures
RBAC System
Audit Triggers
90% Work Reduction

Repo: github.com/Y0U5F/CTRL-EXAM

View Database Project →

Skinca - AI Skin Diagnosis

Mobile healthcare app for skin disease diagnosis using AI/ML with image analysis and intelligent chatbot. Built with Flutter and Firebase.

Flutter
Firebase
TensorFlow
Mobile App
AI Image Diagnosis
Chatbot Integration
Doctor Search
Cross-Platform

🎓 Graduation Project - Beni-Suef University 2024

View Project Details →

Library Management System

Python desktop application for library record management with Tkinter GUI, SQLite persistence, and OOP design patterns.

Python
Tkinter
SQLite
OOP
CRUD Operations
Check-in/Check-out
Catalog Search
Class Inheritance

🐍 Python Desktop Application

View Project Details →

Skills & Technologies

Programming Languages

Python
SQL
Bash
Java
C++

Data Engineering

Apache Airflow
dbt
SSIS
AWS Glue
Apache Spark
Kafka
Snowflake
Prefect
dlt

Python Libraries

Pandas
NumPy
Scikit-learn

Cloud & DevOps

AWS Lambda
AWS Glue
AWS Athena
AWS S3
AWS EMR
AWS Redshift
AWS Kinesis
Docker
Linux

Databases

SQL Server
PostgreSQL
MongoDB
DuckDB

Visualization & ML

Power BI
Grafana
Matplotlib
NLP
Computer Vision
TensorFlow

Frameworks & Tools

Anaconda
Flask
Streamlit

Soft Skills

Problem Solving
Agile Project Management
Critical Thinking

Version Control & CI/CD

Git
GitHub Actions
Jenkins

Education

Bachelor's in Computer Science & AI

Beni-Suef University

2020 - 2024

Graduation Project Grade: Excellent

Data Engineering Training Program

Information Technology Institute (ITI)

June 2025 – Nov 2025

Completed an intensive Data Engineering specialization. Developed end-to-end data pipelines using Apache Airflow & Spark, and designed optimized Data Warehouses with Snowflake & dbt.

Let's Work Together

Get in Touch

Email

yousef.soliman.de@gmail.com

Phone

0100-438-9030

Location

Cairo, Egypt

Send me a message