- Amazon Web Services - AWS Professional ServicesData & ML ArchitectTECHSeptember 2022 - Today (2 years and 8 months)Paris, FranceLead Data Architect – Leading Aerospace CompanyDelivered a group-wide streaming data platform supporting 100+ real-time data and AI use cases. Led technical architecture, FinOps, monitoring, and security.Stack: Spark (EMR), Kafka (MSK), Apache Iceberg, Lake Formation, AWS CDK, JenkinsFeedback: “Mehdi is a key resource, a true builder able to deliver massive high-quality work. He juggles multiple topics and supports wherever needed.”Lead Data Engineer – Leading Aerospace CompanyMigrated a Shopfloor Monitoring app to real-time (13s latency vs 24h before). Designed pipelines consolidating SAP events and IoT data for a front-end.Stack: Python, AWS Lambda, Aurora PostgreSQL, SQS, API Gateway, dbt, Solace, CDK, JenkinsFeedback: “Strong customer focus and active listening. Understands business needs and delivers appropriate designs.”Data Engineer – Major Energy GroupRedesigned an email ingestion system, migrating from SQL Server to Redshift Serverless. Legacy ingestion (via email attachments and stored procs) replaced with dbt models.Stack: dbt, SQL (Redshift), AWS Lambda, Fargate, S3Data Engineer – Major Luxury Goods GroupBuilt a real-time Customer 360 syncing online and in-store profiles across 6 regions. Reduced sync time from 8h to 5 min. Implemented GDPR “droit à l’oubli”.Stack: Flink (KDA), Java, DynamoDB, Terraform, JenkinsML Engineer – Major Energy GroupIntegrated SageMaker Studio as an MLOps service. Trained 30+ data scientists. Platform now supports 50+ ML use cases.Stack: CloudFormation, SageMaker
- Octo Technology (part of Accenture)Data & ML ConsultantTECHApril 2021 - September 2022 (1 year and 5 months)Paris, FranceEnd-of-studies internship andfull-time position Paris, France• Data Engineer for a Heating Network Optimization Project: ∗ Developed multiple data use cases to enhance monitoring and optimization of the heating network. ∗ Implemented data ingestion & transformation from various source systems into the company's Snowflake Data Warehouse sourcing Power BI dashboards for business end-users, including critical production monitoring. ∗ Stack: dbt, Python, AWS Glue, Amazon S3, Amazon Athena, Snowflake, PowerBI• Data Scientist for an EV Charging Station Provider: Performed data analysis to identify low and underperforming stations, driving optimization of the deployment strategy for charging stations.
- Master of ScienceEcole Centrale Marseille2021Relevant Courses: • Machine Learning, Deep Learning, Applied Statistics, Large-Scale Data Processing with PySpark • Algorithms and Data Structures, Object-Oriented Programming in Python/C++ • Sentiment Analysis on Health Startup Data (Pandas, Seaborn, scikit-learn, Keras, FastAPI, GitHub) • Research Project on Open Set Machine Learning (Marseille Computer Science and Systems Lab – Machine Learning Team)
- Master of ScienceUniversity van Amsterdam2019Exchange Program, Stochastic and Financial MathematicsExchange Program, Stochastic and Financial Mathematics - Computational Finance - Stochastic Calculus - Stochastic Processes - Data-Driven Decision Making in Operations Research
- AWS Certified AI PractitionerAmazon Web Services Training and Certification2024
- AWS Certified Data Engineer – AssociateAmazon Web Services Training and Certification2024
- AWS Certified Data Analytics – SpecialtyAmazon Web Services Training and Certification2023
- HashiCorp Certified: Terraform Associate (003)HashiCorp2023
- AWS Certified Developer – AssociateAmazon Web Services Training and Certification2022
- AWS Certified Solutions Architect – AssociateAmazon Web Services Training and Certification2022
- SnowPro Core CertificationSnowflake2021