About Me
I am a Data Engineer with over three years of experience in building data-intensive applications for clinical and complex datasets. My expertise lies in crafting robust ETL pipelines, driving data insights, and managing centralized databases. Passionate about solving intricate challenges, I excel at creating scalable solutions and intuitive dashboards that empower decision-making. I am eager to contribute to transformative projects that bridge the gap between data and actionable insights.
Proficient in: Python, SQL, NoSQL, Clinical Data Management, ETL, Google Cloud, Airflow, and Scientific visualization.
Currently preparing for the Google Cloud Professional Data Engineer certification, aiming to complete it by February 2025. I’m excited to expand my expertise in cloud-based data engineering solutions.
Work Experience
IHU LIRYC - Université de Bordeaux - May 2021 - Present
Location: Pessac, Nouvelle-Aquitaine, France
- Led data engineering efforts for Beat-AF, an EU-funded clinical research project focused on atrial fibrillation.
- Designed and implemented a web server using Girder and MongoDB to manage complex data types, including DICOM and EP signals.
- Developed ETL pipelines to extract ECG data from the Kardia Pro API, anonymize it, and load it into a centralized database for further analysis.
- Created an interactive dashboard using scientific visualization tools (VTK) and Pandas, enabling real-time insights for clinicians and researchers.
- Designed Proof of Concept (PoC) pipelines with Apache Airflow to ensure GDPR compliance through anonymization of sensitive clinical data.
- Streamlined data integration from diverse sources into a centralized database, improving reliability and accessibility for clinical research teams.
- Conducted data cleaning and quality improvements, ensuring integrity and usability of clinical datasets.
Vectorive - Jan 2020 - May 2021
Location: Paris, France
- Registered and integrated Dynamics 365 App with Azure Active Directory.
- Authenticated apps using access tokens via Python libraries MSAL and ADAL.
- Extracted CRM data via Web API and OData.
- Built a streaming environment with Apache Flink using PyFlink.
- Designed a data source from Apache Kafka Producer.
- Created a sink on Apache Flink and registered tables in PostgreSQL.
- Performed ETL operations such as SELECT, AGGREGATE, and FLAT MAP using PyFlink.
4+ Years of Experience
0
Projects Worked
0