s
Data

Data Scientist · Algiers, Algeria

tree
Hana
Yasmine

Data Engineer & AI Builder focused on Automation,

Analytics systems,

and turning data into clear Business decisions.

Behind every dataset, there is a story trying to be understood. I help it surface clearly, without noise or confusion.

View my work Get in touch

Skills

What I work with

Languages

  • Python
  • SQL
  • R

Data Engineering

  • Pandas
  • ETL pipelines
  • Data cleaning & transformation

Machine Learning

  • scikit-learn
  • Feature engineering
  • Anomaly detection

Data & Visualization

  • R
  • Matplotlib
  • Altair

Automation & Tools

  • Git
  • Docker (basic)
  • Python automation scripts

Concepts

  • Data pipelines
  • Business analytics
  • AI readiness & reporting systems

Selected Work

Projects that mattered

01

Medical Indication Prediction

Built a deep learning system that predicts medical indications from molecular combinations stored in structured datasets. The project focused on transforming raw biomedical data into a trained neural network capable of automated prediction and classification.

Python TensorFlow / Keras Deep Learning Data Preprocessing One-Hot Encoding Predictive Modeling
02

Automated Pharmaceutical Data Analysis

Built an automated system that processes pharmaceutical datasets and generates structured insights at multiple levels, including global trends, city-level breakdowns, and individual analysis, with automatic Excel report generation.

Python Pandas Data Analysis Automation Excel Reporting
03

Supplier Management Web Application

Developed a web-based system to manage supplier data, enabling creation, updates, and deletion of supplier records. The application integrates a structured SQL database with a Java EE backend to ensure reliable data handling.

Java EE SQL Web Development CRUD System Backend Systems
04

Pharmacy Matching System

Built an intelligent matching tool to link pharmacy records across multiple datasets using NLP, fuzzy matching, and geospatial signals. The system improves data consistency by detecting duplicates, uncertain matches, and aligning records across sources with minimal manual intervention.

Python NLP Fuzzy Matching Geospatial Data Cosine Similarity DBSCAN Data Cleaning
05

Client Data Automation System

Built a set of Python automation scripts to sync and update a SQL Server database from structured Excel templates. The system standardizes client, hierarchy, and territory data updates while reducing manual errors in recurring data integration workflows.

Python SQL Server Automation Excel Processing Data Engineering ETL
06

Universal Account System

Designed and structured a unified account system to consolidate fragmented client and pharmacy data into a single reliable source of truth. The project focuses on deduplication, data enrichment, and automated synchronization between SaaS inputs and a centralized universal database.

Data Engineering SQL Data Modeling ETL Pipelines Fuzzy Matching Data Deduplication System Design
07

Market Performance Reporting API

Built a reporting system to track key marketplace KPIs and support compliance requirements for e-commerce licensing. The solution aggregates transactional and operational data into validated metrics exposed through a web report and API layer.

Python SQL APIs Data Aggregation Reporting Systems Data Validation Backend Engineering
08

Automated QC Reporting System

Developed an automated quality control reporting system to detect anomalies and data inconsistencies in panel datasets. The solution reduces manual QC effort and provides interactive dashboards for exploring data quality issues in real time.

Python Streamlit Data Quality Anomaly Detection R Dashboarding Automation
09

Pharmacy Image Intelligence (OpenAI Vision)

Built an image analysis pipeline using OpenAI vision models to extract structured insights from pharmacy images via public URLs. The system identifies objects, reads text (OCR), and classifies visual content to support large-scale pharmaceutical data enrichment.

Python OpenAI API Computer Vision OCR Image Classification Data Enrichment API Integration
10

E-commerce Data System

Designed and managed a full e-commerce data system, transforming a completely unstructured environment into a structured platform for products and users. The system improved data visibility, consistency, and operational efficiency, enabling faster decision-making and better data control across the platform.

Data Engineering SQL Python ETL Data Modeling E-commerce Systems Data Cleaning
00

Experience

Where I have been

October 2022 — Present

Data Scientist

Sanisphere – POC Pharma

Developed and delivered data solutions using Python, SQL, and R to answer business needs. Worked on data cleaning, exploratory analysis, NLP tasks, automation of Excel reporting, and data model design. Also contributed to automation of ETL pipelines and recurring data workflows.

September 2024 — February 2025

Data Engineer

Dats Connexion

Built, tested, and optimized data pipelines and analytics solutions. Improved data flow reliability and ensured data quality across multiple sources. Contributed to the design and maintenance of scalable data processing workflows.

February 2022 — September 2022

Software Engineer

Bank of Algeria

Developed a web application for supplier management using JEE and SQL. Worked on backend logic, database integration, and application features to support internal operations.

January 2021 — January 2022

Assistant Research Engineer

CERIST

Developed a VOD platform using Python, Bash, and MediaCMS. Delivered training sessions in Python for state executives. Worked on infrastructure-related skills including Docker and Kolla-Ansible, and system deployments.

February 2020 — November 2020

Freelance Projects

Self-employed

Assisted in teaching LaTeX for scientific writing. Built a standalone ML application for medical data classification. Worked on data processing and classification using Python and R. Automated Excel-based reporting and calculation workflows.

Beyond Work

Other passions

✍️

Writing

I write stories mostly, and sometimes serious texts

📚

Reading

I like to write so logically reading is a must

🎮

Video Games

Outside of playing games, I like to understand how they were made: story, design and logic

🌱

Gardening & Animals

It's important to take care of the nature we live in

🎵

Music

I enjoy listening to music and sometimes learning how create it

Contact

I always want to create something meaningful

hana.eddoud@gmail.com

About

Data scientist with more thn 6 years of experience, I design and deploy real solutions: ML models, tasks automations, robust pipelines, BI dashboards.
My role is to understand both technical and business parts. I won't deliver empty models: only what's real and with measurable results.
Available for remote, French, English and Arabic speaking environments.
What interests me is to join an ambitious team, invest and build something valuable and durable.