Mohammed Zniber

Logo


Data Scientist with 6 years of experience in Data collection, processing, exploration, and modelling.

View my LinkedIn Profile

View My GitHub Profile

Selected Projects in Data Science, Chemometrics, Applied AI and Product Optimization

Computer Vision-based Defect Detection for Augmented Reality Glasses Manufacturing

We collaborated with OptoFidelity to develop an automated defect detection workflow for augmented reality glasses using microscopy images.

Dataset:

Model development:

Deployment:

Results:

Bridging the Gap between Simulation and Experiment in Antireflective Coatings: A Data-Driven Approach

We collaborated with Senop to optimize antireflective coatings in simulation and validate experiments, aligning experimental results with simulation predictions.

Key contributions:

Results:

🔒 Proprietary project:
Code and detailed results cannot be publicly shared at the moment.


Data-driven Optimization of Laser-Assisted Bonding Process for Hybrid Integration in Silicon Photonics

We collaborated with Tampere University to optimize the laser-assisted bonding of a Si-PIC–GaSb chip by maximizing bond strength.

Process parameters:

Objective:

Optimization workflow:

Results:

Data-driven Optimization of Semiconductor Saturable Absorber Mirrors (SESAMs)

We collaborated with Reflekron to optimize oscillator performance and reduce experimentation by identifying optimal input parameters and chip properties for SESAMs using machine learning.

Research Questions:

Approach:

Results:


Hackathon : Development of a Computer Vision Model for Pumps and Valves Detection

In this project, we participated in a challenge for Valmet organized by Since AI, focusing on building a complete solution for pumps and valves detection. Our team developed a web-based interface integrated with a YOLO-based computer vision model to detect and classify components in industrial environments. The work included data collection and preparation, model training, evaluation, and deployment within a user-friendly interface.

View a demo on Youtube


Protomix : A Python Package for 1H-NMR Metabolomics Data Preprocessing

This project focused on the development of a preprocessing pipeline for NMR data in Python. For more details, check out the scientific publication describing the package. You can also access the full documentation on ReadTheDocs and the notebook below illustrates a glimpse of the capabilities that this library has to offer.

View code on Colab


PyShift: NMR Peak Alignment using Icoshift Algorithm in Python

This project aims to implement the Icoshift algorithm in Python to achieve accurate alignment of NMR spectra. The implementation’s performance will be assessed using the Wine NMR dataset, showcasing its ability to effectively handle variations in peak positions.

View code on Colab


A Potentiometric Multisensor Array for Prostate Cancer Diagnosis

A potentiometric electronic tongue combined with machine learning was used to distinguish between prostate cancer patients and those with benign prostate hyperplasia by analyzing urine samples.

View code on Colab


A Custom Data Collection Pipeline Using BeautifulSoup

The objective of this project is to utilize BeautifulSoup for gathering pertinent data and valuable insights from a rental website. This will enable users to access a comprehensive list of apartments along with their specific details. Additionally, the project includes the creation of an interactive Power BI dashboard, facilitating data visualization for a more engaging user experience.

View code on Colab


An Interactive Power BI Dashboard for Apartments’ data

The project includes the creation of an interactive Power BI dashboard, facilitating data visualization for a more engaging user experience. The data was retrieved using BeautifulSoup in the aforementioned project.

View dashboard on PowerPoint

Important note: Power BI must be installed on your computer in order to interact with the dashboard.


A Comparative Analysis of Rice Varieties Osmanjik and Cameo using CatBoost Classifier and SHAPley values

This project conducts a comparative analysis of rice varieties Osmanjik and Cameo using the CatBoost classifier for accurate classification. SHAPley values are employed to interpret the model’s decisions and identify discriminative features between the two varieties.

View code on Colab