E-Procurement Data Platform

Led the development of a scalable e-procurement marketplace with ML-based fraud detection, achieving 89% accuracy. Implemented real-time data processing pipelines using Apache Kafka and Spark Streaming, reducing data latency by 70%.

Next.js
FastAPI
Elasticsearch
PostgreSQL
PySpark
Airflow
Machine Learning
Apache Kafka
In Progress

Product Data Aggregator

Built a comprehensive product data aggregator from multiple supplier APIs, enhancing data integration and accessibility for the e-procurement platform.

Python
PySpark
Apache Airflow
Elasticsearch
API Integration
Completed

MLOps Implementation

Implemented MLOps best practices to streamline machine learning model deployment, significantly reducing technical debt and deployment time by 60%.

MLOps
CI/CD
Docker
Kubernetes
Git
GitHub Actions
Ongoing

Business Intelligence Dashboard

Created and deployed BI dashboards using SAP Analytics Cloud, increasing BI solution adoption by 30%. Performed detailed analysis of invoices, purchase orders, and suppliers.

SAP Analytics Cloud
Data Visualization
Business Intelligence
Completed

Product Classification Model

Developed a machine learning model for product classification to optimize product repatriation in SAP S4/Hana, achieving 72% precision. Implemented data augmentation and annotation techniques for unbalanced data.

Python
Machine Learning
SAP S4/Hana
imblearn
nlpaug
spacy
Azure ML
Completed

Image Classification System

Developed and deployed an advanced image classification system using OpenCV, TensorFlow, and Keras. Achieved high accuracy with various models including SVM (93%), LSTM (94%), and CNN (94%).

OpenCV
TensorFlow
Keras
Python
SVM
LSTM
CNN
Completed

Data Warehouse Optimization

Redesigned the data warehouse schema, implementing a star schema that improved query performance by 200%. Introduced data partitioning and indexing strategies, reducing storage costs by 40%.

SQL
Data Warehousing
Database Optimization
Star Schema
Completed

E-commerce Predictive Analysis Tool

Created a predictive analysis tool for e-commerce, focusing on customer behavior analysis. Integrated with existing systems to provide actionable insights for marketing and sales teams.

Python
Machine Learning
Data Analysis
Predictive Modeling
Completed

Real-time Analytics Dashboard

Developed a real-time analytics dashboard for monitoring and visualizing data streams from IoT devices. Implemented data processing logic using Apache Flink and created visualizations with Grafana.

Apache Flink
Kafka
Grafana
Python
IoT
Real-time Processing
Completed

Face Mask Detector

Built a face mask detector app using OpenCV and deep learning. The project involved data collection, preparation, and using a DL algorithm to classify images. OpenCV was used to infer the classifier and display results.

OpenCV
Deep Learning
Python
Image Classification
Completed

AWS Word Count Application

Created a word count application using AWS EMR and Spark. Generated a 20 GB corpus using NLTK, set up an EMR cluster, loaded data to S3, and defined a Spark application for processing.

AWS EMR
Apache Spark
S3
NLTK
Big Data Processing
Completed

Neural Machine Translation Model

Created a translation system from a parallel corpus using OpenNMT. The process included sub-tokenization, training, translation, detokenization, and evaluation of the machine translation model.

OpenNMT
Machine Translation
NLP
Python
Completed

Movie Recommender System

Developed a recommendation system using the MovieLens dataset. Created a pipeline for data loading, preparation, model training, cross-validation, and evaluation. Compared SVM and KNN models, with KNN (item-based using cosine similarity) performing best despite longer training time.

Python
Machine Learning
SVM
KNN
Recommendation Systems
Completed

EU Crime Statistics Analysis

Analyzed crime statistics from the European Union to determine the most dangerous countries. Used datasets on assaults, intentional homicides, car thefts, and robberies. Created visualizations using chloropleth maps, tree maps, and bar charts in Plotly.

Data Analysis
Plotly
Python
Data Visualization
Completed