Data Analyst

Transforming raw data into actionable insights that drive business decisions and growth

Learn More

Full Stack Developer

Building robust, scalable web applications with modern technologies

About Me

UI/UX Designer

Crafting intuitive and visually appealing user experiences that delight users

Get In Touch

My Projects

Student Dropout Predictor

Student Dropout Predictor

I built a Random Forest based student dropout risk system using behavioral, academic, and engagement data, achieving an ROC AUC of 0.86 (precision 78%, recall 82%, F1 0.80). SHAP analysis revealed attendance, assignment scores, and financial stability as the top risk drivers. A Python and SQLite pipeline then scores new data in real time and issues early-warning alerts.

Python, Flask, SQLite Random Forest Monte Carlo Simulation
Amazon Sales Analysis

Amazon Sales Trend Analysis

I built an end-to-end Amazon review analysis dashboard using Python, leveraging pandas for data ingestion and cleaning, NLTK’s VADER for sentiment scoring, and Matplotlib, Seaborn, and Plotly for interactive visualizations. The workflow processes line-delimited JSON review data, handles missing fields, converts timestamps, standardizes votes, computes sentiment and rating distributions, and exports standalone HTML dashboards for each product.

Python NLP Data Visualization
Dermatology Data Analysis

Automated Dermatology Data Analysis Pipeline

  • Data Prep: Cleans and imputes a raw dermatology CSV, normalizes features, and splits into train/test sets.
  • Modeling:
    • Custom gradient-descent regression on age.
    • Random Forest & k-NN classifiers with grid search, cross-validation, and stability testing.
    • K-Means & DBSCAN clustering, tuned by ARI/silhouette and visualized with PCA.
  • Results: Logs all metrics and parameters to JSON, generates publication-quality plots comparing models.
  • Tech Stack: Python · NumPy · pandas · scikit-learn · Matplotlib
Machine Learning Python scikit-learn Data Pipeline
Sales Trends

Analysing & Predicting Sales Trends

Built ML models (Logistic Regression, Random Forest) to analyze and predict product sales trends with high accuracy.

Machine Learning Python Pandas
MediCareDB: Healthcare Management

MediCareDB: Healthcare Management

NoSQL-based healthcare database system built using MongoDB and Python with automated data generation and aggregation pipelines.

MongoDB NoSQL Python
Sales Dashboard
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation
Customer Segmentation

Get In Touch

Let's work together

I'm currently available for freelance projects or full-time positions. Feel free to reach out to discuss how I can contribute to your team or project.

Location

Liverpool, United Kingdom.

YouTube Blogs

Visit My Channel
Message sent successfully!