data_science_portfolio

Data Science and Machine Learning Portfolio

Welcome to my data science and machine learning portfolio! This repository showcases a diverse collection of projects demonstrating my skills in data cleaning, exploratory data analysis (EDA), regression, classification, clustering, time series analysis, machine learning, and data visualization. The projects are implemented using Python, Stata, and R to highlight my proficiency across these tools.

Highlights

Explore the projects to see detailed documentation, code, and results. Each project is designed to solve real-world problems and demonstrate practical applications of data science and machine learning techniques.

Python Projects

1. Data Cleaning

Project: Customer Data Cleaning

Project: Web Scraped Data Cleaning

2. Exploratory Data Analysis (EDA)

Project: EDA on Movie Data

Project: EDA on Sales Data

3. Regression Analysis

Project: House Price Prediction

Project: Car Price Prediction

4. Classification Projects

Project: Customer Churn Prediction

Project: Spam Email Detection

5. Clustering Projects

Project: Customer Segmentation

Project: Market Basket Analysis

6. Time Series Analysis

Project: Stock Price Prediction

Project: Weather Forecasting

7. Machine Learning Projects

Project: Image Classification with Convolutional Neural Networks (CNN)

Project: Natural Language Processing (NLP) for Sentiment Analysis

8. Dashboards and Visualization

Project: Interactive Sales Dashboard

Project: COVID-19 Data Dashboard

9. Deployment Projects

Project: Deploying a Machine Learning Model as an API

Project: Building a Web Application with Streamlit

10. Capstone Project

Project: End-to-End Data Science Project

Stata Projects

1. Data Cleaning

Project: Socioeconomic Data Cleaning

2. Exploratory Data Analysis (EDA)

Project: EDA on Health Data

3. Regression Analysis

Project: Wage Determinants Analysis

4. Classification Projects

Project: Loan Default Prediction

5. Clustering Projects

Project: Household Segmentation

6. Time Series Analysis

Project: Economic Indicators Forecasting

7. Machine Learning Projects

Project: Logistic Regression for Health Outcomes

8. Dashboards and Visualization

Project: Economic Data Dashboard

9. Deployment Projects

Project: Deploying a Predictive Model

10. Capstone Project

Project: End-to-End Data Science Project

R Projects

1. Data Cleaning

Project: Financial Data Cleaning

2. Exploratory Data Analysis (EDA)

Project: EDA on mtcars dataset

3. Regression Analysis

Project: Sales Forecasting

4. Classification Projects

Project: Customer Segmentation with Decision Trees

5. Clustering Projects

Project: Market Segmentation

6. Time Series Analysis

Project: Monthly Sales Forecasting

7. Machine Learning Projects

Project: Random Forest for Classification

8. Dashboards and Visualization

Project: Interactive Data Dashboard with Shiny

9. Deployment Projects

Project: Deploying a Machine Learning Model with Plumber

10. Capstone Project

Project: End-to-End Data Science Project