Welcome to a showcase of some of the projects from my Master’s in Data Science journey at Ferris State University, MI, USA, where I’ve refined my skills in R, Python, Machine Learning, Predictive Analytics, Data Mining, Data Visualization, and Statistical Analysis. Each project highlights my ability to tackle complex challenges through rigorous, data-driven solutions, combining technical expertise with statistical precision and creative problem-solving. Read More
With a solid foundation in statistics, computer science, and business, as well as a passion for research, I have applied advanced statistical methodologies to uncover meaningful insights and drive data-informed decision-making. My hands-on experience as a Research Assistant has deepened my expertise in academic research, contributing to innovative projects at the intersection of data science and statistical analysis. Additionally, my diverse work experience across multiple sectors has strengthened my ability to adapt data science solutions to real-world business challenges. Now, I am eager to pursue a Ph.D., push the boundaries of research and analytical methodologies, and contribute to the evolving field of data science.
This project demonstrates the use of loops and supervised machine learning in R, showcasing iterative techniques like for-loops and while-loops. It also includes the implementation of basic machine learning models for predictive analysis.
ViewThis project implements classification algorithms such as Logistic Regression, Linear Discriminant Analysis, Quadratic Discriminant Analysis, Naive Bayes, and K-Nearest Neighbors using python to analyze data.
ViewThis project demonstrates data manipulation, statistical analysis, and visualizations using R programming. It involves building linear models to explore relationships between variables. It also includes creating various visualizations.
ViewIn this project I used Python to build and evaluate various machine learning models with differernt variations to predict the number of medals won by participating countries in international sporting events.
ViewThis project uses TensorFlow, leveraging convolutional neural networks (CNNs) for image classification. The project showcases how TensorFlow can be effectively utilized for real-world image recognition problems.
Viewexplores the fundamentals of using loops to perform repetitive tasks efficiently. The project covers different types of loops, including for-loops, while-loops, and nested loops, to automate tasks such as iterating over data, controlling program flow, and optimizing code performance.
ViewThis project focuses on analyzing customer churn data to identify patterns and trends that affect customer retention. Through visualizations like bar plots, bubble charts, and correlation heatmaps, insights are drawn from features such as credit scores, geography, and customer activity.
ViewIn this project, SAS Help datasets like Stock, Class, Cars, Heart, and Wages were utilized to perform comprehensive data analysis and visualization. Key insights were drawn by analyzing trends, distributions, and relationships.
ViewThis project explores the use of Numpy for efficient numerical computations and Pandas for data manipulation tasks, such as data cleaning and aggregation. It demonstrates how these libraries simplify handling and analyzing large datasets in Python.
ViewThis project focuses on Data Wrangling in Python, using famous library like Pandas to clean, transform, and prepare data for analysis. It showcases techniques for handling missing values, filtering data, and reshaping datasets to derive meaningful insights.
ViewThis data mining project analyzes three distinct datasets following the CRISP-DM methodology, involves clustering countries based on their IT usage, uncovering association rules for cross-selling products, and predicting house prices in U.S. towns.
ViewThis project applied supervised machine learning models in IBM SPSS Modeler to estimate sales representatives' salaries in the tech industry. Various models including Multiple Linear Rrgression, KNN and Neural Networks were evaluated using Mean Squared Error.
ViewThis project aimed to develop an accurate predictive model for college admissions outcomes, focusing on whether applicants would be admitted based on various attributes. A dataset containing information on 17,339 applicants across three colleges within a university was analyzed.
ViewThis project involved creating an interactive dashboard to visualize Olympic gold medal data, leveraging MongoDB for data storage and management.MongoDB was used for its flexibility in handling complex, unstructured data, allowing easy aggregation and retrieval for dynamic chart generation.
ViewThis project examines the multifaceted aspects of database security and ethics, exploring critical issues such as responsible data usage, breach accountability, and legal frameworks. It delves into managing sensitive data, emphasizing the need for clarity in intentions when using or collecting data.
ViewThis project presents a detailed disaster management plan for a large-scale retail organization that depends on outsourced systems for its operations. The plan addresses four critical disaster scenarios: wide-scale power outages, system failures, ransomware attacks, and unauthorized access.
ViewThis white paper focuses on evaluating the critical factors involved in choosing between cloud-hosted databases and in-house database management. The white paper explores key considerations such as the costs associated with each option, potential for early termination, the impact on staff levels, and security concerns.
ViewThis project focuses on applying logistic regression using SAS software to analyze data from the Behavioral Risk Factor Surveillance System (BRFSS). I used logistic regression to model relationships between various behavioral risk factors (such as smoking, alcohol use, and physical activity) and health outcomes like chronic diseases or conditions.
ViewThis in-class project focused on applying simple linear regression using SAS to solve statistical problems and analyze provided data. The project involved answering various questions related to the concepts of linear regression, including understanding relationships between variables, calculating regression coefficients, and interpreting statistical outputs.
ViewThis in-class project focused on the theoretical aspects of multiple linear regression as part of an applied statistical methods course. The objective was to understand and explore how multiple independent variables can be used to predict the outcome of a dependent variable and how multiple linear regression can be applied to complex datasets and scenarios.
ViewThis project involved applying multiple linear regression using R to analyze the "Graduate Admission 2" dataset, downloaded from Kaggle.com.The goal of this project was to model and predict the likelihood of a student being admitted to a graduate program based on these features. R was used for statistical analysis and evaluating model fitness.
ViewThis midterm exam for the Applied Statistical Methods course focused on solving various statistical questions using theoretical knowledge and practical data analysis with SAS software. The exam tested a range of concepts, particularly regression analysis, hypothesis testing, and estimation techniques.
ViewThis project explores interactive data visualization techniques using R and JavaScript libraries. The goal was to integrate various JavaScript-based libraries, such as threejs, leaflet, visNetwork, and highcharter, with R to create dynamic and interactive visualizations that enhance data exploration and analysis.
View