Welcome to my repository where I document my hands-on learning journey in Machine Learning and Data Science through real-world datasets and end-to-end projects.
Each folder represents a self-contained project where I explore algorithms, data preprocessing techniques, model evaluation strategies, and best practices.
Teen Phone Addiction Analysis using Linear Regression
- π Explored and cleaned survey-based dataset
- π Performed EDA and correlation heatmaps
- π§ Built and evaluated models: Linear, Ridge, Lasso, ElasticNet
- β Compared model performance and visualized predictions
- πΎ Cleaned data and saved models included
Titanic Survival Prediction using Logistic Regression
- π§Ό Data Cleaning & Null Value Treatment
- π§ Feature Engineering (FamilySize, IsAlone)
- π EDA with visualizations and correlation analysis
- βοΈ Preprocessing Pipelines with ColumnTransformer
- π Model Training with Logistic Regression
- π Hyperparameter Tuning with GridSearchCV
- π§ͺ Final predictions tested on Kaggle dataset
- Strengthen core ML concepts through real-world application
- Practice full ML workflows: EDA β Preprocessing β Modeling β Evaluation
- Build a clean and collaborative portfolio on GitHub
- Learn by doing and iterate through trial-and-error
- Python (Pandas, NumPy, Scikit-learn)
- Visualization (Matplotlib, Seaborn)
- Jupyter Notebooks
- Git & GitHub
- Kaggle Datasets
- VS Code
- 𧬠Classification projects with Random Forest, SVM, XGBoost
- π Unsupervised Learning: Clustering (K-Means, DBSCAN)
- π§° Feature Selection Techniques (RFE, SHAP)
- π Deployment with Streamlit & FastAPI
- π― Participation in more Kaggle competitions
I'm actively learning and building in the ML & Data Science space.
If you're working on similar projects or want to collaborate, feel free to reach out or connect on LinkedIn!