Skip to content
View Achintya-data's full-sized avatar

Block or report Achintya-data

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Achintya-data/README.md

Achintya

Master's in Data Science at Stevens Institute of Technology

Data Science | AI/ML | Analytics | Big Data

LinkedIn Stevens Institute of Technology

About Me

I am a Master's student in Data Science at Stevens Institute of Technology with interests in machine learning, predictive modeling, time series, financial analytics, and large-scale data systems.

I like building projects that take messy data through a full workflow: cleaning, feature engineering, modeling, evaluation, and communication. This GitHub is being organized as a portfolio of practical data science and analytics work across forecasting, statistical learning, visualization, climate analytics, and distributed machine learning.

  • M.S. in Data Science, Stevens Institute of Technology, 2024-2026
  • GPA: 4.0/4.0
  • Presidential Merit Scholarship
  • Graduate Teaching Assistant for Mathematical Department

Current Focus

  • Building end-to-end machine learning and analytics projects with cleaner documentation
  • Strengthening work in forecasting, model evaluation, and feature engineering
  • Exploring big data workflows with Spark and cloud-based pipelines
  • Publishing more applied projects in forecasting, analytics, and machine learning

Tech Stack

Languages and querying

Python SQL R PySpark C++

Machine learning and analytics

Scikit-learn Pandas NumPy TensorFlow Keras Matplotlib Seaborn Tableau LangChain

Data engineering and cloud

Apache Spark PostgreSQL dbt AWS Docker Kubernetes Git Jupyter

Featured Projects

Project Focus
U.S. Airline Departure Delay Prediction at Scale Distributed PySpark and Spark MLlib pipeline with feature engineering, clustering, anomaly monitoring, and Dataproc scaling analysis
Climate Data Analysis and Rainfall Prediction PCA, SVD, clustering, and machine learning on multi-decade climate data
Time Series Forecasting and Risk Analysis ARIMA and SARIMA modeling, stationarity testing, residual diagnostics, and forecasting
Spotify Track Popularity Prediction with Statistical Learning Statistical testing, regression modeling, and feature-driven popularity analysis
Analyzing EV Adoption: Mapping the Future of Clean Mobility Tableau-based data visualization and storytelling around EV adoption trends

Building Next

  • S&P 500 sector classification using unsupervised learning and financial data
  • Synthetic fraud data generation and fraud detection modeling
  • Additional academic and applied projects being cleaned and published one by one

Certifications

  • Databricks Certified Data Analyst Associate
  • AWS Certified Data Engineer Associate

Socials

Popular repositories Loading

  1. Achintya-data Achintya-data Public

    GitHub profile and project portfolio for Achintya

  2. time-series-forecasting-risk-analysis time-series-forecasting-risk-analysis Public

    Time series forecasting and risk analysis with ARIMA and SARIMA models in R

    Jupyter Notebook

  3. spotify-track-popularity-prediction-statistical-learning spotify-track-popularity-prediction-statistical-learning Public

    Spotify track popularity prediction with statistical analysis and machine learning in Python

    Jupyter Notebook

  4. climate-data-analysis-rainfall-prediction climate-data-analysis-rainfall-prediction Public

    Climate data analysis and rainfall prediction using PCA, SVD, clustering, and machine learning

    Jupyter Notebook

  5. analyzing-ev-adoption-clean-mobility analyzing-ev-adoption-clean-mobility Public

    Tableau project analyzing EV adoption, clean mobility trends, and environmental impact

  6. us-airline-departure-delay-prediction-at-scale us-airline-departure-delay-prediction-at-scale Public

    Distributed PySpark and Spark MLlib pipeline for U.S. airline delay prediction with feature engineering, clustering, anomaly monitoring, and Dataproc scaling analysis

    Jupyter Notebook