Data science, machine learning books and resources
-
Updated
May 6, 2023
Data science, machine learning books and resources
Data science for beginners involves learning to extract insights from data using statistics, programming (Python/R), and visualization. Key steps include data collection, cleaning, analysis, modeling, and communicating findings. Beginners should start with Python, basic math (linear algebra/calculus), and build projects to create a portfolio.
Welcome to the Multiverse of Data Science — a comprehensive, ever-expanding collection of over 100 real-world projects covering the entire data science pipeline!
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
70+ DataCamp Course Notes, Projects, Codes, Exercises on Python, R and SQL with full DS & ML Certification,
This project analyzes and visualizes the Used Car Prices from the Automobile dataset in order to predict the most probable car price
This Repository contains the real life use cases of GenAI (LLM+RAG) in Finance Domain. I covers many projects use cases with theory and projects.
🚂 Data 🚃 Scientist 🚋 is a curated 🚑 end to end 🚒 showcasing 🚞 real world ✈ data science 🚀 projects 🛸 machine 🚁 learning 🚟 models and ⛴ data 🛳 engineering 🛸 workflows 🚤 From data 🛼 wrangling to 🚒 deployment this 🚝 repo is proof ☂ of work and ⛱ personal 🛑 lab everything 🎳 data driven ⚽ Classification ⚾ regression 🥎 clustering NLP
a tool for comparing the predictions of any text classifiers
Ethereum Fraud Detection Models
This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, Matplotlib, and many more to work with real-world datasets to learn the statistical and machine learning techniques.
Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.
Learn Retrieval-Augmented Generation (RAG) from Scratch using LLMs from Hugging Face and Langchain or Python
Data Career Handbook for all
The dataset builder script extracts the most relevant market data straight from Binance's API and builds a series of datasets that can be used in data science and machine learning projects.
Все о машинном обучении и не только...
A credit scoring web app based on an ML model trained on relevant data.
I’ll break down how hyperpersonalization is reshaping industries, from finance to banking, and how you can build your very own hyperpersonalization web app using Python, Machine Learning, Generative AI (GenAI), and open-source LLMs from Hugging Face.
Open-source compilation of free AI/ML learning materials from universities, companies, and industry experts.
Add a description, image, and links to the datascience-machinelearning topic page so that developers can more easily learn about it.
To associate your repository with the datascience-machinelearning topic, visit your repo's landing page and select "manage topics."