Data_Wrangling_Visualization_Project

Project Overview

This project presents an analytical web platform that helps evaluate the quality of movies based on publicly available data. By analyzing over 4,000 films, we identified how factors like IMDb rating, Metascore, genre, release year, budget, and box office returns influence the perception of a movie.

The platform provides interactive visualizations and allows users with known parameters of a film to get data-driven insights into its quality. This tool can benefit viewers, investors, analysts, and studios looking for objective evaluations.

Link to the deployed website: https://data-wrangling-visualization-project.onrender.com/

Our key observations:

• Metascore and IMDb ratings are generally consistent, but viewers tend to avoid extreme ratings.

• Genre and year of release greatly influence how a film is perceived.

• Budget and box office are important, but they don't tell the whole story without genre.

• Box office success ≠ successful film.

Checkpoint 1:

At this stage, our team:

Collected data from the imdb website about 4,000 films using Scrapy
Cleaned and preprocessed data about films
Prepared Advanced Data Analysis including analysis of the completeness of the dataset

Checkpoint 2:

For the second checkpoint, our team:

Prepared a full analysis of the dataset
Developed a unique interactive website with a stylish modern design
Conducted an analysis of patterns between research factors such as:

IMDB rating
Metascore rating
Genre
Year of creation
Box office receipts
Production budget
Popularity of the cast

Added interactive graphs visualizing these dependencies to the website
Deployed Project to: https://data-wrangling-visualization-project.onrender.com/
Added Docker

Tools used:

HTML + CSS + JavaScript for Frontend
Charts.js, D3.js, Plotly for Visualizations
Flask for Backend
Scrapy for parsing
All parsed data are in JSONs
Python (MatPlotLib, Numpy, Seaborn, Pandas) for EDA

Repository structure:

In the data_wrangling folder you can find files created before checkpoint 1:

In the starwars folder there is code for web scraping of the site.
Scraped data is in the films_data.json file
In the data_preparation.ipynb and Advanced_Data_Analysis.ipynb files there is code for cleaning and analyzing the dataset
Cleaned and grouped datasets can be found in the data folder

The img folder:

There are screenshots of our site

In the static folder there are:

assets folder - it contains various icons and pictures for the site
lib folder - it contains library files for the site
index.html
script.js
Styles.css

Also in the main repository is the file app.py

This file contains all the main functions and backend part for the site

Screenshots of the website

Below are attached screenshots of our website:

How to Build and Run the Project Using Docker Compose

Ensure you have Docker and Docker Compose installed on your system. You can download them from Docker's official website.

Clone the repository:

git clone <repository-url>
cd <repository-folder>

Ensure you have a docker-compose.yml file in the root of your project with the following content:

services:
  app:
    build:
      context: .
    ports:
      - "8080:8080"

Build and start the services:
```
docker-compose up --build
```
Open your browser and navigate to: http://127.0.0.1:8080
To stop the services, press Ctrl+C and run:
```
docker-compose down
```

How to Run the Project if you don't have docker

To run the project locally, follow these steps:

Clone the repository:

git clone <repository-url>
cd <repository-folder>

Set up a virtual environment (optional but recommended):
```
python3 -m venv venv
source venv/bin/activate
```
Install the required dependencies:
```
pip install -r requirements.txt
```
Run the Flask application:
```
flask run --port=8080
```
Open your browser and navigate to: http://127.0.0.1:8080

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
data_wrangling		data_wrangling
img		img
static		static
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data_Wrangling_Visualization_Project

Project Overview

Our key observations:

Checkpoint 1:

Checkpoint 2:

Tools used:

Repository structure:

In the data_wrangling folder you can find files created before checkpoint 1:

The img folder:

In the static folder there are:

Also in the main repository is the file app.py

Screenshots of the website

How to Build and Run the Project Using Docker Compose

How to Run the Project if you don't have docker

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

Data-Wrangling-and-Visualisation/StarWars

Folders and files

Latest commit

History

Repository files navigation

Data_Wrangling_Visualization_Project

Project Overview

Our key observations:

Checkpoint 1:

Checkpoint 2:

Tools used:

Repository structure:

In the data_wrangling folder you can find files created before checkpoint 1:

The img folder:

In the static folder there are:

Also in the main repository is the file app.py

Screenshots of the website

How to Build and Run the Project Using Docker Compose

How to Run the Project if you don't have docker

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages