
Book Depository Dataset EDA
kaggle dataset: Book Depository Dataset kaggle notebook: Introduction to Book Depository Dataset github repo: book-depository-dataset Book Depository Dataset EDA This notebook explores the Book Depository Dataset and extracts useful insights. The goal is to provide an introductory overview of the dataset. import pandas as pd import os import json from glob import glob import matplotlib.pyplot as plt import seaborn as sns % matplotlib inline Dataset Structure Files: categories.csv dataset.csv formats.csv places.csv The dataset consists of 5 file, the main dataset.csv file and some extra files. Extra files works as lookup tables for category, author, format and publication place. The reason behind this decision was to prevent data redundancy. ...