Mastodon
outlier-image

Semi-Supervised Fraud Detection

bdd-alt

Book Depository Dataset EDA

kaggle dataset: Book Depository Dataset kaggle notebook: Introduction to Book Depository Dataset github repo: book-depository-dataset Book Depository Dataset EDA Through this notebook we will try to become familiar Book Depository Dataset and extract some usefull insights. The goal of this notebook is to become an introductory step for the dataset. import pandas as pd import os import json from glob import glob import matplotlib.pyplot as plt import seaborn as sns % matplotlib inline Dataset Structure Files: categories.csv dataset.csv formats.csv places.csv The dataset consists of 5 file, the main dataset.csv file and some extra files. Extra files works as lookup tables for category, author, format and publication place. The reason behind this decision was to prevent data redundancy. ...