import pandas as pd
from pandas_profiling import ProfileReport
Automated EDA with Pandas Profiling
Learn how to use Pandas Profiling for automated EDA
Pandas profiling allows us to create a static report of the distributions of all columns in a dataframe, categorical or continuous. It also finds the correlations among these columns. I also deployed this feature as a Flask app a few months ago: https://data-analyzer-hpn4y2dvda-uc.a.run.app/
Import sample data
= pd.read_csv("https://raw.githubusercontent.com/dkedar7/Data-Analyzer/master/Analyzer/titanic.csv")
data_df = data_df.sample(100).reset_index(drop=True).astype(str) data_df
Create the report
= ProfileReport(data_df, title="Pandas Profiling Report") profile
profile
Explore deeper
= ProfileReport(data_df, title="Pandas Profiling Report", explorative=True)
profile profile
Export report
profile.to_widgets()
'report.html') profile.to_file(
Read more
[1] https://github.com/pandas-profiling/pandas-profiling