Exploring the Profile Report Generated

Overview

Overview consists of 3 tabs, these are Overview, Alerts, and Reproduction.
The Overview consists of dataset statistics and variable types. Dataset statistics gives us information on number of variables, duplicates and missing values.

Overview

Next is the Alerts tab, which gives us information on the correlated variables. Also, about the unique values. Here, the data is small but if the dataset will be large then it will also tell us about missing values, skewness of data, etc.

Alerts

The Reproduction tab tells us about the start and end time of the report generation, also about the duration, software version, etc. Take a look at the below image for more clearance.

Reproduction

Variables

This section gives us information on the variables, which tells us about the type of the variable, then distinct and missing values with the memory size that the variable is taking. Let’s see the example of two variables below, id is a real number and grade is categorical.

ID variable

Grade Variable

Correlations

A statistical tool that helps in the study of the relationship between two variables is known as Correlation.

Correlation

Missing Values

The profile report also gives us information on missing values in the data visually using the bar plot.

Missing Values

Sample

This displays the first and last 10 rows of the dataset.

Sample dataset

Pandas Profiling in Python

Pandas is a very vast library that offers many functions with the help of which we can understand our data. Pandas profiling provides a solution to this by generating comprehensive reports for datasets that have numerous features. These reports can be customized according to specific requirements. In this article, we will dive into this library’s functionalities and explore its various features like:

  • Installation of Pandas Profiling
  • Importing Pandas Profiling
  • Generating Profile Report
  • Exploring Profile Report Generated
    • Overview
    • Variables
    • Correlations
    • Missing Values
    • Sample
  • Saving the Profile Report

Similar Reads

Installation of Pandas Profiling

Pandas Profiling can be easily installed using the following command...

Importing Pandas Profiling

Python3 # importing packages import pandas as pd from pandas_profiling import ProfileReport...

Generating Profile Report

...

Exploring the Profile Report Generated

...

Saving the Profile Report

For generating the profile report we will simply use the Profile Report from pandas_profile and input will the dataframe....

Contact Us