How to use the custom data analysis tool in the Engine?

This article outlines the steps to use the custom data analysis tool

Users can use the custom data analysis tool to:

  • View the distribution of all columns

  • Select a column to group the distributions of other columns

  • View the correlation between different columns using scatter plots

Step 1 - Go to the data analysis view page

  1. From the homepage, select the project with the dataset of interest

  2. Click the “Dataset” tab on the left navigation panel to see the list of all datasets in the project

  3. Select the dataset of interest from the list

  4. Click on the “Analysis” tab of the dataset page

HubSpot Video

Example: View the analysis page of “Pokemon” Dataset

 

Step 2 - Select a column to group by

In the “Analysis” tab of the dataset,

  1. Use the dropdown to select a column to group the data by

  2. Click “Run analysis” to start the analyzing process

HubSpot Video

Example: Run analysis with the group-by column “Legendary“

Step 3 - View analysis results

Select the analysis result from the list and view the analysis

Peek 2022-08-23 16-22-thumbExample: Distribution of columns grouped by “Legendary“

Step 4 - View scatter plots of up to 5 columns

  1. Click “Scatter plot” to enter the pair plot view

  2. Type the column name on the search bar

  3. To remove columns, click the “Close” icon on the tag of a column

Note: You can select up to 5 columns for the scatter plot view

HubSpot Video

Example: View the scatter plot, add the column “Total“ and remove the column “Type 1“