You need to know if the data is normally distributed before doing any analysis on it.
Create histograms for every column and compare it to a normal distribution or bell curve. My code outputs this bell curve on the graph.
Method: The code assumes that you have already uploaded the data in Azure ML and imported into Python. More information about how to do this can be found in my tutorial about Descriptive Statistics per Column in Azure ML
It wil output one boxplot, and a histogram with a bell curve
To run:
get_ipython().run_line_magic('matplotlib', 'inline')
plotstats(<datasetname>, '<columnname>')
get_ipython().run_line_magic('matplotlib', 'inline')
plotstats(frame, 'PTrustb')
Note: "frame' is the name of the dataset, and the 'Ptrust' is the name of column.