Missing Data and Outliers Analysis
1. Describe missing data, provide summary of missing data, similar to the analysis in the Chapter 2 (table 3): Count of missing data/percent per variable, type of missing data (NA, null), total percent of missingness per dataset [ – 20pts]
· If you find that there is no missing data, you still have to report your findings. Your reader does not know your data and you have to show how reliable your data is
2. Plot visualization of missing data pattern [ – 20pts]
· If you do not have missing data, you still need to plot it to show it to your reader
3. Describe what type of missing data you have observed: MCAR, MAR, MNAR or no missing data [- 10pts]
· If you do not have missing data, simply state that you did not observe any missing values
4. Select Imputation method that you will be performing and explain why [20pts]:
· list-wise/pair-wise deletions, mean imputation, regression imputation etc. [ – 10pts]
· perform imputation and provide data statistics. For example, if you perform list-wise deletion, how many observations will you use for consequent analysis. If you perform regression imputation, provide statistic summary [-10pts]
· If you do not have any missing data: instead of imputation, perform data normalization, describe which methods you will use, why and perform the normalization. Provide statistical summary of data after the normalization
5. Outlier analysis:
Option 1.
· Continuous Y:
· You will perform outlier detection using Z-Score, a parametric outlier detection method in one dimensional space. [30pts]
· Z-Score Formula:
· If the data points > |3|, these are extreme values (outliers):
· You can use z from the library (outliers) or build your own function:
Measure – mean(Measure)) / sd(Measure)
· You can test for outliers only for your dependent variable Y (continuous). How many outliers do you have?
Option 2.
· See Chapter 13.3 Checking for Outliers. http://daviddalpiaz.github.io/appliedstats/model-diagnostics.html
Delivering a high-quality product at a reasonable price is not enough anymore.
That’s why we have developed 5 beneficial guarantees that will make your experience with our service enjoyable, easy, and safe.
You have to be 100% sure of the quality of your product to give a money-back guarantee. This describes us perfectly. Make sure that this guarantee is totally transparent.
Read moreEach paper is composed from scratch, according to your instructions. It is then checked by our plagiarism-detection software. There is no gap where plagiarism could squeeze in.
Read moreThanks to our free revisions, there is no way for you to be unsatisfied. We will work on your paper until you are completely happy with the result.
Read moreYour email is safe, as we store it according to international data protection rules. Your bank details are secure, as we use only reliable payment systems.
Read moreBy sending us your money, you buy the service we provide. Check out our terms and conditions if you prefer business talks to be laid out in official language.
Read more