You are on page 1of 1

Instruction:

When you start the assignment you may notice that simply running na.omit on the whole data set
removes quite a lot of interesting data. na.omit works by ignoring the whole row IF it has an NA
somewhere in the row.

This isn't great for the Pokemon data set. It is ok in the plants data set though because lots of the
rows with NA have lots of NA values.

You'll have to make a choice about how you handle this in the assignment.

You can choose to:

1. Run with na.omit and explain why you did this

2. Do some data reduction and create a new Pokemon data file with the rows you want to use
then run na.omit

3. Replace the NA values with NULL

4. use na.rm when doing calculations.

Remember - you can use Excel to do data cleaning too - just explain what you did.

You might also like