Professional Documents
Culture Documents
The following links contain the datasets used throughout the module. The instructions to
download the same on your windows machine, Linux/Mac machine and the AWS EC2 instance
have been provided below.
● https://s3.amazonaws.com/pig-dataset/count-words.pig
● https://s3.amazonaws.com/pig-dataset/data-bag.txt
● https://s3.amazonaws.com/pig-dataset/dropbox-policy.txt
● https://s3.amazonaws.com/pig-dataset/u.data
● https://s3.amazonaws.com/pig-dataset/u.item
● https://s3.amazonaws.com/pig-dataset/products.csv
● https://s3.amazonaws.com/pig-dataset/discountCodes.props
● https://s3.amazonaws.com/pig-dataset/sales_code.csv
● https://s3.amazonaws.com/pig-dataset/sales.csv
In case you want to download the datasets on your windows machine, copy the above links
directly in the browser after which the file will directly start downloading.
wget https://s3.amazonaws.com/pig-dataset/data-bag.txt
For AWS EC2 instance:
Use wget command on EC2 terminal to download the files. (Note that wget package must be
installed in your machine. Use
For example,
wget https://s3.amazonaws.com/pig-dataset/data-bag.txt