Professional Documents
Culture Documents
5
Name:Shubham Acharekar
Roll No: 01
Subject: SL-VI
Title: Perform the following operations using R/Python on the Amazon book review and facebook
metrics data sets
1) Create data subsets 2) Merge Data
3) Sort Data 4) Transposing Data
5) Melting Data to long format 6) Casting data to wide format
________________________________________________________________________________
>getwd()
>setwd("/cloud/project/software lab")
>d=read.csv("fb.csv")
>dim(d)
500 19
>nrow(d)
500
>head(d)
>sub1=subset(sub,comment>50)
>sub1
Category comment like share
4 2 58 1572 147
143 2 60 859 90
169 1 144 1622 208
229 2 64 367 25
245 2 372 5172 790
289 1 103 469 33
380 3 51 1998 128
461 3 146 1546 181
481 2 56 360 99
>write.csv(sub,"facebook-sub.csv")
>dataA=read.csv("fb.csv")
>dataB=read.csv("newfb.csv")
>dim(dataA)
500 19
>dim(dataB)
134 19
>newAB=rbind(dataA,dataB)
>dim(newAB)
634 19
>new11=rbind(sub,sub1)
>dim(new11)
509 4
# Sorting the Data
>x= sub[order(-d$share),]
>head(x)
Category comment like share
245 2 372 5172 790
169 1 144 1622 208
461 3 146 1546 181
4 2 58 1572 147
106 1 42 955 139
380 3 51 1998 128
>y=sort(d$share,decreasing = TRUE)
>head(y)
790 208 181 147 139 128
>y
[1] 790 208 181 147 139 128 123 122 121 109 102 99 98 98 97 95 90 90 90 84 83 80
79 78 77 76 76 74
[29] 72 70 70 68 64 63 61 61 60 60 58 58 58 57 57 55 55 54 54 54 53
53 52 51 51 50 49 49
[57] 49 47 47 47 47 47 47 46 45 44 44 44 44 44 44 43 43 43 43 42 42
42 42 41 41 41 41 41
[85] 40 40 40 40 40 40 39 39 39 39 38 38 38 38 38 38 37 37 36 36 36
36 36 36 36 36 36 35
[113] 35 35 34 34 34 34 34 33 33 33 33 33 32 32 32 32 32 32 32 32 32
32 31 31 31 31 31 31
[141] 31 31 30 30 30 30 30 29 29 29 29 29 29 29 28 28 28 28 28 28 28
28 28 28 28 28 27 27
[169] 27 27 27 27 27 26 26 26 26 26 26 26 26 26 26 26 26 26 26 26 26
25 25 25 25 25 25 25
[197] 25 24 24 24 24 24 24 24 23 23 23 23 23 23 22 22 22 22 22 22 22
22 22 21 21 21 21 21
[225] 21 21 21 21 21 21 20 20 20 20 20 20 20 20 20 19 19 19 19 19 19
19 19 19 19 19 19 18
[253] 18 18 18 18 18 18 18 18 18 18 18 18 18 17 17 17 17 17 17 17 17
17 17 17 17 17 17 17
[281] 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 15 15 15 15 15 15
15 15 15 15 15 15 15
[309] 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 13
13 13 13 13 13 13 13
[337] 13 13 13 13 13 13 13 13 13 13 13 13 12 12 12 12 12 12 11 11 11
11 11 11 11 11 11 11
[365] 11 11 11 11 11 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
9 9 9 9 9 9 9
[393] 9 9 9 9 9 8 8 8 8 8 8 8 8 8 8 8 7 7 7 7 7
7 7 7 7 7 7 7
[421] 7 7 6 6 6 6 6 6 6 6 6 6 5 5 5 5 5 5 5 5 5
4 4 4 4 4 4 4
[449] 3 3 3 3 3 3 3 3 3 3 2 2 2 2 2 2 2 2 2 2 2
2 2 2 1 1 1 1
[477] 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0
# Transpose of data
>tran=t(sub)
>head(tran)
[,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] [,11] [,12] [,13] [,14] [,15] [,16]
[,17] [,18] [,19] [,20]
Category 2 2 3 2 2 2 3 3 2 3 2 2 2 2
2 2 3 1 3 3
comment 4 5 0 58 19 1 3 0 0 3 0 0 0 5
2 4 2 15 4 0
[,21] [,22] [,23] [,24] [,25] [,26] [,27] [,28] [,29] [,30] [,31] [,32]
[,33] [,34] [,35] [,36] [,37] [,38]
Category 2 1 1 3 2 2 2 3 2 1 2 2 3
3 1 2 3 1
comment 3 0 0 0 3 0 10 0 36 18 33 1 2
4 2 6 0 16
[,39] [,40] [,41] [,42] [,43] [,44] [,45] [,46] [,47] [,48] [,49] [,50]
[,51] [,52] [,53] [,54] [,55] [,56]
Category 2 1 2 1 1 1 1 1 1 1 1 1 2
1 1 1 1 1
comment 11 1 7 6 7 7 0 4 4 6 0 1 1
24 9 4 4 2
[,57] [,58] [,59] [,60] [,61] [,62] [,63] [,64] [,65] [,66] [,67] [,68]
[,69] [,70] [,71] [,72] [,73] [,74]
Category 1 1 1 1 1 1 1 1 1 1 1 1 1
1 1 1 3 1
comment 2 0 3 4 8 8 10 4 2 19 0 20 0
7 7 17 3 14
[,75] [,76] [,77] [,78] [,79] [,80] [,81] [,82] [,83] [,84] [,85] [,86]
[,87] [,88] [,89] [,90] [,91] [,92]
Category 1 1 1 1 2 1 2 3 1 3 3 1 1
3 1 1 1 2
comment 2 20 0 4 0 2 2 18 5 2 2 10 2
3 3 2 13 0
[,93] [,94] [,95] [,96] [,97] [,98] [,99] [,100] [,101] [,102] [,103] [,104]
[,105] [,106] [,107] [,108]
Category 3 3 2 3 1 2 2 1 1 2 3 1
1 1 3 2
comment 9 2 2 5 3 1 7 12 0 26 2 0
4 42 9 17
[,109] [,110] [,111] [,112] [,113] [,114] [,115] [,116] [,117] [,118] [,119]
[,120] [,121] [,122] [,123]
Category 3 2 1 1 2 2 1 1 1 1 1
1 1 2 1
comment 7 4 2 0 4 2 0 0 1 0 0
0 0 6 1
[,124] [,125] [,126] [,127] [,128] [,129] [,130] [,131] [,132] [,133] [,134]
[,135] [,136] [,137] [,138]
Category 1 1 1 3 1 1 1 1 1 1 1
1 1 1 1
comment 1 0 0 3 0 0 0 0 0 1 0
2 0 0 4
[,139] [,140] [,141] [,142] [,143] [,144] [,145] [,146] [,147] [,148] [,149]
[,150] [,151] [,152] [,153]
Category 2 1 1 3 2 3 1 1 3 3 2
1 1 2 2
comment 0 4 15 2 60 10 3 6 0 0 3
0 24 2 47
[,154] [,155] [,156] [,157] [,158] [,159] [,160] [,161] [,162] [,163] [,164]
[,165] [,166] [,167] [,168]
Category 1 2 2 2 3 2 2 3 2 3 2
1 2 1 3
comment 7 13 0 16 1 30 1 6 6 22 4
0 0 2 8
[,169] [,170] [,171] [,172] [,173] [,174] [,175] [,176] [,177] [,178] [,179]
[,180] [,181] [,182] [,183]
Category 1 2 3 1 2 1 2 2 3 1 2
2 2 1 3
comment 144 6 3 2 38 6 5 2 29 2 7
4 20 4 1
[,184] [,185] [,186] [,187] [,188] [,189] [,190] [,191] [,192] [,193] [,194]
[,195] [,196] [,197] [,198]
Category 1 2 1 2 2 3 2 3 3 2 2
1 2 1 2
comment 6 11 0 5 0 1 3 2 9 0 3
6 9 2 1
[,199] [,200] [,201] [,202] [,203] [,204] [,205] [,206] [,207] [,208] [,209]
[,210] [,211] [,212] [,213]
Category 1 3 2 1 2 3 3 1 1 2 3
2 3 3 2
comment 0 33 2 0 2 4 2 0 4 5 1
2 6 2 3
[,214] [,215] [,216] [,217] [,218] [,219] [,220] [,221] [,222] [,223] [,224]
[,225] [,226] [,227] [,228]
Category 3 3 1 2 2 1 3 2 3 2 2
1 2 3 1
comment 1 4 2 2 41 3 16 2 6 1 18
2 9 3 6
[,229] [,230] [,231] [,232] [,233] [,234] [,235] [,236] [,237] [,238] [,239]
[,240] [,241] [,242] [,243]
Category 2 2 3 3 2 1 2 2 3 2 1
2 3 1 2
comment 64 9 2 0 6 1 1 3 12 4 2
10 6 4 9
[,244] [,245] [,246] [,247] [,248] [,249] [,250] [,251] [,252] [,253] [,254]
[,255] [,256] [,257] [,258]
Category 1 2 1 2 2 3 2 1 1 3 2
3 2 1 1
comment 18 372 4 0 0 1 1 4 6 6 11
10 30 2 4
[,259] [,260] [,261] [,262] [,263] [,264] [,265] [,266] [,267] [,268] [,269]
[,270] [,271] [,272] [,273]
Category 2 3 2 1 2 2 3 3 1 2 1
1 2 1 1
comment 10 9 0 13 2 0 7 3 2 13 26
2 6 7 22
[,274] [,275] [,276] [,277] [,278] [,279] [,280] [,281] [,282] [,283] [,284]
[,285] [,286] [,287] [,288]
Category 3 2 2 3 1 1 1 2 2 2 1
1 2 1 1
comment 1 18 36 0 23 11 7 1 1 0 1
11 4 0 14
[,289] [,290] [,291] [,292] [,293] [,294] [,295] [,296] [,297] [,298] [,299]
[,300] [,301] [,302] [,303]
Category 1 1 1 2 1 2 1 1 3 3 2
1 2 3 1
comment 103 5 0 5 0 1 0 5 5 0 0
0 11 1 9
[,304] [,305] [,306] [,307] [,308] [,309] [,310] [,311] [,312] [,313] [,314]
[,315] [,316] [,317] [,318]
Category 3 1 3 2 3 3 3 1 3 3 3
1 3 3 2
comment 4 0 2 12 4 3 8 0 2 2 3
0 9 0 2
[,319] [,320] [,321] [,322] [,323] [,324] [,325] [,326] [,327] [,328] [,329]
[,330] [,331] [,332] [,333]
Category 3 2 1 3 3 3 3 3 2 3 3
3 3 3 3
comment 2 18 0 2 2 20 2 2 1 3 16
7 20 1 0
[,334] [,335] [,336] [,337] [,338] [,339] [,340] [,341] [,342] [,343] [,344]
[,345] [,346] [,347] [,348]
Category 3 2 2 3 3 3 1 3 3 2 3
1 3 3 3
comment 0 2 1 2 4 25 25 6 2 5 6
3 3 37 1
[,349] [,350] [,351] [,352] [,353] [,354] [,355] [,356] [,357] [,358] [,359]
[,360] [,361] [,362] [,363]
Category 3 3 2 3 1 3 2 1 3 2 2
3 1 3 2
comment 12 45 3 4 6 4 2 12 3 2 1
25 3 0 1
[,364] [,365] [,366] [,367] [,368] [,369] [,370] [,371] [,372] [,373] [,374]
[,375] [,376] [,377] [,378]
Category 3 1 3 3 3 1 2 1 2 1 1
1 3 3 3
comment 1 9 1 4 12 4 2 45 17 7 2
0 0 0 8
[,379] [,380] [,381] [,382] [,383] [,384] [,385] [,386] [,387] [,388] [,389]
[,390] [,391] [,392] [,393]
Category 1 3 1 3 1 3 1 3 1 1 1
1 3 2 1
comment 2 51 8 11 1 2 3 5 0 0 0
4 0 6 1
[,394] [,395] [,396] [,397] [,398] [,399] [,400] [,401] [,402] [,403] [,404]
[,405] [,406] [,407] [,408]
Category 3 3 1 3 1 3 3 3 1 1 1
3 3 3 3
comment 1 3 1 1 5 1 9 5 9 4 1
1 1 2 2
[,409] [,410] [,411] [,412] [,413] [,414] [,415] [,416] [,417] [,418] [,419]
[,420] [,421] [,422] [,423]
Category 1 2 2 3 1 1 1 1 1 1 1
1 1 1 1
comment 1 4 7 1 1 2 3 7 3 0 1
0 0 0 0
[,424] [,425] [,426] [,427] [,428] [,429] [,430] [,431] [,432] [,433] [,434]
[,435] [,436] [,437] [,438]
Category 1 1 3 1 1 1 1 1 1 1 1
1 1 1 1
comment 8 0 2 0 4 0 0 0 0 0 12
1 11 16 0
[,439] [,440] [,441] [,442] [,443] [,444] [,445] [,446] [,447] [,448] [,449]
[,450] [,451] [,452] [,453]
Category 2 1 1 1 3 3 2 1 1 1 1
1 1 1 1
comment 2 1 1 0 10 1 0 1 10 6 2
6 3 0 4
[,454] [,455] [,456] [,457] [,458] [,459] [,460] [,461] [,462] [,463] [,464]
[,465] [,466] [,467] [,468]
Category 3 3 3 1 1 3 2 3 1 3 3
1 3 2 2
comment 3 14 5 1 6 5 2 146 5 19 8
9 4 3 1
[,469] [,470] [,471] [,472] [,473] [,474] [,475] [,476] [,477] [,478] [,479]
[,480] [,481] [,482] [,483]
Category 1 3 1 3 1 3 1 3 1 1 3
3 2 1 3
comment 5 10 4 7 0 4 0 0 7 1 1
7 56 0 3
[,484] [,485] [,486] [,487] [,488] [,489] [,490] [,491] [,492] [,493] [,494]
[,495] [,496] [,497] [,498]
Category 2 3 1 3 3 3 3 3 3 1 3
3 3 2 1
comment 2 2 0 2 1 21 1 1 1 0 17
10 5 0 4
[,499] [,500]
Category 3 2
comment 7 0
[ reached getOption("max.print") -- omitted 2 rows ]
>head(molten.sub)
Category variable value
1 2 comment 4
2 2 comment 5
3 3 comment 0
4 2 comment 58
5 2 comment 19
6 2 comment 1
Roll No: 01