1)Have you ever used any software for statistical data
analysis? If so, which software have you used? Which statistical functions did you use in the software?
Yes, I have used Microsoft Excel for statistical data analysis.
Microsoft excel:, MS Excel does offer a wide variety of tools for data visualization and simple statistics. It’s simple to generate summary metrics and customizable graphics and figures, making it a usable tool for many who want to see the basics of their data. As many individuals and companies both own and know how to use Excel, it also makes it an accessible option for those looking to get started with statistics.
Following are the statistical functions which were used in Ms Excel:
There are a variety of functions available in Excel. Here are some of the most common functions :
● SUM: This function adds all of the values of the cells in
the argument. ● AVERAGE: This function determines the average of the values included in the argument. It calculates the sum of the cells and then divides that value by the number of cells in the argument. ● COUNT: This function counts the number of cells with numerical data in the argument. This function is useful for quickly counting items in a cell range. ● MAX: This function determines the highest cell value included in the argument. ● MIN: This function determines the lowest cell value included in the argument.
2)You have learnt basic workings of SPSS on day 6
and R software on day 8. Give any five major differences between these two software.
Basis for Comparison R SPSS
User Interface R has the less SPSS has more
interactive analytical interactive and tool but editors are user-friendly interface. available for providing SPSS displays data in a GUI support for spreadsheet-like programming in R. for fashion learning and practicing hands-on analytics R us best tool as it really helps the analyst to master the various analytics steps and commands.
Decision Making For decision trees, R For Decision trees, IBM
does not offer many SPSS is better than R algorithms and most of because R does not the packages of R can offer many tree only implement CART algorithms. For decision (Classification and trees, SPSS interface is Regression Tree) and very user-friendly and their interface is not as understandable. user-friendly.
Data Management A major drawback of R In terms of data is that most of its management, IBM SPSS functions have to load is more or less similar all the data into memory to R. it provides data before execution, which management functions set a limit on the such as sorting, volumes that can be aggregation, handled. transposition and for merging of the table.
Documentation In terms of While SPSS is lag
documentation R has behind in this feature. easily available explain SPSS lack this feature documentation files. R due to its limited use. community, however, is one of the strongest open source communities.
Platform R is written in C and SPSS graphical user
Fortran. R has stronger interface (GUI) is object-oriented written in Java. It uses programming facilities for interactive and than most statistical statistical Analysis computing languages. mainly.
Cost R is open source free IBM SPSS is not free if
software, where R someone wants to learn community is very fast SPSS then it has to use for software update trial version first. adding new libraries.
Visualizations R offer much more The graphical opportunities to capabilities of SPSS are customize and optimize purely functional graphs due to a wide although it is possible range of modules that to make minor changes are available. The most to the graph, to fully widely used module in R customize your graph is ggplot2. These and visualizations in graphs are also easily SPSS can be very made interactive, which cumbersome. allow users to play with data.