Professional Documents
Culture Documents
D R H J H M AD I H AH K H AL I D
MATHEMATICS AS A LANGUAGE
Mathematicians need to be clear and concise when they communicate. The language of mathematics is better at communicating quantitative information than day to day language. How best do I communicate my work? The answer is to use a combination of written phonetic words, graphical representation of information, and certain symbolic conventions of mathematics. The challenge of the mathematician is not simply thinking up harder and harder proofs, but the challenge of finding ways to communicate information.
WHAT IS STATISTICS?
What do you think of when you hear the word statistics? Think of a general question that could be answered with statistics. How would you carry out the process in order to answer your question? Be as specific as possible. What is a random event? Give an example of something that happens randomly and something that does not.
HISTORY OF STATISTICS
The history of statistics can be said to start around 1749. Over time, there have been changes to the interpretation of what the word statistics means. In early times, the meaning was restricted to information about states. This was later extended to include all collections of information of all types, and later still it was extended to include the analysis and interpretation of such data. In modern terms, "statistics" means both sets of collected information, as in income distribution and temperature records, and analytical work which requires statistical inference.
4. Medical Studies Scientists must show a statistically valid rate of effectiveness before any drug can be prescribed. Statistics are behind every medical study you hear about. 5. Political Campaigns Whenever there's an election, the news organizations consult their models when they try to predict who the winner is. Candidates consult voter polls to determine where and how they campaign. Statistics play a part in who your elected government officials will be 6. Insurance You know that in order to drive your car you are required by law to have car insurance. If you have a mortgage on your house, you must have it insured as well. The rate that an insurance company charges you is based upon statistics from all drivers or homeowners in your area. 7. Stock Market Another topic that you hear a lot about in the news is the stock market. Stock analysts also use statistical computer models to forecast what is happening in the economy. Note: Try to think where do YOU encounter statistics in YOUR life
EXAMPLE
Ask a question: Are men typically taller than women? Do men typically have longer arm spans than women?
a. Examine the 24 measurements for height and arm span. Youll notice that they are not all the same. What is the source of this variation? Can you explain why there are differences? b. Suppose your goal was to prove that men are typically taller than women. Does this data prove that conclusion? Why or why not? Talk about error and bias. What can you do to reduce these? Sampling?
STATISTICAL REASONING
The way people reason with statistical ideas and make sense of statistical info. All students should be able to do the following: Formulate questions that can be addressed with data and collect, organize, and display relevant data to answer them Select and use appropriate statistical methods to analyze data Develop and evaluate inferences and predictions that are based on data Understand and apply basic concepts of probability For data analysis and statistics, students are expected to do the following: Formulate questions, design studies, and collect data about a characteristic shared by two populations or different characteristics within one population Select, create, and use appropriate graphical representations of data Find, use, and interpret measures of center and spread, including mean and interquartile range Discuss and understand the correspondence between data sets and their graphical representations, especially histograms, stem and leaf plots?, box plots?, and scatter plots?
PRE-TEST
Consider topics of interest to you that involved collecting data about a characteristic shared by two populations. Formulate five questions that involve collecting qualitative (categorical) data and five questions that involve collecting quantitative (numerical) data. For each question, identify the type of data that will be collected and an appropriate way to display the data (e.g., line graph, bar graph, histogram, pie chart, stem and leaf plot, box plot)
PRESENTATION OF DATA
Statistical representation is the science/art of using data to describe the world around us. There are numerous ways of constructing statistical representations. The proper representation depends upon the nature of the data and the particular issues being addressed. A combination of methods is often appropriate e.g. tables, charts and graphs. Statistical representations include pictograms, bar graphs, line graphs, box plots, pie charts, histograms and box plots.
Determine the message(s) to be transmitted. Ask yourself the following questions to figure out what your message is and why it is important:
What do the data show? Is there more than one main message? What aspect of the message(s) should be highlighted? Can all of the message(s) be displayed on the same graphic?
Determine the nature of the message(s). Consider the following instructions and their appropriate terms when labelling the graph or describing features of it in accompanying text: If your graph will... Use the following terms... describe components share of, percent of the, smallest, the majority of compare items ranking, larger than, smaller than, equal to establish a time series change, rise, growth, increase, decrease, decline, fluctuation determine a frequency range, concentration, most of, distribution of x and y by age analyse relationships in increase with, decrease with, vary with, despite, data correspond to, relate to do any combination of e.g., 'percentage of dropouts among the 15 to 24 the above actions age group has increased because of....'
Experiment with different types of graphs and select the most appropriate.
pie chart (description of components) horizontal bar graph (comparison of items and relationships, time series) vertical bar graph (comparison of items and relationships, time series, frequency distribution) line graph (time series and frequency distribution) scatterplot (analysis of relationships)
Pictographs A pictograph uses picture symbols to convey the meaning of statistical information. Pictographs should be used carefully because the graphs may, either accidentally or deliberately, misrepresent the data. This is why a graph should be visually accurate.
Pie charts A pie chart is a way of summarizing a set of categorical data or displaying the different values of a given variable (e.g., percentage distribution). This type of chart is a circle divided into a series of segments. Each segment represents a particular category. The area of each segment is the same proportion of a circle as the category is of the total data set. Pie charts usually show the component parts of a whole. Often you will see a segment of the drawing separated from the rest of the pie in order to emphasize an important piece of information.
Figure 1. Student and faculty response to the poll 'Should Avenue High School adopt student uniforms?'
Bar Chart A bar graph may be either horizontal or vertical. The important point to note about bar graphs is their bar length or heightthe greater their length or height, the greater their value. Bar graphs are one of the many techniques used to present data in a visual form so that the reader may readily recognize patterns or trends. Bar graphs usually present categorical and numeric variables grouped in class intervals. They consist of an axis and a series or labeled horizontal or vertical bars. The bars depict frequencies of different values of a variable or simply the different values themselves. The numbers on the x-axis of a bar graph or the y-axis of a column graph are called the scale. When developing bar graphs, draw a vertical or horizontal bar for each category or value. The height or length of the bar will represent the number of units or observations in that category (frequency) or simply the value of the variable. Select an arbitrary but consistent width for each bar as well. There are three types of graphs used to display time series data: horizontal bar graphs, vertical bar graphs and line graphs. All three of these types of graphs work well when you need to compare values. However, in general, data comparisons are best represented vertically.
Number of students at Diversity College who are immigrants, by last country of permanent residence
Example Comparing several places or items Figure 5 is an example of a double horizontal bar graph. Hillary sampled an equal number of boys and girls at her high school and asked them to pick the one snack food they liked the most from the following list: Popcorn, chips, chocolate bars, crackers, pretzels, cookies, ice cream, fruit, candy, vegetables. She created a graph to display the results of her survey. Examine Figure 5, and answer the following questions: What comparison does this graph show? Which snack food was least preferred by girls? Which snack food was preferred by substantially more boys than girls? Which snack foods were preferred by more girls than boys? Which snack food was preferred equally by both boys and girls?
Example Inappropriate use of bar graphs Vertical bar graphs are an excellent choice to emphasize a change in magnitude. The best information for a vertical bar graph is data dealing with the description of components, frequency distribution and time-series statistics. A horizontal bar graph may be more effective than a line graph when there are fewer time periods or segments of data. If you want to compare more than 9 or 10 items, use a line graph instead. Figure 6 is an example of when a line graph should be used instead of a horizontal bar graph.
Stacked Bar Chart Campbell High Triathlon, percentage of time spent on each event, by
competitor
Smoking fr ncy of year-ol s on e arkview Secondary School rack and field eam
Line graphs Line graphs are more popular than all other graphs combined because their visual characteristics reveal data trends clearly and these graphs are easy to create. Line graphs, especially useful in the fields of statistics and science, are one of the most common tools used to present data. A line graph is a visual comparison of how two variablesshown on the x- and yaxesare related or vary with each other. It shows related information by drawing a continuous line between all the points on a grid. Line graphs compare two variables: one is plotted along the x-axis (horizontal) and the other along the y-axis (vertical). The y-axis in a line graph usually indicates quantity (e.g., dollars, litres) or percentage, while the horizontal x-axis often measures units of time. As a result, the line graph is often viewed as a time series graph. For example, if you wanted to graph the height of a baseball pitch over time, you could measure the time variable along the x-axis, and the height along the y-axis. Although they do not present specific data as well as tables do, line graphs are able to show relationships more clearly than tables do. Line graphs can also depict multiple series which are usually the best candidate for time series data and frequency distribution. Bar and column graphs and line graphs share a similar purpose. The column graph, however, reveals a change in magnitude, whereas the line graph is used to show a change in direction. In summary, line graphs show specific values of data well reveal trends and relationships between data compare trends in different groups of a variable Graphs can give a distorted image of the data. If inconsistent scales on the axes of a line graph force data to appear in a certain way, then a graph can even reveal a trend that is entirely different from the one intended. This means that the intervals between adjacent points along the axis may be dissimilar, or that the same data charted in two graphs using different scales will appear different.
Histograms The histogram is a popular graphing tool. It is used to summarize discrete or continuous data that are measured on an interval scale. It is often used to illustrate the major features of the distribution of the data in a convenient form. A histogram divides up the range of possible values in a data set into classes or groups. For each group, a rectangle is constructed with a base length equal to the range of values in that specific group, and an area proportional to the number of observations falling into that group. This means that the rectangles will be drawn of non-uniform height. A histogram has an appearance similar to a vertical bar graph, but when the variables are continuous, there are no gaps between the bars. When the variables are discrete, however, gaps should be left between the bars. Figure 1 is a good example of a histogram. A vertical bar graph and a histogram differ in these ways: In a histogram, frequency is measured by area of column while in a vertical bar graph, frequency is measured by height of bar. Histogram characteristics Generally, a histogram will have bars of equal width, although this is not the case when class intervals vary in size. Choosing the appropriate width of the bars for a histogram is very important. As you can see in the example above, the histogram consists simply of a set of vertical bars. Values of the variable being studied are measured on an arithmetic scale along the horizontal x-axis. The bars are of equal width and correspond to the equal class intervals, while the height of each bar corresponds to the frequency of the class it represents. The histogram is used for variables whose values are numerical and measured on an interval scale. It is generally used when dealing with large data sets (greater than 100 observations). A histogram can also help detect any unusual observations (outliers) or any gaps in the data. Frequency polygon - ? Cumulative Frequency Polygon (s-curve, ogive)