You are on page 1of 18
Grouping and Displaying Data ys to Convey Meaning: Tables and Graphs pn OBIECTIVES ey reading this chapter, you can Understand: how ihe diference between samples and © To graph frequency —¢ oe pp latiONs histograms, eo Prete aw dato useful information mitra: + Polygons, and ogives es roconsiruct and use data arrays decisions construct and use frequency distributions How Can We Arrange Data? 14 = Statistics at Work 58 Examples of Raw Data 17 ® Terms Introduced in Chapter 2 59 Arranging Data Using the Data Array ® Equations Introduced in Chapter 2 60 ‘elie Frequency Distribution 18 {Review nd Applicaton Exercises 60 Conttucting a F Chart: Arranging DatatoConvey fie Meaning 72 5. mith) igpalelenaeaia P aatt eae itae 2G; peas . Grouping and Displayi isplaying Data to Convey Meoni ing: Tables and Grophs RCISES 2.1 oe jo I tions SSS When asked what they would use i f they wer 1 pain reliever, more doctors chose Bayer thay get on an island with choice Jdrawn from a sample of a Mulan than Tylenol, Bufferin, or Advil. 1 wane, anion Twenty-five percent of the cars sol 1th conclusion dawn fom usaiple os porn 2° mee mated in An electronics firm recently introduced a aeipecrrrg ue of these have been sold so far. The presi complaint about the new amplifiers, informed production manager would be implemented immediately to poneiabapee ce would not appear again. Comment on the president's reaction from the standpoi given “Germany will remain ever divided” oa Wall tacherlogboed timid racitt ee Stated Walter Ulbricht after construction of the Berlin fail in lowever, toward the end of 1969, the communists of East Germany began. free ' pet en the east and west, and twenty years after that, the wall was completely ved. Give some reasons for Ulbricht’s incorrect i yed. C prediction. Discuss ‘e data given in the chapter-opening problem in terms of the five tests for data given on page 15. EXAMPLES OF RAW DATA tion before it is arranged and analyzed is called raw data. It is “raw” because it is unprocess tistical methods. carpet-loom data in the chapter-opening problem was one Problem facing admissions iple of raw data. Consider a second. Suppose that the admis- staff staff of «university, consemed with the sucoss of he east ERR = selects for admission, wishes to compare the students’ college per- mances ‘vith other achievements, such as high school grades, test scores, and extracurricular a Ther than study every student from every year, the staf can draw a sample ofthe popula snudents in a given time period and study only that group to conclude what charac\etshe® i seen. For example, the staff can compar high school grades with college erade-PN en designing a bridge, engineers are cone uch as concrete, at 4 hat a given materi st every cubfe inch of conerete to d neers take a sample of the concrete, test Sf conerete can withstand, Table 2-2 summarize concrete (o be used in constructing a Data are not necessarily decisions. The goal is to graduates OFN~" rather” covers: ated categories ever our list does wi not specifically Hist ‘our characterintic sibilities. For example, if mur in amy month of the year, a complete Tint ‘clude 12 categories. But if we wish to list ‘¢ months from January through August, we can use the term sober to account for OUF DSS ints of September, g the 4 mo fcr, and December. Although out Trcally list all the possibilities, it “other” is called an open- ‘ws either the upper or the ‘¢ classification scheme ye Inst class in Table 2-11 (72 vations durin October, Nove jist does not spect Jusive, This js all-ine sss when it al 10 be limitless. The ye either quantita- tive of qu continu- ous. Discrete ¢ lass not progress from one break. Such classes as number of trucks owned by aduates are es are Sepal class to the next without a the number of children in moving companies, tions of Central College gr College graduates can comething in between. The closing price of AT&T st 5 or 27 points (but not by 17.6 points). number of values, Central basketball team can win by of cans of to to fit 7 when 2 OM etnies. ~cem wi Tameg anny shes SiDents) “08S OF Canney oR com iG) eeney ima — py — Kiwis oa . aN ; oo a ws ‘Wy Hawk Mae i, 9453 ony 401047 nga ial 4810 55 om ine, 56 10 63 1p 0m, 641071 7203 Sony, Tand older —_6,192 Ooms 89.592 Soo each family, the Discrete and the occupa- sont whole numbers. Continoous data can be expressed jn either fractions or discrete. Discrete data are data that can take be -Tassified aa cithes dotiaasie: eau tock can be 39% or 39 % (but not 39.43), ct tinuous classes Continuous data do pro, a break. involve numerical uremel i . bes, the pounds of pressure on concrele.-0 the high school GPAs gS of college y t data, Constructing a data array in either di di ag | There are many ways to presen ‘Showing how many times a value appears by using a equ frequencies to decimals ich 7 order is a good place to start. * distribution is even more effective, and converting these ‘ . if relative frequencies) can help even more. Hint: We should things that can be counted but continuous variables are things that appear at some! EXERCISES 2.3 WV Self-Check Exercises : SC 2-1 Here are the ages of 50 members of a 7 a serie om: eens =a rT 22 24 | statssies for 212 Be 214 Monogament swith intes distribution ae wey icy of i believes that mere or Consiroct fea can you reach at Fix-ft mannagement rived about the mance, does have a major OF NEY on of 45 motores: County Transportatio Here are the Spee The Orange County Tor ain igh He wn driving on = 6 42 . ‘ eas 8 eS - § ‘a a ae 6 age o aT ee @ 7 9 65 te 6 5S 4 RE Pa sixtibutions using $ equal intervals Use these data to construct relative frequency distr that, nationally, no mo quel intervals. The U.S, Department of Transportation reports that, 10 percent of the motorists exceed $5 mph, bs sae Doane County motorists follow the U.S. DOT's report about national driving (>) Which distribution did you use to answer part (a)? 2h) (©) The U.S. DOT has determined that the safest speed for this highway is more {ess than 59 mph. What proportion of the motorists drive within this range? bution helped you answer this question? ‘Arrange the data in Table 2-2 on page 12 in an array from highest to lowest, (2) Suppose that state law requires bridge concrete to withstand Ib/sq in, at many samples would fail this test? bei (0) How many samples could withstand a 0 Pressure of at ‘i withstand a pressure greater than 2,506 fsg in.) at 497 thls in, (c) As you examine the array, you should notice that some Amounts of pressure, List pepantite st these pressures and the number of samp! Arecent i i sve sedy conceming the habits of US. cable television ¢ Grouping and Displaying Dota to Convey Meaning Jobles ond Graph | 2 ] The Environmental Protection Agency took water samples from 12 different rivers and sm ns that feed into Lake Erie. These samples were tested in the EPA lahoralory and rated 8 te the amount of solid pollution suspended in cach sample. The results of the testing are wived in the following table ! 2 3 4 s 6 Sample Pollution Rating (ppm) 372 31.7 a4 $42 499 ma Sample 7 ’ 9 10 " 12 198 $2.7 0.0 461 3a ot Pollution Rating (ppm) Ge) Arrange the data into an array from highest to lowest (b) Determine the number of samples having a pollution content betw and 49.9, 50.0 and $9.9, and 60.0 and 69.9 (c) 1£45.0 is the number used by the EPA to indicate excessive pollution, how many samples would be rated as having excessive pollution? (d) What is the largest distance between any two consecutive samples? Suppose that the admissions staff mentioned in the discussion of Table 2-1 on page 12 wishes * to examine the relationship between a student's differential on the college SAT examination the difference between actual and expected score based on the student's high school GPA) and the spread between the student’s bigh school and college GPA (the difference between the ge and high schoo! GPA). The admissions staff will use the following data: ee TLS. GPA College GPA SAT Score a4 36 940 28 30 950 9 40 33 cen 30.0 and 39.9, 40.0 a9 a7 1310 26 at SS In addition, the admissions staff has received the following information ftom the Testing Service ees NS.GPA___ Avg SAT Score ‘ 1340 39 1.010 is 1.280 32 1250 1.220 ee ment 26 | Statistics for Manoge (Consider an inte sat to lowest . array of spreads from highe® ease in college OPA Be ‘ ate SAT differeniy { the appropt! ff J above expec ted as positive.) ‘Arrange these data into an high c in Gales GPA over high school GPA as positive and high school GPA as negative.) Include with each sprené below expected as negative and dl? is the most common SAT (Consider an SAT score (b) What is the most common spre: differential? (c) For this spread in part (b), wha (d) From the analysis you have done, what do you conclude? NWorked-Out Answers to Self-Check Exercises C2-1 7 Intervals 13 Intervals : ee aoe Relative Relative requency Class Frequency Class —_-Frequeney 30-39 0.02 a iS - —_ 35-39 0.02 70-74 0.10 49 0.06 50-59 40-44 0.04 75-79 0.10 0.16 r 60-69 032 4 0.02 80-84 0.12 a5 50-54 0-79 0.20 pa 0.08 85-89 0.04 ; 55-59 80-89 0.16 0.08 90-94 02 90-99 60-64 0.10 Be 0 Z = 65-69 0.22 ee O04 1.00 (4) Ascan be seen from cither distribut 50, so the program is (b) In this case, both a ot compli ion, about 90 Percent ees lance, 4 are equally easy to use. High School GPA College GPA a 4.0, 39 40 i 39 4 39 a Bi. 32 3.6 3.0 36 25 Grouping end Displaying Date to Convey Meaning: Tables ond Grophs | 33 EXERCISES 2.4 gelf-Check Exercises (a High Performance Bicycle Products Company in Chapel Hill, North Carolina, sampled chipping records for a certain day with these results: e sco Eee ‘Time from Receipt of Order to Delivery (in Days) z a ' « n 6 7 B B TW Construct a frequency distribution for these data and a relative frequency distribution. Use intervals of 6 days. (q) What statement can you make about the effectiveness of order processing from the frequency distribution? «s) Ifthe company wants to ensure that half of its deliveries are made in 10 or fewer days, can you determine from the frequency distribution whether they have reached this goal? - =) What does having a relative frequency distribution permit you to do with the data that is difficult to do with only a frequency distribution? sC14 Mr. Frank, a safety engineer for the Mars Point Nuclear Power Generating Station, has sted the peak reactor temperature each day for the past year and has prepared the following ency distribution: __Temperatures in°C_Frequeney_= Below 500 501-510 7 bs 511-520 32 re 521-$30 39 530-540 82 380-560 65 . 561-570 3 571-580 28 ‘ 580-590 27 591-600 Total List and explain any errors you can find in Mr. Franks’s distribution. Pplications 7 Universal unger iconcina ss aE ———— Spienees so a elative FEUEDEY CIPS pation for these dat Constrvet a frequency dist! aed scons of shifts to lbave 90 GET “sistribution whether this fonervals of 5 barge a oS ers ner piven Burges esl gn the fromm Ore an want, Cm 1 ore has been achieved? (b) What percentage of shits BaY waste of 21 ie to determine your ans™ naar to Table 2-2 on page 18 and construct & ae toutiod? ‘of 4.0 Ib’sq in. What do you conclude from this 22h ce y atthe beginning and end of AU (CPI) has changed during Augus lows: in ench community ithe Consumer Price Index for the 30 communities are as foll 03 07 “00 “OAs a1. 05 02 03, Ain 03 Ge 02 0S Ot 0s 04s) 00. 02n” OSme nme 04 o7 04 03 02 4 01 21 05 -03 01 (a) Arrange the data in an array from lowest to highest. (b) Using the following four equal-sized classes, create & 0.1 10 0.2, 0.3 to 0.6, and 0.7 to 1.0. 4 (c) How many communities had prices that cither did not change or that increased less 1,0 percent? (4) Are these data discrete or continuous? Sarah Anne Rapp, the president of Baggit, Inc., has just obtained some raw data from a me survey = her company recently conducted. The survey was taken to determine effectiveness of the new company slogan, “When you've giv Baggit!” Se eae et acon cpa ee tow many boxes of Luncheon Baggits per month they bought before and after the used in the advertising campaign. The results were as follows: sm frequency distribution: ~0.5 to 0 220 __ Before/After BeforeiAn i 4 6 6 a 1 5 6 E 3 7 $ ‘ 3 ca (a) Create both : frenccey a cen using as classes | ~ relative frequency r Work pr) for ng ao ad 9-10, Give the most basie responses. “Before” and “An Groupiog ond Displaying Data te Convey Meaning: Tobles and Grophs \ 3s create frequency and relative frequency distributi -3 10-2, 1100, 1 0 2, 3 to4, and $06, (e) Based on your analysis, state whether the new slo, two reasons f support your conclusion, Here are the ages of 30 people who bought video recorders at Symphony Music Shop last ns for “Change” using classes ~$ to —4, gan has helped sales, and give one or week: 5 3740 18 a a ee ” 20 2 Is 27 46 44 62 8 30 42 2 26 44 at 34 $5 50 63 29 n {«) From looking atthe data just as they are, what conclusions can you come to quickly about ‘Symphony's market? ree a oy () Construct a 6-category closed classification, Does having this enable you to conclude anything more about Symphony's market? 22 Use the data from Exercise 2-21, (a) Construct a S-category open-ended classification. Docs having this enable you to con- clude anything more about Symphony’s market? (b) Now construct a relative frequency distribution to go with the S-category open-ended classification. Does having this provide Symphony with additional information useful in its marketing? Why? 23 John Lyon, owner of Fowler's Food Store in Chapel Hill, North Carolina, has arranged his customers" purchase amounts last week into this frequency distribution: SSpent Frequency SSpent Frequency SSpent Frequency 0.00-0.99 50 16.00-18.99 4,150 34,00-36.99 610 1.00-3.99 240 19,00-21.99 980 37.00-39.99 420 4.00-6.99 300 22.00-24.99 830 40.00-42,99 280 7.00-9.99 460 25.00-27.99 780 43.00-45.99 100 10.00-12.99 900 28.00-30.99 760 46.00-48.99 90 13.00-15.99 1,050 31.00-33.99 720 John says that having 17 intervals each defined by 2 numbers is cumbersome, Can you help him simplify the data he has without losing too much of their value? Here are the midpoints of the intervals for a distribution representing minutes it took the bers of a university track team to complete a 5-mile cross-country run. 4 stays a & @ Would you say that the team coach can get enough information from these m 4 2 Me es 6 | Stotities for Management! a Pounds Frequency sirbnch Fry Waste ar . 0039 3 Z 40-79 = 80-119 s 120-169 2 170-259 a 0-409 23 26, Fe ‘sg distribution. sstand explain any errors you can find in BAMSY © nas Hadar duro, ood classiteation For + sesible responses i Tosment application, Also, construct ® 3-categorys the high and low bids, the ele portion of an emp sation for the same responses. i ee the com day’s closing price. Here's an example: Low Bid Closing. Change ‘took exchange listings usually contain price, and the change from the previous ‘Name High Bid 27 ~ System Associates uy 10% uh 4% Is a distribution of all (a) stocks on the New York Stock Exchange by industry, (b) cl from the previous day prices on a given day, and (c) changes in prices (1) Quantitative or qualitative? (2) Continuous or discrete? (3) Open-ended or closed? ‘3 Would your answer to part (c) be different if the change were expressed simply “lower,” or “unchanged”? The noise level in decibels of aircraft departing Westchester County Airport the nearest decibel and grouped in a frequency distribution having intervals: 100 and 130, Under 100 decibels is not considered loud at all, and anything, is almost deafening. If Residents for a Quieter Neighborhood is gathering « against the airport, is this distribution adequate for its purpose? 29 Use the data from Exercise 2-28. Ifthe lawyer defending the airport tory to going to trial, would she approve of the midpoints of the int 4 ae eeee a9 i president of Ocean Airlines is trying to estimate when the Fe GFAA) is mos ial orale ont Stnpeny's abhi fc and Nashville, Assistants to the president have assembled applications filed during the past year, The data n until an FAA ruling, eee 34 2-28 Grouping and Displaying Data to Convey Meaning: Tables and Graphs | @7 4) Construct a frequency distribution using 10 clos2d intervals, equally spaced. Which inter- (val contains the most data points? % aren eet (p) Construct a frequency distribution using $ closed intervals, equally spaced, Which inter. val contains the most data points? (¢) ifthe president of Ocean Airlines had a relative frequency distribution for either (a) or (b), would that help him estimate the answer he needs? gor ne purpose of performance evaluation and quota adjustment, Ralph Williams monitored Far fu ales of his 40 salespeople, Over I-month period, they sold the following number of cars: qs S08 0 see ee 10° if 6 SO Os Se $12 °° 8 8 10. 1S tn cee $ 6 On Lee ee (@) Based on frequency, what would be the desired class marks (midpoints of the intervals)? {b) Construct a frequency and relative frequency distribution having as many of these marks as possible, Make your intervals evenly spaced and at least two cars wide. (¢) If sales fewer than seven cars @ month is considered unacceptable performance, which of the two answers, (a) or (b), helps you more in identifying the unsatisfactory group of salespeople? Kesler’ Ice Cream Delight attempts to keep all of its $5 flavors of ice cream in stock at each tits sores, Their marketing-research director suggests that keeping. better records for each store is the key to preventing stockouts. Don Martin, director of store operations, collects data tothe nearest half gallon on the daily amount of each flavor of ice cream that is sold. No more than 20 gallons of any flavor are ever used on one day. (a) Is the flavor classification discrete or continuous? Open or closed? (b) Is the “amount of ice cream” classit ication discrete or continuous? Open or closed? (c) Are the data qualitative or quantitative? @ What would you suggest Martin do to generate better data for market-research purpo Doug Atkinson is the owner and ticket collector for a ferry that transports people and from Long Island to Connecticut. Doug has data indicating the number of people, as well the number of cars, that have ridden the ferry during the past 2 months. For example, _ JULY3 NUMBER OF PEOPLE, 173 NUMBER OF CARS, 32 might be a typical daily entry for Doug, Doug has set up six the daily number of people, and the class marks are 84.5, 184.5. Doug's six equally spaced classes for the daily number of cars! 345, 42.5, 50.5, 58.5, and 66.5. (The class marks are (a) What are the upper and lower boundaries of the classes for i (b) What are the upper and lower boundaries of x Grouping and Displaying Data to Convey Meaning: Tables and Graphs 13 EX ISES 2.5. TT Self-Check Exercises 5C2.5. Here isa frequency distribution of the weight of 150 people who used a ski lift certain day. F Construct a histogram for these data. Class Frequency Class ‘Frequency 75-89 10 150-164 23 90-104 it 165-179 9 105-119 23 180-194 - ‘ 120-134 26 195-209 6 135-149 EL 210-224 2 (a) What can you see from the histogram about the data that was not immediately apparent from the frequency distribution? (b) If each ski lift chair holds two people but is limited in tetal safe weight capacity to 400 pounds, what can the operator do to maxjmize the people capacity of the ski lift without exceeding the safe weight capacity of a chair? Do the data support your proposal? 2-6 Central Carolina Hospital has the following data representing weight in pounds at birth of 200 premature babies. Class ‘Frequency Class Frequency 05-09 10 25-29 29 10-14 19 3.0-3.4 34 15-19 4 353.9 40 20-24 27 4.0-4.4 17 Construct an ogive that will help you enswer these questions: (a) What was the approximate middle value in the original data set? (>) If premature babies under 3.0 pounds are normally kept in an ‘4 precaution, about what percentage of Central’s pr Pplications 80 18 61 3; oe oO yee 2 AS 58 Rave at will help YOu answer these questions: etal wil Pe sidents would Be Migible foro fee? ‘ould the board have 10 ‘charge to the remaining Total cost of running the ‘clubhouse? tain from Salter Path, North Carolina, beli Roughly Sy Apprasimatcly what fee sto cover the same x sa fishing boat cap - ue poats is $,000 pounds per trip. Here ‘are data on a sample of boats have made recently: 500 6700 «3,400 3,600 2,000 zoo 5,600 = 4800 -8,000 5,000. ekaat 4600 8,100, 6,500, 9,000 4,200 400 7,000 «7,800 6,000 5,400 ot an ogive that will help you answer these questions: Oe what proportion of the trips breaks even for Homer? ( iat is the approximate middle value in the data = (c) What catch do Homer's boats exceed #0 percent ear pha The Massaclusetts Friends of Fish has the i resentil lutants million) at 150 sites in the state: Be one ae & (a) Raa __ Pollutants (in ppm) Frequency ee ee ornney 0-89 14 = wee 6 .0-28.9 0-1 a o 8 29.0-32.9 17.0-209, s aes 210-249 re 3 2-39 Grouping ond Displaying Date to Convay Meaning: Tables and Graphs | 5s Before constructing a dam on the Colorada River, the U.S Army Corps of Engineers per formed a series of tests to measure the water flow past the proposed location of the dam. The results of the testing were used to construct the fallowing frequency distribution: River and ogive. (b) Use the data given in the table to construct a “less-than” cumulative frequency distribution and ogive, (c) Use your ogive to estimate what proportion of the flow occurs at less than 1,300 thousands of gallons per minute, 9 Pamela Mason, a consultant for a small local brokerage firm, was attempting to design invest: ment programs attractive to senior citizens, She knew that if potential customers could obtain a certain level of return, they would be willing to risk an investment, but below a certain level, they would be reluctant. From a group of $0 subjects, she obtained the following data regard- ing the various levels of return required for each subject to, invest $1,000: Indifference Point Frequency Indifference Point Frequency $70-74 2 S 90- 94 W 75-79 5 9s- 99 3 80-84 10 100-104 3 = 85-89 is 105-109 2 (a) Construct both “more-than” and “less-than” cumulative relative (>) Graph the 2 distributions in part (a) into relative frequency ogives. — At a newspaper office, the time required to set the entire Sociales 50 days. The data, to the nearest tenth of a minute, are given 228 219 20.7 — be 20.7 225 i CS A 15,001-$20,000 20,001-$30,000 1530,001-$40,000 19 $40,001 -$50,000 9 Over $50,000 . . we that will help you answer these questions. wk proportion ofthe salespeople ears more than $25,000? vt does the “middle” salesperson in the sample earn? Approximately how much could a real estate salesperson whose performance was 25 percent from the fop expect to earn annually’? 243, Springfield is a college town with the usual parking problems. The city allows people have received tickets for illegally parked cars to come in and make their case to an admin tive officer and have the ticket voided. The town’s administrative officer collected the ing frequency distribution for the time spent on each appeal; Minutes Spent on Appeal Frequency Minutes Spent on Appeal Less than 2 30 8-9 70 aa ae 40 10-11 50 40 12413 50 $$ 145 30 400 Grouping and Displaying Data to Convey Meaning: Tables and Graphs | 57 (a) Construct a “less-than” cumulative frequency distribution, (b) Construct an ogive based on part (a) (c) The town administrator will consider streamlining the paperwork for the appeal process if more than SO percent of appeals take longer than 4 minutes, What is the percentage taking more than 4 minutes? What is the approximate time for the 200th (midpoint) appeal? Worked-Out Answers to Self-Check Exercises 3 sci-s % Frequency 825 97.5 1128 127.5 142.5 157.5 17251875 2028 217.5 Weight (pounds) (a) The lower tail of the distribution is fatter (has more observations in it) than the upper! (b) Because there are so few people who weigh 180 pounds or more, the operator can afl to pair each person who appears to be heavy with a lighter person. This can be done w out greatly delaying any individual's turn at the lift, C14 Cumulative Class Relative Frequency Class 05-09 0.050 2.5-2.9 1.0-1.4 0.145 3.0-3.4 0.265 0.400 relative frequency Eeeceess ss of ‘Cumulative o 05 {@) The middle value was

You might also like