7. (a) Given the following measurement for the variable age:
18, 22, 25, 42, 28, 43, 33, 35, 56, 28
Standardize the variable by the following:
8. An e-mail database is a database that stores a large number of electronic mail messages. It can be viewed as a semistructured database consisting mainly of text data. Discuss the following.
(a) How can such an e-mail database be structured so as to facilitate multi- dimensional search, such as by sender, by receiver, by subject, by time, and so on?
(c) suppose you have roughly classi\ufb01ed a set of your previous e-mail messages as junk, unimportant, normal, or important. Describe howa data mining system may take this as the training set to automatically classify new e-mail messages or unclassi\ufb01ed ones.
8. Suppose that a city transportation department would like to perform data analysis on highway tra\ufb03c for the planning of highway construction based on the city tra\ufb03c data collected at di\ufb00erent hours every day.
(a) Design a spatial data warehouse that stores the highway tra\ufb03c information so that people can easily see the average and peak time tra\ufb03c \ufb02ow by highway, by time of day, and by weekdays, and the tra\ufb03c situation when a major accident occurs.
(c) This data warehouse contains both spatial and temporal data. Propose one mining technique that can e\ufb03ciently mine interesting patterns from such a spatio-temporal data warehouse.
This action might not be possible to undo. Are you sure you want to continue?