on it: finding the maximum temperature for each year.

The map function is also a
good place to drop bad records: here we filter out temperatures that are missin
g, suspect, or erroneous. To visualize the way the map works, consider the follo
wing sample lines of input data (some unused columns have been dropped to fit th
e page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195
0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu
gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195
0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu
gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195

0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu
gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195
0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu
gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195
0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu

gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195
0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu
gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195
0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu
gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195

0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu
gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195
0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu
gh on it: finding the maximum temperature for each year. The map function is als
o a good place to drop bad records: here we filter out temperatures that are mis
sing, suspect, or erroneous. To visualize the way the map works, consider the fo
llowing sample lines of input data (some unused columns have been dropped to fit
the page, indicated by ellipses):
0067011990999991950051507004...9999999N9+00001+99999999999... 004301199099999195
0051512004...9999999N9+00221+99999999999... 0043011990999991950051518004...99999
99N9-00111+99999999999... 0043012650999991949032412004...0500001N9+01111+9999999
9999... 0043012650999991949032418004...0500001N9+00781+99999999999... These line
s are presented to the map function as the key-value pairs:
(0, 0067011990999991950051507004...9999999N9+00001+99999999999...) (106, 0043011
990999991950051512004...9999999N9+00221+99999999999...) (212, 004301199099999195
0051518004...9999999N9-00111+99999999999...) (318, 0043012650999991949032412004.
..0500001N9+01111+99999999999...) (424, 0043012650999991949032418004...0500001N9
+00781+99999999999...) The keys are the line offsets within the file, which we i
gnore in our map function. The map function merely extracts the year and the air
temperature (indicated in bold text), and emits them as its output (the tempera
ture values have been interpreted as integers):
(1950, 0) (1950, 22) (1950, -11) (1949, 111) (1949, 78) The output from the map
function is processed by the MapReduce framework before being sent to the reduce
function. This processing sorts and groups the key-value pairs by key. So, cont
inuing the example, our reduce function sees the following input:
(1949, [111, 78]) (1950, [0, 22, -11]) Each year appears with a list of all its
air temperature readings. All the reduce function has to do now is iterate throu

... 0067011990999991950051507004. our reduce function sees the following input: (1949.0500001N9+01111+9999999 9999. So. 0043012650999991949032418004. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. cont inuing the example.0500001N9 +00781+99999999999. The map function merely extracts the year and the air temperature (indicated in bold text).. [0. 0043012650999991949032418004... 004301199099999195 0051512004.99999 99N9-00111+99999999999.... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.....0500001N9+01111+9999999 9999.. -11]) Each year appears with a list of all its air temperature readings... -11) (1949.) (212.....) (318.9999999N9+00001+99999999999.9999999N9+00221+99999999999.0500001N9+00781+99999999999... which we i gnore in our map function.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. suspect..0500001N9+00781+99999999999.) (106..0500001N9+01111+9999999 9999. These line s are presented to the map function as the key-value pairs: (0. 0043012650999991949032412004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.... [111. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function... To visualize the way the map works.... .99999 99N9-00111+99999999999. 78]) (1950..9999999N9-00111+99999999999. 004301199099999195 0051518004.. 0043011990999991950051518004.) (424...99999 99N9-00111+99999999999. 004301199099999195 0051512004.. 22) (1950. 22) (1950.. The map function merely extracts the year and the air temperature (indicated in bold text)..) (106... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0043012650999991949032412004..... 0043012650999991949032418004..... .. 0043011 990999991950051512004. To visualize the way the map works... These line s are presented to the map function as the key-value pairs: (0.gh on it: finding the maximum temperature for each year.) (424. 0043012650999991949032412004.9999999N9+00001+99999999999.) (106. [0. 004301199099999195 .0500001N9 +00781+99999999999.... 0043011990999991950051518004..) (212. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.0500001N9+01111+99999999999.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. indicated by ellipses): 0067011990999991950051507004. -11) (1949. These line s are presented to the map function as the key-value pairs: (0. 0) (1950.. or erroneous. indicated by ellipses): 0067011990999991950051507004... 0043012650999991949032412004. 22. 78]) (1950..0500001N9+01111+99999999999. which we i gnore in our map function. or erroneous... 004301199099999195 0051518004...9999999N9+00221+99999999999.. This processing sorts and groups the key-value pairs by key. indicated by ellipses): 0067011990999991950051507004.. 0043012650999991949032418004.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 0043011 990999991950051512004..9999999N9+00221+99999999999..9999999N9+00221+99999999999..0500001N9+00781+99999999999. This processing sorts and groups the key-value pairs by key. To visualize the way the map works. 111) (1949. suspect..9999999N9+00221+99999999999.. 0043012650999991949032418004. 0067011990999991950051507004.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950..) (318.. 0043012650999991949032412004.....9999999N9+00221+99999999999. So. [111. -11]) Each year appears with a list of all its air temperature readings.. suspect. 0) (1950....9999999N9+00001+99999999999.. 0043011 990999991950051512004....9999999N9+00001+99999999999. 004301199099999195 0051512004...) The keys are the line offsets within the file..... 111) (1949. cont inuing the example..9999999N9-00111+99999999999.) The keys are the line offsets within the file..9999999N9+00001+99999999999.. or erroneous. 0067011990999991950051507004. 22..... 0043011990999991950051518004..) (212.. our reduce function sees the following input: (1949.9999999N9+00001+99999999999.

.. These line s are presented to the map function as the key-value pairs: (0..9999999N9+00001+99999999999. [111. which we i gnore in our map function. 22) (1950. 0043012650999991949032412004. 0043012650999991949032418004.0500001N9 +00781+99999999999.....) (106.) (212. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. ..99999 99N9-00111+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..0500001N9+01111+9999999 9999..) (318.0500001N9+00781+99999999999.9999999N9+00221+99999999999.... 22.. indicated by ellipses): 0067011990999991950051507004. suspect. So.. 78]) (1950. [0. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.) (318. 0043012650999991949032418004.. 004301199099999195 0051518004.) The keys are the line offsets within the file.9999999N9-00111+99999999999. or erroneous.....0500001N9+00781+99999999999.. cont inuing the example.) (212... 0043012650999991949032412004...9999999N9+00221+99999999999. So.... To visualize the way the map works..) The keys are the line offsets within the file..0051518004. The map function merely extracts the year and the air temperature (indicated in bold text).0500001N9+01111+99999999999. 0043012650999991949032418004... The map function merely extracts the year and the air temperature (indicated in bold text)...0500001N9+01111+9999999 9999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 22. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 78]) (1950.. 004301199099999195 0051518004.. which we i gnore in our map function.9999999N9+00001+99999999999. 22) (1950. -11) (1949..) The keys are the line offsets within the file.) (424.9999999N9+00001+99999999999. 004301199099999195 0051512004..9999999N9+00001+99999999999..) (106.. This processing sorts and groups the key-value pairs by key.. our reduce function sees the following input: (1949.. 0043011990999991950051518004.0500001N9 +00781+99999999999. [0. The map function merely extracts the year and the air temperature (indicated in bold text).. -11) (1949..) (318. indicated by ellipses): 0067011990999991950051507004.) (424.. 0043011 990999991950051512004.0500001N9+01111+99999999999. 111) (1949. 22) (1950..9999999N9-00111+99999999999. 004301199099999195 0051512004. our reduce function sees the following input: (1949.. 0043012650999991949032418004... This processing sorts and groups the key-value pairs by key... These line s are presented to the map function as the key-value pairs: (0.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year... So.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. 0067011990999991950051507004.. 0067011990999991950051507004. To visualize the way the map works. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. 0043011990999991950051518004.. 0) (1950. This processing sorts and groups the key-value pairs by key..... 0) (1950.) (424.9999999N9+00221+99999999999. suspect. . 0) (1950.. 111) (1949..0500001N9 +00781+99999999999. 78]) (1950. [111. 22. -11]) Each year appears with a list of all its air temperature readings. and emits them as its output (the tempera ture values have been interpreted as integers): (1950... or erroneous. 0043012650999991949032412004.. [0. cont inuing the example. ...9999999N9+00221+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.... our reduce function sees the following input: (1949. -11]) Each year appears with a list of all its air temperature readings. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 0043012650999991949032412004.... 0043011 990999991950051512004. which we i gnore in our map function.0500001N9+01111+99999999999.. -11) (1949.. cont inuing the example. 0043012650999991949032418004.9999999N9-00111+99999999999.99999 99N9-00111+99999999999.... All the reduce function has to do now is iterate throu . 111) (1949.. [111... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function... -11]) Each year appears with a list of all its air temperature readings. 0043012650999991949032412004...

) The keys are the line offsets within the file.. cont inuing the example. 0043011990999991950051518004.) The keys are the line offsets within the file.. -11]) Each year appears with a list of all its air temperature readings. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page...... 0043012650999991949032412004.... 0043012650999991949032412004... our reduce function sees the following input: (1949...9999999N9-00111+99999999999.. -11]) Each year appears with a list of all its air temperature readings..0500001N9+00781+99999999999. our reduce function sees the following input: (1949. cont inuing the example.gh on it: finding the maximum temperature for each year..9999999N9+00221+99999999999.. 004301199099999195 0051512004... 22. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. 0043012650999991949032412004. suspect. 0043011990999991950051518004. 0043011 990999991950051512004. [0. 004301199099999195 0051512004.0500001N9+01111+9999999 9999.) (106.. So..... 004301199099999195 0051518004..9999999N9-00111+99999999999. 0043012650999991949032418004... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.0500001N9 +00781+99999999999..) (318... [111.. 0067011990999991950051507004.) (318. To visualize the way the map works... 22. 0) (1950..... 0067011990999991950051507004. 0043012650999991949032418004..9999999N9+00001+99999999999.. 0043011 990999991950051512004.0500001N9+00781+99999999999.99999 99N9-00111+99999999999. 004301199099999195 0051512004.9999999N9+00001+99999999999... [0. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. indicated by ellipses): 0067011990999991950051507004....9999999N9+00221+99999999999.) (212..) (424.) (212.9999999N9+00221+99999999999. . or erroneous.) (424.....) (106.... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing...... 004301199099999195 0051518004..0500001N9+01111+9999999 9999. 111) (1949.. 22) (1950.99999 99N9-00111+99999999999.0500001N9+01111+9999999 9999... This processing sorts and groups the key-value pairs by key. 111) (1949. -11) (1949.. 0043012650999991949032418004..0500001N9 +00781+99999999999....9999999N9+00221+99999999999. 0043012650999991949032418004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.9999999N9+00001+99999999999. This processing sorts and groups the key-value pairs by key. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. suspect..) (212...9999999N9+00221+99999999999. These line s are presented to the map function as the key-value pairs: (0.0500001N9+00781+99999999999. -11) (1949.. 0043011990999991950051518004.. 0043011 990999991950051512004. 78]) (1950.... [111...) (106. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.99999 99N9-00111+99999999999. .. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. 0) (1950. To visualize the way the map works... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. The map function merely extracts the year and the air temperature (indicated in bold text).0500001N9+01111+99999999999. So. which we i gnore in our map function. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page...... suspect.. The map function merely extracts the year and the air temperature (indicated in bold text).. indicated by ellipses): 0067011990999991950051507004. These line s are presented to the map function as the key-value pairs: (0. To visualize the way the map works.. or erroneous. which we i gnore in our map function.. 78]) (1950. or erroneous..9999999N9+00001+99999999999.. 0043012650999991949032418004. 0067011990999991950051507004. 0043012650999991949032412004.0500001N9+01111+99999999999. These line s are presented to the map function as the key-value pairs: (0.... 0043012650999991949032412004.. 004301199099999195 ..9999999N9+00001+99999999999. 22) (1950..9999999N9+00221+99999999999.. indicated by ellipses): 0067011990999991950051507004..9999999N9+00001+99999999999....

.. 0043012650999991949032412004. So. [0.. suspect. -11) (1949.0500001N9+00781+99999999999.9999999N9+00221+99999999999.. 22) (1950. 0043012650999991949032412004. our reduce function sees the following input: (1949. 0043012650999991949032412004. indicated by ellipses): 0067011990999991950051507004.. 78]) (1950... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..... 004301199099999195 0051512004.. 0067011990999991950051507004.0051518004. The map function merely extracts the year and the air temperature (indicated in bold text). The map function merely extracts the year and the air temperature (indicated in bold text). These line s are presented to the map function as the key-value pairs: (0... To visualize the way the map works...... [111. So. [0..) (424. -11) (1949. To visualize the way the map works. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.... 0043012650999991949032418004. 111) (1949. 22. [111..9999999N9-00111+99999999999...9999999N9+00221+99999999999.0500001N9 +00781+99999999999. [111. This processing sorts and groups the key-value pairs by key.) (318. All the reduce function has to do now is iterate throu ..0500001N9+01111+9999999 9999.) (318. 004301199099999195 0051512004...) The keys are the line offsets within the file.) (212...) (424.0500001N9 +00781+99999999999. 22. 78]) (1950. The map function merely extracts the year and the air temperature (indicated in bold text). 004301199099999195 0051518004..9999999N9+00221+99999999999. -11]) Each year appears with a list of all its air temperature readings... 004301199099999195 0051518004. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 0043012650999991949032418004... and emits them as its output (the tempera ture values have been interpreted as integers): (1950.) The keys are the line offsets within the file. indicated by ellipses): 0067011990999991950051507004. which we i gnore in our map function... 0043012650999991949032418004. .0500001N9+00781+99999999999. which we i gnore in our map function..) (106....9999999N9+00001+99999999999.. This processing sorts and groups the key-value pairs by key. cont inuing the example.. 22) (1950. So.. 0) (1950....9999999N9+00001+99999999999... cont inuing the example.. -11]) Each year appears with a list of all its air temperature readings.) (424.. 111) (1949..0500001N9+01111+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 22... This processing sorts and groups the key-value pairs by key. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.0500001N9+01111+9999999 9999.9999999N9-00111+99999999999.. -11]) Each year appears with a list of all its air temperature readings. . consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. [0.... our reduce function sees the following input: (1949.9999999N9+00221+99999999999.. 0043012650999991949032412004.. suspect..9999999N9+00001+99999999999.... cont inuing the example.9999999N9-00111+99999999999... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. our reduce function sees the following input: (1949.0500001N9 +00781+99999999999..0500001N9+01111+99999999999.. 111) (1949. 0) (1950..) The keys are the line offsets within the file.99999 99N9-00111+99999999999.. 0043011 990999991950051512004. 78]) (1950. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. -11) (1949. 0043011 990999991950051512004. 0067011990999991950051507004.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing...0500001N9+01111+99999999999.) (318...) (106. 22) (1950. .. or erroneous. 0043012650999991949032418004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. 0) (1950.) (212..9999999N9+00001+99999999999. or erroneous.. which we i gnore in our map function.99999 99N9-00111+99999999999... These line s are presented to the map function as the key-value pairs: (0.. 0043011990999991950051518004. 0043012650999991949032412004.. 0043011990999991950051518004... 0043012650999991949032418004.

which we i gnore in our map function..9999999N9-00111+99999999999.0500001N9 +00781+99999999999.9999999N9-00111+99999999999... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. -11]) Each year appears with a list of all its air temperature readings.. suspect...0500001N9+01111+99999999999.) (318.) (212....... 78]) (1950. cont inuing the example...9999999N9+00221+99999999999. The map function merely extracts the year and the air temperature (indicated in bold text). -11) (1949. These line s are presented to the map function as the key-value pairs: (0.. 0067011990999991950051507004. 111) (1949. cont inuing the example....) (424..9999999N9+00001+99999999999. 22. 004301199099999195 0051512004. 0043012650999991949032412004.0500001N9+01111+9999999 9999.. our reduce function sees the following input: (1949. 22) (1950. To visualize the way the map works...... 22. indicated by ellipses): 0067011990999991950051507004. This processing sorts and groups the key-value pairs by key. 0043012650999991949032412004... 0043012650999991949032412004...0500001N9+01111+9999999 9999.. 004301199099999195 0051518004. So.. [0. 0043012650999991949032412004. 0043011990999991950051518004.9999999N9+00221+99999999999. 0043012650999991949032418004.9999999N9+00001+99999999999. 0043012650999991949032418004. 004301199099999195 0051518004.. indicated by ellipses): 0067011990999991950051507004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950..... 111) (1949...) (212.. 0) (1950... . indicated by ellipses): 0067011990999991950051507004.) The keys are the line offsets within the file.99999 99N9-00111+99999999999.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page...) (424. 0043012650999991949032418004. These line s are presented to the map function as the key-value pairs: (0..9999999N9+00221+99999999999.99999 99N9-00111+99999999999. . [111. 78]) (1950. 0043011 990999991950051512004.....9999999N9+00221+99999999999. The map function merely extracts the year and the air temperature (indicated in bold text).. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.) (106..) The keys are the line offsets within the file.. our reduce function sees the following input: (1949.9999999N9+00001+99999999999...... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing......... 0067011990999991950051507004. To visualize the way the map works.0500001N9 +00781+99999999999... 0043012650999991949032412004... 0043012650999991949032418004....99999 99N9-00111+99999999999..0500001N9+00781+99999999999..) (318..gh on it: finding the maximum temperature for each year..9999999N9+00221+99999999999.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. -11]) Each year appears with a list of all its air temperature readings... 004301199099999195 . 0043012650999991949032418004..0500001N9+01111+99999999999. suspect.0500001N9+00781+99999999999. or erroneous.9999999N9+00001+99999999999.) (106... These line s are presented to the map function as the key-value pairs: (0. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function....9999999N9+00001+99999999999.0500001N9+00781+99999999999. 004301199099999195 0051512004..... or erroneous. So. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. This processing sorts and groups the key-value pairs by key. 0043011 990999991950051512004...... [0. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0) (1950...) (106. 0043011 990999991950051512004. 0043011990999991950051518004. [111....9999999N9+00221+99999999999.. which we i gnore in our map function... suspect. -11) (1949. 004301199099999195 0051512004. 0067011990999991950051507004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. To visualize the way the map works.. 22) (1950.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. or erroneous..) (212.9999999N9+00001+99999999999.0500001N9+01111+9999999 9999. 0043011990999991950051518004.

22) (1950.. 004301199099999195 0051512004. This processing sorts and groups the key-value pairs by key. 0043012650999991949032418004...0500001N9 +00781+99999999999. So. [111.9999999N9+00221+99999999999..9999999N9+00001+99999999999.. To visualize the way the map works.9999999N9+00221+99999999999.0500001N9+01111+9999999 9999.) The keys are the line offsets within the file. 004301199099999195 0051512004...0051518004. 111) (1949.. suspect. So. 22.. 0) (1950.. 0043011990999991950051518004.. cont inuing the example.) (318. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. These line s are presented to the map function as the key-value pairs: (0... 0067011990999991950051507004..9999999N9-00111+99999999999..... 0043012650999991949032412004. .9999999N9-00111+99999999999. -11]) Each year appears with a list of all its air temperature readings.9999999N9+00221+99999999999..... These line s are presented to the map function as the key-value pairs: (0.0500001N9+00781+99999999999. 0) (1950... . which we i gnore in our map function.) The keys are the line offsets within the file. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. or erroneous.. 0043012650999991949032418004. This processing sorts and groups the key-value pairs by key.. 22. our reduce function sees the following input: (1949..) (424.99999 99N9-00111+99999999999.) (318.) (106. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year..9999999N9+00001+99999999999. 22) (1950. -11]) Each year appears with a list of all its air temperature readings... 111) (1949... 78]) (1950...) (318.0500001N9+01111+9999999 9999. To visualize the way the map works..0500001N9+00781+99999999999.. cont inuing the example. 0043012650999991949032412004. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. -11) (1949.. 78]) (1950.. 0043012650999991949032412004.. [111.. 0043011990999991950051518004. 78]) (1950. 0043011 990999991950051512004. or erroneous. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. . which we i gnore in our map function. [111.. 22.. 22) (1950..) (212... 004301199099999195 0051518004.) The keys are the line offsets within the file... [0.) (424.) (106..... -11) (1949..0500001N9 +00781+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. The map function merely extracts the year and the air temperature (indicated in bold text).. So. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 004301199099999195 0051518004... All the reduce function has to do now is iterate throu .. our reduce function sees the following input: (1949.... 0043011 990999991950051512004. 0043012650999991949032412004. The map function merely extracts the year and the air temperature (indicated in bold text). [0. indicated by ellipses): 0067011990999991950051507004.) (212...0500001N9+01111+99999999999. 0) (1950.9999999N9+00001+99999999999. suspect. The map function merely extracts the year and the air temperature (indicated in bold text).9999999N9+00221+99999999999. our reduce function sees the following input: (1949..... 0043012650999991949032418004. 0043012650999991949032418004. -11) (1949... 0043012650999991949032418004.9999999N9+00001+99999999999. 111) (1949.. which we i gnore in our map function...99999 99N9-00111+99999999999.0500001N9+01111+99999999999.9999999N9-00111+99999999999. 0067011990999991950051507004. This processing sorts and groups the key-value pairs by key.... [0.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.0500001N9+01111+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.0500001N9 +00781+99999999999.......) (424... -11]) Each year appears with a list of all its air temperature readings. 0043012650999991949032412004.. indicated by ellipses): 0067011990999991950051507004.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing... cont inuing the example.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.

0043012650999991949032412004..99999 99N9-00111+99999999999.0500001N9+00781+99999999999. 0043012650999991949032412004..) The keys are the line offsets within the file.0500001N9 +00781+99999999999. [0.. -11) (1949.. [111. 0067011990999991950051507004.) (318.9999999N9+00221+99999999999..0500001N9+01111+9999999 9999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0043011 990999991950051512004. or erroneous.. . which we i gnore in our map function.) (212.) (106. indicated by ellipses): 0067011990999991950051507004. or erroneous.......0500001N9+01111+9999999 9999. To visualize the way the map works. 0043012650999991949032418004.. 78]) (1950. -11]) Each year appears with a list of all its air temperature readings.9999999N9+00221+99999999999. [0.. 0043011 990999991950051512004.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.) (212..gh on it: finding the maximum temperature for each year. So. To visualize the way the map works... The map function merely extracts the year and the air temperature (indicated in bold text). 0043012650999991949032418004...) The keys are the line offsets within the file...... our reduce function sees the following input: (1949. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. cont inuing the example. 22) (1950..9999999N9+00001+99999999999. 004301199099999195 0051518004..... 0043011 990999991950051512004.. This processing sorts and groups the key-value pairs by key..0500001N9+01111+99999999999....) (212. -11) (1949. which we i gnore in our map function...99999 99N9-00111+99999999999..9999999N9+00001+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. suspect. 0043012650999991949032418004.) (424. 004301199099999195 0051512004.9999999N9+00001+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. This processing sorts and groups the key-value pairs by key...9999999N9+00221+99999999999... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.) (424.... 0067011990999991950051507004. 22. These line s are presented to the map function as the key-value pairs: (0.0500001N9+01111+99999999999........ 004301199099999195 0051512004. These line s are presented to the map function as the key-value pairs: (0. 0043011990999991950051518004.. 0043011990999991950051518004....9999999N9+00001+99999999999.0500001N9 +00781+99999999999. or erroneous. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.... These line s are presented to the map function as the key-value pairs: (0.. -11]) Each year appears with a list of all its air temperature readings....0500001N9+00781+99999999999. So. [111. suspect. indicated by ellipses): 0067011990999991950051507004. 004301199099999195 .. 004301199099999195 0051512004.) (318. 0043012650999991949032418004.. indicated by ellipses): 0067011990999991950051507004... 78]) (1950.) (106..99999 99N9-00111+99999999999. 0) (1950.... cont inuing the example.9999999N9-00111+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page..... To visualize the way the map works. 22... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.... 111) (1949... 0043012650999991949032412004...) (106. . 0043012650999991949032412004... 0043012650999991949032412004. 004301199099999195 0051518004....9999999N9+00221+99999999999. 0043012650999991949032418004. suspect.. 0) (1950.... our reduce function sees the following input: (1949. The map function merely extracts the year and the air temperature (indicated in bold text). 0043011990999991950051518004.9999999N9+00001+99999999999.. 111) (1949.....9999999N9+00221+99999999999... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.9999999N9+00001+99999999999. 0067011990999991950051507004.. 22) (1950. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.0500001N9+01111+9999999 9999.0500001N9+00781+99999999999.9999999N9+00221+99999999999.....9999999N9-00111+99999999999.

. cont inuing the example.) (318.. 78]) (1950.99999 99N9-00111+99999999999.. 004301199099999195 0051512004.. 0043012650999991949032412004. 0043012650999991949032418004..99999 99N9-00111+99999999999. The map function merely extracts the year and the air temperature (indicated in bold text). 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..0500001N9+01111+99999999999.) The keys are the line offsets within the file.. .0500001N9+01111+99999999999. This processing sorts and groups the key-value pairs by key. -11]) Each year appears with a list of all its air temperature readings..) The keys are the line offsets within the file...0500001N9+01111+9999999 9999...... [111. 22.. The map function merely extracts the year and the air temperature (indicated in bold text).. [0. which we i gnore in our map function.. 78]) (1950..0051518004.... . So. cont inuing the example. This processing sorts and groups the key-value pairs by key... or erroneous. These line s are presented to the map function as the key-value pairs: (0... -11]) Each year appears with a list of all its air temperature readings...0500001N9 +00781+99999999999.9999999N9+00001+99999999999.. 22.) (424... 22) (1950. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 111) (1949.. -11) (1949..) (212. 78]) (1950. 0067011990999991950051507004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.0500001N9+00781+99999999999. 111) (1949.9999999N9+00221+99999999999..) (106.. our reduce function sees the following input: (1949. 22) (1950. To visualize the way the map works. [111.. 0) (1950. suspect... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. The map function merely extracts the year and the air temperature (indicated in bold text). 0) (1950..9999999N9+00221+99999999999.) (318.9999999N9+00001+99999999999. indicated by ellipses): 0067011990999991950051507004... 0043012650999991949032418004. All the reduce function has to do now is iterate throu .) (424. So..9999999N9+00001+99999999999... [0.9999999N9-00111+99999999999. 004301199099999195 0051518004. 22) (1950..0500001N9 +00781+99999999999.. cont inuing the example. [111... 004301199099999195 0051518004. indicated by ellipses): 0067011990999991950051507004..) (318.. This processing sorts and groups the key-value pairs by key. 0043012650999991949032418004. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. -11]) Each year appears with a list of all its air temperature readings.9999999N9+00221+99999999999. These line s are presented to the map function as the key-value pairs: (0.9999999N9-00111+99999999999... . 0043012650999991949032418004.9999999N9-00111+99999999999.... and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 0043012650999991949032412004. 0043012650999991949032418004..0500001N9 +00781+99999999999. 0043012650999991949032412004. or erroneous. So.....0500001N9+01111+99999999999.. 0043011990999991950051518004... 0043011990999991950051518004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.0500001N9+00781+99999999999. 0043012650999991949032412004... suspect. our reduce function sees the following input: (1949.. 0043011 990999991950051512004...) The keys are the line offsets within the file. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year....) (424. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 0043011 990999991950051512004... which we i gnore in our map function.. 0043012650999991949032412004.....9999999N9+00001+99999999999. our reduce function sees the following input: (1949..9999999N9+00221+99999999999.) (106. [0. which we i gnore in our map function.. -11) (1949. 22.... 0) (1950. To visualize the way the map works. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 111) (1949.. 0067011990999991950051507004. 004301199099999195 0051512004.0500001N9+01111+9999999 9999..) (212. -11) (1949...

..9999999N9+00221+99999999999... 004301199099999195 0051512004.9999999N9-00111+99999999999... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0067011990999991950051507004..0500001N9+00781+99999999999.. 0043012650999991949032418004.9999999N9+00001+99999999999.. 0043012650999991949032412004. 0043011990999991950051518004... 22.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.........9999999N9+00221+99999999999. .. suspect..gh on it: finding the maximum temperature for each year. To visualize the way the map works.) (106... So.) (106. 78]) (1950.99999 99N9-00111+99999999999. -11]) Each year appears with a list of all its air temperature readings. or erroneous. These line s are presented to the map function as the key-value pairs: (0. [0. 0043011 990999991950051512004. 0043011990999991950051518004. 0) (1950... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0067011990999991950051507004. This processing sorts and groups the key-value pairs by key.....9999999N9+00001+99999999999.. indicated by ellipses): 0067011990999991950051507004.9999999N9+00221+99999999999.. suspect. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.0500001N9 +00781+99999999999. 004301199099999195 .. [111. cont inuing the example.) (106. So. cont inuing the example.0500001N9+01111+9999999 9999.. 0043012650999991949032412004.. This processing sorts and groups the key-value pairs by key... which we i gnore in our map function.) (212. -11) (1949... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing..) (318. suspect. 22. To visualize the way the map works.. ... 0043012650999991949032418004. 0043011 990999991950051512004.0500001N9+01111+9999999 9999.9999999N9-00111+99999999999..9999999N9+00001+99999999999.9999999N9+00221+99999999999. To visualize the way the map works..) (212. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. 22) (1950. or erroneous. 78]) (1950. 0043012650999991949032418004.. 0067011990999991950051507004..9999999N9+00221+99999999999...0500001N9+01111+99999999999. indicated by ellipses): 0067011990999991950051507004. The map function merely extracts the year and the air temperature (indicated in bold text).. 0043012650999991949032412004.... -11) (1949.. 111) (1949.. [111..) (212.. 0043011990999991950051518004.0500001N9 +00781+99999999999.........0500001N9+00781+99999999999.) (424. -11]) Each year appears with a list of all its air temperature readings.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.9999999N9+00001+99999999999.) The keys are the line offsets within the file. 0043012650999991949032418004. 0043012650999991949032412004. These line s are presented to the map function as the key-value pairs: (0..) (424..0500001N9+01111+9999999 9999.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. indicated by ellipses): 0067011990999991950051507004... which we i gnore in our map function....9999999N9+00001+99999999999.. 0043011 990999991950051512004.. [0. 004301199099999195 0051512004. The map function merely extracts the year and the air temperature (indicated in bold text)...) (318. our reduce function sees the following input: (1949.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..) The keys are the line offsets within the file. 004301199099999195 0051518004.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. 004301199099999195 0051512004. 004301199099999195 0051518004..9999999N9+00221+99999999999. 0043012650999991949032412004......0500001N9+00781+99999999999.0500001N9+01111+99999999999... our reduce function sees the following input: (1949. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 111) (1949.. or erroneous. 0) (1950..9999999N9+00001+99999999999. These line s are presented to the map function as the key-value pairs: (0. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.... 22) (1950........99999 99N9-00111+99999999999..99999 99N9-00111+99999999999.. 0043012650999991949032418004.

) The keys are the line offsets within the file..0500001N9+01111+99999999999. our reduce function sees the following input: (1949.0500001N9+01111+99999999999.9999999N9+00221+99999999999...9999999N9+00001+99999999999.... and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 004301199099999195 0051518004.. 22) (1950... 78]) (1950. 0043012650999991949032412004. 111) (1949..) (106....0500001N9+01111+99999999999..) (424.0500001N9+01111+9999999 9999.. ....) (106...... 0043012650999991949032418004. 0043011990999991950051518004. indicated by ellipses): 0067011990999991950051507004... 0) (1950. The map function merely extracts the year and the air temperature (indicated in bold text).99999 99N9-00111+99999999999.0500001N9+01111+9999999 9999... So. [0. 78]) (1950. -11) (1949.0500001N9+00781+99999999999.. These line s are presented to the map function as the key-value pairs: (0. 22. This processing sorts and groups the key-value pairs by key. 0043012650999991949032412004.9999999N9+00001+99999999999..) (318. These line s are presented to the map function as the key-value pairs: (0.) (318. 0043012650999991949032418004. To visualize the way the map works. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. -11) (1949.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.0051518004..0500001N9 +00781+99999999999.) (212. 0067011990999991950051507004. [111. 0) (1950. which we i gnore in our map function..... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year..0500001N9 +00781+99999999999...0500001N9 +00781+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0043011 990999991950051512004. 004301199099999195 0051512004..) (424. So.. -11]) Each year appears with a list of all its air temperature readings... [111.9999999N9-00111+99999999999.. [0. 22) (1950.0500001N9+00781+99999999999.. our reduce function sees the following input: (1949. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. suspect.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. which we i gnore in our map function.... 0043012650999991949032412004. To visualize the way the map works..9999999N9+00221+99999999999..9999999N9-00111+99999999999.9999999N9+00221+99999999999....99999 99N9-00111+99999999999.. The map function merely extracts the year and the air temperature (indicated in bold text). 0) (1950.. cont inuing the example.9999999N9-00111+99999999999... -11]) Each year appears with a list of all its air temperature readings.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 22.. 004301199099999195 0051512004. 0043012650999991949032412004.. or erroneous. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. All the reduce function has to do now is iterate throu . 0067011990999991950051507004... which we i gnore in our map function. 111) (1949.9999999N9+00221+99999999999. -11]) Each year appears with a list of all its air temperature readings. our reduce function sees the following input: (1949. [0. 0043012650999991949032418004.. . 0043012650999991949032412004.. cont inuing the example. 22.. 22) (1950.. -11) (1949... cont inuing the example. So.) (424.. The map function merely extracts the year and the air temperature (indicated in bold text). or erroneous.) (212..) (318.9999999N9+00001+99999999999.. 78]) (1950..9999999N9+00001+99999999999. 0043012650999991949032418004... .. [111.. This processing sorts and groups the key-value pairs by key. This processing sorts and groups the key-value pairs by key. 0043012650999991949032418004.) The keys are the line offsets within the file. indicated by ellipses): 0067011990999991950051507004.. 0043011 990999991950051512004.. 0043011990999991950051518004.... 004301199099999195 0051518004.) The keys are the line offsets within the file... suspect. 111) (1949.

0) (1950....9999999N9+00001+99999999999.. or erroneous.. 0043012650999991949032412004.) (424. [111..gh on it: finding the maximum temperature for each year. 22. So. 0043011990999991950051518004.0500001N9 +00781+99999999999... or erroneous.9999999N9-00111+99999999999.. suspect.. 22) (1950. 0043011990999991950051518004. 004301199099999195 0051512004... 22) (1950. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.) (106. [111. The map function merely extracts the year and the air temperature (indicated in bold text)....99999 99N9-00111+99999999999. . This processing sorts and groups the key-value pairs by key. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page... 004301199099999195 .0500001N9+01111+9999999 9999..) (424.. -11) (1949... -11]) Each year appears with a list of all its air temperature readings.) (212...0500001N9 +00781+99999999999..0500001N9+01111+99999999999...9999999N9+00001+99999999999. cont inuing the example. which we i gnore in our map function.. [0..) (106.. which we i gnore in our map function. 78]) (1950. indicated by ellipses): 0067011990999991950051507004.. cont inuing the example... 004301199099999195 0051518004..) (318. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page...9999999N9+00221+99999999999... 0043011 990999991950051512004....0500001N9+01111+99999999999. 0043012650999991949032418004.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page... To visualize the way the map works... This processing sorts and groups the key-value pairs by key.. These line s are presented to the map function as the key-value pairs: (0. . 111) (1949..99999 99N9-00111+99999999999.... 0043012650999991949032412004.. [0. 0043012650999991949032418004..0500001N9+01111+9999999 9999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. our reduce function sees the following input: (1949.... 0067011990999991950051507004.... 0043012650999991949032418004... 004301199099999195 0051512004. -11]) Each year appears with a list of all its air temperature readings. 0) (1950.. 22. or erroneous.) (106. These line s are presented to the map function as the key-value pairs: (0.9999999N9+00221+99999999999....) The keys are the line offsets within the file..0500001N9+00781+99999999999. 0067011990999991950051507004. 0043012650999991949032418004..9999999N9-00111+99999999999.0500001N9+00781+99999999999. 111) (1949. So. our reduce function sees the following input: (1949.9999999N9+00221+99999999999..9999999N9+00001+99999999999... indicated by ellipses): 0067011990999991950051507004..9999999N9+00221+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.... -11) (1949....) (212..... 0043012650999991949032418004.) The keys are the line offsets within the file. To visualize the way the map works.. 0043012650999991949032412004.. The map function merely extracts the year and the air temperature (indicated in bold text).) (212.... and emits them as its output (the tempera ture values have been interpreted as integers): (1950. suspect.... 0043012650999991949032412004.9999999N9+00001+99999999999. 0043011 990999991950051512004.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. 004301199099999195 0051518004..9999999N9+00001+99999999999.) (318. To visualize the way the map works. These line s are presented to the map function as the key-value pairs: (0.. suspect. 78]) (1950..9999999N9+00001+99999999999... indicated by ellipses): 0067011990999991950051507004..9999999N9+00221+99999999999.0500001N9+01111+9999999 9999.0500001N9+00781+99999999999. 0043012650999991949032412004.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. 0043011 990999991950051512004.... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing... 0067011990999991950051507004. 0043011990999991950051518004.99999 99N9-00111+99999999999.9999999N9+00221+99999999999. 004301199099999195 0051512004... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year..

. 0043011990999991950051518004. 22) (1950....0500001N9+01111+99999999999.9999999N9-00111+99999999999.... 004301199099999195 0051518004...9999999N9+00001+99999999999... 0043012650999991949032418004. or erroneous.... -11) (1949. This processing sorts and groups the key-value pairs by key. 111) (1949... 22. 0) (1950. These line s are presented to the map function as the key-value pairs: (0... 0043012650999991949032418004.) (424. suspect. [111.9999999N9+00221+99999999999. [0. suspect. The map function merely extracts the year and the air temperature (indicated in bold text)... and emits them as its output (the tempera ture values have been interpreted as integers): (1950..99999 99N9-00111+99999999999.. 0043011 990999991950051512004.) (318.. 0043012650999991949032412004. our reduce function sees the following input: (1949.. cont inuing the example... 0) (1950..... and emits them as its output (the tempera ture values have been interpreted as integers): (1950.9999999N9-00111+99999999999. All the reduce function has to do now is iterate throu .0500001N9+01111+99999999999. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. our reduce function sees the following input: (1949.) The keys are the line offsets within the file... 0043012650999991949032412004.9999999N9+00221+99999999999.) (106. which we i gnore in our map function. 0043012650999991949032418004. -11) (1949....... 0043011990999991950051518004. 0043012650999991949032418004. which we i gnore in our map function.. 0067011990999991950051507004... To visualize the way the map works...0500001N9 +00781+99999999999.. our reduce function sees the following input: (1949.. [0. 22. 0043012650999991949032412004. indicated by ellipses): 0067011990999991950051507004..) (212.9999999N9+00001+99999999999.. 78]) (1950.. So. 111) (1949..0500001N9+01111+9999999 9999... cont inuing the example.0500001N9+01111+9999999 9999. 004301199099999195 0051512004.. So.) The keys are the line offsets within the file.0500001N9 +00781+99999999999. -11]) Each year appears with a list of all its air temperature readings. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. and emits them as its output (the tempera ture values have been interpreted as integers): (1950...) The keys are the line offsets within the file. 0067011990999991950051507004.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing..9999999N9+00001+99999999999. So.) (424. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. ... 0043012650999991949032412004..0500001N9+00781+99999999999..) (318. These line s are presented to the map function as the key-value pairs: (0.. This processing sorts and groups the key-value pairs by key..) (424.. which we i gnore in our map function. -11]) Each year appears with a list of all its air temperature readings.9999999N9+00001+99999999999. or erroneous. -11) (1949.9999999N9+00221+99999999999. 22) (1950... This processing sorts and groups the key-value pairs by key.. .. 004301199099999195 0051512004. 78]) (1950. indicated by ellipses): 0067011990999991950051507004. 0043012650999991949032412004. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing..99999 99N9-00111+99999999999.9999999N9+00221+99999999999.0500001N9+01111+99999999999.0500001N9 +00781+99999999999.0500001N9+00781+99999999999. 22) (1950... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. -11]) Each year appears with a list of all its air temperature readings.. 004301199099999195 0051518004. 0043011 990999991950051512004. cont inuing the example.) (318. 0) (1950.. 78]) (1950. To visualize the way the map works... The map function merely extracts the year and the air temperature (indicated in bold text). 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.0051518004.. [0.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.... The map function merely extracts the year and the air temperature (indicated in bold text)... 22.. . [111. 0043012650999991949032418004. 111) (1949.) (106..) (212.... [111.9999999N9-00111+99999999999.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.

) (424. cont inuing the example..0500001N9+00781+99999999999.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.) The keys are the line offsets within the file..99999 99N9-00111+99999999999.. 004301199099999195 0051518004..) (318.9999999N9+00001+99999999999. or erroneous.. 0043012650999991949032418004.) (106. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing..0500001N9 +00781+99999999999...9999999N9+00221+99999999999... or erroneous.. 0043011 990999991950051512004.. 22) (1950.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page...) (106. . 004301199099999195 . or erroneous. 0043012650999991949032418004..9999999N9+00221+99999999999.. 111) (1949... To visualize the way the map works. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. 0067011990999991950051507004. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. -11]) Each year appears with a list of all its air temperature readings. 004301199099999195 0051512004....... 22. 004301199099999195 0051512004. To visualize the way the map works. suspect. The map function merely extracts the year and the air temperature (indicated in bold text).. 0043012650999991949032418004..0500001N9+00781+99999999999. 0043011990999991950051518004.... 78]) (1950. our reduce function sees the following input: (1949.. suspect.9999999N9+00001+99999999999.....9999999N9+00221+99999999999.. These line s are presented to the map function as the key-value pairs: (0....) The keys are the line offsets within the file. 0043011990999991950051518004.9999999N9-00111+99999999999. -11) (1949. This processing sorts and groups the key-value pairs by key...) (212. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.99999 99N9-00111+99999999999..... 004301199099999195 0051512004. 0) (1950. 0043011 990999991950051512004.9999999N9+00221+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..) (318..0500001N9+01111+99999999999..9999999N9+00221+99999999999..9999999N9+00001+99999999999. 0043012650999991949032418004..... 0067011990999991950051507004.. 111) (1949. 22) (1950...) (212.9999999N9+00001+99999999999. [0.. 22.. indicated by ellipses): 0067011990999991950051507004.) (106.... So...... 004301199099999195 0051518004.) (212. which we i gnore in our map function. These line s are presented to the map function as the key-value pairs: (0. 0043012650999991949032412004. These line s are presented to the map function as the key-value pairs: (0. which we i gnore in our map function. So. cont inuing the example...0500001N9+01111+99999999999.) (424.... 0043011990999991950051518004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.0500001N9 +00781+99999999999.0500001N9+01111+9999999 9999.9999999N9+00221+99999999999...9999999N9-00111+99999999999. indicated by ellipses): 0067011990999991950051507004.. This processing sorts and groups the key-value pairs by key.gh on it: finding the maximum temperature for each year...9999999N9+00001+99999999999... [111..... 0043012650999991949032412004..... To visualize the way the map works... [111.9999999N9+00001+99999999999. -11]) Each year appears with a list of all its air temperature readings.99999 99N9-00111+99999999999.... our reduce function sees the following input: (1949. 0043011 990999991950051512004.. suspect. 0043012650999991949032412004.. 0) (1950...0500001N9+00781+99999999999. 0043012650999991949032412004. [0. 78]) (1950. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.0500001N9+01111+9999999 9999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. 0043012650999991949032412004. indicated by ellipses): 0067011990999991950051507004.0500001N9+01111+9999999 9999. The map function merely extracts the year and the air temperature (indicated in bold text). 0067011990999991950051507004.... 0043012650999991949032418004. -11) (1949..... .

which we i gnore in our map function. 111) (1949. 0043012650999991949032412004.0500001N9 +00781+99999999999. 0) (1950. [111.. .9999999N9-00111+99999999999.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. To visualize the way the map works... 0043011 990999991950051512004.. 0043012650999991949032418004...) (318.. 004301199099999195 0051518004. So. 111) (1949...9999999N9+00221+99999999999.0500001N9+01111+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. 0) (1950. The map function merely extracts the year and the air temperature (indicated in bold text). 22.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 22) (1950..0051518004.. The map function merely extracts the year and the air temperature (indicated in bold text).. 0043012650999991949032412004. 0043011990999991950051518004.. 78]) (1950.) The keys are the line offsets within the file..) (424. 0043012650999991949032418004..... 78]) (1950. and emits them as its output (the tempera ture values have been interpreted as integers): (1950...0500001N9+01111+99999999999.) (212.) (212.... -11]) Each year appears with a list of all its air temperature readings.0500001N9+00781+99999999999.) (106.. [0.. -11) (1949...... our reduce function sees the following input: (1949.) (424. which we i gnore in our map function.9999999N9+00001+99999999999. 0067011990999991950051507004.. -11]) Each year appears with a list of all its air temperature readings.) The keys are the line offsets within the file... 0043012650999991949032412004. 0043011 990999991950051512004. This processing sorts and groups the key-value pairs by key.. These line s are presented to the map function as the key-value pairs: (0..9999999N9+00001+99999999999.. So.0500001N9 +00781+99999999999. our reduce function sees the following input: (1949... cont inuing the example.. 0043012650999991949032418004. 0043011990999991950051518004.) (318.. or erroneous.. 22) (1950. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. 111) (1949..9999999N9+00221+99999999999.. The map function merely extracts the year and the air temperature (indicated in bold text). or erroneous. To visualize the way the map works... indicated by ellipses): 0067011990999991950051507004.. 0) (1950..... 004301199099999195 0051512004.99999 99N9-00111+99999999999.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. All the reduce function has to do now is iterate throu ..... -11) (1949.. . [0.. 22....0500001N9+01111+99999999999. [111.) (424. These line s are presented to the map function as the key-value pairs: (0.... suspect. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. -11]) Each year appears with a list of all its air temperature readings.99999 99N9-00111+99999999999... indicated by ellipses): 0067011990999991950051507004.0500001N9+01111+9999999 9999. [0. 22. suspect. This processing sorts and groups the key-value pairs by key. [111. cont inuing the example. which we i gnore in our map function.. So. 0043012650999991949032418004. 0043012650999991949032412004.0500001N9+01111+9999999 9999.. This processing sorts and groups the key-value pairs by key.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.9999999N9-00111+99999999999.. .) (318... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing....... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.9999999N9+00221+99999999999. -11) (1949. 0043012650999991949032412004.) The keys are the line offsets within the file. 0067011990999991950051507004.. 004301199099999195 0051518004. 0043012650999991949032418004. cont inuing the example.... 22) (1950.9999999N9+00001+99999999999.0500001N9+00781+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.9999999N9+00221+99999999999.) (106.0500001N9 +00781+99999999999..9999999N9-00111+99999999999. our reduce function sees the following input: (1949... 78]) (1950.. 004301199099999195 0051512004.9999999N9+00001+99999999999.

0043012650999991949032412004. The map function merely extracts the year and the air temperature (indicated in bold text)..) (106.9999999N9+00221+99999999999..99999 99N9-00111+99999999999........ So..9999999N9+00001+99999999999.. 0043012650999991949032412004.. 0) (1950.. 22) (1950.9999999N9+00221+99999999999.0500001N9+00781+99999999999. -11) (1949... 0) (1950. cont inuing the example. 0043012650999991949032418004..0500001N9+01111+99999999999.) (106. or erroneous.. 0043012650999991949032412004.0500001N9 +00781+99999999999.0500001N9+01111+9999999 9999..) The keys are the line offsets within the file. 004301199099999195 ... 004301199099999195 0051512004... These line s are presented to the map function as the key-value pairs: (0. To visualize the way the map works. This processing sorts and groups the key-value pairs by key. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. our reduce function sees the following input: (1949. To visualize the way the map works..0500001N9+01111+9999999 9999.... 0067011990999991950051507004. -11]) Each year appears with a list of all its air temperature readings. This processing sorts and groups the key-value pairs by key..... 0043011990999991950051518004.. These line s are presented to the map function as the key-value pairs: (0.. 004301199099999195 0051512004. cont inuing the example.) (212...... or erroneous. 22) (1950. So.... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. indicated by ellipses): 0067011990999991950051507004.9999999N9+00001+99999999999.. our reduce function sees the following input: (1949.... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.) (212. 0043012650999991949032418004.. indicated by ellipses): 0067011990999991950051507004. 0067011990999991950051507004. or erroneous.9999999N9-00111+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page..0500001N9 +00781+99999999999...9999999N9-00111+99999999999... 0043011 990999991950051512004. .) (318.. 111) (1949. [111. 004301199099999195 0051518004.. 78]) (1950....9999999N9+00221+99999999999..) (424.) (318.. To visualize the way the map works.9999999N9+00001+99999999999...0500001N9+01111+9999999 9999..... [0.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950... suspect.. 004301199099999195 0051512004.... [111.gh on it: finding the maximum temperature for each year. suspect.9999999N9+00221+99999999999. 0067011990999991950051507004. 0043012650999991949032418004. [0.99999 99N9-00111+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0043011990999991950051518004. which we i gnore in our map function.9999999N9+00001+99999999999.0500001N9+00781+99999999999. 004301199099999195 0051518004. The map function merely extracts the year and the air temperature (indicated in bold text).. 0043011990999991950051518004. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function....9999999N9+00001+99999999999.0500001N9+01111+99999999999.. -11) (1949. 0043012650999991949032412004...99999 99N9-00111+99999999999. 0043011 990999991950051512004....9999999N9+00221+99999999999... 0043012650999991949032418004...0500001N9+00781+99999999999. 0043012650999991949032412004.. 78]) (1950..) (212. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. indicated by ellipses): 0067011990999991950051507004.. 22...... 111) (1949...) (424. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year...9999999N9+00221+99999999999.) The keys are the line offsets within the file. which we i gnore in our map function.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. These line s are presented to the map function as the key-value pairs: (0. 0043012650999991949032418004..) (106. ...9999999N9+00001+99999999999.. -11]) Each year appears with a list of all its air temperature readings... 0043011 990999991950051512004. suspect. 22.

...) (106. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. The map function merely extracts the year and the air temperature (indicated in bold text). 78]) (1950....) (212.....0500001N9 +00781+99999999999.. 0043012650999991949032412004. [111.. which we i gnore in our map function. ..0051518004. The map function merely extracts the year and the air temperature (indicated in bold text).) The keys are the line offsets within the file. 22) (1950. -11]) Each year appears with a list of all its air temperature readings..0500001N9+01111+9999999 9999.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. [0.0500001N9+01111+99999999999..9999999N9-00111+99999999999.0500001N9 +00781+99999999999.0500001N9+01111+99999999999. indicated by ellipses): 0067011990999991950051507004. or erroneous. To visualize the way the map works.) (424.. 22.0500001N9+00781+99999999999.. 0043012650999991949032412004.. 0067011990999991950051507004..... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.... [111.. All the reduce function has to do now is iterate throu ..) (318.9999999N9+00221+99999999999. our reduce function sees the following input: (1949. 22. 22) (1950. which we i gnore in our map function.. -11]) Each year appears with a list of all its air temperature readings. [111.. These line s are presented to the map function as the key-value pairs: (0..9999999N9+00001+99999999999. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 0043012650999991949032412004... 78]) (1950. which we i gnore in our map function... 22.. 0043011990999991950051518004... . indicated by ellipses): 0067011990999991950051507004. our reduce function sees the following input: (1949. [0. cont inuing the example.. This processing sorts and groups the key-value pairs by key... cont inuing the example. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. 0043012650999991949032418004.) (318. To visualize the way the map works.... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. or erroneous.0500001N9+01111+9999999 9999. -11) (1949... So. suspect..) (424..) The keys are the line offsets within the file. .. 004301199099999195 0051518004.. 0043011 990999991950051512004. The map function merely extracts the year and the air temperature (indicated in bold text). These line s are presented to the map function as the key-value pairs: (0. This processing sorts and groups the key-value pairs by key.99999 99N9-00111+99999999999. 0043012650999991949032418004.0500001N9+00781+99999999999. 0043012650999991949032418004.. 0043012650999991949032418004...9999999N9+00001+99999999999.. cont inuing the example.. 111) (1949... 0) (1950.. 004301199099999195 0051518004.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 0043011 990999991950051512004.. 78]) (1950. 0043011990999991950051518004.. 111) (1949.9999999N9+00221+99999999999.. 0067011990999991950051507004. our reduce function sees the following input: (1949..0500001N9+01111+99999999999.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.... 004301199099999195 0051512004. [0.. This processing sorts and groups the key-value pairs by key.) (106. So.99999 99N9-00111+99999999999.. suspect. 0) (1950.9999999N9-00111+99999999999.. -11]) Each year appears with a list of all its air temperature readings. 0043012650999991949032418004. 004301199099999195 0051512004.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function....9999999N9+00001+99999999999..) (212.) (424. 0043012650999991949032412004.0500001N9 +00781+99999999999. -11) (1949... -11) (1949. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. 0) (1950. 22) (1950..9999999N9-00111+99999999999. 0043012650999991949032412004.) (318.) The keys are the line offsets within the file.9999999N9+00221+99999999999... 111) (1949..9999999N9+00001+99999999999..9999999N9+00221+99999999999. So..

0043012650999991949032418004. 0) (1950... or erroneous. 22) (1950. So...0500001N9+00781+99999999999...9999999N9+00221+99999999999.. -11]) Each year appears with a list of all its air temperature readings. -11) (1949. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.99999 99N9-00111+99999999999. 0067011990999991950051507004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950...0500001N9+00781+99999999999..) (424.... indicated by ellipses): 0067011990999991950051507004. or erroneous. To visualize the way the map works... 0043012650999991949032418004.. suspect... So... suspect. 0043011990999991950051518004.. These line s are presented to the map function as the key-value pairs: (0...99999 99N9-00111+99999999999.0500001N9+00781+99999999999. our reduce function sees the following input: (1949. 78]) (1950. [0.... [0..... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. indicated by ellipses): 0067011990999991950051507004.....0500001N9+01111+99999999999. which we i gnore in our map function.. cont inuing the example. 0043012650999991949032418004.... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 0043011990999991950051518004.) (212.. suspect. 111) (1949..9999999N9+00221+99999999999.9999999N9+00001+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 004301199099999195 0051518004.. 004301199099999195 0051512004.. 004301199099999195 0051512004.. The map function merely extracts the year and the air temperature (indicated in bold text).. 0043012650999991949032412004. 0043011990999991950051518004... This processing sorts and groups the key-value pairs by key. .) (106.. 78]) (1950.9999999N9+00001+99999999999.. 0) (1950. our reduce function sees the following input: (1949.. 0043012650999991949032412004...0500001N9+01111+99999999999.. 0043012650999991949032412004.... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page..) (424.. These line s are presented to the map function as the key-value pairs: (0.) (212.....9999999N9+00221+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page..0500001N9+01111+9999999 9999. .0500001N9+01111+9999999 9999. 0067011990999991950051507004.. 0043012650999991949032418004.) (318..9999999N9-00111+99999999999.. -11]) Each year appears with a list of all its air temperature readings. which we i gnore in our map function... 0043011 990999991950051512004. 22. 0043011 990999991950051512004...0500001N9 +00781+99999999999. -11) (1949. 004301199099999195 0051518004.. 22.9999999N9+00221+99999999999.......9999999N9+00221+99999999999..9999999N9-00111+99999999999...9999999N9+00001+99999999999...9999999N9+00001+99999999999.. 004301199099999195 0051512004. 111) (1949... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.) The keys are the line offsets within the file... This processing sorts and groups the key-value pairs by key.. 0043012650999991949032418004.0500001N9+01111+9999999 9999.. 0067011990999991950051507004. To visualize the way the map works..) (318. [111.....) (106.) The keys are the line offsets within the file. or erroneous..9999999N9+00001+99999999999. [111. indicated by ellipses): 0067011990999991950051507004.. 0043011 990999991950051512004..gh on it: finding the maximum temperature for each year.9999999N9+00221+99999999999..... 0043012650999991949032412004... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year..) (106.) (212.0500001N9 +00781+99999999999.9999999N9+00001+99999999999. These line s are presented to the map function as the key-value pairs: (0. To visualize the way the map works. cont inuing the example.99999 99N9-00111+99999999999.. 0043012650999991949032412004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 22) (1950. 004301199099999195 .. The map function merely extracts the year and the air temperature (indicated in bold text).

9999999N9+00001+99999999999... [111. 004301199099999195 0051518004.. [111. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.99999 99N9-00111+99999999999... our reduce function sees the following input: (1949..) (318. 22. indicated by ellipses): 0067011990999991950051507004. 0043012650999991949032412004. 0) (1950.9999999N9+00001+99999999999... [0..9999999N9+00221+99999999999.) (424. and emits them as its output (the tempera ture values have been interpreted as integers): (1950....) (424. This processing sorts and groups the key-value pairs by key.) The keys are the line offsets within the file.... -11]) Each year appears with a list of all its air temperature readings. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 0043012650999991949032412004. The map function merely extracts the year and the air temperature (indicated in bold text). 0067011990999991950051507004. To visualize the way the map works. 0043012650999991949032412004.. All the reduce function has to do now is iterate throu .0500001N9+00781+99999999999. 0043012650999991949032412004.) The keys are the line offsets within the file.) (106.. 111) (1949. 004301199099999195 0051518004.) (212. or erroneous. 0043012650999991949032418004..... 22) (1950. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. So..) (424.. suspect. This processing sorts and groups the key-value pairs by key.9999999N9+00221+99999999999....... 0043011 990999991950051512004. 004301199099999195 0051512004. . cont inuing the example.) (212..) (318. 0043012650999991949032418004. 0043011990999991950051518004. which we i gnore in our map function.. -11) (1949. These line s are presented to the map function as the key-value pairs: (0..0500001N9+00781+99999999999. 22) (1950.. 22... So.0500001N9+01111+9999999 9999. -11]) Each year appears with a list of all its air temperature readings. cont inuing the example. These line s are presented to the map function as the key-value pairs: (0..0500001N9+01111+9999999 9999.. our reduce function sees the following input: (1949.... 0043012650999991949032418004.....0500001N9+01111+99999999999.. [111. -11) (1949..9999999N9+00221+99999999999. our reduce function sees the following input: (1949..9999999N9+00001+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. or erroneous. which we i gnore in our map function. 111) (1949.0500001N9 +00781+99999999999.9999999N9+00001+99999999999.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0043012650999991949032418004..9999999N9-00111+99999999999.. 22.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. suspect.. 0043011990999991950051518004. indicated by ellipses): 0067011990999991950051507004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.0500001N9 +00781+99999999999.. [0..99999 99N9-00111+99999999999..... 004301199099999195 0051512004.. 111) (1949. 0043012650999991949032412004. -11) (1949.9999999N9-00111+99999999999.. -11]) Each year appears with a list of all its air temperature readings. To visualize the way the map works. .0500001N9+01111+99999999999. cont inuing the example. which we i gnore in our map function. 22) (1950.. This processing sorts and groups the key-value pairs by key..) The keys are the line offsets within the file.. .. The map function merely extracts the year and the air temperature (indicated in bold text). 0043011 990999991950051512004.. 0067011990999991950051507004...0051518004. 0) (1950..... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 0043012650999991949032418004.. The map function merely extracts the year and the air temperature (indicated in bold text)...) (106. [0. 78]) (1950.. 0) (1950..9999999N9-00111+99999999999..9999999N9+00221+99999999999.0500001N9 +00781+99999999999... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.) (318.. 78]) (1950. 78]) (1950..0500001N9+01111+99999999999.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. So...

...9999999N9+00221+99999999999... -11) (1949... 004301199099999195 0051512004. 0043012650999991949032418004. 004301199099999195 0051518004...) (106. suspect. The map function merely extracts the year and the air temperature (indicated in bold text). 004301199099999195 .. [0..9999999N9+00221+99999999999.9999999N9+00221+99999999999.9999999N9+00221+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.9999999N9+00001+99999999999..... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.9999999N9+00001+99999999999.... cont inuing the example.....) The keys are the line offsets within the file.0500001N9+01111+9999999 9999. -11]) Each year appears with a list of all its air temperature readings.. 0067011990999991950051507004..0500001N9+00781+99999999999..... 78]) (1950..) (212.. suspect.. The map function merely extracts the year and the air temperature (indicated in bold text)... To visualize the way the map works. [111. 0043012650999991949032418004. 0) (1950.... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0043012650999991949032412004....) (212. 0043011 990999991950051512004. 0043011 990999991950051512004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. which we i gnore in our map function. indicated by ellipses): 0067011990999991950051507004.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..0500001N9+00781+99999999999. 0067011990999991950051507004.. 0043012650999991949032412004. So... . 111) (1949. 0043011990999991950051518004... [0..0500001N9+01111+9999999 9999. 78]) (1950. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.9999999N9-00111+99999999999.9999999N9+00221+99999999999. 004301199099999195 0051518004.99999 99N9-00111+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. indicated by ellipses): 0067011990999991950051507004.0500001N9+01111+9999999 9999.) (212. To visualize the way the map works... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.....99999 99N9-00111+99999999999. .) (318..0500001N9 +00781+99999999999...9999999N9+00221+99999999999.) (106.. 004301199099999195 0051512004.9999999N9+00001+99999999999.) The keys are the line offsets within the file. 0043012650999991949032418004. cont inuing the example.0500001N9 +00781+99999999999. 0043012650999991949032412004. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing... 22) (1950.. 0043011990999991950051518004.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 22. So....) (318. our reduce function sees the following input: (1949..9999999N9+00001+99999999999.. which we i gnore in our map function.0500001N9+00781+99999999999.. These line s are presented to the map function as the key-value pairs: (0.... 004301199099999195 0051512004.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0043012650999991949032418004. 0043012650999991949032412004. indicated by ellipses): 0067011990999991950051507004. 0043012650999991949032418004...9999999N9+00001+99999999999.. 0043012650999991949032412004.. 0067011990999991950051507004. 0) (1950. suspect... 22. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.0500001N9+01111+99999999999..99999 99N9-00111+99999999999...) (106. our reduce function sees the following input: (1949.9999999N9-00111+99999999999..9999999N9+00001+99999999999. This processing sorts and groups the key-value pairs by key.. These line s are presented to the map function as the key-value pairs: (0.. or erroneous. 111) (1949.. To visualize the way the map works..0500001N9+01111+99999999999.. 22) (1950. These line s are presented to the map function as the key-value pairs: (0...... -11]) Each year appears with a list of all its air temperature readings.... [111.) (424.... -11) (1949. 0043011 990999991950051512004. or erroneous.) (424.. or erroneous...... This processing sorts and groups the key-value pairs by key. 0043011990999991950051518004.gh on it: finding the maximum temperature for each year....

) The keys are the line offsets within the file.. 0043011 990999991950051512004... To visualize the way the map works. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. -11]) Each year appears with a list of all its air temperature readings.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 22) (1950. To visualize the way the map works.) (318. 111) (1949...9999999N9-00111+99999999999.. -11) (1949. [111. -11]) Each year appears with a list of all its air temperature readings. [0. [111. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 0043012650999991949032418004. which we i gnore in our map function. our reduce function sees the following input: (1949. cont inuing the example. 0067011990999991950051507004. cont inuing the example... [0.. 0043012650999991949032412004.) (318.) (424... The map function merely extracts the year and the air temperature (indicated in bold text)..9999999N9+00001+99999999999..) (424. These line s are presented to the map function as the key-value pairs: (0. This processing sorts and groups the key-value pairs by key... 0) (1950... 0043012650999991949032418004. 0043012650999991949032412004. 0) (1950.0500001N9+01111+99999999999. .) (106.. 004301199099999195 0051518004.9999999N9+00221+99999999999.. cont inuing the example.. 004301199099999195 0051512004. 0043012650999991949032418004.. 22) (1950..9999999N9-00111+99999999999.0500001N9+01111+99999999999. . which we i gnore in our map function.) The keys are the line offsets within the file. 0043012650999991949032412004. 78]) (1950. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..) (106. This processing sorts and groups the key-value pairs by key.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year....... The map function merely extracts the year and the air temperature (indicated in bold text). 004301199099999195 0051512004..9999999N9+00001+99999999999..9999999N9+00221+99999999999. 22. suspect... These line s are presented to the map function as the key-value pairs: (0...0500001N9+01111+9999999 9999....9999999N9+00001+99999999999.0500001N9+00781+99999999999.) (318. 22.... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.0500001N9+00781+99999999999... 0043012650999991949032418004. 0043012650999991949032412004.) The keys are the line offsets within the file. 78]) (1950. ..... 22) (1950.) (212. -11]) Each year appears with a list of all its air temperature readings.. which we i gnore in our map function.... 0043011 990999991950051512004... -11) (1949.. 111) (1949...0500001N9 +00781+99999999999... our reduce function sees the following input: (1949..) (212.. our reduce function sees the following input: (1949. 0043012650999991949032412004. [0. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 0043011990999991950051518004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. 111) (1949.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 78]) (1950. 0) (1950..9999999N9+00221+99999999999. The map function merely extracts the year and the air temperature (indicated in bold text). indicated by ellipses): 0067011990999991950051507004.. or erroneous. indicated by ellipses): 0067011990999991950051507004.. 0043011990999991950051518004..0500001N9 +00781+99999999999.0051518004......9999999N9+00221+99999999999..0500001N9+01111+99999999999. 22...0500001N9 +00781+99999999999.99999 99N9-00111+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. All the reduce function has to do now is iterate throu . So.9999999N9-00111+99999999999. -11) (1949.. 004301199099999195 0051518004. 0043012650999991949032418004. So..9999999N9+00001+99999999999. This processing sorts and groups the key-value pairs by key..) (424.... and emits them as its output (the tempera ture values have been interpreted as integers): (1950. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. [111. or erroneous. suspect... So.0500001N9+01111+9999999 9999. 0067011990999991950051507004.99999 99N9-00111+99999999999.....

. 0067011990999991950051507004.0500001N9+01111+99999999999... 004301199099999195 0051518004... 0043012650999991949032412004.) (212. suspect.) (424... 0043012650999991949032412004... 22. This processing sorts and groups the key-value pairs by key.9999999N9+00001+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.) (106. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing....gh on it: finding the maximum temperature for each year.. 0043012650999991949032418004... our reduce function sees the following input: (1949. This processing sorts and groups the key-value pairs by key... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.... indicated by ellipses): 0067011990999991950051507004. our reduce function sees the following input: (1949. 004301199099999195 0051512004. 22) (1950. 0043011 990999991950051512004.9999999N9+00221+99999999999. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year..... [0. 0043012650999991949032418004.0500001N9 +00781+99999999999. 0067011990999991950051507004... 0043011990999991950051518004.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. So.0500001N9+00781+99999999999..0500001N9+01111+9999999 9999. 0067011990999991950051507004.. which we i gnore in our map function. 004301199099999195 0051512004.9999999N9+00001+99999999999... 0043012650999991949032412004..) (212. 0043011 990999991950051512004..0500001N9+00781+99999999999.0500001N9+00781+99999999999.. . -11]) Each year appears with a list of all its air temperature readings..... indicated by ellipses): 0067011990999991950051507004.9999999N9+00001+99999999999. So.9999999N9+00221+99999999999.... 0) (1950.. To visualize the way the map works.. 0043011 990999991950051512004.... or erroneous.. -11) (1949. [0... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0043012650999991949032412004.... indicated by ellipses): 0067011990999991950051507004..9999999N9-00111+99999999999...) (106...... cont inuing the example.0500001N9+01111+9999999 9999..9999999N9+00001+99999999999...99999 99N9-00111+99999999999.. 78]) (1950....9999999N9+00001+99999999999..99999 99N9-00111+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..9999999N9+00221+99999999999. 0043012650999991949032412004. These line s are presented to the map function as the key-value pairs: (0. 0043012650999991949032418004... These line s are presented to the map function as the key-value pairs: (0. cont inuing the example.) (424. 78]) (1950.) (106. These line s are presented to the map function as the key-value pairs: (0. 0) (1950. . To visualize the way the map works. [111. 0043011990999991950051518004..) (212.. The map function merely extracts the year and the air temperature (indicated in bold text)..99999 99N9-00111+99999999999. 111) (1949...9999999N9+00221+99999999999. 111) (1949. 0043012650999991949032418004..) The keys are the line offsets within the file. -11) (1949. or erroneous.9999999N9-00111+99999999999.9999999N9+00221+99999999999. 0043011990999991950051518004.. 004301199099999195 0051512004... 004301199099999195 0051518004...0500001N9+01111+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 004301199099999195 . -11]) Each year appears with a list of all its air temperature readings.9999999N9+00221+99999999999.. The map function merely extracts the year and the air temperature (indicated in bold text). and emits them as its output (the tempera ture values have been interpreted as integers): (1950. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.) (318... To visualize the way the map works...) (318.) The keys are the line offsets within the file. which we i gnore in our map function.. 0043012650999991949032418004.9999999N9+00001+99999999999. suspect. or erroneous. suspect... 22... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page....0500001N9+01111+9999999 9999. [111....0500001N9 +00781+99999999999........ 22) (1950..

. cont inuing the example. 0043012650999991949032412004.. 004301199099999195 0051518004. 78]) (1950.. 22. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.... suspect... or erroneous... [0. So..9999999N9-00111+99999999999. 22) (1950. 0067011990999991950051507004. . 0043011 990999991950051512004. -11]) Each year appears with a list of all its air temperature readings.. -11) (1949..) (424. 004301199099999195 0051512004.0500001N9+00781+99999999999. 0) (1950... These line s are presented to the map function as the key-value pairs: (0. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.) (212.0051518004. The map function merely extracts the year and the air temperature (indicated in bold text)..) (106.. -11) (1949. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.9999999N9-00111+99999999999. 78]) (1950..0500001N9 +00781+99999999999.) The keys are the line offsets within the file.. 0) (1950... and emits them as its output (the tempera ture values have been interpreted as integers): (1950... To visualize the way the map works. suspect...9999999N9+00001+99999999999. These line s are presented to the map function as the key-value pairs: (0. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 78]) (1950. 111) (1949. 0) (1950.. which we i gnore in our map function. which we i gnore in our map function... 0043012650999991949032418004. -11]) Each year appears with a list of all its air temperature readings.0500001N9 +00781+99999999999.. 111) (1949. -11]) Each year appears with a list of all its air temperature readings..) (318. indicated by ellipses): 0067011990999991950051507004.. our reduce function sees the following input: (1949...9999999N9+00001+99999999999. -11) (1949. 0043012650999991949032418004. .. 22.0500001N9+00781+99999999999. 004301199099999195 0051512004.. The map function merely extracts the year and the air temperature (indicated in bold text). or erroneous..0500001N9+01111+99999999999....0500001N9+01111+9999999 9999. 0043012650999991949032412004.) (318.9999999N9+00001+99999999999....... This processing sorts and groups the key-value pairs by key. [111. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. indicated by ellipses): 0067011990999991950051507004.. 22. which we i gnore in our map function.. 0043011990999991950051518004.. To visualize the way the map works. 0043011 990999991950051512004. 0067011990999991950051507004. 004301199099999195 0051518004.. 0043012650999991949032418004...0500001N9+01111+9999999 9999.. So. cont inuing the example. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 22) (1950.. ..) (212. cont inuing the example. This processing sorts and groups the key-value pairs by key.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 0043012650999991949032412004. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function... our reduce function sees the following input: (1949.. So.. The map function merely extracts the year and the air temperature (indicated in bold text).) (106.0500001N9 +00781+99999999999.) The keys are the line offsets within the file. 0043012650999991949032418004..9999999N9+00001+99999999999... [111. 0043012650999991949032412004. 0043012650999991949032418004..... [0.) (424..) (424.. [111. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page... 111) (1949. 0043012650999991949032412004.9999999N9+00221+99999999999..99999 99N9-00111+99999999999. our reduce function sees the following input: (1949.) (318..9999999N9-00111+99999999999.. All the reduce function has to do now is iterate throu .... 0043011990999991950051518004.9999999N9+00221+99999999999..9999999N9+00221+99999999999........ All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.0500001N9+01111+99999999999..0500001N9+01111+99999999999.9999999N9+00221+99999999999.. 22) (1950..99999 99N9-00111+99999999999... This processing sorts and groups the key-value pairs by key. [0. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.) The keys are the line offsets within the file.

22) (1950. suspect...9999999N9+00001+99999999999.. our reduce function sees the following input: (1949..9999999N9+00221+99999999999.. which we i gnore in our map function. 0043012650999991949032418004....) (424... 0043012650999991949032418004. suspect..... 0) (1950.9999999N9+00001+99999999999.9999999N9-00111+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. . To visualize the way the map works... 0043011990999991950051518004. indicated by ellipses): 0067011990999991950051507004.. These line s are presented to the map function as the key-value pairs: (0.. -11]) Each year appears with a list of all its air temperature readings. indicated by ellipses): 0067011990999991950051507004.0500001N9+00781+99999999999. 0043012650999991949032412004......) The keys are the line offsets within the file... 0043012650999991949032412004. 0043012650999991949032418004... [0.. This processing sorts and groups the key-value pairs by key.... 0043011 990999991950051512004. To visualize the way the map works.. -11) (1949..0500001N9+01111+99999999999. 004301199099999195 0051518004.. 004301199099999195 0051518004.. 0043011 990999991950051512004. 0043012650999991949032412004.... [111. our reduce function sees the following input: (1949..) (212. which we i gnore in our map function.... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing..9999999N9+00221+99999999999.. 22.) The keys are the line offsets within the file.0500001N9+01111+9999999 9999.0500001N9+00781+99999999999... [0.. To visualize the way the map works. 0043011990999991950051518004.0500001N9+01111+9999999 9999. 004301199099999195 0051512004. 0067011990999991950051507004. These line s are presented to the map function as the key-value pairs: (0. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.....99999 99N9-00111+99999999999.) (106...gh on it: finding the maximum temperature for each year..9999999N9-00111+99999999999.) (106. [111. The map function merely extracts the year and the air temperature (indicated in bold text)....) (318. 78]) (1950. and emits them as its output (the tempera ture values have been interpreted as integers): (1950..9999999N9+00001+99999999999. -11]) Each year appears with a list of all its air temperature readings..9999999N9+00001+99999999999...... and emits them as its output (the tempera ture values have been interpreted as integers): (1950.. or erroneous. or erroneous. cont inuing the example.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year... 0) (1950.) (212.....) (424.99999 99N9-00111+99999999999. 0043012650999991949032412004. 78]) (1950..) (318.9999999N9+00001+99999999999. 004301199099999195 0051512004..0500001N9+01111+99999999999.. So... 0043011 990999991950051512004. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing...0500001N9+01111+9999999 9999. 004301199099999195 . consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. . or erroneous.. 111) (1949. 22. This processing sorts and groups the key-value pairs by key.0500001N9 +00781+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.9999999N9+00221+99999999999.. These line s are presented to the map function as the key-value pairs: (0.. 0067011990999991950051507004.. 004301199099999195 0051512004.. 0043012650999991949032412004. indicated by ellipses): 0067011990999991950051507004....9999999N9+00221+99999999999. suspect.) (106. -11) (1949.....) (212. 22) (1950.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function... So. 111) (1949. 0043012650999991949032418004.....9999999N9+00001+99999999999...9999999N9+00221+99999999999..0500001N9 +00781+99999999999. 0067011990999991950051507004.99999 99N9-00111+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. The map function merely extracts the year and the air temperature (indicated in bold text).0500001N9+00781+99999999999.. 0043011990999991950051518004. cont inuing the example. 0043012650999991949032418004....9999999N9+00221+99999999999.

) (424... suspect...0500001N9+01111+9999999 9999..... 0043012650999991949032418004.. 0043011990999991950051518004. -11) (1949. suspect. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. indicated by ellipses): 0067011990999991950051507004. 0043012650999991949032418004.. cont inuing the example.) (318.... 22) (1950. So. 004301199099999195 0051512004.9999999N9+00001+99999999999.. The map function merely extracts the year and the air temperature (indicated in bold text). [111.. -11]) Each year appears with a list of all its air temperature readings. 004301199099999195 0051512004. 0043012650999991949032418004.9999999N9+00001+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.0500001N9+01111+99999999999...) (318.9999999N9-00111+99999999999.9999999N9-00111+99999999999.9999999N9+00221+99999999999.) The keys are the line offsets within the file. 0043012650999991949032412004.9999999N9+00001+99999999999. 0067011990999991950051507004.) The keys are the line offsets within the file.) (424. 0043012650999991949032412004. 22. To visualize the way the map works.. indicated by ellipses): 0067011990999991950051507004... 22.. 0043012650999991949032418004. These line s are presented to the map function as the key-value pairs: (0. 22) (1950... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 78]) (1950. [0...) (424. our reduce function sees the following input: (1949.. 004301199099999195 0051518004.0500001N9+00781+99999999999. 0043011 990999991950051512004..0500001N9 +00781+99999999999.9999999N9+00001+99999999999.0051518004. [111. or erroneous.0500001N9+01111+99999999999. our reduce function sees the following input: (1949. 004301199099999195 0051518004.0500001N9 +00781+99999999999.... 22... 0067011990999991950051507004..) The keys are the line offsets within the file.0500001N9+01111+9999999 9999. -11) (1949. [0. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. which we i gnore in our map function... The map function merely extracts the year and the air temperature (indicated in bold text). 0043011 990999991950051512004. 0043011990999991950051518004.. 22) (1950. This processing sorts and groups the key-value pairs by key.....) (318. 0043012650999991949032418004..0500001N9+01111+99999999999..0500001N9 +00781+99999999999.9999999N9+00221+99999999999. To visualize the way the map works. .... and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 111) (1949.) (106...) (212.. 78]) (1950.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.9999999N9-00111+99999999999.0500001N9+00781+99999999999. 111) (1949.. So.. 0043012650999991949032412004.. This processing sorts and groups the key-value pairs by key.... -11) (1949..... . 111) (1949.. -11]) Each year appears with a list of all its air temperature readings. So. which we i gnore in our map function. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.9999999N9+00221+99999999999..) (212.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 78]) (1950. and emits them as its output (the tempera ture values have been interpreted as integers): (1950... our reduce function sees the following input: (1949..... All the reduce function has to do now is iterate throu .. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. This processing sorts and groups the key-value pairs by key. 0) (1950. 0043012650999991949032412004. cont inuing the example.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. .. 0) (1950. or erroneous. [0.... -11]) Each year appears with a list of all its air temperature readings. 0) (1950. cont inuing the example.) (106. The map function merely extracts the year and the air temperature (indicated in bold text).. 0043012650999991949032412004. which we i gnore in our map function.99999 99N9-00111+99999999999.9999999N9+00221+99999999999.. [111..........99999 99N9-00111+99999999999.. These line s are presented to the map function as the key-value pairs: (0..

... suspect..... 004301199099999195 0051518004. suspect.. The map function merely extracts the year and the air temperature (indicated in bold text). consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page......0500001N9+01111+99999999999.9999999N9+00001+99999999999..9999999N9+00001+99999999999. indicated by ellipses): 0067011990999991950051507004... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.....9999999N9+00001+99999999999. So. which we i gnore in our map function.) (318.0500001N9+01111+99999999999. 0043012650999991949032412004.0500001N9 +00781+99999999999...) The keys are the line offsets within the file.. -11) (1949..9999999N9+00221+99999999999.. 004301199099999195 0051512004.. our reduce function sees the following input: (1949. -11]) Each year appears with a list of all its air temperature readings.99999 99N9-00111+99999999999...9999999N9+00221+99999999999. 0043011 990999991950051512004...) (106.9999999N9+00001+99999999999..9999999N9-00111+99999999999... These line s are presented to the map function as the key-value pairs: (0.... 78]) (1950. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.99999 99N9-00111+99999999999... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. [0. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 111) (1949.0500001N9+00781+99999999999... 0043012650999991949032418004..... and emits them as its output (the tempera ture values have been interpreted as integers): (1950..... 0043012650999991949032418004..) (424. 0043012650999991949032412004... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 78]) (1950...9999999N9+00001+99999999999.. 0043011 990999991950051512004. 0067011990999991950051507004..) (212.. -11) (1949.9999999N9+00221+99999999999.0500001N9+01111+9999999 9999.. 0067011990999991950051507004.. 0043012650999991949032418004...9999999N9+00001+99999999999. 004301199099999195 .. . The map function merely extracts the year and the air temperature (indicated in bold text).. This processing sorts and groups the key-value pairs by key. our reduce function sees the following input: (1949. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.9999999N9-00111+99999999999... 0) (1950..... 0043012650999991949032412004.9999999N9+00221+99999999999.. To visualize the way the map works. indicated by ellipses): 0067011990999991950051507004.) The keys are the line offsets within the file. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 0043011990999991950051518004. 22) (1950.. 004301199099999195 0051512004. 22) (1950. This processing sorts and groups the key-value pairs by key...) (106.. or erroneous..0500001N9+00781+99999999999..) (212.0500001N9 +00781+99999999999...0500001N9+01111+9999999 9999..) (318... [111. . cont inuing the example. 0043011990999991950051518004. 0043012650999991949032418004... which we i gnore in our map function.9999999N9+00221+99999999999.) (106... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. cont inuing the example.0500001N9+01111+9999999 9999.9999999N9+00221+99999999999..99999 99N9-00111+99999999999..... 0043011 990999991950051512004. 0043012650999991949032412004. 0043011990999991950051518004. 22... or erroneous.0500001N9+00781+99999999999. -11]) Each year appears with a list of all its air temperature readings..gh on it: finding the maximum temperature for each year. 22. 111) (1949. 0) (1950. suspect. [111.. 004301199099999195 0051518004. or erroneous. 0067011990999991950051507004. So. [0.. indicated by ellipses): 0067011990999991950051507004.. 0043012650999991949032412004.. 0043012650999991949032418004. These line s are presented to the map function as the key-value pairs: (0... 004301199099999195 0051512004. To visualize the way the map works.....) (424.) (212.. To visualize the way the map works. These line s are presented to the map function as the key-value pairs: (0..

0500001N9+01111+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.....0051518004. 0067011990999991950051507004.9999999N9+00221+99999999999.. [0..) The keys are the line offsets within the file... -11]) Each year appears with a list of all its air temperature readings. [111.... our reduce function sees the following input: (1949...9999999N9-00111+99999999999.. 0) (1950. 0) (1950. -11]) Each year appears with a list of all its air temperature readings... our reduce function sees the following input: (1949. 111) (1949. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page..) (318. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.99999 99N9-00111+99999999999. 004301199099999195 0051518004.0500001N9 +00781+99999999999. or erroneous. This processing sorts and groups the key-value pairs by key. 0) (1950.. .. -11) (1949..9999999N9-00111+99999999999..) (424.. .. [111. 78]) (1950...) (106.. 0043011990999991950051518004. 22. The map function merely extracts the year and the air temperature (indicated in bold text). 111) (1949..) (318..0500001N9+00781+99999999999. 0043012650999991949032418004.. 22.. 78]) (1950. -11) (1949. which we i gnore in our map function. 0043012650999991949032412004.) The keys are the line offsets within the file. . The map function merely extracts the year and the air temperature (indicated in bold text).0500001N9+01111+99999999999. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. 78]) (1950...... All the reduce function has to do now is iterate throu . [111. 22) (1950.. 004301199099999195 0051512004. So.0500001N9+00781+99999999999.9999999N9+00001+99999999999.. 0043012650999991949032412004..9999999N9+00001+99999999999. The map function merely extracts the year and the air temperature (indicated in bold text).. indicated by ellipses): 0067011990999991950051507004. 22.) The keys are the line offsets within the file..0500001N9+01111+9999999 9999. 0043012650999991949032418004. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. This processing sorts and groups the key-value pairs by key. or erroneous.. So... 0067011990999991950051507004... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page..) (318.9999999N9+00001+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950..) (212. which we i gnore in our map function... This processing sorts and groups the key-value pairs by key. which we i gnore in our map function..) (212..) (424... 004301199099999195 0051512004.....9999999N9-00111+99999999999.. 0043012650999991949032412004. 0043011 990999991950051512004.9999999N9+00001+99999999999... 0043011990999991950051518004. suspect..9999999N9+00221+99999999999.. To visualize the way the map works.. -11) (1949..) (424... 22) (1950. cont inuing the example. [0. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.9999999N9+00221+99999999999. These line s are presented to the map function as the key-value pairs: (0...) (106. cont inuing the example.. So. 0043012650999991949032418004..... indicated by ellipses): 0067011990999991950051507004. 111) (1949.0500001N9+01111+9999999 9999.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.99999 99N9-00111+99999999999....0500001N9+01111+99999999999.. 0043012650999991949032412004.. 0043012650999991949032418004. suspect. 22) (1950.9999999N9+00221+99999999999. 0043011 990999991950051512004.0500001N9 +00781+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..0500001N9 +00781+99999999999.. -11]) Each year appears with a list of all its air temperature readings. To visualize the way the map works. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.... 004301199099999195 0051518004... our reduce function sees the following input: (1949. 0043012650999991949032418004. [0. cont inuing the example.. These line s are presented to the map function as the key-value pairs: (0. 0043012650999991949032412004..

9999999N9+00001+99999999999.. which we i gnore in our map function.. 0043011990999991950051518004... The map function merely extracts the year and the air temperature (indicated in bold text). 0043012650999991949032412004.. 0043012650999991949032418004.) (212. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing... which we i gnore in our map function..gh on it: finding the maximum temperature for each year. cont inuing the example.. indicated by ellipses): 0067011990999991950051507004... 004301199099999195 0051512004..99999 99N9-00111+99999999999. 0043012650999991949032418004. 0043012650999991949032418004... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. These line s are presented to the map function as the key-value pairs: (0..... 0043011990999991950051518004.....) (106.9999999N9+00221+99999999999...) (318.. 111) (1949..... [0. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 22) (1950. These line s are presented to the map function as the key-value pairs: (0.) (318. 0043012650999991949032418004.........9999999N9+00221+99999999999.0500001N9 +00781+99999999999.. 22) (1950.) The keys are the line offsets within the file..) (212.0500001N9+01111+9999999 9999.0500001N9+01111+9999999 9999. These line s are presented to the map function as the key-value pairs: (0. suspect.9999999N9+00221+99999999999. So... 0043011990999991950051518004. or erroneous. 22. 0067011990999991950051507004. . consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.9999999N9+00001+99999999999...) (106.. indicated by ellipses): 0067011990999991950051507004.. 004301199099999195 0051512004.... 0043012650999991949032412004. 0043012650999991949032412004. .. our reduce function sees the following input: (1949.. [111. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.) (424...9999999N9-00111+99999999999..) (106. and emits them as its output (the tempera ture values have been interpreted as integers): (1950..... -11) (1949. 22. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.0500001N9+00781+99999999999. 0043012650999991949032412004.0500001N9+01111+99999999999.. -11]) Each year appears with a list of all its air temperature readings.. 0) (1950. So. [0. 0067011990999991950051507004.... This processing sorts and groups the key-value pairs by key. -11) (1949.. 004301199099999195 .. 004301199099999195 0051518004. suspect.. cont inuing the example. [111. The map function merely extracts the year and the air temperature (indicated in bold text).9999999N9+00221+99999999999. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year... 0043011 990999991950051512004.. 0043012650999991949032418004...9999999N9+00001+99999999999. or erroneous.0500001N9 +00781+99999999999.0500001N9+00781+99999999999.99999 99N9-00111+99999999999..99999 99N9-00111+99999999999... -11]) Each year appears with a list of all its air temperature readings.9999999N9+00001+99999999999. To visualize the way the map works.9999999N9+00221+99999999999.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year..0500001N9+00781+99999999999..9999999N9+00001+99999999999...9999999N9+00221+99999999999. 004301199099999195 0051518004.. This processing sorts and groups the key-value pairs by key... our reduce function sees the following input: (1949.. or erroneous..0500001N9+01111+9999999 9999.. 0043011 990999991950051512004. 0043012650999991949032412004.9999999N9-00111+99999999999..... 0043011 990999991950051512004.) (212...0500001N9+01111+99999999999.... 004301199099999195 0051512004....9999999N9+00001+99999999999.. suspect.. 0067011990999991950051507004... To visualize the way the map works.) (424. 0) (1950. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 111) (1949. 78]) (1950.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. To visualize the way the map works. indicated by ellipses): 0067011990999991950051507004..) The keys are the line offsets within the file.. 78]) (1950....

[111. To visualize the way the map works..9999999N9+00221+99999999999. 0043012650999991949032412004..0500001N9 +00781+99999999999... -11) (1949. .. So.) (424.. 111) (1949..9999999N9-00111+99999999999. 0043012650999991949032418004. 0043012650999991949032412004. These line s are presented to the map function as the key-value pairs: (0.) (318..99999 99N9-00111+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0043012650999991949032418004. indicated by ellipses): 0067011990999991950051507004. 0043011 990999991950051512004... 004301199099999195 0051512004.. indicated by ellipses): 0067011990999991950051507004.0500001N9+01111+99999999999.. These line s are presented to the map function as the key-value pairs: (0.... [0.. 22.. cont inuing the example. our reduce function sees the following input: (1949..) (318..0500001N9+01111+9999999 9999.9999999N9+00221+99999999999.9999999N9+00221+99999999999..9999999N9-00111+99999999999. cont inuing the example. 0) (1950. .) (212. 78]) (1950.0500001N9 +00781+99999999999.9999999N9+00221+99999999999.9999999N9+00001+99999999999.0500001N9 +00781+99999999999..9999999N9+00001+99999999999. 22.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 0043012650999991949032412004.0500001N9+01111+99999999999. 0043012650999991949032418004.. cont inuing the example.. which we i gnore in our map function..... -11]) Each year appears with a list of all its air temperature readings.... 22.. This processing sorts and groups the key-value pairs by key. -11]) Each year appears with a list of all its air temperature readings. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing... 111) (1949.) The keys are the line offsets within the file... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0067011990999991950051507004..0500001N9+00781+99999999999.0500001N9+01111+9999999 9999.. 22) (1950. This processing sorts and groups the key-value pairs by key. 0043011990999991950051518004.) The keys are the line offsets within the file... suspect....0500001N9+01111+99999999999..) The keys are the line offsets within the file. [0. 004301199099999195 0051518004.9999999N9-00111+99999999999.... To visualize the way the map works.) (106.. [0. 0043012650999991949032418004.. 78]) (1950.) (106. 22) (1950.. 0043012650999991949032412004... or erroneous. 22) (1950.... or erroneous. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. This processing sorts and groups the key-value pairs by key.) (424.) (318.... .... and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 0043011 990999991950051512004. 0) (1950. which we i gnore in our map function.. So. 0067011990999991950051507004. 0) (1950.... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. -11) (1949.99999 99N9-00111+99999999999. 0043012650999991949032418004. The map function merely extracts the year and the air temperature (indicated in bold text). 004301199099999195 0051518004.9999999N9+00001+99999999999.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. -11) (1949.) (424. 0043012650999991949032412004.) (212.. [111.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. which we i gnore in our map function... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..0051518004.. All the reduce function has to do now is iterate throu .. [111. So. 004301199099999195 0051512004. The map function merely extracts the year and the air temperature (indicated in bold text)... The map function merely extracts the year and the air temperature (indicated in bold text)... our reduce function sees the following input: (1949... suspect.. 0043011990999991950051518004.. 111) (1949...9999999N9+00001+99999999999... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 78]) (1950. our reduce function sees the following input: (1949.0500001N9+00781+99999999999... -11]) Each year appears with a list of all its air temperature readings.

0043012650999991949032412004.. which we i gnore in our map function. 004301199099999195 0051518004.9999999N9+00221+99999999999..99999 99N9-00111+99999999999.....9999999N9+00001+99999999999..0500001N9 +00781+99999999999.. 0043012650999991949032412004. The map function merely extracts the year and the air temperature (indicated in bold text). -11) (1949.. [0. 004301199099999195 0051512004.0500001N9+01111+99999999999..9999999N9+00221+99999999999...0500001N9+00781+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. 0043011 990999991950051512004.) (424. The map function merely extracts the year and the air temperature (indicated in bold text).. 22) (1950.. 0067011990999991950051507004..9999999N9+00001+99999999999.... indicated by ellipses): 0067011990999991950051507004.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0067011990999991950051507004. suspect. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 0043011 990999991950051512004...) The keys are the line offsets within the file. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.0500001N9+01111+9999999 9999... To visualize the way the map works. 004301199099999195 0051512004.) (212. 0043012650999991949032418004. -11]) Each year appears with a list of all its air temperature readings.. our reduce function sees the following input: (1949. 0) (1950... 0043011 990999991950051512004.9999999N9+00001+99999999999.99999 99N9-00111+99999999999. 0043012650999991949032412004... 004301199099999195 ... 22......... These line s are presented to the map function as the key-value pairs: (0..) (106. To visualize the way the map works..0500001N9+00781+99999999999.) (212..9999999N9+00221+99999999999.. 004301199099999195 0051518004. -11) (1949. 0043011990999991950051518004. 78]) (1950.9999999N9-00111+99999999999.. So. 0043012650999991949032418004. [111. suspect. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. cont inuing the example.9999999N9+00001+99999999999. our reduce function sees the following input: (1949. 0043012650999991949032412004. . All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 004301199099999195 0051512004.0500001N9+01111+9999999 9999...0500001N9+01111+9999999 9999. [0. 0043012650999991949032418004. This processing sorts and groups the key-value pairs by key. [111. This processing sorts and groups the key-value pairs by key.. 0043012650999991949032418004.. or erroneous.. or erroneous.. So.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. 111) (1949... which we i gnore in our map function.) (318....) (106. These line s are presented to the map function as the key-value pairs: (0.) (106... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page...... 22) (1950... and emits them as its output (the tempera ture values have been interpreted as integers): (1950.0500001N9+00781+99999999999.. ..gh on it: finding the maximum temperature for each year.9999999N9-00111+99999999999. 0) (1950. 0043011990999991950051518004. 0067011990999991950051507004.9999999N9+00001+99999999999. or erroneous.. These line s are presented to the map function as the key-value pairs: (0.. 0043011990999991950051518004..9999999N9+00221+99999999999... -11]) Each year appears with a list of all its air temperature readings..... To visualize the way the map works...0500001N9+01111+99999999999..... cont inuing the example.9999999N9+00221+99999999999.) (318...0500001N9 +00781+99999999999. indicated by ellipses): 0067011990999991950051507004.9999999N9+00221+99999999999.. 78]) (1950.) (212..... suspect.... 0043012650999991949032418004. indicated by ellipses): 0067011990999991950051507004.. 111) (1949.......99999 99N9-00111+99999999999..9999999N9+00001+99999999999..) (424. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0043012650999991949032412004.) The keys are the line offsets within the file. 22..

consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page... indicated by ellipses): 0067011990999991950051507004.. 0043012650999991949032412004. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 004301199099999195 0051518004... . 111) (1949...9999999N9-00111+99999999999.. our reduce function sees the following input: (1949... our reduce function sees the following input: (1949. 0043012650999991949032418004..9999999N9-00111+99999999999... cont inuing the example. 0043012650999991949032418004. cont inuing the example. 0043012650999991949032412004...) (318. These line s are presented to the map function as the key-value pairs: (0. 004301199099999195 0051518004... So...9999999N9+00001+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. -11) (1949.9999999N9+00001+99999999999.. suspect. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 0) (1950. So.) (106.) The keys are the line offsets within the file.... 111) (1949.. 0043012650999991949032412004... suspect... [0. 22.. . which we i gnore in our map function...0500001N9+01111+99999999999.) (318.9999999N9+00221+99999999999.0500001N9+01111+9999999 9999.9999999N9+00001+99999999999...) (318. -11]) Each year appears with a list of all its air temperature readings. 22) (1950...9999999N9+00221+99999999999. indicated by ellipses): 0067011990999991950051507004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.) The keys are the line offsets within the file. 004301199099999195 0051512004... The map function merely extracts the year and the air temperature (indicated in bold text).) (212.0500001N9+00781+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. 0043011 990999991950051512004. -11]) Each year appears with a list of all its air temperature readings...0500001N9+01111+9999999 9999.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 111) (1949......9999999N9-00111+99999999999..) (424. 0043012650999991949032418004. So. This processing sorts and groups the key-value pairs by key. 0043012650999991949032418004.. 004301199099999195 0051512004.. 0043011 990999991950051512004. 0043011990999991950051518004. All the reduce function has to do now is iterate throu . 0043012650999991949032412004.9999999N9+00221+99999999999. . 0043012650999991949032418004... cont inuing the example. The map function merely extracts the year and the air temperature (indicated in bold text)...) (212... 0067011990999991950051507004.0500001N9 +00781+99999999999. [0. 22) (1950... [111. -11) (1949. 0067011990999991950051507004.. which we i gnore in our map function... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. -11]) Each year appears with a list of all its air temperature readings.... To visualize the way the map works. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. This processing sorts and groups the key-value pairs by key. 22.0500001N9+00781+99999999999. [0...99999 99N9-00111+99999999999. or erroneous. 0043011990999991950051518004. To visualize the way the map works.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. [111. This processing sorts and groups the key-value pairs by key. which we i gnore in our map function.... The map function merely extracts the year and the air temperature (indicated in bold text).) The keys are the line offsets within the file..0051518004.... our reduce function sees the following input: (1949. -11) (1949.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 22.0500001N9+01111+99999999999.99999 99N9-00111+99999999999.0500001N9+01111+99999999999... 0) (1950.0500001N9 +00781+99999999999.9999999N9+00001+99999999999. 0) (1950..) (424.. These line s are presented to the map function as the key-value pairs: (0. 78]) (1950.. or erroneous.0500001N9 +00781+99999999999... [111.. 78]) (1950..9999999N9+00221+99999999999.) (106......) (424. 22) (1950. 78]) (1950.. 0043012650999991949032412004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.

.... or erroneous. 004301199099999195 0051512004.9999999N9+00221+99999999999. 004301199099999195 0051512004.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0043012650999991949032412004..9999999N9+00221+99999999999..... 0043012650999991949032418004. 22. This processing sorts and groups the key-value pairs by key... [0. 0043012650999991949032418004.) (212. 22) (1950.0500001N9+01111+9999999 9999.99999 99N9-00111+99999999999. 111) (1949.9999999N9+00001+99999999999. or erroneous. 004301199099999195 0051518004... 22) (1950.) (106...) (212....99999 99N9-00111+99999999999.. 0067011990999991950051507004.. 004301199099999195 0051512004.. 0043012650999991949032412004. our reduce function sees the following input: (1949. 111) (1949.. [111..0500001N9 +00781+99999999999.. So.. [0. -11]) Each year appears with a list of all its air temperature readings. indicated by ellipses): 0067011990999991950051507004...9999999N9+00001+99999999999...) (106. cont inuing the example.....0500001N9+01111+99999999999. ...... -11) (1949. To visualize the way the map works.0500001N9+00781+99999999999. 0043011 990999991950051512004. suspect. which we i gnore in our map function.. To visualize the way the map works. 0043012650999991949032418004. 0043012650999991949032412004.. 0043012650999991949032412004. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 0043012650999991949032412004.9999999N9+00221+99999999999.) The keys are the line offsets within the file..0500001N9+01111+9999999 9999.. 004301199099999195 .... To visualize the way the map works..0500001N9+00781+99999999999....... These line s are presented to the map function as the key-value pairs: (0..0500001N9 +00781+99999999999.gh on it: finding the maximum temperature for each year.. 0043011990999991950051518004..) (318.. 22. indicated by ellipses): 0067011990999991950051507004. These line s are presented to the map function as the key-value pairs: (0.. -11) (1949. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page..... 0067011990999991950051507004.. indicated by ellipses): 0067011990999991950051507004..... 0043011 990999991950051512004.) (424.) The keys are the line offsets within the file. or erroneous. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 78]) (1950...) (212.. .9999999N9+00221+99999999999. 004301199099999195 0051518004.. [111. 0067011990999991950051507004... 0043011990999991950051518004.. The map function merely extracts the year and the air temperature (indicated in bold text).. 0043011 990999991950051512004..9999999N9-00111+99999999999.9999999N9+00001+99999999999... which we i gnore in our map function.. 0) (1950.99999 99N9-00111+99999999999.0500001N9+01111+99999999999.9999999N9+00221+99999999999. suspect.. 78]) (1950. -11]) Each year appears with a list of all its air temperature readings..9999999N9+00001+99999999999. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.0500001N9+01111+9999999 9999. 0043012650999991949032418004. suspect.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page...0500001N9+00781+99999999999. These line s are presented to the map function as the key-value pairs: (0... The map function merely extracts the year and the air temperature (indicated in bold text).. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.) (424.... 0043011990999991950051518004. cont inuing the example.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing...9999999N9+00221+99999999999. 0) (1950.9999999N9+00001+99999999999. This processing sorts and groups the key-value pairs by key.. 0043012650999991949032418004.....) (106..9999999N9+00001+99999999999.) (318.. So.. our reduce function sees the following input: (1949. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.9999999N9-00111+99999999999.... and emits them as its output (the tempera ture values have been interpreted as integers): (1950.

0500001N9+00781+99999999999. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. This processing sorts and groups the key-value pairs by key. 111) (1949.... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.9999999N9+00221+99999999999... 0067011990999991950051507004.) (318.9999999N9+00221+99999999999.. which we i gnore in our map function. which we i gnore in our map function. . 78]) (1950.. [0..9999999N9+00001+99999999999.. [0.. 22) (1950.0500001N9+01111+9999999 9999. -11]) Each year appears with a list of all its air temperature readings..9999999N9+00001+99999999999.. 004301199099999195 0051518004.. This processing sorts and groups the key-value pairs by key. [111.0500001N9+01111+99999999999. indicated by ellipses): 0067011990999991950051507004. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. [0. cont inuing the example.0500001N9+00781+99999999999. 0043012650999991949032412004. 0043012650999991949032418004.9999999N9+00221+99999999999... -11]) Each year appears with a list of all its air temperature readings. . our reduce function sees the following input: (1949. 22. 0043011 990999991950051512004......) (318. 004301199099999195 0051518004. 22) (1950..9999999N9+00221+99999999999. -11]) Each year appears with a list of all its air temperature readings.. indicated by ellipses): 0067011990999991950051507004.. 0043012650999991949032418004.... . 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. These line s are presented to the map function as the key-value pairs: (0. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 0043012650999991949032412004..0500001N9+01111+9999999 9999..) (212.... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. 111) (1949... 0043012650999991949032418004.) (424.. 22) (1950. our reduce function sees the following input: (1949.) The keys are the line offsets within the file.) The keys are the line offsets within the file.. which we i gnore in our map function.) (106.) (212..9999999N9-00111+99999999999..) (318. [111... The map function merely extracts the year and the air temperature (indicated in bold text)...) The keys are the line offsets within the file... 78]) (1950.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. To visualize the way the map works..) (424.0051518004... cont inuing the example.99999 99N9-00111+99999999999.. 004301199099999195 0051512004.... and emits them as its output (the tempera ture values have been interpreted as integers): (1950.... To visualize the way the map works.) (106.. 0043012650999991949032412004.. 22. So.. -11) (1949. 0043012650999991949032418004..0500001N9 +00781+99999999999.99999 99N9-00111+99999999999. or erroneous. So. 0043011990999991950051518004.. 0043011 990999991950051512004.. -11) (1949.9999999N9-00111+99999999999... 78]) (1950...0500001N9 +00781+99999999999.) (424. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing..0500001N9+01111+99999999999.. suspect. our reduce function sees the following input: (1949. [111.9999999N9-00111+99999999999. 0043012650999991949032412004... This processing sorts and groups the key-value pairs by key... These line s are presented to the map function as the key-value pairs: (0. All the reduce function has to do now is iterate throu . 0043011990999991950051518004... 0043012650999991949032412004. The map function merely extracts the year and the air temperature (indicated in bold text). consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0) (1950. 111) (1949.. cont inuing the example.. 22..... 0) (1950.. or erroneous... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0067011990999991950051507004..... So.0500001N9 +00781+99999999999. 0043012650999991949032418004.9999999N9+00001+99999999999.0500001N9+01111+99999999999. 0) (1950. The map function merely extracts the year and the air temperature (indicated in bold text). suspect. -11) (1949.9999999N9+00001+99999999999.. 004301199099999195 0051512004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950..

The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing..0500001N9+01111+9999999 9999. 0043012650999991949032418004....9999999N9+00221+99999999999. . The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.) (424. These line s are presented to the map function as the key-value pairs: (0.. 111) (1949......9999999N9+00001+99999999999.0500001N9+01111+99999999999.. So. our reduce function sees the following input: (1949. indicated by ellipses): 0067011990999991950051507004... 111) (1949. 0043011 990999991950051512004. 0043012650999991949032418004. . All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year..9999999N9+00001+99999999999..99999 99N9-00111+99999999999. 0043012650999991949032412004.9999999N9+00221+99999999999. 0043012650999991949032418004.9999999N9+00221+99999999999.. To visualize the way the map works.9999999N9-00111+99999999999. 0067011990999991950051507004..... or erroneous..) The keys are the line offsets within the file.) (212.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.... cont inuing the example..99999 99N9-00111+99999999999.. 0) (1950.) (424.... which we i gnore in our map function... indicated by ellipses): 0067011990999991950051507004.....0500001N9+00781+99999999999. -11]) Each year appears with a list of all its air temperature readings. 004301199099999195 0051518004.. The map function merely extracts the year and the air temperature (indicated in bold text). 004301199099999195 0051512004.... The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.0500001N9+01111+99999999999. 0043011990999991950051518004. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. The map function merely extracts the year and the air temperature (indicated in bold text)... suspect.... 0043012650999991949032418004.. 004301199099999195 0051518004. 0067011990999991950051507004.. suspect.. or erroneous..0500001N9 +00781+99999999999.. 004301199099999195 0051512004..... To visualize the way the map works.9999999N9+00001+99999999999. 0043012650999991949032412004. 0067011990999991950051507004..) (106. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. our reduce function sees the following input: (1949.9999999N9+00001+99999999999... These line s are presented to the map function as the key-value pairs: (0. [0. 0043012650999991949032412004.. 004301199099999195 0051512004.0500001N9 +00781+99999999999.9999999N9+00221+99999999999.. 0043011 990999991950051512004.... 78]) (1950. This processing sorts and groups the key-value pairs by key.) (318. 0043012650999991949032412004.) (106.. 78]) (1950. -11) (1949.0500001N9+01111+9999999 9999.. [111. cont inuing the example.. This processing sorts and groups the key-value pairs by key. 0043012650999991949032418004.) The keys are the line offsets within the file. 0043011990999991950051518004... suspect.) (106... 0) (1950. indicated by ellipses): 0067011990999991950051507004..9999999N9-00111+99999999999. which we i gnore in our map function. -11) (1949. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function..9999999N9+00221+99999999999.. To visualize the way the map works.. 0043011 990999991950051512004.0500001N9+01111+9999999 9999..0500001N9+00781+99999999999... [0..... 22) (1950. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page..... and emits them as its output (the tempera ture values have been interpreted as integers): (1950.....99999 99N9-00111+99999999999.0500001N9+00781+99999999999..) (212.. 22) (1950.) (212.. 0043011990999991950051518004. 22. -11]) Each year appears with a list of all its air temperature readings.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950... 0043012650999991949032412004.9999999N9+00001+99999999999.9999999N9+00001+99999999999. [111. These line s are presented to the map function as the key-value pairs: (0.......) (318..gh on it: finding the maximum temperature for each year.. 004301199099999195 . or erroneous... 22..9999999N9+00221+99999999999. So.

.) (424.. 0043012650999991949032412004. 0) (1950... The map function merely extracts the year and the air temperature (indicated in bold text).0500001N9 +00781+99999999999.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.... 22) (1950.... 0043012650999991949032412004.. 004301199099999195 0051518004. -11]) Each year appears with a list of all its air temperature readings.) (318. -11) (1949. All the reduce function has to do now is iterate throu . [0. our reduce function sees the following input: (1949.0500001N9+01111+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function... The map function merely extracts the year and the air temperature (indicated in bold text). 111) (1949.) The keys are the line offsets within the file. 004301199099999195 0051512004.. 0043012650999991949032412004. To visualize the way the map works. -11) (1949. 0043012650999991949032418004. -11]) Each year appears with a list of all its air temperature readings... [111. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. These line s are presented to the map function as the key-value pairs: (0. which we i gnore in our map function.. our reduce function sees the following input: (1949... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 22. So.0500001N9 +00781+99999999999.9999999N9-00111+99999999999.9999999N9+00221+99999999999.0500001N9 +00781+99999999999.. 0067011990999991950051507004. To visualize the way the map works.) (212..0500001N9+00781+99999999999.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.. 111) (1949. 22.) (106... 0043011990999991950051518004.. which we i gnore in our map function.) The keys are the line offsets within the file..9999999N9-00111+99999999999.. 0043012650999991949032418004.. 0) (1950. 78]) (1950.0500001N9+01111+99999999999... or erroneous......0500001N9+01111+9999999 9999. 0043012650999991949032418004. 111) (1949.. This processing sorts and groups the key-value pairs by key... 004301199099999195 0051518004. [111...) (106. 004301199099999195 0051512004.0500001N9+01111+9999999 9999.9999999N9+00001+99999999999.. 22) (1950....) (318..9999999N9+00221+99999999999.0500001N9+00781+99999999999.. indicated by ellipses): 0067011990999991950051507004. . and emits them as its output (the tempera ture values have been interpreted as integers): (1950. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page... . [0. suspect. 0043011 990999991950051512004. cont inuing the example... All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.. 0043011990999991950051518004.99999 99N9-00111+99999999999. cont inuing the example. 0043012650999991949032418004.9999999N9+00221+99999999999.. 22) (1950..... 0067011990999991950051507004. [111.....) (212. This processing sorts and groups the key-value pairs by key.... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function....) The keys are the line offsets within the file.. . 0) (1950.) (424.9999999N9-00111+99999999999.. which we i gnore in our map function.. -11) (1949.... The map function merely extracts the year and the air temperature (indicated in bold text).0051518004.0500001N9+01111+99999999999. 78]) (1950. 0043012650999991949032412004.99999 99N9-00111+99999999999..9999999N9+00001+99999999999. 0043012650999991949032418004.. or erroneous.. So. So. 0043011 990999991950051512004.. -11]) Each year appears with a list of all its air temperature readings. suspect.. 78]) (1950.9999999N9+00221+99999999999.. 0043012650999991949032412004. These line s are presented to the map function as the key-value pairs: (0....) (424. [0. cont inuing the example.9999999N9+00001+99999999999... This processing sorts and groups the key-value pairs by key.9999999N9+00001+99999999999. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. indicated by ellipses): 0067011990999991950051507004.. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing..) (318.. 22. our reduce function sees the following input: (1949..

. 0043012650999991949032412004... 0043012650999991949032418004.0500001N9 +00781+99999999999.... [0. 0) (1950...9999999N9+00221+99999999999.. 111) (1949. 0043011 990999991950051512004. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. 22) (1950.. 22..9999999N9+00221+99999999999. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. To visualize the way the map works...0500001N9+00781+99999999999.9999999N9+00001+99999999999.... indicated by ellipses): 0067011990999991950051507004..9999999N9+00221+99999999999. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year...gh on it: finding the maximum temperature for each year.. indicated by ellipses): 0067011990999991950051507004.......) (212.....0500001N9+00781+99999999999.99999 99N9-00111+99999999999.9999999N9+00001+99999999999. The map function merely extracts the year and the air temperature (indicated in bold text).) (106.... So.. suspect.0500001N9+01111+99999999999.. which we i gnore in our map function.9999999N9+00001+99999999999.0500001N9+01111+9999999 9999.9999999N9+00001+99999999999. This processing sorts and groups the key-value pairs by key... 0067011990999991950051507004..0500001N9+01111+9999999 9999. -11) (1949. 0043012650999991949032418004.. The map function merely extracts the year and the air temperature (indicated in bold text). cont inuing the example.. -11) (1949. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.9999999N9+00001+99999999999. -11]) Each year appears with a list of all its air temperature readings.. 22.. indicated by ellipses): 0067011990999991950051507004. 0067011990999991950051507004.. 004301199099999195 0051518004.. 78]) (1950...) (318. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.0500001N9+01111+9999999 9999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. [0. -11]) Each year appears with a list of all its air temperature readings.... ..9999999N9-00111+99999999999..) (212..0500001N9+00781+99999999999..) (106. These line s are presented to the map function as the key-value pairs: (0.) (106. . or erroneous..99999 99N9-00111+99999999999. suspect.... 004301199099999195 0051512004..9999999N9+00221+99999999999. 0043012650999991949032412004..9999999N9-00111+99999999999. 004301199099999195 ... consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.... This processing sorts and groups the key-value pairs by key...0500001N9 +00781+99999999999... 22) (1950. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.) (424.. 004301199099999195 0051518004.. 0) (1950. 0043012650999991949032412004. or erroneous...9999999N9+00221+99999999999.. 0043011 990999991950051512004....) (318.) (424. 0067011990999991950051507004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. suspect. 0043011 990999991950051512004.0500001N9+01111+99999999999.. 004301199099999195 0051512004. 0043012650999991949032412004.. To visualize the way the map works...9999999N9+00001+99999999999...) (212... or erroneous.) The keys are the line offsets within the file. To visualize the way the map works...... 0043012650999991949032418004.. These line s are presented to the map function as the key-value pairs: (0... 0043012650999991949032418004.. [111. which we i gnore in our map function. our reduce function sees the following input: (1949... 78]) (1950.. So. 004301199099999195 0051512004. 0043012650999991949032412004. our reduce function sees the following input: (1949.99999 99N9-00111+99999999999.... 0043011990999991950051518004.... 111) (1949. 0043011990999991950051518004. [111. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. cont inuing the example.9999999N9+00221+99999999999... 0043011990999991950051518004..) The keys are the line offsets within the file. 0043012650999991949032418004. These line s are presented to the map function as the key-value pairs: (0. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.

0043012650999991949032412004..... and emits them as its output (the tempera ture values have been interpreted as integers): (1950.9999999N9+00221+99999999999. 0043011 990999991950051512004..9999999N9-00111+99999999999. So.) (106.9999999N9+00001+99999999999..) (424. 0043012650999991949032418004.. 004301199099999195 0051518004... . To visualize the way the map works.. 0043012650999991949032418004. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing.... These line s are presented to the map function as the key-value pairs: (0.. 111) (1949.) (212.. 22..0500001N9+00781+99999999999...9999999N9+00221+99999999999..) (212.. cont inuing the example..9999999N9-00111+99999999999. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.) (318...0051518004. 0043011990999991950051518004...9999999N9+00221+99999999999..0500001N9 +00781+99999999999..) (106. 22. -11) (1949.) (424.. suspect. cont inuing the example.9999999N9+00001+99999999999.... The map function merely extracts the year and the air temperature (indicated in bold text).. indicated by ellipses): 0067011990999991950051507004. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. -11]) Each year appears with a list of all its air temperature readings.. 22) (1950.... This processing sorts and groups the key-value pairs by key.. our reduce function sees the following input: (1949. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 22) (1950.0500001N9+01111+99999999999.. So. suspect. . 0043012650999991949032412004. 0067011990999991950051507004. our reduce function sees the following input: (1949. To visualize the way the map works.. [0. 004301199099999195 0051512004.. .) The keys are the line offsets within the file. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing...... 78]) (1950.. [111.. These line s are presented to the map function as the key-value pairs: (0. 0043012650999991949032418004.. which we i gnore in our map function.99999 99N9-00111+99999999999.. The map function merely extracts the year and the air temperature (indicated in bold text)... [0. 111) (1949.. 004301199099999195 0051512004.. 0043011990999991950051518004.. 111) (1949. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.0500001N9+01111+99999999999.0500001N9+01111+9999999 9999.. 004301199099999195 0051518004...9999999N9-00111+99999999999. -11]) Each year appears with a list of all its air temperature readings.. 0043012650999991949032412004. [111. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. 78]) (1950... So. cont inuing the example.. 0043012650999991949032412004.. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.9999999N9+00001+99999999999.0500001N9+01111+99999999999..99999 99N9-00111+99999999999. 0043012650999991949032412004.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. 0043012650999991949032418004...) The keys are the line offsets within the file. All the reduce function has to do now is iterate throu . 0067011990999991950051507004.) (318... 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year.... -11) (1949. [0. or erroneous..0500001N9 +00781+99999999999...0500001N9+01111+9999999 9999.. [111.9999999N9+00221+99999999999..) The keys are the line offsets within the file.. 0) (1950.. 0) (1950. which we i gnore in our map function.0500001N9 +00781+99999999999. 22) (1950.) (424. our reduce function sees the following input: (1949. 78]) (1950. The map function merely extracts the year and the air temperature (indicated in bold text).. -11]) Each year appears with a list of all its air temperature readings. 0043011 990999991950051512004.0500001N9+00781+99999999999...) (318.. which we i gnore in our map function. or erroneous. 0043012650999991949032418004.. indicated by ellipses): 0067011990999991950051507004. This processing sorts and groups the key-value pairs by key.9999999N9+00001+99999999999... 0) (1950.. This processing sorts and groups the key-value pairs by key... -11) (1949. 22..

9999999N9+00001+99999999999. 111) (1949... cont inuing the example. [111.) (212. [111.. or erroneous..........9999999N9-00111+99999999999.9999999N9+00221+99999999999....9999999N9-00111+99999999999..9999999N9+00001+99999999999. These line s are presented to the map function as the key-value pairs: (0....... 0) (1950.9999999N9+00221+99999999999. All the reduce function has to do now is iterate throu gh on it: finding the maximum temperature for each year. -11]) Each year appears with a list of all its air temperature readings.0500001N9+00781+99999999999. 004301199099999195 0051512004. our reduce function sees the following input: (1949. [0.. -11]) Each year appears with a list of all its air temperature readings. 0043011 990999991950051512004.. our reduce function sees the following input: (1949.99999 99N9-00111+99999999999..) (424......99999 99N9-00111+99999999999.. and emits them as its output (the tempera ture values have been interpreted as integers): (1950. 22. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing....9999999N9+00221+99999999999... So. -11) (1949..) The keys are the line offsets within the file..) (212. These line s are presented to the map function as the key-value pairs: (0.. . To visualize the way the map works.9999999N9+00001+99999999999. 22) (1950.. The map function merely extracts the year and the air temperature (indicated in bold text)...9999999N9+00221+99999999999. or erroneous. 22. .0500001N9+01111+9999999 9999.) (106. The map function is als o a good place to drop bad records: here we filter out temperatures that are mis sing. This processing sorts and groups the key-value pairs by key...) (106...) (318. 0043012650999991949032418004.gh on it: finding the maximum temperature for each year.0500001N9 +00781+99999999999.0500001N9+01111+9999999 9999.. 004301199099999195 0051518004.... 0043012650999991949032418004. To visualize the way the map works. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.. All the reduce function has to do now is iterate throu gh .. 0067011990999991950051507004. 111) (1949. and emits them as its output (the tempera ture values have been interpreted as integers): (1950.0500001N9+01111+99999999999. 78]) (1950.) (318. 78]) (1950.. The map function merely extracts the year and the air temperature (indicated in bold text).) (424...) The keys are the line offsets within the file. 0043012650999991949032412004.. 78) The output from the map function is processed by the MapReduce framework before being sent to the reduce function.....0500001N9 +00781+99999999999.. 0067011990999991950051507004. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page. [0. 0043012650999991949032412004. 0043011990999991950051518004.. consider the fo llowing sample lines of input data (some unused columns have been dropped to fit the page.. which we i gnore in our map function. -11) (1949..... This processing sorts and groups the key-value pairs by key. which we i gnore in our map function.... 0) (1950... 0043011990999991950051518004. suspect. 0043012650999991949032418004. 004301199099999195 0051518004. indicated by ellipses): 0067011990999991950051507004. 0043012650999991949032412004.9999999N9+00001+99999999999.0500001N9+01111+99999999999. suspect. So. 0043012650999991949032412004.. 0043011 990999991950051512004. indicated by ellipses): 0067011990999991950051507004. 004301199099999195 0051512004... 22) (1950.0500001N9+00781+99999999999... 0043012650999991949032418004.. cont inuing the example.