You are on page 1of 121

V o l .

22

Vol.

IT

22
www.itarchitect.jp

Apache Hadoop

Apache Hadoop

1,680
1,600

61503-93
2009 Printed in Japan


113-00333-4-5
200910 03-5800-2661

DIC124

024

026

034

Part 1

O/R

TCO

044

Part 2

056

Part 3

Web

Contents

Event Report

015

IT 2009 Report

Business Modeling

128

22
Vo l .

Communication Technique

080

092

IT

13

.indd 1

09/03/11 22:17

096

110

064

Apache Hadoop

065

Part 1

068

Part 2

075

Part 3

Contents

HadoopMapReduce/GFS
Hadoop
Hadoop

086

088

IT
022
085
109
134

.indd 1

22
Vo l .

News & Topics


Books
Present

09/03/11 22:17

IT Architect Summit

2 0 0 9

R e p o r t

218
IT
ITA2009

ITA 2009

ITVol.22

.indd 15

015

09/03/11 22:10

1ROI
Ret

urn On Investment

ROI

IT

IT

SQL

IT

ROI

ITA

IT

IBM


IBM

IT Architect

Summit 2009
R

016

IT

IT

ITA 2009
IT

ITA

ITA

ITVol.22

.indd 16

09/03/11 22:10

SIer

1 Part 1

UFJ

ITAIT

SIer

ITA 2009

IT


UFJIS IT

UFJ

UFJ IT

UFJIS3 IT

IT
2

UFJIS

IT

IT

UFJIS

3IT
UFJIS
2

4
UFJIS 400
20019CO2

ITVol.22

.indd 17

017

09/03/11 22:10

MSOSS

ASF

Apache Stonehenge


W3C
OASIS
SOA

MS3
7
MSASF
10

HPSolaris 10
HP ProLiant

KVMKernel-based
Virtual Machine

Linux OSRed Hat Enterprise Li


nux

Red Hat Enterprise Virtu


alization Manager for Servers
Desktops

Red Hat Enter


prise Virtualization Hypervisor

HP2

HPSolaris 10
HP
HP ProLiant

HP BladeSystem
Solaris

HP Insight
Solaris 10

HP Integrity
HP-UX

News & Top i cs


IT

IBM

IBM2

IBM Computing on De
mand
4
1

IBM Computing on Dem


and

CPU2
15
400

1CPU 1
150

022

news.indd 22

SAP
Business Suite 7

SAP 2ERP
CRMSCM

SAP Business Suite 7

enhancement package

11
5

TOGAF 9

2EA
TOGAF
The Open Group Archi
tecture Framework
Version 9

Web
htt
p://www.opengroup.org/togaf/

TOGAF
ADM
Architectu
re Development Method
SOA

IT

Exadata
1
Oracle Exadata

HP

HP Oracle Exad
ata Storage Server
Oracle
Database 11g Enterprise Edition

HP Oracle Database
Machine
Stora
ge ServerOracle Database
2 Intel

IT Vol.22

09/03/11 21:42

IBM4DITA

IBM

2
XML
DITA
Darwin Information Typi
ng Architecture

DITA

OASIS
DITA

DITA
DI
TA

VMwareMicr
osoft Hyper-V ServerCitrix Xen
Server

E vent C a l e n d a r
4
Qcon Tokyo 2009

49
10

QCon Tokyo 2009


E-mailqcontokyo2009@c-sq.com
URLhttp://qcontokyo.com/

Pulse Japan 2009

415

ibm.com
0120-300-426
URLhttp://www-06.ibm.com/itsolutions/jp/

solutions/governance/events/pulse/

Agile Japan 2009

422

2F
2009
03-5414-5363
E-mailinfo@agilejapan.org
URLhttp://www.agilejapan.org/

Oracle OpenWorld Tokyo 2009

SaaS

IBMTeam Concert

NTT 3
Webi
ntra-mart WebPlatform Ver.7
SaaS
Software as a Service

SaaS

SaaS

intra-mart
intra
-mart
intra-martSaaS

IBM2
IBM Rational Team Concert
V.1.0.1.1
Re
quirements Composer V1.0
Quality Manag
er V1.0
IBM

Jazz
Team Concert

HP

422
24

E-mailinfo-openworld_jp@oracle.com
URLhttp://www.oracle.co.jp/openworld/2009/

513
15

SODEC
03-3349-8504FAX03-3349-8500
E-mailsodec@reedexpo.co.jp
URLhttp://www.sodec.jp/SODEC/

6
Interop Tokyo 2009

68
12

URLhttp://www.interop.jp/

HP 2

HP Insight Dynamics
- VSE 4.1
Orchestration
Recovery

HP SE2120

HP
SE2120

Linux

378,000

1IT
SIer
IT40

IT

IT

2009
79
12


03-3349-8507FAX03-3349-8523
E-maildigi@reedexpo.co.jp
URLhttp://www.digi-fair.jp/

BioFuels World

722
24

BioFuels World
03-5800-3534FAX03-5800-3979
URLhttp://www.biofuels.co.jp/2009/

2009

722
24


03-3834-8134FAX03-3834-8009
E-mailwjc@ric.co.jp
URLhttp://www8.ric.co.jp/expo/wj/

IT Vol.22

news.indd 23

023

09/03/11 21:42

024

IT Vol.22

toku01.indd 24

09/03/11 23:02




IT Vol.22

toku01.indd 25

025

09/03/11 23:02

Tsutomu Hosokawa

026

PC

IT

SIer

IT Vol.22

toku01.indd 26

09/03/11 23:02

EC
SNSSocial Networking Service

Web

Web

Web

IT Vol.22

toku01.indd 27

027

09/03/11 23:02

123

Web

Web

CPU

21

CPU

2
CPUCPU

1
2

1990
PC

CPU 64
PC

CPU

CPU
CPU
CPU
CPU

...

028

IT Vol.22

toku01.indd 28

09/03/11 23:02

WebWeb

CPU

Oracle TimesTen In-Memory Databa

10GB

seCash Connect to

Oracle

3
TimesTen
SQL

SQL

SQL

SQL

TimesTen

TimesTen

TimesTen

Cash Connect
to Oracle

Cash Connect
to Oracle

Cash Connect
to Oracle

Oracle Database

toku01.indd 29

IT Vol.22

029

09/03/11 23:02

SQL

Web

Web

HTTP

Web

Web

JBoss

Web

Cache

Microsoft Velocity

Oracle Coherence

Part 2Coherence

Web

Web

030

Web

WebWeb

IT Vol.22

toku01.indd 30

09/03/11 23:02

SQL

Java EE4
2

2Java EEJDBCAPI
API4Oracle Database
2

ATM
ATM

ATM

Web

ATM

001020

021040

041060

061080

081100

IT Vol.22

toku01.indd 31

031

09/03/11 23:02

1 OS

OS
Web

VMMVirtual Machine Mo

nitor3
OS

VMM

2OSOS

IT
IT

CPU

VMM

OSWeb

3VMM
OS

4VMM2
VMM
2

Windows

UNIX

Linux

VMM

OS

OS

032

OS

Windows

UNIX

Linux

VMM

IT Vol.22

toku01.indd 32

09/03/11 23:02

Windows

UNIX

UNIX

Linux

Linux

VMM

Windows/UNIX/Linux
OS

IT Vol.22

toku01.indd 33

033

09/03/11 23:02

P a r1 t

P a r t

O / R


T C O

RDBMS

RDBMS
IT

TCOTotal Cost of Ownership

RDBMS

TCO

IT

Yuzo Ishida

034

IT Vol.22

toku01.indd 34

09/03/11 23:02

12

CPU
I/O

21

CPU

CPU

IT Vol.22

toku01.indd 35

035

09/03/11 23:02

P a r t

O / R

1I/O

RDBMS

CPUI/O

CPU

036

CPU

CPU

IT Vol.22

toku01.indd 36

09/03/11 23:02

3RDBMS

I/O

SQL

I/O

CPU

CPU

RDBMSCPU

TCO

CPU

Java

SQL

1SQL
SQL

IT Vol.22

toku01.indd 37

037

09/03/11 23:02

P a r t

O / R

Record/Table R/

CPU

1R/T

I/O

R/T

CPU

R/T

CPU

SQLRDBMS

CPU

I/O

R/T

I/O

CPU

1R/T
a1b1c1

b1

c1

a2b1c1

b1

c1

a3b1c2

b1

c2

a1b1c1
a2b1c1

c1
b1

a3b1c2

c2

a1b1c1
a2b1c1
a3b1c1

038

b1
b2
....

c1
c2
....

IT Vol.22

toku01.indd 38

09/03/11 23:02

SQL DML

Data Manipulation Language

DML

AP

DMM

AP

I/O

DMM 1

DMM1

AP

CPU

CPU

DMMData Model Management

1DMM
DMM
DMM

2AP
I/O

A
B
C

ID

AP

ID
1
2
3
4
5
6

AP

AP

IT Vol.22

toku01.indd 39

039

09/03/11 23:02

P a r t

O / R

DMM

I/O

RDBMS

I/O

I/O

I/O

I/O

I/O

I/O

I/O

DMM

3O/R

DMM

AP

R/T

040

IT Vol.22

toku01.indd 40

09/03/11 23:02

CPU

I/OI/

I/O

2
I/O

SQL

CPU

SQL

CPU

O/R

DMM

AP

DML)

R/T

AP

IT Vol.22

toku01.indd 41

041

09/03/11 23:02

P a r t

O / R

I/OCPU

SQL

SQL

SQL

AP

I/O

CPU

CPU

R/TAP

CPU1

042

IT Vol.22

toku01.indd 42

CPU

CPU2

09/03/11 23:02

RDBMSCOBOL

I/O

RDBMS

http://www.itarchitect.jp/

O/R

TCO

62

201

2
3
4
5
6

200

3
200

K2

1
201

ID

200

201

ID

K1

K2

201

201

2
3
4
5
6

toku01.indd 43

100

200

201

IT Vol.22

043

09/03/11 23:02

P a r t

Tatsuya Sugi

Fusion Middleware
Fusion Middleware

Tomofumi Nijo

Noriyuki Abe

Fusion Middleware
Fusion MiddlewareSC

044

IT Vol.22

toku01.indd 44

09/03/11 23:02

I/O

RDBMS

RDBMS

RDBMS

ROIReturn On Investment

ROI

RDBMS

IT Vol.22

toku01.indd 45

045

09/03/11 23:02

P a r t

RDBMS

JBoss Cache
1

1 RDBMS

Java Map APIput/get

RDBMS

Java

RDBMS

SQL

RDBMS

1
RDBMS
SQL
API JDBCODBC

046

put/get

put/get
API

Oracle TimesTenIBM solidDB

API

API

memcachedJBoss Cache

Oracle CoherenceGemstone GemFi


reGigaSpaces XAP

IT Vol.22

toku01.indd 46

09/03/11 23:02

RDBMS

Java API

MapReduce

ITWeb 2.0

1JBoss Cache 3.0


Java Map APIJava

// Person
// CacheFactoryDefaultCacheFactoryCacheFqnNodeJBoss Cache
CacheFactory factory = new DefaultCacheFactory();
//
Cache cache = factory.createCache();
Fqn personData = Fqn.fromString("/person");
// Person
Node personNode = cache.getRoot().addChild(personData);
// Person
Person p1 = new Person(1234, "", "", "");
//
personNode.put(1234, p1);

1
1

2Oracle Coherence

JBoss Cache

// Person
// CacheFactoryOracle Coherence
// Person
Map personCache = CacheFactory.getCache("person");
// Person
Person p1 = new Person(1234, "", "", "");
//
personCache.put(1234, p1);

toku01.indd 47

IT Vol.22

047

09/03/11 23:02

P a r t

get

put
A

CPU

1,000

1 1,000

21

048

IT Vol.22

toku01.indd 48

09/03/11 23:02

41,000

11,000

111

2,000

500

1
2

22

Person

get
P
B

put

P
B

P
B

P
B

P
B

P
B
3

P
B

A
D

1
2AA
4D

IT Vol.22

toku01.indd 49

049

09/03/11 23:02

P a r t

MapEvent

Oracle Coherence

2put/get

100

RDBMS

0.1

Coherence

Coherence3

Coherence

Invocation

Java

Java

// 100
Filter filter = new EqualsFilter("getProductId", 100);
//
Set<Order> orders = orderCache.entrySet(filter);

//
public class ProcessOrderListener implements MapListener {
public void entryInserted(MapEvent event) { /* */ }
public void entryUpdated(MapEvent event) {}
public void entryDeleted(MapEvent event) {}
}

5
orderCache.addMapListener(new ProcessOrderListener());

050

IT Vol.22

toku01.indd 50

09/03/11 23:02

Coherence

HTTP

Invocation

toku01.indd 51

IT Vol.22

051

09/03/11 23:02

P a r t

HTTP

JVM

HTTP

GC

GC

GC

OutOfMemoryError

HTTP

77

GC

052

IT Vol.22

toku01.indd 52

09/03/11 23:02

7HTTP

3
HTTP

GC

IT Vol.22

toku01.indd 53

053

09/03/11 23:02

P a r t

Coherence

054

IT Vol.22

toku01.indd 54

09/03/11 23:02

10

10

Java EE

Java EE

Java EE

Java EE

Java EE

Java EE

MQ/JMS

toku01.indd 55

MQ/JMS

IT Vol.22

055

09/03/11 23:02

P a r3 t

P a r t


W e b


Web1

24 365

Kazuhiko Komabayashi

CA

1
2007Web
1
Web

Web

Web

SOA

Service Oriented Architecture

Web

24
365

056

IT Vol.22

toku01.indd 56

1Web 1

Typical Revenue Loss per Hour of Do


wntime, 2007
Web

650

260

TV

15

TV

113,000

89,500

69,000

28,000

ATM

14,500

09/03/11 23:02

SLAService Level Agreement

UNIX

IT

CPU

RDBMS12

Web

Web

30

CPU

COBITCo

ntrol Objectives for Information and related Te

chnology

ITILIT Infrastructure Library

COBIT
ITIL

Web

Web

IT Vol.22

toku01.indd 57

057

09/03/11 23:02

P a r t

W e b

SLA

Web

End-User Experience Management

Web

SLA

UNIX

Web

058

Web

IT Vol.22

toku01.indd 58

09/03/11 23:02

CPU

SOA

SOA

SOA

Web

SaaSSoftware as a Servi

ce
PaaSPlatform as a Service

SOA

IT Vol.22

toku01.indd 59

059

09/03/11 23:02

P a r t

W e b

SOA

IT

SLA

SLA

SLA

SLA SLA

Java

Web

060

IT Vol.22

toku01.indd 60

09/03/11 23:02

APM

Web

APM

Application Performance Management

APM

Web

CAAPM
Wily Intr

oscope

Wily CEM
APM

CPU

APM

IT Vol.22

toku01.indd 61

061

09/03/11 23:02

P a r t

W e b

APM

Web

APM

SLA

SLA

SLA

Introscope

EC
1,000

APM

APM1

062

1,300

My Circle

24 365Web

1SLA
Introscope

Introscope

IT Vol.22

toku01.indd 62

09/03/11 23:02

Web

Web

WebLogic ServerWebSphere Applicati

on ServerWeb

APM

Oracle DatabaseEAI

1 7

1 5,000

Web

Web

Web

Web

Web

APM

9 Web

2,000

My Circle

3APM
Introscope

APM3

JSP
Web

IT Vol.22

toku01.indd 63

063

09/03/11 23:02

MapReduce
Apache Hadoop
MapReduceGFS
Google File System
Hadoop

Hadoop
3
Hadoop

Yohei Sasaki
Michihiro Koide
Hiroshi Itou

IBM

064

IT Vol.22

022_.indd 64

09/03/11 22:59

Part 1

Hadoop

MapReduce/GFS

Apache HadoopApache HTTP

ServerTomcatStruts

Hadoop

15

HadoopMapReduce

OSS

1
Map

Reduce

Core

3Hadoop

HadoopHadoop Core

Hadoop
HDFSHadoop Distrib

Hadoop1

uted File System

Hadoop

Hadoop

HadoopWeb
http://hadoop.apache.org/

core/
Hadoop

Hadoop

Hadoop

Hadoop

Hadoop

MPI Message Passing

Interface


Hadoop

1Hadoop

1Hadoop
Apache Hadoop
Core
Core

Hadoop

3MapReduce

Hadoop

IT Vol.22

022_.indd 65

065

09/03/11 22:59

1Hadoop

HDFS

Hadoop1

Map
Reduce

MapReduceMap

CPU

Reduce2Map

MapReduce
HadoopMapReduce

1
1 ReduceMap

HDFS

MapReduce

MapReduceGFS

MapReduce

1512

Hadoop

Apache HadoopOSS

2Map1

HadoopMapReduceHDFS

Reduce5
2

MapReduceGFS

46810

MapReduce/GFS

Map

1Apache HadoopOSS

Apache HadoopOSS

Hadoop Core
MapReduce Framework

MapReduce

Hadoop Core
HDFS

GFS

ZooKeeper

Chubby

hBase

BigTable

Part 1

1 <key,
value>3

Hadoop

MapReduce/GFS

066

IT Vol.22

022_.indd 66

Apache
09/03/11 22:59

2152

30

Reduce

246810

246810

1
Map

12

22

32

42

52

<,1><,5>
<,2><,3>
<,4>
Map3

<2,2><2,10>
<2,4><2,6>
<2,8>

MapRedu
ce

GFS
HDFSGFS

GFS

Reduce

100MBGB

<2 ,18><2
,12>

append

Reduce1

GFS3

MapReduce

GFS1

Map

Reduce

64MB

MapReduceSimplifi

ed Data Processing on Large Clusters

Hadoop
022_.indd 67

MapReduce

IT Vol.22

067

09/03/11 22:59

Map

3MapReduce

Map
Map

MapReduceMapReduce

GFS

3GFS
MapReduce

GFS

Map

CPU

CPU

CPU

Part 2

CPU

Hadoop

Hadoop

1MapReduce

Hadoop

Hadoop MapReduce

Hadoop2

FrameworkMap/Reduce

Map/ReduceMapReduce

Hadoop2

Framework

HadoopMapReduce

1 HDFS

GFSJavaOSSMapRe

HDFS

duceGFS2

Map/ReduceHDFS

068

IT Vol.22

022_.indd 68

Apache
09/03/11 22:59

Hadoop 0.19.0

JobTrackerJob
TrackerMapRe

MapReduce Framework

duce FrameworkJobClient

MapReduce Framework

JobConf

Hadoop

JobTracker

Map/Reduce
JobTrack

JobConfMap/Reduce

er

TaskTracker
2

4JobTrackerTaskTracker

JobClient

JobClientJobTracker

Hadoop
Had

JobTracker

oop1JobTracker

TaskTracker

MPI

JobTrackerMapReduce

Hadoop

First come, first served

mainJobClient

TaskTracker
TaskTrackerJobTrackerMap
/ReduceMap
Map
Reduce
Reduce

Map
map

Reduce
reduce

JobTracker

4MapReduce Framework
JobTrackerTaskTracker
JobTracker

JobTrackerHDFS

JobTracker

HDFS

Map/Reduce

Task
Tracker

Task
Tracker

Task
Tracker

Map/
Reduce

Map/
Reduce

CPU

Map/
Reduce

CPU

CPU

Hadoop
022_.indd 69

Map/
Reduce

CPU

Task
Tracker

IT Vol.22

069

09/03/11 22:59

MapReduce Framework 5

MapReduce

HDFS

HDFS

Mapmap

Combiner

Map

HDFS

MapReduce

Reduce

MapReduce Framework

ReduceJobConf

Shue/SortMap

6HDFS

<key,{value1,value2,val

NamenodeDatanode HDFS

ue3,...valueN}>

Reducereduce

Namenode

Reduce

MapReduce Framework

5MapReduce Framework
Map

Combiner

ShueSort

Reduce

Task
Tracker

Task
Tracker

Task
Tracker

Task
Tracker

JobTracker

Part 2

Hadoop

070

IT Vol.22

022_.indd 70

Apache
09/03/11 22:59

6HDFS

Namenode

Namenode

Datanode

Datanode

Datanode

CPU

CPU

CPU

DatanodeGFS

HDFS1

GFS

64MB
Datanode

HDFS

Namenode

NamenodeDatanode

Datanode

Namenode

HDFS

HDFS

HDFSMapReduce

HDFS

HDFSMapReduce

HDFS

HDFS

HDFS

1
1

Hadoop
022_.indd 71

Hadoop
Hadoop

IT Vol.22

071

09/03/11 22:59

Hadoop

Hadoop3

HadoopWeb
http://hadoop.apache.

1JavaHadoop

org/core/releases.html#Download
Hadoop

NamenodeJobTracker

TAR

0.19.0tar

/home/hadoop/hadoop-0.19.0

Java

hadoop-0.19.0

confbin
hadoop

MapReduce

HADOOP_HOME

hadoop.bashrc1

OSCent OS 5.2
JavaJDK 6u12
HadoopHadoop 0.19.0
localhost
Firewall/SELinuxDisabled

export HADOOP_HOME=/home/hadoop/hadoop-0.19.0

$ . .bashrc

HadoopSSH
SSHSSH

Hadoop3

SSH

IPv6Hadoop0.0.0.0

IPv6
IPv6

$
$
$
$

ssh-keygen -t rsa
cat .ssh/id_rsa.pub >> .ssh/authorized_keys
chmod 600 .ssh/authorized_keys
ssh localhost

HadoopHadoop

Hadoop

hadoop

${HADOOP_HOME}/conf/

JDK 6.0Hadoop0.19.0

hadoop-env.shJAVA_HOME

JDK 6.0

Java

Web

http://java.sun.com/java

1JAVA_HOME

se/ja/6/download.html

Hadoop

hadoop-default.xmlHadoop

hadoop-site.xml

5Amazon EC2Elastic Computer Cloud

HadoopAMI
Amazon Machine Image
OpenSolaris
HadoopLive CD
Hadoop
Part 2

hadoop-default.xml

Hadoop

072

IT Vol.22

022_.indd 72

Apache
09/03/11 22:59

1hadoop-env.shJAVA_HOME

HDFS
hadoop.tmp.dir/tmp
Linuxtmpwatch
2-
fs.default.namemapred.job.tracker
NamenodeJobTracker
2-dfs.replication

1
1
2-
mastersNamenode
11

slavesTaskTrackerDatanode
masters11

# The java implementation to use. Required.


export JAVA_HOME=/usr/java/default

2hadoop-site.xml
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!-- Put site-specific property overrides in this file. -->
<configuration>
<property>
<name>hadoop.tmp.dir</name>
<value>/home/${user.name}/hadoop-tmp/</value>
</property>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:51010</value>
</property>
<property>
<name>mapred.job.tracker</name>
<value>localhost:52010</value>
</property>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>

HDFSName
node

$ /home/hadoop/hadoop-0.19.0/bin/hadoop namenode -format

HadoopWikihttp://wiki.apache.org/hadoop/

Hadoop

WordCounthttp://wiki.apache.org/

$ /home/hadoop/hadoop-0.19.0/bin/start-all.sh

$ /home/hadoop/hadoop-0.19.0/bin/hadoop dfsadmin -report

hadoop/WordCount

WordCount

HadoopWikiWordCount
13
MapReduce

Webhttp://localhost:50070

HDFS

dfs.http.address50070

/home/hadoop/mydomain/

mypkg

JobTraker

WordCount
WordCount.java

$ /home/hadoop/hadoop-0.19.0/bin/hadoop job -list

WordCount
JobTrackermain

Webhttp://localhost:50030

3JobConf

JobTraker

Map/Reduce

mapred.job.tracker.http.

address50030
Hadoop

Hadoop
022_.indd 73

6
Web
http://www.itarchitect.jp/

IT Vol.22

073

09/03/11 22:59

HDFS

MapWordCount

MyMap
MyMap.java

JobConfTextInputFor

MapMapReduceBase

mat1 <key

Mapperimplements

,value>

4
LongWritable

mapmap1

Text
Text
IntWritab

<,1>

le

Mapper
Gene

MyReduce
MyReduce.java

rics

ReduceMapReduce

BaseReducerimplements

5MyMap

3WordCount
WordCount.java

public class WordCount {


public static void main(String[] args) throws Exception {
JobConf conf = new JobConf(WordCount.class);
conf.setJobName("wordcount");
conf.setOutputKeyClass(Text.class);
conf.setOutputValueClass(IntWritable.class);
conf.setMapperClass(MyMap.class);
conf.setCombinerClass(MyReduce.class);
conf.setReducerClass(MyReduce.class);
conf.setInputFormat(TextInputFormat.class);
conf.setOutputFormat(TextOutputFormat.class);
FileInputFormat.setInputPaths(conf, new Path(args[0]));
FileOutputFormat.setOutputPath(conf, new Path(args[1]));
JobClient.runJob(conf);
}
}

4MyMap
MyMap.java

public class MyMap extends MapReduceBase


implements Mapper<LongWritable, Text, Text, IntWritable> {
private final static IntWritable one = new IntWritable(1);
private Text word = new Text();
public void map(
LongWritable key, Text value, OutputCollector<Text,
IntWritable> output, Reporter reporter)
throws IOException {
String line = value.toString();
StringTokenizer tokenizer = new StringTokenizer(line);
while (tokenizer.hasMoreTokens()){
word.set(tokenizer.nextToken());
output.collect(word, one);
}
}
}

Reducereduce
MyMapmap<,1>
<,{1,1,1,1,1,}>
reduce
<

,
>

JAR
$ cd /home/hadoop/mysample/mydomain/mypkg/
$ javac -classpath /home/hadoop/:/home/hadoop/hadoop-0.19.0/hado
op-0.19.0-core.jar WordCount.java
$ cd /home/hadoop/
$ jar cvf mysample.jar ./mydomain

HDFS
Hadoophadoop

$ bin/hadoop fs -mkdir /user/hadoop/input


$ bin/hadoop fs -copyFromLocal /home/hadoop/messages /user/hadoop
/input/

HDFS
5MyReduce
MyReduce.java

public class MyReduce extends MapReduceBase implements Reducer


<Text, IntWritable, Text, IntWritable>{
public void reduce(
Text key, Iterator<IntWritable> values,
OutputCollector<Text, IntWritable> output, Reporter reporter)
throws IOException {
int sum = 0;
while (values.hasNext()){
sum += values.next().get();
}
output.collect(key, new IntWritable(sum));
}
}

Part 2

hadoop

$ bin/hadoop jar /home/hadoop/mysample.jar


mydomain.mypkg.WordCount /user/hadoop/input /user/hadoop/output

Hadoop

074

IT Vol.22

022_.indd 74

Apache
09/03/11 22:59

Part 3

Hadoop

Hadoop

Excel

Hadoop

0.19.01Hadoop

Wiki

Wiki

QAMapRed

Hadoop

uce

Hadoop

QA

Hadoop

Hadoop

Hadoop

Hadoop

HadoopRDBMS

Hadoop

QAHDFS

QA

QA

HDFS

QA

7- Hadoop

JavaMapReduceMapRe

10QA

duce1

QA

HDFS 7-

7-

QAQA

Hadoop
022_.indd 75

Hadoop

7-

IT Vol.22

075

09/03/11 22:59

QA
QA
QA

QA

HDFS

Map

Reduce

Map

Reduce

Map

Reduce

Hadoop

Web

Map

MapReduce

Reduce

MapReduce

Hadoop

MapReduce

Hadoop

QAQA

Hadoop

HDFS

1 1 QA

JSON
JSON

9 10

JSON <key,value>

MapReduce

MapReduce

JSON

Map1QA

MapReduce

{"QAStartDate" : "2004/02/01", "Category" : ["Web", "DataManagemen


t"], }

MapReduce
MapReduce
Part 3

7
7 9 10
Hadoop

Hadoop

076

IT Vol.22

022_.indd 76

Apache
09/03/11 22:59

8MapReduce
Map

MapReduce

120

960

260

1,560

300

11

3,300

HDFS
Reduce

2
Hadoop

5,820

HDFSNamenode
8Hadoop

MapReduce

HDFS

1HDFS

XenVMware

HadoopXen

HDFS

VMware

Hadoop

8NamenodeNamenode

9
1200

1000

800

2
4
6
8
10

600

400

200

0
1

10

20

30

40

50

60

70

80

90

100

10
1200

1
10
20
30
40
50
60
70
80
90
100

1000

800

600

400

200

0
2

Hadoop
022_.indd 77

10

IT Vol.22

077

09/03/11 22:59

HadoopHDFS

TCP/IPHDFS

11Hadoop

Xen/VMwareHa

doop192.

168.128.0/24LANVMware/

Xen 192.168.128.0/24

Hadoop

Hadoop

Hadoop

Part 1Hadoop

Hadoop

11

http://www.forrester.com/Research/Document/

Excerpt/0,7211,47100,00.html

9OS

11HDFS
192.168.0.254
Hadoop

dfs -cat /path/to/secret

hadoop

192.168.0.0/24
Hadoop Hadoop Hadoop Hadoop
HDFShadoop

OS

HDFSHDFS
OShadoopdfs
Part 3

Hadoop

078

IT Vol.22

022_.indd 78

Apache
09/03/11 22:59

12Hadoop

Hadoop

QA

QA

QA

HDFS

Apache Pig

Pig Latine

10

PigHadoop

Hadoop

12

Hive

Hadoop

HiveQL
SQL

HadoopOSS

10
SLA
Service Level Agreement

1
99.9
20

24

0.001
1
0.48

Hadoop
DatanodeTaskTracker Namenode
JobTracker
0.48

HBaseApache PigHive

HadoopMapReduce
FrameworkHDFSApa
che HadoopOSS

Hadoop
Wiki
OSSWiki

HBase
BigTable
HDFS

HBaseHBase
HDFS

Hadoop
022_.indd 79

MapReduceSimplied Data Processing on Large Clust


ers
URLhttp://labs.google.com/papers/mapreduceosdi04.pdf
The Google File System
URLhttp://labs.google.
com/papers/gfs-sosp2003.pdf
The Chubby lock service for loosely-coupled distribu
ted systems
URLhttp://labs.google.com/papers/
chubby-osdi06.pdf
BigtableA Distributed Storage System for Structured
Data
URLhttp://labs.google.com/papers/bigtableosdi06.pdf

IT Vol.22

079

09/03/11 23:00

IT
IT

IT

Takashi Arai

11

2,500

1991

080

IT Vol.22

.indd 80

09/03/11 21:37

11

13

IT Vol.22

.indd 81

081

09/03/11 21:37

SCMSupply Chain Management

082

IT Vol.22

.indd 82

09/03/11 21:37

Everio GZ-MG330

Everio GZ-

MG3305117

Everio GZ-MG330

11

IT Vol.22

.indd 83

083

09/03/11 21:37

3-3

10

IT

33IT

IT

IT

IC

100

IT

IT

IT

084


100

IT Vol.22

.indd 84

09/03/11 21:37


James Shore

3,780

B o oks

1
XP
1

2,310

882

1,575

Release It!

Michael T. Nygard

3,780

IT

IT
1714

57
IT

IT

Java
Scott W. Ambler

1,995

Java

ITVol.22

book.indd 85

085

09/03/11 22:43

EAI

1990

SOA
ESBEnterpr

SCM
EAI
SOA

ise Service Bus

ERPRDBMS

1985

Web 2.0

UNIX

SOA
Web 2.0

X Wi

ndow System

SOABPMB

usiness Process Management

Vol.

IT

21

Hiroaki Oizumi

IT

1990

SCM

IT

IT

IT

IT

IT

SCM

2004

IT

EAI1

ERP

Masanori Sakaguchi

KOYO

P e r s o n a l

086

.indd 86

H i s t o r y

o f

T o p

A r c h i t e c t

IT Vol.22

09/03/11 22:20

1993

1997
1999

2002
2004
2006

2009

UNIXC/S

SCM

EAI
SOA

SOACRM

SOA

IT

1959
1985

IT Vol.03
0 8
7

.indd 87

09/03/11 22:20

To Do

To Do

To Do

4
5

100

To Do

Outlook

To

To Do

To Do

30

IT

IT

IT

Do

To Do

David Allen
David Allen

Vol.20

Norio kasahara

1 1

3OutlookTo Do

41

1
PC

2 5

088

.indd 88

IT Vol.22

09/03/11 14:55

10

IT

11

67

PDA

PC

11

8PDA
PC

PC

12

IT

Analyzing

the
architecture
freely

6
10

8 Gmail 1

11
7

12

IT Vol.22

.indd 89

089

09/03/11 14:55

14

13

2 Allen

FAX

15

2
LT
GPS

10km

LSDLong Slow Distance

13

14

151

090

.indd 90

IT Vol.22

09/03/11 14:55

20

19

Mandal-Art
http://www.ma

ndal-art.com/
9

17

18

Vol.20

IT

16

To DoTo Be

16


19Allen
17

20
18Allen


IT Vol.22

.indd 91

091

09/03/11 14:55

Mizuhiro Kaimai

Vol.21

13

3
3
A

2
A

12
3

1
C

2
A

092

IT Vol.22

.indd 92

09/03/11 20:40

11











A1.


A2.

A
B
2

A1BC

B C A2

12
A1 B C

A1.

B.

C.

A2.

IT Vol.22

093

.indd 93

09/03/11 20:40

A2

2
A

UML


A








IDEF1X

UML

094

IT Vol.22

.indd 94

09/03/11 20:40

3 1

13

3 A 1 B C

A2 4

311

2 3

33

1
23

13

Vol.21

IT Vol.22

095

.indd 95

09/03/11 20:40

09/03/11 21:35

022_toku2.indd 96

IT Vol.22

096

IT

IT

IT

MALT

Koichi Hayashi

IT Vol.22

022_toku2.indd 97

097

09/03/11 21:36

What

So What/Why So
So

Why So

MECE

So

What/Why So
MECE

MECE

MECE
Mutually Exclusive and Collectively

Exhaustive

So What/Why So

MECE

13

So What/Why So

MECE

098

IT Vol.22

022_toku2.indd 98

09/03/11 21:36

MECE

1
MECE

IT

MALTMo
deling As Logical Thinking
2

IT

MALT2
1IT

3C

PDCA

IT

Modeling

IT Vol.22

022_toku2.indd 99

099

09/03/11 21:36

2MALTIT

MALT

So What/Why So

MECE

IT

MALT

MALTIT

1
100ml

40
1

100ml

100100

200ml

MALTIT

ERUML

MALT
2

100

IT Vol.22

022_toku2.indd 100

09/03/11 21:36

3
IT

11 1

2 3

IT

20
40020

MALT

40/100
100
0/100
100/100100
20

MALT7

IT

IT Vol.22

022_toku2.indd 101

101

09/03/11 21:36

4MALT7

MALT

MVC

IT

MALT

102

IT Vol.22

022_toku2.indd 102

09/03/11 21:36


7 FIRE

AIM

5Web

Web

So What

Why So

4
DQQDocument Quality Quad

Web

MALT

AIMFIRE

6-

6-

Web

6-

IT Vol.22

022_toku2.indd 103

103

09/03/11 21:36

Web

Web

FactInterpretation

REsponse
3

FIRE

73

AIMFIRE

MALTA

Web

IMFIRE

FIRE
AI
M2
FIRE
FIRE

104

31980Donald A. Norman

IT Vol.22

022_toku2.indd 104

09/03/11 21:36

7FIRE

Interpretation
Fact

REesponse

Web

8 6-

AIM

6-

6-

6-

6-

MALT

AIM
9

Ad

vanced Interpretation Matching

Web

AIM

9AIM

IT Vol.22

022_toku2.indd 105

105

09/03/11 21:36

AIM

Web

Web

3
Web

AIM

Web

10-

Web
10

10Web

Web

Web

106

Web

IT Vol.22

022_toku2.indd 106

09/03/11 21:36

MAL
T
Web

10

10-

Web

11 11-

Web

Web

Web

MALT

11

11-

10-

11-

IT Vol.22

022_toku2.indd 107

107

09/03/11 21:36

11

Web

MALT

5IDG

7MALT

IT

MALT

IT

IT

MALT

5
IT

IT

IT

IT

Web5


2,000

2009524

108

IT Vol.22

022_toku2.indd 108

09/03/11 21:36


IDG
1

1
2

1
4
4
2

IDG
1

2
PLAY Mou
se
1
PC

OSWindo
ws 2000/XP/VistaMac
OS X 10.3.9

IDG
1

12

2009424

11
Vol.23
2009525

P resent

19cm
5cm

4
Vol.21
DSi

USB

IT

Java

5
85
Java
2

Java

85

IT

6
85

IT Vol.22

Present.indd 109

109

09/03/11 23:00

100IT

Noriaki Okui

09/03/11 14:49

toku03.indd 110

IT Vol.22

110

IT

1IT
http://www.gartner.co.jp/b3i/analy

st/090107/
2003 6

IT

IT

200711IT

4 IT

IT

PM

IT Vol.22

toku03.indd 111

111

09/03/11 14:49

IT

IT

NHK

2007

PM

112

IT Vol.22

toku03.indd 112

09/03/11 14:50

13

IT Vol.22

toku03.indd 113

113

09/03/11 14:50

114

IT Vol.22

toku03.indd 114

09/03/11 14:50

IT Vol.22

toku03.indd 115

115

09/03/11 14:50

IT

116

IT Vol.22

toku03.indd 116

09/03/11 14:50

IT Vol.22

toku03.indd 117

117

09/03/11 14:50

IT

IT

118

IT Vol.22

toku03.indd 118

09/03/11 14:50

11

IT Vol.22

toku03.indd 119

119

09/03/11 14:50

5 BMW400 NHK

120

IT Vol.22

toku03.indd 120

09/03/11 14:50

IT Vol.22

toku03.indd 121

121

09/03/11 14:50

IT

122

IT Vol.22

toku03.indd 122

09/03/11 14:50

SWOT

SWOT

SWOTSStrengthsWWeaknes

ses OOpportunities T

Threats30

A
B

SWOT2

SWOT

2A

SStrengths

WWeaknesses

3A

OOpportunities

TThreats

4
S

IT Vol.22

toku03.indd 123

123

09/03/11 14:50

A
B
C

A
B
C

A
B
C

A:
B

A
B
C
A
B
C

BC

BC

A
B
C

A
B
C

124

IT Vol.22

toku03.indd 124

09/03/11 14:50

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

A
B
C

IT Vol.22

toku03.indd 125

125

09/03/11 14:51

126

IT Vol.22

toku03.indd 126

09/03/11 14:51

IT Vol.22

toku03.indd 127

127

09/03/11 14:51

IT BPMS
Business Process Management Suites
BPMS IT

Satoshi Akeniwa

I
T

I
T

!!

1
10

BPM
N

128

IT Vol.22

BPM.indd 128

09/03/11 22:32

1
1

As Is
To Be
2

2
3

BPMS

IT Vol.22

BPM.indd 129

129

09/03/11 22:32

21

32

10

10

130

IT Vol.22

BPM.indd 130

09/03/11 22:32

3 1

52

IT Vol.22

BPM.indd 131

131

09/03/11 22:32

1015

1035

1540

1825

1035

1330

1330

1540

810

132

65W1H

5W1H

IT Vol.22

BPM.indd 132

09/03/11 22:32

65W1H
What

Who

Why

Where
How

When
WhatWhyWho
WhenWhereHow

BPMN

2
7

IT

7BPMN

IT
IT

IT

IT

IT Vol.22

BPM.indd 133

133

09/03/11 22:32


Author's Profile

1993 1999
IT

2001

MBA

Java/SOA
WebLogic
Application Grid

Japan OSOA
Community Java

2001

Cohere
nce

P.026

P.034

P.044

P.044

2000
Java/C++Web
2006
Coheren
ce/WebLogic

SI

RDB
ORDB

XML WebSOA

IT


IBM

P.044

P.056

P.096

P.110

20

CIO

134

IT Vol.22

.indd 134

09/03/11 17:42

IBM
2004

2006
Web

2002IBM
1990
Linux
Web

IBM
2001
AIXHACMP

2003

P.064

P.064

P.064

P.080

P.088

SE

90 SE

BP

P.092

SI


BPMN

P.128


Unified Process
IT

IT Vol.22

.indd 135

135

09/03/11 17:42

Staff

Next
Issue

Vol.

232009525

IT
IT

IDG
113-0033
3-4-5

03-5800-3111

03-5800-2931

03-5800-4851

03-5800-2661

AD

index

2
2-3

11

18

19

20

21

10

URL

.indd 136

136

http://www.itarchitect.jp/adlist/

IT Vol.22

ITVol.22

09/03/11 22:41

You might also like