Professional Documents
Culture Documents
---------------------------------------
inh Quang nh
TM TT LUN VN THC S
H NI - 2013
Phn bin 1:
Phn bin 2: ..
H NI - 2013
M U
S pht trin khng ngng ca khoa hc cng ngh, c bit l tc bng pht
mnh m ca cng ngh thng tin khin cho Internet tr thnh mt cng c hu ch
khng th thiu trong i sng hin nay. Khng ch l kho ti nguyn thng tin khng l,
Internet cn cung cp cho ngi s dng cc cng c khai thc thng tin tin li, nhanh
chng v hin i. Mt trong s cc dch v mng c ngi dng s dng rng ri nht l
dch v th in t (e-Mail). Dch v ny cho php ngi s dng trao i th tn mt cch
d dng, ng thi c th ph bin kin thc, thng tin, thng bo v ni dung chnh sch
ca mt vn trong mt c quan, t chc Vi tnh cht d s dng, khng ph thuc
vo khng gian v thi gian, th in t tr thnh mt phng tin trao i thng tin
quan trng i vi nhiu ngi.
Cng vi s pht trin tin li ca Internet, vic ly cp thng tin, thm nhp bt hp
php, ph hoi thng qua Internet cng gia tng v s lng, loi hnh v k thut. Bn cnh
, cc i tng c t tng chnh tr cc oan, cc th lc phn ng trong nc v
ngoi nc tn dng trit nhng kh nng ca mng Internet phc v cho mc ch
tuyn truyn, pht tn ti liu phn ng v thc hin cc hnh vi phn ng khc nhm
chng ph nh nc cng ha x hi ch ngha Vit Nam. Do , vic u tranh chng cc
hot ng s dng cc dch v trn mng phc v cho mc ch xu tr thnh mt yu
cu cp thit ca lc lng Cng an.
Trc nhu cu thc t , ti nghin cu v mnh dn chn ti: Nghin cu
xy dng h thng lc ni dung th in t gi theo giao thc SMTP lm bo co tt
nghip nhm xut mt gii php kim sot cc ni dung ca th in t gi i trn mng
Internet.
Mc tiu ca ti
Xy dng h thng lc nhm tm ra nhng th c gi trn mng Internet theo giao
thc SMTP c ni dung lin quan n cng tc nghip v Cng an cn quan tm; phc
v yu cu nghip v trong lc lng.
Phm vi nghin cu
Nhng th c ni dung lin quan n cc vn m cng tc Cng an quan tm (ni
dung ca bc th c gi c th hin th di dng text .doc, .docx, .pdf, .txt; th c ni
dung thuc cc a ch cn theo di v nhng cm t m cng tc nghip v cng an phi
quan tm).
Gii hn ni dung nhng th c vit bng ngn ng ting Vit vi cc loi font
ch ting Vit hin c (TCVN3, Unicode, VNI, UTF8). ng thi, th c gi i trn
mng theo giao thc SMTP.
Phng php nghin cu
- Nghin cu mt th c gi theo giao thc SMTP phn tch ra cc phn trong
ni dung bc th nh: a ch gi, nhn, ch bc th, ni dung bc th, cc file vn bn
nh km Trn c s s dng phng php so snh a chui (multi matching) tm
kim cc cm t cn quan tm v i snh xy dng chnh sch lc theo yu cu nghip
v.
- Nghin cu cc font ch ting Vit v phng php chuyn i cc font ch, t
chuyn i cc font ch sang mt font tiu chun ri p dng phng php tm kim v so
snh a chui thc hin vic lc ni dung th.
Ngoi phn li m u, kt lun, ti liu tham kho v ph lc, lun vn c chia
thnh 3 chng:
Chng 1: Tng quan v h thng th in t
Chng ny ni v cu trc ca mt th in t, mt h thng th tn in t v cc
giao thc c s dng cho dch v th in t. Ngoi ra, chng ny cng gii thiu giao
thc SMTP, i su vo nghin cu vic gi mt th in t qua giao thc SMTP, nghin
cu proxy cho SMTP.
Chng 2: Phng php v cc k thut s dng cho h thng lc ni dung th
in t
Chng ny trnh by v nhng thng tin cn kim sot i vi mt th in t;
mt s bin php kim sot ni dung th in t; cc k thut s dng phc v cho vic
lc th: k thut tm kim, so snh chui (multi matching), x l phng ting Vit
(Unicode, TCVN3,VNI, UTF8)
Chng 3: Lc ni dung th in t gi theo giao thc SMTP
Chng ny xut phng php lc v m hnh, xy dng h thng lc ni dung
th in t gi theo giao thc SMTP.
1.1
L thuyt chung v th in t
H thng th in t cho php ngi dng trao i th in t vi nhau. H thng
MUA
gi
System Call
SMTP
( Direct connection )
TCP
MDA
SMTP
Mail
Retreival
Agent
MTA nhn
System Call
POP/IMAP
c/ Vit
MDA
cc b
MUA
Hp th
Vit
System Call
nhn
1.1.2 Cu trc mt th in t
Th in t thng c hai phn chnh: phn u (Header) v phn thn (Body) l vn
bn cha ni dung ca th. Khi gi i, ton b th in t c gi trong ni dung
(content). Ngoi ra, h thng th cn to thm mt phn na c gi l b th (envelope),
phn ny cha cc thng tin cn thit cho vic chuyn th n ni nhn. Hnh 1.2 trnh by
cu trc mt th in t.
B th
Tiu
Phn
thn
Ni dung
1.1.3.1
1.1.3.2
H
thng
SMTP
khch
SMTP khch
SMTP
Commands/Replies
And Mail
SMTP
ch
H
thng
SMTP ch
Client lin quan n th i, Server lin quan n nhn th. H thng th cc b cha
hp th (mailbox) ca mi user. Mailbox c 2 phn: phn cc b v phn ton cc.
Sau khi tho bc th trong khun dng chun, h thng th cc b xc nh tn ngi
nhn hp th cc b hay phi gi ra ngoi. bc th c gi i, Client SMTP phi bit
a ch IP ca ni nhn thng qua DNS v gi qua cng a ch SMTP (25) bt u thit
lp kt ni server SMTP ni nhn. Khi mi ni c thit lp, Client bt u chuyn th
n Server bi cc lnh ca SMTP.
1.2.2 M hnh giao thc SMTP
M hnh SMTP h tr c hai phng php truyn pht th end-to-end (khng c cc
MTA trung gian) v store-and-forward. Phng php end-to-end c s dng gia cc
mng ni b ca cc t chc v phng php store-and-forward c la chn cho cc h
iu hnh gia cc t chc c mng s dng giao thc TCP/IP v SMTP c s.
Mt tin trnh SMTP c bn c th truyn ti th in t ti mt tin trnh khc trn
cng mt mng hoc ti mt mng khc thng qua mt tin trnh truyn tip hoc qua cng
ni c th ti c c hai mng. Mt m hnh n gin cc thnh phn ca h thng SMTP
c trnh by trong hnh 1.5:
User
(ngi nhn)
User
( ngi gi)
UA
(User Agent)
UA
(User Agent)
Queue of mail
Hng i th
User
Mailboxes
MTA client
(Mail Transfer Agent)
SMTP Commands,
Replies and Mail
Kt ni giao thc
TCP, cng 25
MTA Server
(Mail Transfer Agent)
My ch gi
User
(nginhn)
User
(nginhn)
)
UA
(User Agent)
UA
(UserAgent)
User
Mailboxes
Hp th nhn
Queue of mail
i th gi
Local
MTA
Local
MTA
Relay MTA
Local
MTA
Local
MTA
SMTP Commands,
Replies and Mail
Relay MTA
Queue of
mail
Hng i th
Local
MTA
Local
MTA
Queue of
mail
Hng i th
H thng cc MTA relay cho php nhng ni khng s dng b giao thc TCP/IP
gi th in t ti nhng ngi dng nhng ni khc c th hoc khng th s dng b
giao thc TCP/IP.
Cc dch v Proxy khng cho php kt ni trc tip, chng buc tt c cc gi tin trn
mng phi c kim tra v lc theo quy tc thch hp. Thay v trao i thng tin trc tip
vi dch v thc s, ngi dng phi trao i thng tin vi Proxy server.
10
Hnh 1.8: M hnh trao i thng tin gia Client v Server qua Proxy
Server
Client
POP,IMAP
,
SMTP
Client
thc
SMTP
POP,IMAP
,
Spool
SMTP
Server
thc
SMTP
11
2.1
.doc,
docx,txt,pdf.).
2.2
2.2.1 Lc th rc
Th rc (spam) l th in t, tin nhn c gi n ngi nhn m ngi nhn
khng mong mun hoc khng c trch nhim phi tip nhn theo quy nh ca php lut.
Vn th rc l vn gy nhc nhi trong x hi trong nhng nm gn y. Nhiu
cng trnh nghin cu v phng php lc th rc c u t v tin hnh t kh lu.
nh gi hiu qu ca mt cng c lc th rc ngi ta thng da trn hai o sau:
False Positive T l th thng b lc nhm thnh th rc.
False Negative T l th rc b lc nhm thnh th thng.
Tt c nhng cng c lc c gi tr ngy nay thng s dng mt trong s nhng
phng php hoc kt hp ca cc phng php sau:
Phng php lc Bayesian
Phng php lc SpamAssassin
Phng php dng danh sch trng/en
Phng php lc th rc dng chui hi p (Challenge/Response filters)
Phng php lc da vo v tr ca cc b lc (Filter Placement)
Phng php lc da trn xc nhn danh tnh ca ngi gi
12
13
2.3.2.3
Thut ton Boyer - Moore l thut ton tm kim chui rt c hiu qu trong thc
tin, hot ng theo quy tc kim tra cc k t ca mu t phi sang tri v khi pht hin s
khc nhau u tin, thut ton s tin hnh dch chuyn.
14
15
Chng 3- LC NI DUNG CHO TH IN T GI THEO GIAO THC SMTP
3.1
Mail
Server
FIRE WALL
SMTP PROXY
Server
Client
Client
Server
M un lc
Lu gi th
vi phm
M un h thng lc ni
dung th
Chnh sch
lc
CSDL
M un Qun Tr b lc
16
3.1.1 M un Firewall
M un ny thc hin chc nng nh hng li gi tin ca th in t c gi t
my Client ti mt SMTP server theo cng 25. Cc gi tin c a ch cng ch TCP l 25
s c nh hng li i ti m un SMTP proxy server vi mt cng ch c xc
nh.
17
3.1.4 M un lc
y l phn quan trng nht ca h thng lc ni dung th in t. Khi ni dung bc
th c m un x l font ting Vit thc hin, m un lc s s dng cc k thut tm
kim v so snh chui (multi matching) thc hin i snh cc phn a ch gi i, a
ch nhn th, v cc t kha ca phn tiu bc th v trong ni dung vi cc t v cm
t kha cm c trong c s d liu ca b lc (t in lc) kim tra nhng ni dung
c trng khp hay khng? Nu mt trong cc thng tin i snh trng nhau th b lc s
thc hin vic ghi li nht k v thc hin chnh sch lc.
3.1.5 M un Qun tr
Cho php ngi qun tr h thng iu khin b lc, thit lp cc chnh sch lc v
cc cng c lc nh: thit lp cc a ch cn lc, cc t kha cn lc, kt xut nht k vi
phm, lu gi th vi phm v.v...
3.2
Lu hot ng ca h thng lc th in t
18
Proxy SMTP
khi to
E-mail Client
Gi th
Proxy SMTP
Server
X l font ting
Vit
Kim tra th
vi phm ?
Vi phm
Khng vi phm
Proxy Client
SMTP SERVER
Hnh 3.2: Lu tip nhn v gi th in t khng vi phm
3.2.2 Lu x l th vi phm
Hnh 3.3 th hin cc bc x l i vi mt th vi phm chnh sch an ninh an ton
thng tin.
19
CSDL
Cn i snh
Proxy SMTP
server
B nh
Th gi i
Kim tra th
vi phm?
?
D liu cn i snh
Khng vi phm
Ghi nht k
Thay i ni dung
th
Vi phm
Client proxy
SMTP SERVER
Hnh 3.3: Lu x l th vi phm
20
21
KT LUN
Lc ni dung th in t theo giao thc SMTP nhm mc ch kim sot ni dung
th in t c gi trn Internet l mt hng mi trong vic m bo an ninh an ton
thng tin. Hng tip cn ti c bit hiu qu trong vic thu thp thng tin, theo di i
tng phc v cho cc hot ng nghip v ca lc lng Cng an. cng nh gp phn
khng nh trong vic hn ch vic pht tn ti liu, tuyn truyn, kch ng v ni xu
ng, lnh o, v Nh nc. ti cng p ng c mt phn yu cu thc tin t ra
v vn kim sot ni dung thng tin trn Internet.
Kha lun h thng ha c mt s vn l thuyt v h thng th in t, gii
thiu mt s bin php kim sot ni dung th in t, mt s phng php c ng dng
ph bin. Tuy nhin vi mc tiu ti t ra th vic s dng phng php lc t kha s
dng k thut multi matching thu c cc kt qu thc s kh quan v p ng c
nhu cu rt cp thit hin nay i vi mt s vn cn quan tm.
Tuy nhin, do thi gian nghin cu c hn chng trnh lc mi ch thc hin c
cc chc nng lc c bn da theo cu trc ca mt bc th in t nh lc theo cc trng
a ch ngi nhn, ngi gi, tiu th v ni dung th theo t kha ting vit.
Vi cc kt qu t c, ti ang tin hnh th nghim ti phng th nghim
An ninh an ton thng tin, Cc Tin hc nghip v, B Cng an. Nu c iu kin, thi gian
v c s cho php, tc gi mong mun c nghin cu pht trin hon thin cc chc
nng lc ca chng trnh nh lc ni dung cc tp nh km vi cc nh dng khc nhau
Trong thi gian ti s tin hnh th nghim tch hp vi cc m un lc khc nh:
lc th in t s dng giao thc khc nh: giao thc POP, IMAP; lc ni dung trang web,
kim tra l hng an ninh trang web a h thng vo hot ng trong thc t.