You are on page 1of 12

TP CH KHOA HC, i hc Hu, S 58, 2010

PHNG PHP CI TIN K THUT CHE GIU THNG TIN


TRONG CC TI LIU TH DA TRN K THUT STEGANOGRAPHY
Nguyn Th Hng Giang
Trng i hc S phm, i hc Hu
Nguyn Xun Linh
Trung tm Cng ngh Thng tin tnh Tha Thin Hu

TM TT
Che giu thng tin l cng ngh nhng cc thng tin b mt vo nhng d liu ngy
trang v lm cho cc thng tin b mt ny tr thnh v hnh. Ngy nay, cc ti liu th nh
HTML, XML, XHTML v WML c bit n nh l nh dng chun lu tr cc d liu c
cu trc cng nh trnh din d liu trn cc trnh duyt web. Chng l nhng ngn ng c
s cho vic trao i thng tin trn mng Internet. Khc vi cc phng php che giu thng tin
trn d liu hnh nh hoc m thanh, hin nay ch c mt s t phng php che giu thng tin
vo d liu vn bn, c bit trn cc ti liu th. Hn na, mt trong nhng hn ch ca
nhng phng php ny l d dng b pht hin nu i tng tn cng bit c phng php
c s dng che giu thng tin (stego-key). Trong bi bo ny, chng ti xut phng
php ci tin nng cao tnh nng bo mt ca cc phng php truyn thng thng qua vic
s dng khi nim kha ng (dynamic stego-key) che giu thng tin trong cc ti liu th.

1. Gii thiu
Steganography hay giu d liu trong d liu c bt ngun t thut ng Hy
Lp stegos, c ngha bao ph hoc che giu v graphia ngha l vit, va l ngh thut
va l ngnh khoa hc che giu thng tin bn trong thng tin. Vi tc pht trin
nhanh chng ca cc cng ngh d liu trn Internet, lng thng tin d liu di dng
in t c truyn v nhn trn mng ang tng ln khng ngng. Khi cng ngh
truyn nhn thng tin trn mng cn phi c bo mt th tm quan trng ca vic che
giu thng tin c nhn nhn mt cch rng ri hn.
Bng vic s dng cc d liu ngy trang (cover data) che giu, cc thng
tin b mt bn trong n c th truyn i an ton trn mng m khng h gy ra mt s
nghi ng hay bt thng no v mt thng tin, thng tin b mt c th c trch xut
sau khi cn [3], [4], [5]. Hnh 1 di y s m t phng php che giu thng tin
tng qut [4].
Vi nhng loi d liu khc nhau, chng ta cn nhng phng php che giu
25

thng tin khc nhau. V d giu thng tin b mt trn d liu hnh nh, chng ta c
th s dng cc bit khng quan trng ca cc im nh trn bc nh che giu
thng tin. Sau khi giu cc thng tin ny, s thay i hnh dng v mu sc ca bc nh
kh c th nhn thy bng mt thng. i vi d liu vn bn, s thay i v tr ca
cc du chm cu hoc khong trng gia cc t l mt trong cc cch n gin thm
cc gi tr thng tin m khng lm thay i ni dung ca vn bn gc. Tng t nh vy,
cc ti liu th c th c s dng che giu cc thng tin bng cch thay i cu
trc ca cc ti liu m khng nh hng ti ni dung hin th ca n trn trnh
duyt Web.
D liu
ngy trang
Thut ton
che du
thng tin

Truyn i
trn mng

Thut ton
bc tch
thng tin

Thng ip
b mt

Thng ip
b mt
Kha b mt

Kha b mt

Hnh 1. S che giu thng tin tng qut [4]

2. Cc phng php che giu thng tin trn ti liu th


Mc d c nhiu phng php giu thng tin trn ti liu th nhng trong
phm vi bi bo ny chng ti ch gii thiu hai phng php l Chn khong trng
trong th v Thay i th t ca cc thuc tnh trong th v hai phng php ny
cho php chng ta tn dng c tt c cc th trong ti liu che giu thng tin
(lng thng tin c th c che giu l ln nht so vi cc phng php khc). ng
thi chng ti xut phng php ci tin nng cao tnh bo mt vi tng s
dng kha ng thay v kha tnh nh trong phng php truyn thng.
2.1. Phng php truyn thng
a. Chn khong trng trong th
Phng php Chn khong trng trong th l mt trong nhng phng php
hiu qu nht c s dng che giu thng tin trn cc ti liu th v chng ta c th
khai thc v s dng tt c cc th c trong ti liu. Theo W3C, mt th c th cha
nhiu khong trng hoc khng c khong trng no trc k hiu ng ca th. Bng
vic thm vo hoc xa i cc khong trng ny, chng ta c th nhng cc d liu vo
m vn bo m gi nguyn ngha ca ni dung th gc [4]. Chng hn, chng ta c
th nh ngha mt quy tc theo v d sau:
V d 1: Vn bn XML [4]
Kha tnh:
26

<tag>, </tag> or <tag attribute=value>: Khng c khong trng no trong th


trc khi ng th c k hiu l i din cho bit 0
<tag >, </tag >, or <tag attribute=value >: C mt khong trng trc khi ng
th c k hiu l i din cho bit 1
D liu ngy trang trc khi nhng thng tin
<user><name>Peter</name><id>01</id></user>
<user><name>Mary</name><id>02</id></user>
D liu ngy trang sau khi c nhng thng tin:
<user ><name>Peter</name ><id >01</id></user>
<user><name >Mary</name><id>02</id ></user >
R rng chng ta thy d liu ngy trang trc v sau khi nhng thng tin s
hin th hon ton ging nhau trn trnh duyt Web.
If (c mt khong trng trc k hiu ng th) {
Bit 1 c m ha trong th
}
Else {
Bit 0 c m ha trong th
}
Trong v d trn, d liu c nhng l: 101100 010011
V d 2: Vn bn HTML [4]
Qui tc giu thng tin s dng kha tnh (static stego-key):
<tag>, </tag> or <tag attribute=value>: Khng c khong trng no trong th
trc khi ng th c k hiu l i din cho bit 0
<tag >, </tag >, or <tag attribute=value >: C mt khong trng trc khi ng
th c k hiu l i din cho bit 1
Gi s chng ta mun nhng k t A (A = 01000001)
D liu ngy trang sau khi giu thng tin (stego data):
<html xmlns="http://www.w3.org/1999/xhtml">..0
<head ><title>Microsoft Corporation</title>100
<meta http-equiv= "X-UA-Compatible">...0
<meta http-equiv="Content-Type" charset=UTF-8">..0
27

</meta>0
</html >1
Trong v d trn, d liu c nhng l: 01000001 (A)
i vi phng php truyn thng ny, mt trong nhng hn ch ln nht l
vic s dng kha tnh. l nh dng th <tag> hoc </tag> lun lun i din cho
mt gi tr (bit 0 hoc bit 1) trong d liu kt qu sau khi cha thng tin b mt
(stego data). Chnh v vy, i tng tn cng c th tm thy thng tin b mt c che
giu nu bit c phng php che giu.
b. Thay i th t xut hin ca cc thuc tnh trong th
Phng php Thay i th t xut hin ca cc thuc tnh trong th cng c
xem l mt phng php hiu qu che giu cc thng ip b mt trong cc ti liu
th. Mc d chng ta khng th chn thm bt c d liu g vo ti liu th v chng
hoc s hin th trn trnh duyt hoc s c nhn thy trong m ngun; nhng cc
thng ip b mt c th c giu trong cc ti liu XML, HTML, XHTML v.v.. thng
qua vic thay i th t xut hin ca cc thuc tnh bn trong th m khng lm thay
i s hin th ni dung ca th trong trnh duyt cng nh gia tng kch thc ca ti
liu [4], [6].
Trong phng php ny, th t xut hin ca cc thuc tnh s c quy c
nh l mt kha (stego-key) cho vic che giu v trch xut thng tin. Quy c ny l c
nh p dng cho ton b ti liu, do cn gi l kha tnh (static stego-key). Chng ta
c th nh ngha nhiu cp thuc tnh tng khi lng thng tin c th che giu. V
d sau s m t r hn v phng php ny.
V d 3: on m XML [4]
nh ngha kha thng qua cp th t:
<event month=MONTH date=DATE>EVENT</event> i din bit 0 (khi
thuc tnh month ng trc thuc tnh date)
<event date=DATE month=MONTH>EVENT</event> i din bit 1 (khi
thuc tnh month ng sau thuc tnh date)
D liu sau khi c giu thng tin s c dng:
<event month="JUL" date="4">Independence day</event>
<event date="24" month="DEC">Christmas</event>
Chui bit thng ip c che giu trong on d liu trn l: 01
V d 4: on m HTML [4]
<span class="normal Text" style="color:#012388ff">
28

What is the difference?


</span>
<span style="color #012388ff" class="normal Text">
What is the difference?
</span>
Chng ta c th nh ngha th t cc cp thuc tnh t v d trn nh sau:
Bng 1. Kha tnh cho phng php Thay i th t xut hin ca cc thuc tnh trong th

Thuc tnh th nht

Thuc tnh th hai

Month

Date

Class

Style

Cellpadding

Cellspacing

Align

Valign

Width

Height

If (Thuc tnh th nht ng trc Thuc tnh th hai)


{
Bit "0" c m ha
}
Else {
Bit "1" c m ha
}
D liu c che giu l: 01
2.2 Phng php ci tin
Trong cc phng php ci tin ny, chng ti s dng khi nim kha ng
nhng cc thng tin cn che giu thay v s dng kha tnh (biu din bit 0 v bit 1 bng
cc k hiu c nh) tng an ton cho cc thng tin b mt c gi i. Kha ng
l kt qu ca thut ton XOR kt hp gia kha tnh truyn thng v mt kha ph b
mt, trong kha ph l do ngi gi v ngi nhn t qui c. Trong phng php
s dng kha ng, cc bit 0 v bit 1 c biu din bng cc k hiu khng c nh,
ty thuc vo chui mt khu b mt v tn ca mi th.
a. Khong trng trong th
Chng ta c th nh ngha mt kha ng da vo s kt hp gia mt kha
29

nh trc v tn th trong mi th. Bng sau y m t kha ng cho phng php


Khong trng trong th.
Bng 2. Kha ng cho phng php Khong trng trong th

Dynamic Stego Key

Nu
((N mod 2) = 1)

Nu
((N mod 2) = 0)

<tn th thuc tnh=gi tr> or </tn th>

Biu din bit 1

Biu din bit 0

<tn th thuc tnh=gi tr > or </tn th >

Biu din bit 0

Biu din bit 1

Bc 1: Xc nh mt kha b mt t ngi s dng


Bc 2: Vi mi th trong ti liu th
Bc 2.1: Trch xut tn ca mi th, v d html, title, meta, head, v.v
Bc 2.2: Tch chui k t kha b mt v tn th thnh cc k t ring l.
V d: kha b mt pswd s c tch thnh
p: 01110000 (m ASCII tng ng) s: 01110011 w: 01110111 d: 01100100
tn th: title
t: 01110100

i: 01101001

t: 01110100

l: 01101100

e:01100101

Bc 2.3: Thc hin php ton XOR gia kha b mt v tn th


title
XOR
p s w d p (lp li cc k t ca kha b mt nh hn di ca tn th)
Tng ng vi:
01110100 01101001 01110100 01101100 01100101
XOR
01110000 01110011 01110111 01100100 01110000
--------------------------------------------------------------00000100 00011010 00000011 00001000 00010101
Gi N l s lng bit 1 trong chui bit kt qu ca php ton XOR, trong
trng hp ny, ta c N= 10
Bc 2.4: Nhng thng tin b mt
If (C khong trng trc khi ng th)
{
30

If ((N mod 2) = 1)
Bit "0" c m ha trong th
Else
Bit "1"- c m ha trong th
}
Else
{
If ((N mod 2) = 1)
Bit "1"- c m ha trong th
Else
Bit "0" c m ha trong th
}
Bc 3: Kt thc th tc
V d 5: on m HTML vi kha ng [4]
D liu dng giu thng tin:
<html xmlns="http://www.w3.org/1999/xhtml">
<head><title>Microsoft Corporation</title>
<meta http-equiv= "X-UA-Compatible">
<meta http-equiv="Content-Type" charset=UTF-8">
</meta>
</html>
Gi s, chng ta cn giu mt k t A vo d liu trn truyn i trn mng.
(A = 01000001, m ASCII)
Mt khu nh ngha trc: pswd
Tn th: html, head, title, /title, meta, /meta, /html
html XOR pswd s cho chng ta kt qu:
01101000 01110100 01101101 01101100
01110000 01110011 01110111 01100100
--------------------------------------------------00011000 00000111 00011010 00001000 (c chn bit 1 trong chui bit kt
qu)
31

Do N = 9
Vi cc tn th cn li chng ta c:
head XOR pswd s cho kt qu N = 8
title XOR pswdp s cho kt qu N = 10
/title XOR pswdps s cho kt qu N = 19
meta XOR pswd s cho kt qu N = 11
/meta XOR pswdp s cho kt qu N = 15
/html XOR pswdp s cho kt qu N = 17
D liu sau khi che giu thng tin s l:
<html xmlns="http://www.w3.org/1999/xhtml" >0
<head ><title>Microsoft Corporation</title >100
<meta http-equiv= "X-UA-Compatible" >...0
<meta http-equiv="Content-Type" charset=UTF-8" >...0
</meta >0
</html>1
Chui bit c che giu: 01000001
Trong v d 3 ny, d liu sau khi che giu k t A (01000001) khc vi v d
hai ( phng php truyn thng). D liu sau khi che giu thng tin trong v d 3
cng s c thay i khi mt khu thay i v khong trng trong th s i din bit 0
hoc bit 1 (khng c nh) ty thuc vo tn th v mt khu. Ti ni nhn d liu, cn
c mt khu trch xut thng tin b mt.
b. Thay i th t xut hin ca cc thuc tnh trong th
Trong phng php ny, tng s dng kha ng thay v phng php truyn
thng cng c xut p dng tng t nh phng php cp trc trong
bi bo ny. Bng di y m t kha ng cho phng php Thay i th t xut
hin ca cc thuc tnh trong th.
Bng 3. Kha ng cho phng php Thay i th t xut hin ca cc thuc tnh trong th

Thuc tnh th
nht

Thuc tnh th
hai

If ((N mod 2) = 1)

If ((N mod 2) = 0)

Cellpadding

Cellspacing

Biu din bit 1

Biu din bit 0

Cellspacing

Cellpadding

Biu din bit 0

Biu din bit 1

32

Align

Valign

Biu din bit 1

Biu din bit 0

Valign

Align

Biu din bit 0

Biu din bit 1

Month

Date

Biu din bit 1

Biu din bit 0

Date

Month

Biu din bit 0

Biu din bit 1

Gi N l s bit 1 trong chui kt qu ca thut ton XOR gia mt khu v tn


th chng ta c:
If (Thuc tnh th nht trc Thuc tnh th hai)
{
If ((N mod 2) = 1)
Bit 1 c m ha bi th
Else
Bit 0 c m ha bi th
}
Else {
If ((N mod 2) = 1))
Bit 0 c m ha bi th
Else
Bit 1 c m ha bi th
}
V d 6: D liu ngy trang HTML vi phng php kha ng
Gi s, chng ta cn nhng k t B = 01000010
Kha b mt (stego-key): pass
D liu ngy trang (cover data):
<table border="0" cellpadding ="0cellspacing ="0">
<tr>
<td align="left" valign="bottom">
<a href="http://www.microsoft.com">
<img
src="
align=right valign=top

http://www.microsoft.com/ssdis100.gif"

border="0" width="100" height="100"></a></td>


33

alt="click"

<td align="top valign="left" >


<font color="#ffffff"></font></td></tr>
<tr align="center" valign="middle">
<td align="center" valign="middle" nowrap="nowrap">
<td align="center" valign="middle" nowrap="nowrap">
<td align="center" valign="middle" nowrap="nowrap">
<a
<td></td>

href="http://www.storagesearch.com/ssd.html">ssd

news</a></td>

</tr>
</table>
Tn th: table, tr, td, img
table XOR passp cho chng ta kt qu:
01110100 01100001 01100010 01101100 01101101
01110000 01100001 01110011 01110011 01110000
--------------------------------------------------------------00000100 00000000 00010001 00011111 00011101 (c 12 bit 1 trong chui
kt qu)
Do N = 12
Vi cc tn th cn li chng ta c:
td XOR pa s cho kt qu N = 3
img XOR pas s cho kt qu N = 7
tr XOR pa s cho kt qu N = 4
Thng tin b mt cn che giu: B = 01000010
<table border="0" cellpadding ="0cellspacing ="0">0
<tr>
<td align="left" valign="bottom">1
<a href="http://www.microsoft.com">
<img
src="
http://www.microsoft.com/ssdis100.gif"
valign=right align=top 0
border="0" width="100" height="100"></a></td>0
34

alt="click"

<td valign="topalign="left" > 0


<font color="#ffffff"></font></td></tr>
<tr align="center" valign="middle"> .0
<td align="center" valign="middle" nowrap="nowrap">...1
<td valign="center" align="middle" nowrap="nowrap">0
<td valign="center" align="middle" nowrap="nowrap">
<a
<td></td>

href="http://www.storagesearch.com/ssd.html">ssd

news</a></td>

</tr>
</table>
D liu c giu i vi kha ng: 01000010
3. Kt lun
Che giu thng tin trong cc ti liu th l mt k thut kh phc tp so vi cc
loi d liu ngy trang khc v cu trc ca ti liu th cc k kh thm hay xa i
cc d liu t ti liu gc m khng lm nh hng ti ngha v hin th ca ti liu
trn trnh duyt. Tuy nhin, bng cch s dng cc c tnh ca ti liu th, chng ta c
th thay i cu trc ca ti liu che giu d liu bng nhiu phng php. Trong bi
bo ny, chng ti xut phng php dng kha ng nhm tng cng tnh bo
mt ca phng php truyn thng. Nh , i tng tn cng kh c th pht hin v
nh cp cc thng tin c che giu.
Trong tng lai, cc phng php ny c th kt hp vi cc k thut m ha
nh DES, 3DES, RSA to ra s bo mt kp cho vic che giu thng tin [2], [8],
[9], [10]. i tng tn cng c th pht hin kha hoc c th bit k thut che giu
nhng kh c th tm ra hai thng tin trn cng mt lc gii m ra cc thng tin b
mt c che giu. iu ny gy kh khn hn rt nhiu cho i tng tn cng, v
gia tng tnh bo mt cho vic trao i thng tin.

TI LIU THAM KHO


1. W. Bender et al. Techniques for Data Hiding, IBM Systems Journal, 1996
2. Neil F. Johnson, Sushil Jajodia. Steganalysis: The Detection of Hidden Information.
Proceedings of the IEEE Information Technology Conference, Sep 1998
3. R.Anderson, F.Petitcolas: On the limits of the steganography, IEEE Journal Selected
Areas in Communications, VoL .16, No. 4, May 1998.
35

4. Shingo Inoue, Kyoko Makino, Ichiro Murase, Osamu Takizawa, Tsutomu Matsumoto,
Hiroshi Nakagawa. A proposal on Information Hiding Method using XML, Mitshubishi
Research Institute, Inc, Yokohama National University, 2001.
5. Donovan Artz. Digital Steganography: Hiding Data within Data, IEEE, 2001.
6. M. M Amin, M. Salleh, S. Ibrahim, M.R.Katmin and M.Z.I. Shamsuddin: Information
Hiding using Steganograph, National Conference on Telecommunication Technology
Proceedings, Shah Alam, Malaysia , IEEE 2003.
7. Mohamed Lahcen BenSaad, Sun XingMing. Techniques with Statistics for Web Page
Watermarking, 2005.
8. Aasma Ghani Memon, Sumbul Khawaja, Asadullah Shah. Steganography: A New
Horizon for Safe Communication through XML, 2005
9. Marc Smeets, Matthijs Koot Covert Channels - Research Report. 2006
10. Shirali-Shahreza, Advanced Communication Technology, 2008. ICACT 2008. 10th
International Conference on Volume 3. Text Steganography by Changing Words
Spelling, 2008.

IMPROVEMENT METHOD FOR INFORMATION HIDING IN


TAGGED DOCUMENT BASED ON STEGANOGRAPHY APPROACH
Nguyen Thi Huong Giang
College of Pedagogy, Hue University
Nguyen Xuan Linh
Thua Thien Hue Center of Information Technology

SUMMARY
Information hiding is a technique to embed secret information into a covered data in the
way that keeps the secret information invisible. Nowadays, tagged documents such as HTML,
XML, XHTML and WML are known as the universal format for preserving structured documents
and data as well as presenting data on web browsers. They are used as basic languages for
exchanging information on Web. As compared to the information hiding methods intended for
images and sounds, there are few methods for hiding information into text, especially on tagged
documents. Furthermore, one of the limitations of the traditional method is that it is easy to
break if the attacker knows the method with static stego-key. In this paper, we propose
improvement methods which enhance the security level of the traditional method by using the
dynamic stego-key concept to hide information in tagged documents.

36

You might also like