Professional Documents
Culture Documents
FB 210606222711
FB 210606222711
ᅺγፕЎ
Department of Computer Science and Communication Engineering
Providence University
Master Thesis
ࣴزғǺᅺޱኗ
Graduate StudentǺShuo-Fang Hsu
ࡰᏤ௲Ǻ ֻדറγ
AdvisorǺ Chih-Hung Chang, Ph.D
ύ҇୯ O ΖԃΎД
July 2019
ၗૻኞπำᏢس
ᅺγፕЎ
A Thesis
Submitted to Department of Computer Science and Communication Engineering
College of Computing and Informatics
Providence University
in partial Fulfillment of the Requirements
for the Degree of Master
in
Computer Science and Communication Engineering
July 2019
ύ҇୯ O ΖԃΎД
ঁ ୷ ܭ Y O L O ኳ ࠠ Չ ջ ਔ ނҹ ᒣ ϐ ࣴ ز
ᏢғǺᅺޱ ࡰᏤ௲Ǻֻד
ᓉەεᏢၗૻኞπำᏢسᅺγ
ᄔ ा
ᜢᗖӷ: ջਔނҹᒣǹYOLOǹOpenCVǹDarknetǹLabelImg
i
An Approach of Real-Time Object Detection using YOLO Model
ABSTRACT
ii
ठ ᖴ
ҁፕЎૈֹԋाགᖴ߃ఆ҉۸ԴৣᔅךቪӳԾک௲ךय़၂מѯǴᡣךё
аࡐᄪ۩ޑډ೭ঁسǴགᖴᓉەεᏢޑ௲Ǵ๏ϒךᐒඤډঁཥᕉნᡣ
ԾρׯᡂǶᗨฅԖਔӢࣁԾݩރޑيǴࡐӭԛགྷाܫకǴགᖴఆ҉۸Դৣុո
Κό໔ᘐޑႴᓰךǴᡣךԖᝩុֹԋ೭ঁᏢޑΚǶ
གᖴߋࠄमᙴৣᔅך໒ҥ҉ΦԖਏख़εੰьූکምЋнǴᡣૈך๓ҔᏢ
ਠ๏ϒਸғޑၗྍǴԶճǶ
གᖴᏢਠӚೀ࠻ჹޑךхǴᗋԖၗྍ௲࠻ߏයڐޑշǶགᖴֻדԴৣᜫ
ཀԏ੮ࡰᏤךǶགᖴᏢຼډၗྍ௲࠻ڐշךፕЎǴᗋٰפঁᏢڐۂշך
ޑჴᡍǴᗨฅךᗋࢂϙሶόǴՠࢂाགᖴֻדԴৣࡰᏤךঅׯፕЎᗋԖα
၂ޑቹТǴၸΑ೭ሶӭ॥॥ߘߘǴԾيགૈډڙѳӼޑ൩ӳǶനࡕགᖴ
ޑךР҆ᡣךՊ१คલߏޑεډӧֹԋፕЎǶ
iii
Ҟ ᒵ
ᄔ ा ............................................................................................................ i
ABSTRACT ...................................................................................................................... ii
ठ ᖴ ..........................................................................................................iii
Ҟ ᒵ .......................................................................................................... iv
კҞᒵ ............................................................................................................................... v
ಃക ׇፕ ..................................................................................................................... 1
ಃΒക Ў ............................................................................................................. 3
2-1 YOLO ................................................................................................................ 3
2-2 OpenCV ............................................................................................................. 4
2-3 Darknet............................................................................................................... 5
2-4 LabelImg ............................................................................................................ 5
ಃΟക ჴᡍБ ݤ............................................................................................................. 7
3-1 ࣴزी ........................................................................................................... 7
3-2 ჴᡍᡯ ............................................................................................................ 8
ಃѤക ჴᡍ่݀ ............................................................................................................ 11
4-1 ᒣٿᅿН݀─ᐊηǵဟ....................................................................... 11
4-2 ᒣٿᅿН݀─ᐊηǵ........................................................................... 17
4-3 ᒣٿᅿН݀─ဟǵ....................................................................... 25
4-4 ᒣΟᅿН݀─ᐊηǵဟǵ........................................................... 32
4-5 ᒣ૽ግѦޑၗ ..................................................................................... 34
ಃϖക ่ፕ ................................................................................................................... 40
ୖԵЎ ......................................................................................................................... 41
iv
კҞᒵ
v
კ 42. ЕҐෳ၂კТ 2 ................................................................................................... 37
კ 43. ЕҐෳ၂კТ 2 ޑᒣ่݀ .............................................................................. 38
კ 44. ३ᑴǵЕҐᆶᘗᘔޑෳ၂ಔӝ .......................................................................... 38
კ 45. ३ᑴǵЕҐᆶᘗᘔෳ၂ಔӝޑᒣ่݀ .......................................................... 39
vi
ಃക ׇፕ
߈ԃٰǴቹႽᒣుࡋᏢಞೌמࢂޔঁߐޑᔈҔǶჹނᡏջਔᒣǴ
೭ሡा٬ҔډவკТύᒣ܌ԖрނޑᡏǴ٠ЪҢрՏٰǴाډ೭
ࠁวޑ༧ሦୱǶ
ࢂճҔঁᘠ᜔ଞჹচۈቹႽǴख़ཥౢғঁќঁᙁϯၸޑቹႽٰගڗቹႽ
Н݀ᒣኳࠠ૽ግᆶୀෳǶॺךаᐊηǵᆶဟΟᅿН݀Չ૽ግǴࣁϙ
ሶॺךाୀෳΟᅿН݀ǻӢࣁ࣬ঁٿ՟ޑН݀ǴҔޑॺךԺࡐᜤѐղᘐǴ܌
аॺךགྷाၸႝတٰղᘐ࣬ঁٿ՟ޑН݀Ǵکঁό࣬՟ޑН݀Ǵע೭ঁό࣬
՟ޑН݀ёаٰ৾ଆКၨǶ
ӢࣁѠࢂ٥୯ৎǴॺךখӳՐӧѠǴ܌аॺךᒧѠޑН݀Ǵࡐ
தډـᐊηکǴᐊηکဟࡐ࣬՟Ǵࢂό࣬՟ޑН݀ǴቻܴᡉόӕǴ
ᐊηکဟషӧଆޑН݀ࡷрٰΑǴӢࣁ೭ኬᇥόۓёаᔅշډѠޑၭ҇Ƕ
ҁፕЎ่ޑᄬӼ௨ӵΠǴӧಃΒകύǴॺךஒჹҁፕЎύ܌٬ҔೌמޑᆶЎ
1
ӣ៝ǶௗӧಃΟകύǴஒᇥܴॺךճҔ YOLO ኳࠠࡌҥᒣࢬޑำᆶ
ᡯǶӧಃѤക္Ǵஒϟಏޑॺךჴᡍᆶᒣ่݀ޑǶനࡕಃϖകύǴॺךஒჹܭ
ॺךჴբ่݀ޑբঁᙁౣᇥܴᆶᕴ่Ƕ
2
ಃΒക Ў
ӧҁകύǴॺךஒჹҁፕЎύЇҔೌמޑډᆶ࣬ᜢୖԵЎǴঁᙁൂӣ
៝Ƕ
2-1 YOLO
YOLO җ You Only Look Once: Unified, Real-Time Object Detection[1] ೭ጇፕЎ
ᆉނᡏޑᐒǴ೭ኬޑᄽᆉݤёаᗉխނᡏୀෳѸϩ໒૽ግޑલᗺǶ
ճҔ YOLO ٰՉջਔቹႽᒣޑፕЎԖࡐӭǴٯӵǺᔈҔቹႽᙯඤᘉቚ૽
94%Ƕ
ྗࡋࢂ 83.43%Ƕ
3
კ 1. YOLO Οঁހҁኳࠠޑৡ౦
ၗٰྍǺ[4]
2-2 OpenCV
PythonǵRubyǵMATLAB ᇟޑقᔈҔำԄϟय़ǴёҔܭ໒วკႽೀǵႝတຎ
аϷკᒣำԄǶ
4
OpenCV ёҔܭှ،ӵΠሦୱޑୢᚒǺ
ᘉቚჴნ
ᖍᒣ
Ћ༈ᒣ
Γᐒϕ
բᒣ
ၮၟᙫ
ނᡏᒣ
კႽϩപ
ᐒᏔΓ
2-3 Darknet
ࡐԖሽॶǴٯӵπቹႽୀෳǵՉΓୀෳǶ
2-4 LabelImg
LabelImg[7] ࢂঁቹႽຏπڀǴёаҢკТύނޑᡏǴᇙբԋҔٰ૽ግ
5
٬ҔᜪઓᆛၡჴբቹႽނޑҹୀෳਔǴሡाԖεໆޑςޕၗǴԶ೭٤
ၗӧ߃යाၸΓπޑБԄՉЋຏǴஒკТނҹ܌ޑӧՏаϷӜ
ᆀՉຏǴLabelImg ൩ࢂҔٰຏკТύނᡏՏکӜᆀޑπڀǶ
6
ಃΟക ჴᡍБݤ
ӧҁകύǴॺךϟಏزࣴޑॺךीᆶჴբБԄǶ
3-1 ࣴزी
аᗉխނᡏୀෳѸϩ໒૽ግޑલᗺǶ
࣬՟ޑН݀ǴҔޑॺךԺࡐᜤѐղᘐǴ܌аॺךགྷाၸႝတٰղᘐ࣬ঁٿ
՟ޑН݀Ǵکঁό࣬՟ޑН݀Ǵע೭ঁό࣬՟ޑН݀ёаٰ৾ଆКၨǶӢࣁ
Ѡࢂ٥୯ৎǴॺךখӳՐӧѠǴ܌аॺךᒧѠޑН݀Ǵࡐதډـᐊ
ηکǴᐊηکဟࡐ࣬՟Ǵࢂό࣬՟ޑН݀ǴቻܴᡉόӕǴᔅշղ
ёаע୴ᐊηکဟషӧଆޑН݀ࡷрٰΑǴӢࣁ೭ኬᇥόۓёаᔅշډ
Ѡޑၭ҇Ƕ
ॺך௦ޑڗБࣁݤǴӃঁձ૽ግᐊηǵǵဟӚ 90 Ǵෳ၂ၗ(ᐊ
ηǵဟǴᐊηǵǴဟǵǴᐊηǵဟǵ)Ӛ 40 Ǵ૽ግ
Ѧ(ٯӵǺ݀ǵ३ᑴ)ޑၗӚ 25 ǶࣁϙሶॺךҔ೭٤૽ግޑচӢǻӢࣁ
ТǴYOLO ࢂցૈ҅ዴղᘐΨࢂॺךགᑫ፪زࣴޑǶ
7
3-2 ჴᡍᡯ
1. Ӽး Ubuntu 16.04
3. ӼးЎҁڋᏔ
4. Ӽးёаவᆛၡፄᇙᆛ֟ࡰޑз
5. Ӽး OpenCV
6. Ӽး Python3.6
7. Ӽး Git
(1) cd darknet
(2) make
9. ෳ၂ኳࠠǴа۔ᆛކޑკТࣁጄٯ
8
10. Ӽး LabelImg
11. Ӽး pip
(1) ls
(2) cd labelImg-master
13. ۓϖঁۓᔞǴϩձࣁǺobj.namesǵobj.dataǵtrain.txtǵtest.txtǵ
yolov3.cfgǶ
ᔞਢǶ
૽ግϷႣෳਔࣣ᠐ڗԜᔞਢǶ
14. ૽ግၗ
/home/root1/fruit/weights/yolov3_2110000.weights –gpu
9
15. ෳ၂ၗ
/home/root1/fruit/weights/yolov3.cfg
/home/root1/fruit/weights/yolov3_2220000.weights
/home/root1/Grapefruit/Grapefruit01.jpg -gpu
10
ಃѤക ჴᡍ่݀
аΠǴॺךஒଞჹჴᡍ่݀բঁϟಏǶ
4-1 ᒣٿᅿН݀─ᐊηǵဟ
ֹԋᕉნۓϐࡕǴ२ӃаᐊηǵǵဟӚ 90 Չঁձ૽ግǴௗ
ॺךճҔ૽ግӳޑኳࠠǴՉᐊηᆶဟޑቹႽᒣǶკ 2 ࣁ࠻Ѧᐊηǵဟ
კ 2. ᐊηǵဟෳ၂კТ 1
11
კ 3. კ 2 ޑᐊηǵဟෳ၂่݀
კ 4. ᐊηǵဟෳ၂კТ 2
12
კ 5. კ 4 ޑᒣ่݀
კ 6. 3 ᗭᐊηᆶ 4 ᗭဟᒣ
13
კ 7. 3 ᗭᐊηᆶ 4 ᗭဟᒣ่݀
ௗკ 8 ࣁᐊηᆶဟಃΟঁෳ၂ਢٯǴॺךᕉნׯԋ࠻ϣǴஒᅿᜪᆶኧ
98ʘǵ61ʘǴဟ 92ʘǵ86ʘǶ
კ 8. ࠻ϣ 2 ᗭᐊηᆶ 2 ᗭ ဟෳ၂კТ
14
კ 9. 2 ᗭᐊηᆶ 2 ᗭဟᒣ่݀
ௗკ 10 ࣁᐊηᆶဟಃѤঁෳ၂ਢٯǴॺךӧ࠻ϣаᐊηᆶဟӚ 1
15
კ 11. კ 10 ޑᒣ่݀
җа่݀ޑύǴϩෳ၂კТᒣ่݀ό٫ǴԶЪ࠻Ѧ࠻ϣԖǴෳচ
Ӣεཷࢂ૽ግኬҁኧϝ༮όىǴԶᕉნӀጕޑӢનቹៜ՟ЯόεǶ
16
4-2 ᒣٿᅿН݀─ᐊηǵ
ௗΠٰޑჴᡍǴׯॺךаᐊηᆶՉᒣǶ२Ӄॺךӧ࠻ѦӃаᐊηᆶ
ʘ҅ዴᒣрٰǶ
კ 13. კ 12 ޑᒣ่݀
17
ௗӧკ 14 ॺךӕኬӧ࠻Ѧύඤԋ 1 ᗭᐊηᆶ 2 ᗭՉෳ၂Ǵҗკ 15 ޑ
18
კ 16. ࠻Ѧ 1 ᗭᐊηᆶ 3 ᗭޑෳ၂კТ
ӧკ 18 ύǴॺךӕኬӧ࠻ѦǴஒᐊηቚуࣁ 5 ᗭᆶޑኧໆቚуࣁ
19
კ 18. ࠻Ѧ 5 ᗭᐊηᆶ 4 ᗭ
20
კ 19. ࠻Ѧ 5 ᗭᐊηᆶ 4 ᗭޑᒣ่݀
21
კ 20. ࠻Ѧ 1 ᗭᐊηᆶ 4 ᗭ
22
კ 21. კ 20 ޑᒣ่݀
23
კ 22. ࠻Ѧ 3 ᗭᐊηᆶ 4 ᗭ
24
კ 23. ࠻Ѧ 3 ᗭᐊηᆶ 4 ᗭᒣ่݀
җа่݀ёаวǴаᆶᐊη೭ٿᅿቻৡ౦ၨεޑН݀Ǵӧᒣ
εठёаᕇளόᒱޑᒣǶ
4-3 ᒣٿᅿН݀─ဟǵ
ௗΠॺךஒෳ၂ኬҁඤԋဟᆶՉᒣǴӕኬޑ೭ٿᅿቻৡ౦ၨ
วᒣૈᕇளόᒱ่݀ޑǴࣗԿӧკ 30 ౣ༾ѨขኳጋݩރޑΠǴ٩ฅёа
҅ዴᒣ(კ 31)Ƕ
25
კ 24. Е݈ਫय़ 2 ᗭဟᆶ 3 ᗭ
26
კ 26. ર݈ਫय़ 2 ᗭဟᆶ 2 ᗭᒣ่݀
კ 27. კ 26 ޑᒣ่݀
27
კ 28. Е݈ਫय़ 1 ᗭဟᆶ 2 ᗭ
28
კ 29. Е݈ਫय़ 1 ᗭဟᆶ 2 ᗭޑᒣ่݀
29
კ 31. ౣ༾Ѩขߎឦਫय़ 1 ᗭဟᆶ 1 ᗭᒣ่݀
30
კ 32. ࠻ϣభՅਫय़ 1 ᗭဟᆶ 1 ᗭ
31
კ 33. ࠻ϣభՅਫय़ 1 ᗭဟᆶ 1 ᗭᒣ่݀
4-4 ᒣΟᅿН݀─ᐊηǵဟǵ
ӧௗΠٰޑෳ၂ύǴॺךӕਔܫΕᐊηǵဟᆶՉෳ၂(კ 34)Ǵӧ
კ 35 ޑᒣ่݀ύǴॺךёаวᐊηǵဟᆶޑᒣ่݀ϩձࣁ 98ʘǵ
99ʘᆶ 100ʘǶ
კ 34. ᐊηǵဟᆶޑෳ၂ಔӝ
32
კ 35. ᐊηǵဟᆶޑෳ၂ಔӝޑᒣ
33
4-5 ᒣ૽ግѦޑၗ
ЕҐǴӢԜ೭ٿᅿН݀ӧޑॺךᒣኳࠠύคݤᒣрٰǶ
კ 36. ३ᑴෳ၂კТ 1
34
კ 37. ३ᑴෳ၂კТ 1 ޑᒣ่݀
კ 38. ३ᑴෳ၂კТ 2
35
კ 39. ३ᑴෳ၂კТ 2 ޑᒣ่݀
კ 40. ЕҐෳ၂კТ 1
36
კ 41. ЕҐෳ၂კТ 1 ޑᒣ่݀
კ 42. ЕҐෳ၂კТ 2
37
კ 43. ЕҐෳ၂კТ 2 ޑᒣ่݀
ӧௗΠٰკ 44 ޑෳ၂ύǴॺךନΑ३ᑴǵЕҐаѦǴӆቚуӕኬ҂૽ግၸ
ޑᘗᘔӕՉᒣǴኬӧޑॺךᒣኳࠠύΟᅿН݀คݤᒣрٰǶ
კ 44. ३ᑴǵЕҐᆶᘗᘔޑෳ၂ಔӝ
38
კ 45. ३ᑴǵЕҐᆶᘗᘔෳ၂ಔӝޑᒣ่݀
39
ಃϖക ่ፕ
კТӚ 90 Չᒣኳࠠ૽ግǶ
ӧෳ၂ၗύǴॺךϩձаᐊηᆶဟǴᐊηᆶǴဟᆶǴᐊ
ηǵဟᆶᗋԖ҂ӧ૽ግၗύޑ३ᑴǵЕҐᆶᘗᘔՉኳࠠᒣǴନΑ
ӧᐊηᆶဟޑಔӝύǴϩკТᒣౣեаѦǴځᎩಔӝεӭёаᕇளό
ᒱޑᒣ่݀ǴԶЪᕉნӢન(࠻ϣǵ࠻Ѧǵቹǵਫय़)ჹܭᒣ่݀ቹៜৡ౦
όܴᡉǶځচӢᔈ၀ᐊηᆶဟቻ࣬߈Ǵӧ૽ግၗኧόىǴYOLO ૽ግ
рٰޑኳࠠǴёૈᜤаᒣᐊηکဟޑৡ౦Ƕ
ќѦӧ҂૽ግၸޑ३ᑴǵЕҐᆶᘗᘔკТǴӵႣයคݤӧޑॺךኳࠠύ
ձрٰǴܴుࡋᏢಞቹႽᒣኳࠠሡ๏ϒ૽ޑግၗǴωૈ҅ዴᒣނ
ҹǶ
40
ୖԵЎ
[1] Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once:
Unified, real-time object detection. In Proceedings of the IEEE conference on
computer vision and pattern recognition (pp. 779-788).
[2] ᙁߪٺ, ”ᔈҔቹႽᙯඤᘉቚ૽ግၗϐόؼᕉნΠًจుࡋᒣ”ೌמ, ύ
εᏢၗૻπำᏢس, ᅺγፕЎ, 2018
[3] ྷь, “Cat Nose Recognition”, ୯ҥᆵࣽמεᏢၗૻᆅس, ᅺγፕЎ, ҇
୯ 107 ԃ
[4] Huang, T. “ుࡋᏢಞ-ނҹୀෳ YOLOv1ǵYOLOv2 کYOLOv3 cfg ᔞှ᠐
()”,
https://medium.com/@chih.sheng.huang821/%E6%B7%B1%E5%BA%A6%E5%
AD%B8%E7%BF%92-
%E7%89%A9%E4%BB%B6%E5%81%B5%E6%B8%ACyolov1-
yolov2%E5%92%8Cyolov3-cfg-%E6%AA%94%E8%A7%A3%E8%AE%80-
75793cd61a01, access on 2019/07/11
[5] ᆢ୷ԭࣽ, OpenCV, https://zh.wikipedia.org/wiki/OpenCV, access on 2019/07/11
[6] Redmon, J. “Darknet: Open Source Neural Networks in C”,
http://pjreddie.com/darknet/, access on 2019/07/11
[7] LabelImg, https://github.com/tzutalin/labelImg, access on 2019/07/11
41