Formulario - de - Extraccion - de - Datos - 1

FORMULARIO DE EXTRACCIÓN DE DATOS
Publication name: Modeling public mood and emotion: Blog

and news sentiment and socio-economic phenomena
N° INFORMACION A RECOPILAR INFORMACION DEL ARTICULO

1 Idioma en que se realizó la investigación English
2 Nombre de la revista / fecha de publicación Elsevier B.V - 18 October 2017
Mu-Yen Chen, Ting-Hsuan Chen
Department of Information Management, National

Taichung University of Science and Technology,
3 Datos de Investigadores Taichung 404, Taiwan, R.O.C
Department of Finance, National Taichung University

of Science and Technology, Taichung 404, Taiwan,
R.O.C
National Taichung University of Science and

4 Centro de Investigación Technology
This research proposes and a sentiment mining

approach designed to address massive quantities of
5 Hipótesis / objetivo de la investigación short text articles based on the Academia Sinica
Bilingual Ontological Wordnet (BOW-WordNet)
Sentiment classification focuses on four aspects:

subjectivity classification, word
sentiment classification, document sentiment

6 Metodología utilizada en la investigación classification, and opinion extraction. This paper
mainly focuses on the sentiment classification of
document content to
identify positive or negative opinions
7 Técnicas de procesamiento del lenguaje natural Categorical Proportional Difference (CPD) is a feature
utilizadas para el procesamiento de información selection method
extraída de redes sociales
proposed by Simeon and Hilderman (2008) and is
often applied to multi-category
document classification [38]. O'Keefe and Koprinska

(2009) used this method in two
categories of semantic classification studies [30]. This

method is mainly used to
calculate the difference between a feature’s positive

and negative document frequency,
allowing for the selection of features that can

effectively distinguish between the two
categories.
8 Algoritmos de clasificación utilizados

Matlab
Python x
¿Qué software de aprendizaje automático y
9 Seleccione un campo R
minería de datos se utilizó?
Weka
Otro
Si
10 Se utilizó un corpus Seleccione un campo
No X
11 El corpus fue anotado manualmente
12 Información de las fuentes de datos utilizadas
Vindhini et al. (2012) pointed out that sentiment
mining is primarily a natural language processing
technique used to analyze emotions and opinions in
text. - G. Vinodhini, R.M. Chandrasekaran, Sentiment
Analysis and Opinion Mining: A Survey. International
Journal of Advanced Research in Computer Science
13 and Software Engineering 2(6) (2012) 282-292.
Referencias útiles del artículo
New opinion words and feature vocabulary items can
then again be used to identify new feature vocabulary
items and opinion words - X. Ding, B. Liu, P.S. Yu, A
Holistic Lexicon-Based Approach to Opinion Mining.
WSDM’08, February 11-12, Palo Alto, California, USA,
2008.
This research processed financial data collected from

Taiwan's most popular
news websites including ChinaTi2Es.com, Yahoo, and

Google from
¿Qué técnicas se utilizaron para el análisis de
14
datos? Januaty 1, 2016 to July 31, 2017_ Stock prices of the
individual stock and the
TWSE capitalization weighted stock index (TAEX) for

the period wete also
15 ¿Cómo se evaluó la calidad de los resultados? The prcposed model was verifed using experimental
datasets from
ctñES.cotn, Yahoo stock market news, and Google

stock market
news fromJanuary 1, 2016 to July 311 2017.
Based on the lexicon training results, this research was

calculated the positil.e or
negative emotion of each uticle from the time period

t-5 to ta to determine úe
cotrelation of stock price movement on time t,

Experiments were designed to ad&ess
the following research questiotE. (1) Does the amount

of data used for prediction
This research integrated content from financial blogs

and news articles to
develop a public mood dynamic prediction model for

stock prices, referencing
behavioral finance and online financial community

characteristics. A public mood
16 Notas del articulo time series prediction model is also presented,

integrating features from social
networks and behavioral finance, and uses big data

analysis to assess emotional
content of commentary on current stock or financial

issues to forecast changes for
Taiwan stock index.
This step uses the word identificatioa to analyze the

emotional moment
for online financial news articles. The target feature

wor& manually acquired
and expanded using Python-Jieba- We used Python-

Jieba to extend the collected
17 Trabajos futuros feature words. This can be acconplished through two

approaches- The first
segments words through the fill uticle without respect

to part of speech or polarity.
The oúer captures nouns because it is easier to

distinguish between sentences based
on polarity.

Formulario - de - Extraccion - de - Datos - 1

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Formulario - de - Extraccion - de - Datos - 1

Uploaded by

Copyright:

Available Formats

FORMULARIO DE EXTRACCIÓN DE DATOS

Publication name: Modeling public mood and emotion: Blog

N° INFORMACION A RECOPILAR INFORMACION DEL ARTICULO

Mu-Yen Chen, Ting-Hsuan Chen

Department of Information Management, National

Department of Finance, National Taichung University

National Taichung University of Science and

This research proposes and a sentiment mining

Sentiment classification focuses on four aspects:

sentiment classification, document sentiment

identify positive or negative opinions

document classification [38]. O'Keefe and Koprinska

categories of semantic classification studies [30]. This

calculate the difference between a feature’s positive

allowing for the selection of features that can

8 Algoritmos de clasificación utilizados

This research processed financial data collected from

news websites including ChinaTi2Es.com, Yahoo, and

TWSE capitalization weighted stock index (TAEX) for

ctñES.cotn, Yahoo stock market news, and Google

Based on the lexicon training results, this research was

negative emotion of each uticle from the time period

cotrelation of stock price movement on time t,

the following research questiotE. (1) Does the amount

This research integrated content from financial blogs

develop a public mood dynamic prediction model for

behavioral finance and online financial community

16 Notas del articulo time series prediction model is also presented,

networks and behavioral finance, and uses big data

content of commentary on current stock or financial

Taiwan stock index.

This step uses the word identificatioa to analyze the

for online financial news articles. The target feature

and expanded using Python-Jieba- We used Python-

17 Trabajos futuros feature words. This can be acconplished through two

segments words through the fill uticle without respect

The oúer captures nouns because it is easier to

You might also like