Professional Documents
Culture Documents
PDF Analysis of Images Social Networks and Texts 7Th International Conference Aist 2018 Moscow Russia July 5 7 2018 Revised Selected Papers Wil M P Van Der Aalst Ebook Full Chapter
PDF Analysis of Images Social Networks and Texts 7Th International Conference Aist 2018 Moscow Russia July 5 7 2018 Revised Selected Papers Wil M P Van Der Aalst Ebook Full Chapter
https://textbookfull.com/product/analysis-of-images-social-
networks-and-texts-8th-international-conference-aist-2019-kazan-
russia-july-17-19-2019-revised-selected-papers-wil-m-p-van-der-
aalst/
https://textbookfull.com/product/communication-systems-and-
networks-10th-international-conference-comsnets-2018-bangalore-
india-january-3-7-2018-extended-selected-papers-subir-biswas/
https://textbookfull.com/product/parallel-computational-
technologies-12th-international-conference-pct-2018-rostov-on-
don-russia-april-2-6-2018-revised-selected-papers-leonid-
sokolinsky/
Analysis of Images,
Social Networks and Texts
7th International Conference, AIST 2018
Moscow, Russia, July 5–7, 2018
Revised Selected Papers
123
Lecture Notes in Computer Science 11179
Commenced Publication in 1973
Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen
Editorial Board
David Hutchison
Lancaster University, Lancaster, UK
Takeo Kanade
Carnegie Mellon University, Pittsburgh, PA, USA
Josef Kittler
University of Surrey, Guildford, UK
Jon M. Kleinberg
Cornell University, Ithaca, NY, USA
Friedemann Mattern
ETH Zurich, Zurich, Switzerland
John C. Mitchell
Stanford University, Stanford, CA, USA
Moni Naor
Weizmann Institute of Science, Rehovot, Israel
C. Pandu Rangan
Indian Institute of Technology Madras, Chennai, India
Bernhard Steffen
TU Dortmund University, Dortmund, Germany
Demetri Terzopoulos
University of California, Los Angeles, CA, USA
Doug Tygar
University of California, Berkeley, CA, USA
More information about this series at http://www.springer.com/series/7409
Wil M. P. van der Aalst Vladimir Batagelj
•
Analysis of Images,
Social Networks and Texts
7th International Conference, AIST 2018
Moscow, Russia, July 5–7, 2018
Revised Selected Papers
123
Editors
Wil M. P. van der Aalst Irina A. Lomazova
RWTH Aachen University National Research University Higher School
Aachen, Germany of Economics, Moscow, Russia
Vladimir Batagelj Natalia Loukachevitch
University of Ljubljana Moscow State University
Ljubljana, Slovenia Moscow, Russia
Goran Glavaš Amedeo Napoli
University of Mannheim Loria
Mannheim, Germany Vandoeuvre lès Nancy, France
Dmitry I. Ignatov Alexander Panchenko
National Research University Higher School University of Hamburg
of Economics, Moscow, Russia Hamburg, Germany
Michael Khachay Panos M. Pardalos
Institute of Mathematics and Mechanics University of Florida
Yekaterinburg, Russia Gainesville, FL, USA
Sergei O. Kuznetsov Marcello Pelillo
National Research University Higher School Ca Foscari University of Venice
of Economics, Moscow, Russia Venice, Italy
Olessia Koltsova Andrey V. Savchenko
National Research University Higher School National Research University Higher School
of Economics, Saint Petersburg, Russia of Economics, Nizhny Novgorod, Russia
LNCS Sublibrary: SL3 – Information Systems and Applications, incl. Internet/Web, and HCI
This Springer imprint is published by the registered company Springer Nature Switzerland AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Preface
This volume contains the refereed proceedings of the 7th International Conference on
Analysis of Images, Social Networks, and Texts (AIST 2018)1. The previous confer-
ences during 2012–2017 attracted a significant number of data scientists – students,
researchers, academics, and engineers working on interdisciplinary data analysis of
images, texts, and social networks.
The broad scope of AIST made it an event where researchers from different
domains, such as image and text processing, exploiting various data analysis tech-
niques, can meet and exchange ideas. We strongly believe that this may lead to the
cross-fertilisation of ideas between researchers relying on modern data analysis
machinery.
Therefore, AIST brought together all kinds of applications of data mining and
machine learning techniques. The conference allowed specialists from different fields to
meet each other, present their work, and discuss both theoretical and practical aspects
of their data analysis problems. Another important aim of the conference was to
stimulate scientists and people from industry to benefit from the knowledge exchange
and identify possible grounds for fruitful collaboration.
The conference was held during July 5–7, 2018. The conference was organised in
Moscow, the capital of Russia, on the campus of Moscow Polytechnic University
(MPU)2.
This year, the key topics of AIST were grouped into six tracks:
1. General Topics of Data Analysis chaired by Sergei Kuznetsov (Higher School of
Economics, Russia) and Amedeo Napoli (LORIA, France)
2. Natural Language Processing chaired by Natalia Loukachevitch (Lomonosov
Moscow State University, Russia) and Goran Glavaš (University of Mannheim,
Mannheim, Germany)
3. Social Network Analysis chaired by Vladimir Batagelj (University of Ljubljana,
Slovenia) and Olessia Koltsova (Higher School of Economics, Russia)
4. Analysis of Images and Video chaired by Marcello Pelillo (University of Venice,
Italy) and Andrey Savchenko (Higher School of Economics, Russia)
5. Optimization Problems on Graphs and Network Structures chaired by Panos
Pardalos (University of Florida, USA) and Michael Khachay (IMM UB RAS and
Ural Federal University, Russia)
6. Analysis of Dynamic Behavior Through Event Data chaired by Wil van der Aalst
(RWTH Aachen University, Germany) and Irina Lomazova (Higher School of
Economics, Russia)
1
http://aistconf.org/.
2
http://mospolytech.ru/?eng.
VI Preface
The Program Committee and the reviewers of the conference included 154
well-known experts in data mining and machine learning, natural language processing,
image processing, social network analysis, and related areas from leading institutions of
32 countries including Argentina, Australia, Austria, Bangladesh, Canada, China,
Croatia, Czech Republic, Denmark, Egypt, Finland, France, Germany, Greece, India,
Italy, Kazakhstan, Lithuania, Norway, Qatar, Romania, Russia, Slovenia, Spain,
Switzerland, Vietnam, Taiwan, Turkey, Ukraine, The Netherlands, UK, and USA. This
year we received 107 submissions mostly from Russia but also from Austria,
Bangladesh, Finland, Germany, Norway, Switzerland, Taiwan, Turkey, Ukraine, USA,
and Vietnam.
Out of 107 submissions, only 29 papers were accepted as regular oral papers. Thus,
the acceptance rate of this volume was around 36% (not taking into account 26
automatically rejected papers). In order to encourage young practitioners and
researchers, we included 28 papers in the supplementary proceedings after their poster
presentation at the conference. Each submission was reviewed by at least three
reviewers, experts in their fields, in order to supply detailed and helpful comments.
The conference featured several invited talks and an industry session dedicated to
current trends and challenges.
The topical opening talk was presented by Rostislav Yavorskiy on “Visualization of
Data Science Community in Russia.”
The invited talks were:
– Anne Rozinat (Fluxicon), “Process Mining: Discovering Process Maps from Data”
– Goran Glavaš (University of Mannheim), “Cars, Drivers, Vehicles, and Wheels:
Specializing Distributional Word Vectors for Lexico-Semantic Relations”
– Gleb Gusev (Yandex and Moscow Institute of Physics and Technology), “Machine
Learning Research in Yandex and CatBoost”
– Ekaterina Chernyak of HSE and Sberbank presented a tutorial on “Supervised
Methods in Natural Language Processing”
The business speakers also covered a wide variety of topics:
– Iosif Itkin (Exactpro Systems), “The World’s Most Valuable Commodity”
– Maksim Balashevich (Santiment), “Crypto Assets Valuation”
– Ilya Utekhin (European University at St. Petersburg), “On Understanding Users for
Human-Centered Design”
– Denis Stepanov (JetBrains), “Workshop: Data Visualization with Datalore.plot in
Research, Education, and Just for Fun”
– Eugeny Malyutin (OK.ru), “Trend Detection at OK”
– Oleg Klepikov (Center for Applied Neuroeconomics), “Psychometric Behavorial
Classification of Banking Clients – A Case Study”
– Dmitriy Skougarevskiy (European University at St. Petersburg), “Semantic Rep-
resentation of Accident Reports to Understand the Dark Figure of Crime”
Preface VII
Under a special agreement between the responsible program co-chairs and the
proceedings chair, due to its interdisciplinary nature, one paper was assigned to a
special section on Innovative Systems (outside of the main tracks).
We would like to thank the authors for submitting their papers and the members
of the Program Committee for their efforts in providing exhaustive reviews.
According to the program chairs, and taking into account the reviews and presen-
tation quality, the best paper awards were granted to the following papers:
Track 1. General Topics of Data Analysis: “Lookup Lateration: Mapping of
Received Signal Strength to Position for Geo-Localization in Outdoor Urban Areas” by
Andrey Shestakov, Attila Kertesz-Farkas, Dmitri Shmelkin, and Danila Doroshin
Track 2. Natural Language Processing: “Learning Representations for Soft Skill
Matching” by Luiza Sayfullina, Eric Malmi, and Juho Kannala
Track 3. Social Network Analysis: “Organizational Networks Revisited: Predictors
of Headquarters–Subsidiary Relationship Perception” by Antonina Milekhina,
Valentina Kuskova, and Elena Artyukhova
Track 4. Analysis of Images and Video: “Application of Fully Convolutional Neural
Networks to Mapping Industrial Oil Palm Plantations” by Artem Baklanov, Mikhail
Khachay, and Maxim Pasynkov
Track 5. Optimization Problems on Graphs and Network Structures: “Efficient
PTAS for the Euclidean CVRP with Time Windows” by Michael Khachay and Yuri
Ogorodnikov
Track 6. Analysis of Dynamic Behavior Through Event Data: “Neural Approach to
the Discovery Problem in Process Mining” by Timofey Shunin, Natalia Zubkova, and
Sergey Shershakov
We would also like to express our special gratitude to all the invited speakers and
industry representatives.
We deeply thank all the partners and sponsors. Our golden sponsor was Exactpro.
Exactpro, a fully owned subsidiary of the London Stock Exchange Group, spe-
cialises in quality assurance for exchanges, investment banks, brokers, and other
financial sector organisations. Our special thanks go to Springer for their help, starting
from the first conference call to the final version of the proceedings. JetBrains covered
important conference expenses while Springer sponsored the best paper awards. Last
but not least, we are grateful to the dean of Information Technologies Faculty of MPU,
Andrey Phillipovich, and all the organisers, especially to Andrey Novikov and Marina
Danshina, and the volunteers, whose endless energy saved us at the most critical stages
of the conference preparation.
Here, we would like to mention the Russian word “aist” is more than just a simple
abbreviation (in Cyrillic) – it means a “stork.” Since it is a wonderful free bird, a
symbol of happiness and peace, this stork gave us the inspiration to organise the AIST
VIII Preface
conference. So we believe that this young and rapidly growing conference will likewise
be bringing inspiration to data scientists around the world!
Proceedings Chair
Dmitry I. Ignatov National Research University Higher School of Economics,
Russia
Steering Committee
Dmitry I. Ignatov National Research University Higher School of Economics,
Russia
X Organisation
Program Committee
Ekaterina St. Petersburg Department of Steklov Institute of
Arkhangelskaya Mathematics of Russian Academy of Sciences, Russia
Aleksey Artamonov Neuromation, USA
Xiang Bai Huazhong University of Science and Technology, China
Jaume Baixeries Universitat Politècnica de Catalunya, Spain
Amir Bakarov National Research University Higher School of Economics,
Russia
Artem Baklanov International Institute for Applied Systems Analysis,
Austria
Lamberto Ballan University of Padova, Italy
Vladimir Batagelj Institute of Mathematics, Physics and Mechanics, Slovenia
Timo Baumann University of Hamburg, Germany
Malay Bhattacharyya Indian Statistical Institute, Kolkata
Chris Biemann University of Hamburg, Germany
Battista Biggio University of Cagliari, Italy
Elena Bolshakova Lomonosov Moscow State University, Russia
Samuel Bulò Fondazione Bruno Kessler, Italy
Andrea Burattin Technical University of Denmark, Denmark
Evgeny Burnaev Skolkovo Institute of Science and Technology, Russia
Aleksey Buzmakov National Research University Higher School of Economics,
Perm, Russia
Ignacio Cassol Universidad Austral, Argentina
Artem Chernodub Institute of Mathematical Machines and Systems Problems
of National Academy of Sciences of Ukraine, Ukraine
Mikhail Chernoskutov Krasovskii Institute of Mathematics and Mechanics of the
Ural Branch of the Russian Academy of Sciences and
Ural Federal University, Russia
Bonaventura Coppola University of Trento, Italy
Hernani Costa University of Malaga, Spain
Massimiliano de Leoni Eindhoven University of Technology, The Netherlands
Boris Dobrov Research Computing Center of Lomonosov Moscow State
University, Russia
Sofia Dokuka National Research University Higher School of Economics,
Russia
Dirk Fahland Eindhoven University of Technology, the Netherlands
Organisation XI
Additional Reviewers
Mikhail Batsyn
Marc Chaumont
Ivan Grechikhin
Alexander Rassadin
Yuri Rykov
François Suro
Arsenis Tsokas
Organising Committee
Rostislav Yavorskiy National Research University Higher School of Economics,
(Conference Chair) Russia
Andrey Novikov (Head National Research University Higher School of Economics,
of Organisation) Russia
Marina Danshina Moscow Polytechnic University, Russia
(Venue Organisation
and Management)
Organisation XV
Volunteers
Daniil Bannyh Moscow Polytechnic University, Russia
Ksenia Belkova Moscow Polytechnic University, Russia
Tatiana Mishina Moscow Polytechnic University, Russia
Zahar Kuhtenkov Moscow Polytechnic University, Russia
Maxim Pasynkov Krasovsky Institute of Mathematics and Mechanics of RAS,
Russia, Yekaterinburg
Ivan Poylov Moscow Polytechnic University, Russia
Sponsors
Golden sponsor
Exactpro
Bronze sponsors
JetBrains
Springer
Invited Talks and Tutorials
Process Mining: Discovering Process Maps
From Data
Anne Rozinat
Ekaterina Chernyak
Goran Glavaš
Opening Talk
RusNLP: Semantic Search Engine for Russian NLP Conference Papers . . . . . 111
Irina Nikishina, Amir Bakarov, and Andrey Kutuzov
Exact Algorithms for the Special Cases of Two Hard to Solve Problems
of Searching for the Largest Subset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 294
Alexander Kel’manov, Vladimir Khandeev, and Anna Panasenko
Efficient PTAS for the Euclidean CVRP with Time Windows . . . . . . . . . . . 318
Michael Khachay and Yuri Ogorodnikov
Innovative Systems
Generic Distributed In Situ Aggregation for Earth Remote Sensing Imagery . . . 331
Ramon Antonio Rodriges Zalipynis
Importance and high demand for data science competencies have been widely
discussed in literature, see e.g. [4,5]. It was shown in [1] that the ascendancy of
market and hierarchy principles in the organization of professional work has not
diminished the role of community. Important role of community in education,
research and professional development has been studied for many years, see
[2,3,10,11].
Systematic study of regional community of data science professionals provides
useful information to:
– students, who build their career path and search for scientific supervisor or
an active research group to join;
– scientist, who are looking for new collaboration and cooperation opportuni-
ties;
– research managers who want to know the current situation in the field, trends,
strong and weak spots, and better plan development programs and projects.
2 Education
Good education is the key factor in facilitating research, so one of our goals
is to measure data science education opportunities across the country. We are
collecting data from the official web sites of Russian universities. Some bachelor
and master programs have vary clear focus on data science and include that by
using specific keywords in their titles, e.g. “data analysis”, “big data”, “data
science” etc. For other the relation to data science is not that clear, that is why
we manually inspected all the programs on applied mathematics and informatics.
The following Table 1 presents some of the numbers. In order to make the
data for different cities better comparable we have added a column with figures
for enrollments per 100k citizens rate.
Table 1. The numbers of annual enrollments in data science related bachelor and
master programs in major Russian cities
City Enrolment per year City population x1000 Enrollments per 100k citizens
Moscow 2946 12381 23.79
St. Petersburg 892 5282 16.89
Nizhniy Novgorod 463 3248 14.25
Samara 342 3204 10.67
Tomsk 308 1079 28.54
Kazan 212 3885 5.46
Irkutsk 185 2409 7.68
Ekaterinburg 180 4329 4.16
Chelyabinsk 151 3502 4.31
Krasnoyarsk 145 2875 5.04
Perm 131 1007 13.01
Novosibirsk 119 1511 7.88
Fig. 1. Map of data science researchers in Russia by field. Size of the circle nodes is
proportional to the total number of citations in Google Scholar (minimum is 100). Fill
color indicates the region of the affiliation institute or university. (Color figure online)
6 R. Yavorskiy et al.
Table 2. Top 10 data science researchers in Russia with respect to the total number
of citations in Google Scholar
4 On-line Activities
In the research we decided to examine VKontakte1 (VK). VK is the most pop-
ular website and social networking platform in Russia2 . VK provides access for
working with data through the API3 . In this way, we used VK API to identify
people attendance at on-line activities.
At the first step, we defined the following keywords: Big Data, Data Science,
Deep Learning, Kaggle, Machine Learning, Natural Language Processing, Neural
Network, Reinforcement Learning, Mashinnoe Obuchenie (Machine Learning in
Russian), Nejronnye Seti (Neural Network in Russian). At the second step, we
used the API methods at target groups (groups, pages, events) searching accord-
ing to the keywords. At the final step, we fetched groups member count, people
distribution in relative to cities affiliation, and people membership intersection
in the groups.
The following Table 3 presents top five data science on-line communities on
VK platform (those which have 5000 or more members).
We found 19 on-line communities with 1000 members or more (as of June
2018). They are depicted on Fig. 2. Size of a node is proportional to the number
of involved users. Edges are used to indicate popularity of these communities in
major Russian cities.
1
https://vk.com.
2
https://www.alexa.com/siteinfo/vk.com.
3
https://vk.com/dev/methods.
Visualization of Data Science Community in Russia 7
Fig. 2. Map of on-line communities on data science with minimum 1000 members on
VK platform. Size of a node is proportional to the number of involved users. Edges
indicate popularity in major Russian cities.
Another random document with
no related content on Scribd:
*** END OF THE PROJECT GUTENBERG EBOOK MOTION
PICTURES, JANUARY-JUNE 1974 : CATALOG OF COPYRIGHT
ENTRIES, THIRD SERIES, VOLUME 28, PARTS 12-13, NUMBER 1
***
Updated editions will replace the previous one—the old editions will
be renamed.
1.D. The copyright laws of the place where you are located also
govern what you can do with this work. Copyright laws in most
countries are in a constant state of change. If you are outside the
United States, check the laws of your country in addition to the terms
of this agreement before downloading, copying, displaying,
performing, distributing or creating derivative works based on this
work or any other Project Gutenberg™ work. The Foundation makes
no representations concerning the copyright status of any work in
any country other than the United States.
• You pay a royalty fee of 20% of the gross profits you derive from
the use of Project Gutenberg™ works calculated using the
method you already use to calculate your applicable taxes. The
fee is owed to the owner of the Project Gutenberg™ trademark,
but he has agreed to donate royalties under this paragraph to
the Project Gutenberg Literary Archive Foundation. Royalty
payments must be paid within 60 days following each date on
which you prepare (or are legally required to prepare) your
periodic tax returns. Royalty payments should be clearly marked
as such and sent to the Project Gutenberg Literary Archive
Foundation at the address specified in Section 4, “Information
about donations to the Project Gutenberg Literary Archive
Foundation.”
• You comply with all other terms of this agreement for free
distribution of Project Gutenberg™ works.
1.F.
1.F.4. Except for the limited right of replacement or refund set forth in
paragraph 1.F.3, this work is provided to you ‘AS-IS’, WITH NO
OTHER WARRANTIES OF ANY KIND, EXPRESS OR IMPLIED,
INCLUDING BUT NOT LIMITED TO WARRANTIES OF
MERCHANTABILITY OR FITNESS FOR ANY PURPOSE.
Please check the Project Gutenberg web pages for current donation
methods and addresses. Donations are accepted in a number of
other ways including checks, online payments and credit card
donations. To donate, please visit: www.gutenberg.org/donate.
Most people start at our website which has the main PG search
facility: www.gutenberg.org.