Professional Documents
Culture Documents
[5 DS ]
Supervised By :
Wiem Zaouga & Bilel Farah
Technologist Assistant at Esprit Engineering School
Project outlines
2
Context
Only 32% of IT projects had been delivered on time, budget and with quality
requirements as initially specified [Standish Group International, 2020].
Project Failure
6
Motivation & Objectives
Motivation
In order to assist practitioners and researchers to better study the impact of their decisions and
evaluate the Project Risk (PR) as precisely as possible :
1 Propose an automatic building approach on Project Risk Management (PRM) to obtain a
conceptual graph
Learn from unstructured corpus “PMBOk 6 th ”: the knowledge retrieval process
Enhance the resulting ontology according to PMI standard for RM : ontology enrichment
7
Motivation & Objectives
Objectives
2 The resulting conceptual graph will be enriched with the extracted knowledge from the PMI
standard for RM.
8
system architecture
9
Project Outlines: Steps to be followed
The automatic knowledge retrieval process from unstructured text which is the
PMBOK 6th 41 pages [from 309 to 354 page]) and PMI’s standard for PRM (116
pages) using NLP techniques using transformer models.
This retrieved knowledge will be parsed following OL layer cake into concepts ,
Object/Data properties, axioms ,rules for building conceptual graph such as
“ontology” by applying machine learning techniques and deep learning
architectures to structure the ontological elements.
10
Project Outlines: Steps to be followed
knowledge retrieval module (1): aims to extract the relevant knowledge (concepts,
instances, relationships) from PMI’s corpus which is the reference best practice on
PM.
Conceptual graph building module (2): With respect to the OL process, we
automatically structure and represent the PM knowledge into OWL, RDFS, RDF
format. Then, a recommendation engine will be developed to encapsulate the
interdependencies between PM concepts (e.g. “IF PR has probability value between
0.8 and 1 and impact rating value 100, THEN we obtain very high exposure and
score value 100”).
The recommendation module (3) aims to infer personalized recommendation, advices
on whether additional information is needed and suggests ways to obtain such
information using recommendation techniques and approaches.
EXAMPLE:
For each process, the related tools and the deliverables (documents, lesson learned)
are inferred based on user’s search query and profile.
Step 6:
Step4:
3:
2:
5: Non-Taxonomic
1:Weighting
Concepts
Data preprocessing
Concepts
Synonyms and
conceptsrelations
properties
hierarchy
extraction pertinence
discovery
The knowledge retrieval process
Segmentation is to recognize
Identify the concepts the boundaries
list (NN) and propertiesof sentences
(VB) [from 309 tothe
list from 354].
set of
1
Term Prepare
stemmed.
Through Spacy and
Rule analyze
Frequency-Inverse
Based the PMBOk
Document
Matching using 6 th
corpus
Frequency
the for
pattern the semantic
(TF-IDF)
based annotation
extraction approach tasks.
, the
Wu-Palmer
Tokenization Score [ Wang
splits a et al., 2020]
character sequence up into (words/phrases) called tokens.
Calculates the frequency of occurrence of each term x within document y.
Computes
Chunking
2 the relationships
Identifytask
the that between
segments
concepts and twolabels
and terms by
properties. considering
multi-token the depthThe
sequence. of two
resulted
concepts hierarchy
part-of-speech
synsets is extracted
tagging using
is that Subject-Is
assign for a/is
eachpart of
token - Object
its patterns.
related partof-speech
groupinofwordNet
tokens istaxonomy and the Least Common Subsumer (LCS) depth.
called “chunk.”
[noun (NN), verb (VB), Determiner (DT),etc], using NLTK.Tag.
3 Weight the pertinence of each candidate term.
Stop word removal removes some stop words from the text which have no
4 Extractrelevance
significant synonyms. [adjective ADJ or adverb ADV, DT].
Identify
Stemming
5 the concept
removes word hierarchy.
to the stem (root) of derived words.
ThisGenerate
step3.2
6SpaCy the non-taxonomic
is pipeline
achieved means ofrelations.
byused.
is NLTKit !
12/ 12/01/20 12
The Final output: illustrations
Ontology illustration
13
The Final output: illustrations
14
The Final output: illustrations
System illustration
2022/2023 15
16