Professional Documents
Culture Documents
Knowledge Graphs
A Project report submitted In fulfilment of the requirements for the Award of
the Degree of
Submitted By
ANUMULA. VENKATA REDDY
Reg No: 14AH1F0002
CERTIFICATE
This is to certify that this Project work entitled Online Subgraph Skyline
Analysis Over Knowledge Graphs that is being submitted by
EXTERNAL EXAMINER
ACKNOWLEDGEMENT
We extend my sincere thanks to our guide Mr U. BABU, Assoc. Prof, whose valuable
guidance and unstinting encouragement enabled us to accomplish our project successfully in
time.
9. CONCLUSION 53
11. REFERENCE 55
ABSTRACT
1. INTRODUCTION
Recently, graphs have attracted the increasing attention from the database
community. A lot of real-world data (e.g., social networks, knowledge graphs, heterogenous
information networks, and the Semantic Web ) can be represented by the graph model. In the
literature, various research problems over graphs have been investigated, such as the shortest
path query subgraph search the reachability query, and so on.
"59
1|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
2.LITERATURE SURVEY
2|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
3.SYSTEM ANALYSIS
The input design is the link between the information system and the user.
It comprises the developing specification and procedures for data preparation and those steps
are necessary to put transaction data in to a usable form for processing can be achieved by
inspecting the computer to read data from a written or printed document or it can occur by
having people keying the data directly into the system. The design of input focuses on
controlling the amount of input required, controlling the errors, avoiding delay, avoiding extra
steps and keeping the process simple. The input is designed in such a way so that it provides
security and ease of use with retaining the privacy. Input Design considered the following
things:
1.Input Design is the process of converting a user-oriented description of the input into a
computer-based system. This design is important to avoid errors in the data input process and
show the correct direction to the management for getting correct information from the
computerized system.
2. It is achieved by creating user-friendly screens for the data entry to handle large volume of
data. The goal of designing input is to make data entry easier and to be free from errors. The
data entry screen is designed in such a way that all the data manipulates can be performed. It
also provides record viewing facilities.
3.When the data is entered it will check for its validity. Data can be entered with the help of
screens. Appropriate messages are provided as when needed so that the user
3|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
will not be in maize of instant. Thus the objective of input design is to create an input layout
that is easy to follow
A quality output is one, which meets the requirements of the end user and
presents the information clearly. In any system results of processing are communicated to the
users and to other system through outputs. In output design it is determined how the
information is to be displaced for immediate need and also the hard copy output. It is the most
important and direct source information to the user. Efficient and intelligent output design
improves the systems relationship to help user decision-making.
1. Designing computer output should proceed in an organized, well thought out manner; the
right output must be developed while ensuring that each output element is designed so that
people will find the system can use easily and effectively. When analysis design computer
output, they should Identify the specific output that is needed to meet the requirements.
3.Create document, report, or other formats that contain information produced by the system.
The output form of an information system should accomplish one or more of the following
objectives.
Graphs have attracted the increasing attention from the database community . A lot of real-
world data (e.g., social networks , knowledge graphs , heterogenous information networks ,
and the Semantic Web) can be represented by the graph model. In the literature, various
research problems over graphs have been investigated, such as the shortest path query ,
4|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
subgraph search , the reachability query , and so on. As a well-known research problem, the
subgraph search problem is meaningful and useful in many applications. For example,
answering SPARQL queries in the Semantic Web is equivalent to conducting the subgraph
match over graphs . However, users may be overwhelmed by the enormous matching results
returned from queries. Owing to different requirements in a variety of applications, it is
nontrivial how to design a generic function to measure/rank the goodness of these matches.
3.5 PROPOSED SYSTEM
In this proposed system, we study the subgraph skyline problem on large graphs,
which can retrieve the matching subgraphs by considering both graph structures and graph
entity dominance relationships. We propose the problem of subgraph skyline analysis (denoted
by S2A) over large graphs, and present an efficient method to answer S 2A queries. We partition
the data space into grid cells, and compute skylines cell by cell, instead of entity by entity.
Most importantly, we propose optimizations on how to find a good partitioning. By clustering
numeric attributes, we devise an effective method to deal with the curse of dimensionality.
Extensive experiments over real dataset have demonstrated the effectiveness and efficiency of
our method.
3.6 IMPLEMENTATION
Module Description
1.Subgraph search Module
2.Subgraph skyline Module
3.High dimensionality Module
4. Query Processing With Clusters
5|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
on querying knowledge graphs. To avoid costly graph isomorphism and edit distance
computation, exploits a neighborhood-based subgraph matching technique for querying real-
life networks.
2.Subgraph Skyline Module
A subgraph g 2 G is in the subgraph skyline, if g is graph isomorphic to the query
graph q (without considering the numeric attributes and values) and not dominated by any other
subgraphs g0 2 G, on those specified numeric attributes in q. Subgraph Skyline Analysis over
large graphs (denoted as S 2A). Specifically, given a large graph G and a userspecified query
graph pattern q (which contains numeric attributes in graph entities), S 2A returns all the
subgraphs in G that are isomorphic to q and not dominated by any other isomorphic subgraphs
in terms of numeric attributes in q.
3.High Dimensionality Module
In a heterogenous knowledge graph, there are enormous numeric attributes
associated with entities of different types. Thus, the weighted dominating graph is usually flat,
which may result in too many cells in each single layer and diminish the pruning ability
definitely. This phenomenon is actually the curse of dimensionality. Note that entities in a
knowledge graph
G are naturally distinguished by types. More importantly, it is not required to check the
dominating relation between entities of different types. Hence, it is reasonable to build index
for entities of each type separately. Hereon, all the following discussions focus on the entities
of a certain type.
4. Query Processing With Clusters
In order to solve the curse of dimensionality, we organize the numeric attributes
into several subspaces (i.e., clusters) for entities of each type and build a skylayer for each
subspace. If the numeric attributes of an entity in the query q are covered by a subspace,
Otherwise, it requires several subspaces to cover the entity. Here, we design an efficient
strategy to determine the skyline candidates. The procedure starts from the first layer of a
subspace Di that covers one or more dimensions of
2
u q. If the candidate entity v is in the skyline over Di, we check the structure constraint
directly. Otherwise, we need to check whether v is in skyline on the dimensions involved in u.
All the candidates dominated by v are denoted as Mv(Di). The candidates in the intersection
of all Mv(Di) can be pruned.
6|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Architecture Diagram
Algorithm
K Means algorithm:
7|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
RAM - 4 GB
Hard Disk - 1 TB
IDE : MY Eclipse
8|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
4. SYSTEM STUDY
The feasibility of the project is analyzed in this phase and business proposal is put
forth with a very general plan for the project and some cost estimates. During system
analysis the feasibility study of the proposed system is to be carried out. This is to ensure
that the proposed system is not a burden to the company. For feasibility analysis, some
ECONOMICAL FEASIBILITY
TECHNICAL FEASIBILITY
SOCIAL FEASIBILITY
This study is carried out to check the economic impact that the system will have on
the organization. The amount of fund that the company can pour into the research and
development of the system is limited. The expenditures must be justified. Thus the developed
system as well within the budget and this was achieved because most of the technologies used
This study is carried out to check the technical feasibility, that is, the technical
requirements of the system. Any system developed must not have a high demand on the
available technical resources. This will lead to high demands on the available technical
resources. This will lead to high demands being placed on the client. The developed system
9|Page
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
must have a modest requirement, as only minimal or null changes are required for
The aspect of study is to check the level of acceptance of the system by the user. This
includes the process of training the user to use the system efficiently. The user must not feel
threatened by the system, instead must accept it as a necessity. The level of acceptance by the
users solely depends on the methods that are employed to educate the user about the system
and to make him familiar with it. His level of confidence must be raised so that he is also able
to make some constructive criticism, which is welcomed, as he is the final user of the system.
10 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
5. SYSTEM DESIGN
11 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
User
12 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Admin
13 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Admin
14 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
15 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Admin
16 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Admin
17 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Admin
18 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
5.3.7 ER Diagram
User
Admin
19 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
20 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
6. SYSTEM TESTING
The purpose of testing is to discover errors. Testing is the process of trying to discover
every conceivable fault or weakness in a work product. It provides a way to check the
process of exercising software with the intent of ensuring that the Software system meets its
requirements and user expectations and does not fail in an unacceptable manner. There are
various types of test. Each test type addresses a specific testing requirement.
TYPES OF TESTS
Unit testing
Unit testing involves the design of test cases that validate that the internal program logic
is functioning properly, and that program inputs produce valid outputs. All decision branches
and internal code flow should be validated. It is the testing of individual software units of the
application .it is done after the completion of an individual unit before integration. This is a
structural testing, that relies on knowledge of its construction and is invasive. Unit tests perform
basic tests at component level and test a specific business process, application, and/or system
configuration. Unit tests ensure that each unique path of a business process performs accurately
to the documented specifications and contains clearly defined inputs and expected results.
Integration testing
Integration tests are designed to test integrated software components to determine if
they actually run as one program. Testing is event driven and is more concerned with the basic
outcome of screens or fields. Integration tests demonstrate that although the components were
individually satisfaction, as shown by successfully unit testing, the combination of components
is correct and consistent. Integration testing is specifically aimed at exposing the problems
that arise from the combination of components.
Functional test
Functional tests provide systematic demonstrations that functions tested are available as
specified by the business and technical requirements, system documentation, and user manuals.
21 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
System Test
System testing ensures that the entire integrated software system meets requirements. It tests
a configuration to ensure known and predictable results. An example of system testing is the
configuration oriented system integration test. System testing is based on process descriptions
and flows, emphasizing pre-driven process links and integration points.
22 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Unit testing is usually conducted as part of a combined code and unit test phase of the
software lifecycle, although it is not uncommon for coding and unit testing to be conducted as
two distinct phases.
Features to be tested
Verify that the entries are of the correct format
No duplicate entries should be allowed
All links should take the user to the correct page.
Test Results: All the test cases mentioned above passed successfully. No defects encountered.
Test Results: All the test cases mentioned above passed successfully. No defects encountered.
23 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
7.CODING
Java Technology
Java technology is both a programming language and a platform.
Multithreaded Simple
Architecture neutral
Object oriented
Portable
Distributed
High performance
Interpreted
Robust
Dynamic
Secure
With most programming languages, you either compile or interpret a program so that
you can run it on your computer. The Java programming language is unusual in that a program
is both compiled and interpreted. With the compiler, first you translate a program into an
intermediate language called Java byte codes the platform-independent codes interpreted by
the interpreter on the Java platform. The interpreter parses and runs each Java byte code
instruction on the computer. Compilation happens just once; interpretation occurs each time
the program is executed. The following figure illustrates how this works.
24 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
You can think of Java byte codes as the machine code instructions for the Java Virtual
Machine (Java VM). Every Java interpreter, whether its a development tool or a Web browser
that can run applets, is an implementation of the Java VM. Java byte codes help make write
once, run anywhere possible. You can compile your program into byte codes on any platform
that has a Java compiler. The byte codes can then be run on any implementation of the Java
VM. That means that as long as a computer has a Java VM, the same program written in the
Java programming language can run on Windows 2000, a Solaris workstation, or on an iMac.
The Java API is a large collection of ready-made software components that provide
many useful capabilities, such as graphical user interface (GUI) widgets. The Java API
is grouped into libraries of related classes and interfaces; these libraries are known as
packages. The next section, What Can Java Technology Do? Highlights what
functionality some of the packages in the Java API provide.
The following figure depicts a program thats running on the Java platform. As the
figure shows, the Java API and the virtual machine insulate the program from the
hardware.
Native code is code that after you compile it, the compiled code runs on a
specific hardware platform. As a platform-independent environment, the Java platform
can be a bit slower than native code. However, smart compilers, well-tuned interpreters,
and just-in-time byte code compilers can bring performance close to that of native code
without threatening portability.
However, the Java programming language is not just for writing cute, entertaining
applets for the Web. The general-purpose, high-level Java programming language is
also a powerful software platform. Using the generous API, you can write many types
of programs.
26 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
An application is a standalone program that runs directly on the Java platform. A special
kind of application known as a server serves and supports clients on a network.
Examples of servers are Web servers, proxy servers, mail servers, and print servers.
Another specialized program is a servlet. A servlet can almost be thought of as an applet
that runs on the server side. Java Servlets are a popular choice for building interactive
web applications, replacing the use of CGI scripts. Servlets are similar to applets in that
they are runtime extensions of applications. Instead of working in browsers, though,
servlets run within Java Web servers, configuring or tailoring the server.
How does the API support all these kinds of programs? It does so with packages of
software components that provides a wide range of functionality. Every full
implementation of the Java platform gives you the following features:
The essentials: Objects, strings, threads, numbers, input and output, data
structures, system properties, date and time, and so on.
Applets: The set of conventions used by applets.
Networking: URLs, TCP (Transmission Control Protocol), UDP (User Data
gram Protocol) sockets, and IP (Internet Protocol) addresses.
Internationalization: Help for writing programs that can be localized for users
worldwide. Programs can automatically adapt to specific locales and be
displayed in the appropriate language.
Security: Both low level and high level, including electronic signatures, public
and private key management, access control, and certificates.
Software components: Known as JavaBeansTM, can plug into existing
component architectures.
Object serialization: Allows lightweight persistence and communication via
Remote Method Invocation (RMI).
Java Database Connectivity (JDBCTM): Provides uniform access to a wide
range of relational databases.
The Java platform also has APIs for 2D and 3D graphics, accessibility, servers,
collaboration, telephony, speech, animation, and more. The following figure depicts
what is included in the Java 2 SDK.
27 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
We cant promise you fame, fortune, or even a job if you learn the Java
programming language. Still, it is likely to make your programs better and requires less
effort than other languages. We believe that Java technology will help you do the
following:
28 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
29 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
and plain text files can be turned into data sources. The operating system uses the Registry
information written by ODBC Administrator to determine which low-level ODBC drivers are needed to
talk to the data source (such as the interface to Oracle or SQL Server). The loading of the ODBC drivers
is transparent to the ODBC application program. In a client/server environment, the ODBC API even
handles many of the network issues for the application programmer.
The advantages of this scheme are so numerous that you are probably thinking there
must be some catch. The only disadvantage of ODBC is that it isnt as efficient as talking
directly to the native database interface. ODBC has had many detractors make the charge that
it is too slow. Microsoft has always claimed that the critical factor in performance is the quality
of the driver software that is used. In our humble opinion, this is true. The availability of good
ODBC drivers has improved a great deal recently. And anyway, the criticism about
performance is somewhat analogous to those who said that compilers would never match the
speed of pure assembly language. Maybe not, but the compiler (or ODBC) gives you the
opportunity to write cleaner programs, which means you finish sooner. Meanwhile, computers
get faster every year.
JDBC
In an effort to set an independent database standard API for Java; Sun Microsystems
developed Java Database Connectivity, or JDBC. JDBC offers a generic SQL database access
mechanism that provides a consistent interface to a variety of RDBMSs. This consistent
interface is achieved through the use of plug-in database connectivity modules, or drivers. If
a database vendor wishes to have JDBC support, he or she must provide the driver for each
platform that the database and Java run on.
To gain a wider acceptance of JDBC, Sun based JDBCs framework on ODBC. As you
discovered earlier in this chapter, ODBC has widespread support on a variety of platforms.
Basing JDBC on ODBC will allow vendors to bring JDBC drivers to market much faster than
developing a completely new connectivity solution.
JDBC was announced in March of 1996. It was released for a 90 day public review that
ended June 8, 1996. Because of user input, the final JDBC v1.0 specification was released soon
after.
30 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
The remainder of this section will cover enough information about JDBC for you to know what
it is about and how to use it effectively. This is by no means a complete overview of JDBC.
That would fill an entire book.
JDBC Goals
Few software packages are designed without goals in mind. JDBC is one that, because
of its many goals, drove the development of the API. These goals, in conjunction with early
reviewer feedback, have finalized the JDBC class library into a solid framework for building
database applications in Java.
The goals that were set for JDBC are important. They will give you some insight as to why
certain classes and functionalities behave the way they do. The eight design goals for JDBC
are as follows:
2. SQL Conformance
SQL syntax varies as you move from database vendor to database vendor. In an effort
to support a wide variety of vendors, JDBC will allow any query statement to be passed
through it to the underlying database driver. This allows the connectivity module to handle
non-standard functionality in a manner that is suitable for its users.
31 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
5.Keep it simple
This goal probably appears in all software design goal listings. JDBC is no exception.
Sun felt that the design of JDBC should be very simple, allowing for only one method of
completing a task per mechanism. Allowing duplicate functionality only serves to confuse
the users of the API.
And for dynamically updating the cache table we go for MS Access database.
Simple Architecture-neutral
Object-oriented Portable
Distributed High-performance
Interpreted Multithreaded
Robust Dynamic
Secure
Java is also unusual in that each Java program is both compiled and interpreted.
With a compile, you translate a Java program into an intermediate language called
32 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Java byte codes the platform-independent code instruction is passed and run on the
computer.
Compilation happens just once; interpretation occurs each time the program is
executed. The figure illustrates how this works.
Compilers My Program
You can think of Java byte code as the machine code instructions for the Java
Virtual Machine (Java VM). Every Java interpreter, whether its a Java development
tool or a Web browser that can run Java applets, is an implementation of the Java
VM. The Java VM can also be implemented in hardware.
Java byte codes help make write once, run anywhere possible. You can
compile your Java program into byte codes on my platform that has a Java compiler.
The byte codes can then be run any implementation of the Java VM. For example,
the same Java program can run Windows NT, Solaris, and Macintosh.
33 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Networking
TCP/IP stack
IP datagrams
34 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
UDP
TCP
Internet addresses
In order to use a service, you must be able to find it. The Internet uses an address
scheme for machines so that they can be located. The address is a 32 bit integer which
gives the IP address. This encodes a network ID and more addressing. The network
ID falls into various classes according to the size of the network address.
Network address
Class A uses 8 bits for the network address with 24 bits left over for other
addressing. Class B uses 16 bit network addressing. Class C uses 24 bit network
addressing and class D uses all 32.
Subnet address
Host address
8 bits are finally used for host addresses within our subnet. This places a limit of
256 machines that can be on the subnet.
35 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Total address
Port addresses
A service exists on a host, and is identified by its port. This is a 16 bit number.
To send a message to a server, you send it to the port for that service of the host that
it is running on. This is not location transparency! Certain of these ports are "well
known".
Sockets
#include <sys/types.h>
#include <sys/socket.h>
int socket(int family, int type, int protocol);
36 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
J Free Chart
JFreeChart is a free 100% Java chart library that makes it easy for developers
to display professional quality charts in their applications. JFreeChart's extensive
feature set includes:
A flexible design that is easy to extend, and targets both server-side and client-
side applications;
Support for many output types, including Swing components, image files
(including PNG and JPEG), and vector graphics file formats (including PDF, EPS and
SVG);
1. Map Visualizations
Charts showing values that relate to geographical areas. Some examples
include: (a) population density in each state of the United States, (b) income per capita
for each country in Europe, (c) life expectancy in each country of the world. The tasks
in this project include:
Sourcing freely redistributable vector outlines for the countries of the world,
states/provinces in particular countries (USA in particular, but also other areas);
37 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Implement a new (to JFreeChart) feature for interactive time series charts --- to display a
separate control that shows a small version of ALL the time series data, with a sliding "view"
rectangle that allows you to select the subset of the time series data to display in the main chart.
3. Dashboards
4. Property Editors
The property editor mechanism in JFreeChart only handles a small subset of the
properties that can be set for charts. Extend (or reimplement) this mechanism to provide
greater end-user control over the appearance of the charts.
38 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
Admin:jsp
<body>
<div id="head"></div>
<div id="topPan"><a href="index.html"><img src="images/logo2.jpg" width="204"
height="57" border="0"/></a>
<ul>
<li><span>Home</span></li>
<li><a href="index.html">User</a></li>
<li><a href="admin.jsp">Admin</a></li>
<li><a href="index.html">Logout</a></li>
<li><a href="#">Contact</a></li>
</ul>
39 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
</div>
<div id="headerPan">
<div id="headerPanleft">
<div id="ourblog">
<h2>Upload </h2>
<div id="bodyPan">
40 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
41 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
</form>
</div>
</body>
</html>
File Uploading.jsp:
<body>
<div id="head"></div>
<div id="topPan"><a href="index.html"><img src="images/logo2.jpg" width="204"
height="57" border="0"/></a>
<ul>
<li><span>Home</span></li>
<li><a href="index.html">User</a></li>
<li><a href="admin.jsp">Admin</a></li>
<li><a href="amenu.jsp">Menu</a></li>
<li><a href="index.html">Logout</a></li>
</ul>
</div>
<div id="headerPan">
<div id="headerPanleft">
<div id="ourblog">
<h2>Upload </h2>
42 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
<div id="bodyPan">
43 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
<h3>Latest services</h3>
<ul>
<li><a href="#">Lorem Ipsum has been the industry's standardd</a></li>
<li><a href="#">Text ever since the 1500s, when an unknownpri</a></li>
<li><a href="#">Took a galley of type and scrambled it to make</a> </li>
<li><a href="#">Especimen book. It has</a></li>
</ul>
<p>been the industry's standard dummy text ever since the 1500s, when an
unknown printer when an been took a galley of type and </p>
<div id="rightbodymorenext">
<p class="textposition">sbut also the leap into electronic</p>
<a href="#">more</a></div>
</div>
</div>-->
<div id="ufhead" style="margin-left:250px;">Upload Document File</div>
<table class="tablegen3">
<form action="fileupload1.jsp" method="post" name="f" enctype="multipart/form-data"
onSubmit="return validation();">
try{
Connection
con4=databasecon.getconnection();
st4=con4.createStatement();
String sql4="select * from add_domain";
rs4=st4.executeQuery(sql4);
while(rs4.next()){%>
<option value="<%=rs4.getString("dname")%>">
<%=rs4.getString("dname")%>
</option>
<%}
}
catch (Exception eq4)
{
out.println(eq4.getMessage());
}
%>
</select></td></tr>
44 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
45 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
</td>
</tr>
<tr><td>Choose File</td><td>
<input type="file" name="image" class="bigbutton" width="100" height="30"/>
</td>
</tr>
</table>
</form>
<script src='https://code.jquery.com/jquery-1.10.0.min.js'></script>
<script src="js/index.js"></script></p>
</div>
</body>
</html>
46 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
8.SCREEN SHOTS
47 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
48 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
49 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
50 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
51 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
52 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
9.CONCLUSION
In this paper, we formalize the problem of subgraph skyline analysis (denoted as S 2A)
over large graphs and propose an efficient algorithm to answer S 2A queries. To improve the
efficiency, we propose to partition the data space into grid cells, based on which we carefully
design feature encoding to facilitate the query process. In order to handle the curse of
dimensionality, we propose an attribute cluster-based method. The experimental results on real
datasets validate both the effectiveness and efficiency of our method. As our future work, there
are some issues to be addressed. For example, if the number of subgraph skyline answers is
still quite large, we can define the top-k subgraph skyline ,Since finding subgraphs that exactly
match the queries issued by users is difficult, it is very interesting to redefine the subgraph
skyline by considering the structural similarity as a dimension.
53 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
10.FUTURE ENHANCEMENT
As our future work, there are some issues to be addressed. For example, if the number
of subgraph skyline answers is still quite large, we can define the top-k subgraph skyline, Since
finding subgraphs that exactly match the queries issued by users is difficult, it is very
interesting to redefine the subgraph skyline by considering the structural similarity as a
dimension.
54 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
11.REFERENCES
[1] T. Neumann and G. Weikum, Rdf-3x: a risc-style engine for rdf, PVLDB, vol. 1, no. 1,
2008.
[2] J. Tang, S. Wu, and J. Sun, Confluence: conformity influence in large social networks,
in KDD, 2013.
[3] N. Jayaram, A. Khan, C. Li, X. Yan, and R. Elmasri, Querying knowledge graphs by
example entity tuples, CoRR, vol. abs/1311.2100, 2013.
[4] X. Yu, Y. Sun, P. Zhao, and J. Han, Query-driven discovery of semantically similar
substructures in heterogeneous networks, in KDD, 2012.
[5] L. Zou, J. Mo, L. Chen, M. T. Ozsu, and D. Zhao, gstore: Answering sparql queries via
subgraph matching, PVLDB, vol. 4, no. 8, 2011.
[6] E. P. F. Chan and H. Lim, Optimization and evaluation of shortest path queries, VLDB
J., vol. 16, no. 3, 2007.
[7] P. Zhao and J. Han, On graph query optimization in large networks, PVLDB, vol. 3, no.
1, 2010.
[8] R. Jin, N. Ruan, S. Dey, and J. X. Yu, Scarab: scaling reachability computation on large
graphs, in SIGMOD, 2012.
[9] C. Y. Chan, H. V. Jagadish, K. Tan, A. K. H. Tung, and Z. Zhang, Finding k-dominant
skylines in high dimensional space, in SIGMOD, Chicago, Illinois, USA, June 27-29,
2006, pp. 503514.
[10] Z. Zhang, H. Lu, B. C. Ooi, and A. K. H. Tung, Understanding the meaning of a shifted
sky: a general framework on extending skyline query, VLDB J., vol. 19, no. 2, pp. 181
201, 2010.
[11] K.-L. Tan, P.-K. Eng, and B. C. Ooi, Efficient progressive skyline computation, in VLDB,
2001.
[12] S. Borzs onyi, D. Kossmann, and K. Stocker, The skyline operator, in ICDE, 2001.
[13] Y. Tao, X. Xiao, and J. Pei, Subsky: Efficient computation of skylines in subspaces, in
ICDE, 2006.
[14] C. Sheng and Y. Tao, Worst-case i/o-efficient skyline algorithms, ACM Trans. Database
Syst., vol. 37, no. 4, 2012.
[15] J. R. Ullmann, An algorithm for subgraph isomorphism, J. ACM, vol. 23, no. 1, 1976.
[16] W. Zheng, L. Zou, X. Lian, L. Hong, and D. Zhao, Efficient subgraph skyline search over
large graphs, in CIKM, 2014, pp. 15291538.
[17] S. Yang, Y. Wu, H. Sun, and X. Yan, Schemaless and structureless graph querying,
PVLDB, vol. 7, no. 7, pp. 565576, 2014.
[18] D. Kossmann, F. Ramsak, and S. Rost, Shooting stars in the sky: An online algorithm for
skyline queries, in VLDB, 2002.
[19] D. Papadias, Y. Tao, G. Fu, and B. Seeger, Progressive skyline computation in database
systems, ACM Trans. Database Syst., vol. 30, no. 1, 2005.
[20] W. Jin, A. K. H. Tung, M. Ester, and J. Han, On efficient processing of subspace skyline
queries on high dimensional data, in SSDBM, 2007.
[21] J. Gray, A. Bosworth, A. Layman, and H. Pirahesh, Data cube: A relational aggregation
operator generalizing group-by, cross-tab, and sub-total, in ICDE, 1996.
55 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
[22] J. Pei, W. Jin, M. Ester, and Y. Tao, Catching the best views of skyline: A semantic
approach based on decisive subspaces, in VLDB, 2005.
[23] Y. Yuan, X. Lin, Q. Liu, W. Wang, J. X. Yu, and Q. Zhang, Efficient computation of the
skyline cube, in VLDB, 2005.
[24] W. Jin, M. Ester, Z. Hu, and J. Han, The multi-relational skyline operator, in ICDE, 2007.
[25] A. Vlachou, C. Doulkeridis, and N. Polyzotis, Skyline query processing over joins, in
SIGMOD, 2011.
[26] M. Nagendra and K. S. Candan, Skyline-sensitive joins with lrpruning, in EDBT, 2012.
[27] D. Sun, S. Wu, J. Li, and A. K. H. Tung, Skyline-join in distributed databases, in ICDE
Workshops, 2008.
[28] I. Bartolini, P. Ciaccia, and M. Patella, Salsa: computing the skyline without scanning the
whole sky, in CIKM, 2006.
[29] J. J. Levandoski, M. F. Mokbel, and M. E. Khalefa, Flexpref: A framework for extensible
preference evaluation in database systems, in ICDE, 2010.
[30] V. Raghavan, E. A. Rundensteiner, and S. Srivastava, Skyline and mapping aware join
query evaluation, Inf. Syst., vol. 36, no. 6, 2011.
[31] M. E. Khalefa, M. F. Mokbel, and J. J. Levandoski, Prefjoin: An efficient preference-
aware join operator, in ICDE, 2011.
[32] A. N. Papadopoulos, A. Lyritsis, and Y. Manolopoulos, Skygraph: an algorithm for
important subgraph discovery in relational graphs, Data Min. Knowl. Discov., vol. 17, no.
1, pp. 5776, 2008.
[33] A. Soulet, C. Rassi, M. Plantevit, and B. Cremilleux, Mining dominant patterns in the
sky, in ICDM, 2011, pp. 655664.
[34] P. Shelokar, A. Quirin, and O. Cordon, A multiobjective evolution- ary programming
framework for graph-based data mining, Inf. Sci., vol. 237, pp. 118136, 2013.
[35] Y. Tian and J. M. Patel, Tale: A tool for approximate large graph matching, in ICDE,
2008.
[36] S. Zhang, M. Hu, and J. Yang, Treepi: A novel graph indexing method, in ICDE, 2007.
[37] J. Cheng, Y. Ke, W. Ng, and A. Lu, fg-index: Towards verificationfree query processing
on graph databases. in SIGMOD, 2007.
[38] L. Zou, L. Chen, J. X. Yu, and Y. Lu, A novel spectral coding in a large graph database,
in EDBT, 2008.
[39] K. Zhu, Y. Zhang, X. Lin, G. Zhu, and W. Wang, Nova: A novel and efficient framework
for finding subgraph isomorphism mappings in large graphs, in DASFAA (1), 2010.
[40] A. Khan, Y. Wu, C. C. Aggarwal, and X. Yan, Nema: Fast graph search with label
similarity, PVLDB, vol. 6.
[41] N. Jayaram, A. Khan, C. Li, X. Yan, and R. Elmasri, Querying knowledge graphs by
example entity tuples, IEEE Trans. Knowl. Data Eng., vol. 27, no. 10, pp. 27972811,
2015.
[42] D. Mottin, M. Lissandrini, Y. Velegrakis, and T. Palpanas, Exemplar queries: Give me an
example of what you need, PVLDB, vol. 7, no. 5, pp. 365376, 2014.
[43] Y. Su, S. Yang, H. Sun, M. Srivatsa, S. Kase, M. Vanni, and X. Yan, Exploiting relevance
feedback in knowledge graph search, in SIGKDD, 2015, pp. 11351144.
56 | P a g e
MCA, AIET.
ONLINE SUBGRAPH SKYLINE ANALYSIS OVERKNOWLEDGE GRAPHS
57 | P a g e
MCA, AIET.