You are on page 1of 3

International Journal of Emerging Technologies and Engineering (IJETE)

Volume 3 Issue 5, May 2016, ISSN 2348 8050

REGULAR EXPRESSION BASED FRAMEWORK FOR WEB


SERVICE API DESIGN
Prof. Dinesh Satre1, Shubhravrat Deshpande2,
Sumedh Sodegaonkar3, Sakshi Tickoo4, Akshay Dalavi5
1

Computer Science, MMIT, SPPU Pune University, lohgaon


Information Technology, MMIT, SPPU Pune University, lohgaon

2, 3, 4, 5

and simple user interface. We model a tool Easy


Downloader to download different types files such as
mp3, pdf and etc. It is as the simple graphical interface
between websites and the user. This quality makes it
user friendly and easy to use.With just one touch user
can enjoy easy downloading with this tool.

ABSTRACT
Pattern matching is the technique of searching a string
containing text or binary data for some set of characters
based on a search pattern. When user use a search engine
to look for something on the Web, youre using a simple
version of pattern matching: user criteria is find these
characters. In those environments, user can often
customize the criteria in particular ways, for example, to
search for this or that, to search for this or that but not
the other thing. Pattern matching in Regex, however, can
be even more complicated but ease of use than that.
Using Regex, user can define an incredibly specific set
of search criteria, and do it in an incredibly small
amount of space using a pattern-definition minilanguage called regular expressions.Easy Downloader is
tool to download different types files such as mp3, pdf
and etc. Itas the simple graphical interface between
websites and the user (Application user interface).This
quality makes it user friendly and easy to use.With just
one touch user can enjoy easy downloading with this
tool.
Keywords- API, Beautiful
expressions, type system.

I.

Soup,

regex,

II.

Before an xml document (database) can be accessed and


manipulated it must be loaded into an XML DOM
object. (it can be accessed with JavaScript). It creates
object for every query hence its lengthy process.
Regular expression or regex is used instead of using
DOM parser. A pattern is set of objects with some
recognizable property. It determines correlation
variables for pattern matching.

III.

PROPOSED SYSTEM REGULAR


EXPRESSION

Regular expressions (called REs, or regexes, or regex


patterns) are essentially a tiny, highly specialized
programming language embedded inside Python and
made available through The remodule. Using this little
language, users specify the rules for the set of possible
strings that you want to match. The regular expression
type system expresses two different varieties of
information. First, the type system distinguishes between
Strings that are syntactically valid regular expressions
and Strings that might not be. This enables the type
system to verify that a syntactically valid regular
expression is used where required. Second, the type
system tracks the number of capturing categories in each
regular expression. This enables the type system to
verify correct parameter usage for method calls that take
a resizing group number.

regular

INTRODUCTION

Advances in information and communications


technology have increased the popularity of mobile
devices.This in turn is motivating the development of a
growing number of mobile applications as well as
Websites and services, which are having a profound
effect on peoples lifestyles.When one has to download
music, ebooks and etc, the person has to go through
manual intervention. The process is time consuming and
boring.Despite of that one is not sure if he/she will find
desired files.
This observation motivates us to explore how to
minimize with the idea of building such tool which will
make users task easier.This tool will help users to save
their time and reduce manual intervention with attractive

LITERATURE SURVEY

IV.

ARCHITECTURE
SYSTEM

OF

PROPOSED

36
www.ijete.org

International Journal of Emerging Technologies and Engineering (IJETE)


Volume 3 Issue 5, May 2016, ISSN 2348 8050

The operations performed in module1 are like parsing


query, pattern matching and detecting loose or strict
schema. In this module user enters the query and
Android Smartphone to get resultant links.User can
select links from list.

BLOCK DIAGRAM OF SYSTEM

MODULE 2

Figure 1: Block Diagram

ARCHITECTURE OF PROPOSED SYSTEM

Figure 3: Module 2

In module2 after connecting to source server, links are


fetched and with links labels. These labels are parsed to
list view adapter i.e. module 3.
MODULE 3

The proposed system basically divided into 3 modules.


These module discussed in above. The first module
working of The Machine. The Second module for
fetching links and labels. The third module for list view
adapter.
MODULE 1

Figure 4: Module 3

In module 3, Fetched links and labels are parsed to list


view adapter. List view adapter filters parameters such
as Title, time, size etc file attributes.

V.

ALGORITHM

Lets consider for following HTML tag:

<a href='mysite.com' target='_blank'>hello</a>


1. First get the value from a tag Result : a
href='mysite.com' target='_blank'

Figure 2: Module 1

37
www.ijete.org

International Journal of Emerging Technologies and Engineering (IJETE)


Volume 3 Issue 5, May 2016, ISSN 2348 8050

2. Later get the link from above extracted


value Result : mysite.com

diverse ways. The regular expression language is


relatively small and confined, so not all possible string
processing tasks can be done using regular expressions.
There are also tasks that can be done with regular
expressions, but the expressions turn out to be very
complicated. In such cases, user may be better off
writing Python code to the further processing; while
Python code will be lengthy than a tangled regular
expression, it will also probably be more comprehensive.
Hence we have implemented regular expression based
framework which can minimize users manual
intervention and it iss easy to use.

Regular Expression Pattern:


1. Extract A tag Regular Expression Pattern
General tag extraction:
(?i)<a([^>]+)>(.+?)</a>
#start of group #1

REFERENCES
?i
insensive

# all checking are case

#end of group #1

<a

#start with "<a"

# start of group #2

[^>]+
# anything except (">"), at
least one character
)

# end of group #2

>

(.+?) #
</a>

[1] N. Broberg, A. Farre, and J. Svenningsson. Regular


expression patterns. In ICFP, pages 67a78.
[2] Aiken, A. and Murphy, B. R. Implementing regular
tree expressions. In Proceedings of Functional
Programming and Computer Architecture, J. Hughes,
Ed. Lecture Notes in Computer Science, vol. 523.
Springer - Verlag.
[3] Hosoya, H. Regular expression pattern matching a a
simpler design n. Tech. Rep. 1397,RIMS, Kyoto
University.
[4]http://www.crummy.com/software/BeautifulSoup/bs4
/doc/online documents, updated.

follow by ">"

match anything
#

end with "</a>

Hence we will get the desired result i.e. text


in anchor tag.

VI.

CONCLUSION

Regular expressions are essentially a small, highly


specialized programming language embedded inside the
Python and made available. Using this language, user
may specify the rules for the set of possible strings that
he/she want to match; this set might contain TeX
commands, or e-mail addresses or English sentences, or
anything user want. You can then ask questions such as
Does this string match the patterns?, or Is there any
match for the pattern anywhere in this string?. You can
also use REGEXs to modify a string or to split it apart in
38
www.ijete.org