0% found this document useful (0 votes)
72 views1 page

Web Database Annotation Techniques

This document discusses annotating search results from web databases to semantically label data units for machine processing. It proposes using part-of-speech taggers to automatically assign labels to data within search result records. The software requirements include Windows, SQL and Java, and the hardware requirements are a dual-core processor, 50GB hard disk, 1GB RAM and internet connection. The project is estimated to take 10 weeks to complete, including tasks like literature review, analysis, implementation, results analysis and documentation.

Uploaded by

JOY Philip
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
72 views1 page

Web Database Annotation Techniques

This document discusses annotating search results from web databases to semantically label data units for machine processing. It proposes using part-of-speech taggers to automatically assign labels to data within search result records. The software requirements include Windows, SQL and Java, and the hardware requirements are a dual-core processor, 50GB hard disk, 1GB RAM and internet connection. The project is estimated to take 10 weeks to complete, including tasks like literature review, analysis, implementation, results analysis and documentation.

Uploaded by

JOY Philip
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

ANNOTATING SEARCH USING WEB DATABASES

1. Abstract
For many search engines, data encoded in the returned result pages come from the underlying
structured databases. Web databases (WDB) contain a many search results records, each SRR contain
multiple data units which need to be label semantically for machine process able.

2. Existing System
An automatic annotation approach which contains the data units on the web result page into a
different groups such that same groups have the same semantic labels. Then the six annotations are
combined and predict the final annotation label. The last is the wrapper generation, with the help of
wrapper generation we annotate the new result page from the same web database.

3. Proposed System
In this paper, we consider how to automatically assign labels to the data units within the SRR’s using
Parts of Speech (POS) Taggers. A POS Tagger is a piece of software that reads text in some language and
assigns parts of speech to each word (another token), such as noun, verb, adjective, etc.

4. Software Requirement
Windows, SQL, Java.

5. Hardware Requirements
Processor – Dual Core, Hard Disk – 50 GB, Memory – 1GB RAM, Internet.

6. Project Activity Duration

Project Activity Suggested Duration by Faculty


Literature Survey 2 weeks
Analysis and Design 2 weeks
Implementation/Experimentation 3 weeks
Result Analysis 2 weeks
Documentation 1 week
Overall 10 weeks

You might also like