You are on page 1of 1

Aligning the Warehouse and the Web

the introduction of an intermediate data-staging layer. and Ontology Reuse. NLDB 2007: 131-142.
Instead of clumsily seeking to combine the highly struc-
Dumbill, E. (2000). The Semantic Web: A Primer. Re-
A
tured warehouse data with the lax and unpredictable
trieved Sept. 2004, 2004, from http://www.xml.com/pub/
web data, the meta-data engine we propose mediates
a/2000/11/01/semanticweb/index.html
between the disparate environments. Key features are
the composition of domain specific queries which are Eysenbach, G. (2003). The Semantic Web and healthcare
further tailor made for individual entries in the suite of consumers: a new challenge and opportunity on the ho-
search engines being utilized. The ability to disregard rizon. Intl J. Healthcare Technology and Management,
irrelevant data through the use of Information Retrieval 5(3/4/5), 194-212.
(IR), Natural Language Processing (NLP) and/or On-
tologies is also a plus. Furthermore the exceptional Hassell, J., Aleman-Meza, B., & Arpinar, I. B. (2006).
independence and flexibility afforded by our model Ontology-Driven Automatic Entity Disambiguation in
will allow for rapid advances as niche-specific search Unstructured Text. Paper presented at the ISWC 2006,
engines and more advanced tools for the warehouse Athens, GA, USA.
become available. Holzinger, W., Kr¨upl, B., & Herzog, M. (2006). Using
Ontologies for Extracting Product Features from Web
Pages. Paper presented at the ISWC 2006, Athens, GA,
REFERENCES USA.

Bergman, M. (August 2001). The deep Web:Surfacing Imhoff, C., Galemmo, N. and Geiger, J. G. (2003). Master-
hidden value. BrightPlanet. Journal of Electronic Pub- ing Data Warehouse Design: Relational and Dimensional
lishing, 7(1). Retrieved from http://beta.brightplanet. Techniques. New York: John Wiley & Sons.
com/deepcontent/tutorials/DeepWeb/index.asp Inmon, W.H. (2002). Building the Data Warehouse, 3rd
Berson, A. and Smith, S.J. (1997). Data Warehousing, ed. New York: John Wiley & Sons.
Data Mining and Olap. New York: McGraw-Hill. Kalfoglou, Y., Alani, H., Schorlemmer, M., & Walton, C.
Chakrabarti, S. (2002). Mining the web: Analysis of Hy- (2004). On the emergent Semantic Web and overlooked
pertext and Semi-Structured Data. New York: Morgan issues. Paper presented at the 3rd International Semantic
Kaufman. Web Conference (ISWC’04).

Crescenzi, V., Mecca, G., & Merialdo, P. (2001). ROAD- Kim, W., et al. (2003). “A Taxonomy of Dirty Data”. Data
RUNNER: Towards Automatic Data Extraction from Mining and Knowledge Discovery, 7, 81-99.
Large Web Sites. Paper presented at the 27th International Kimball, R. and Ross, M. (2002). The Data Warehouse
Conference on Very Large Databases, Rome, Italy. Toolkit: The Complete Guide to Dimensional Modeling,
Daconta, M. C., Obrst, L. J., & Smith, K. T. (2003). The 2nd ed. New York: John Wiley & Sons.
Semantic Web: A Guide to the Future of XML, Web Laender, A. H. F., Ribeiro-Neto, B. A., da Silva, A. S., &
Services, and Knowledge Management: Wiley. Teixeira, J. S. (2002). A Brief Survey of Web Data Extrac-
Day, A. (2004). Data Warehouses. American City & tion Tools. SIGMOD Record, 31(2), 84-93.
County. 119(1), 18. Ladley, J. (March 2007). “Beyond the Data Warehouse:
Decker, S., van Harmelen, F., Broekstra, J., Erdmann, M., A Fresh Look”. DM Review Online. Available at http://
Fensel, D., Horrocks, I., et al. (2000). The Semantic Web: dmreview.com
The Roles of XML and RDF. IEEE Internet Computing, Manning, C.D, Raghavan, P., Schutze, H. (2007). Intro-
4(5), 63-74. duction to Information Retrieval. Cambridge University
Devlin, B. (1998). Meta-data: The Warehouse Atlas. DB2 Press.
Magazine, 3(1), 8-9. Peter, H. & Greenidge, C. (2005) “Data Warehousing
Ding, Y., Lonsdale, D.W., Embley, D.W., Hepp, M., Xu, L. Search Engine”. Encyclopedia of Data Warehousing and
(2007). Generating Ontologies via Language Components



You might also like