You are on page 1of 2

MarkLogic Server

Defense, intelligence, and law enforcement agencies collect a tremendous volume of raw data. For these agencies, the fast, accurate analysis of this raw data to produce actionable intelligence is vital to their national security missions. With the increasing flow of textual content – emails, field reports, immigration records, open source content streams, Web content, and the like – government agencies are forced to reevaluate their analysis strategies. What may have worked before is no longer viable. Solutions based on open, industry-standard XML technologies are providing more powerful ways to integrate, discover, analyze, and share actionable intelligence. Load content “as is” without predefined DTDs or XML schemas Automatic conversion to XML makes “shredding” or “chunking” documents a thing of the past by eliminating the time-consuming and costly first step most organizations experience when attempting to format, load, and process content. MarkLogic Server loads XML content “as is” and converts popular document formats including Microsoft Office, HTML and Adobe PDF into structured XML – all without requiring adherence to predefined DTDs or schemas. In addition, MarkLogic Server is compatible with content archiving and interchange initiatives such as the Intelligence Community Metadata Working Group (ICMWG) and the Department of Defense Discovery Metadata Standard (DDMS), enabling full compliance with cross-agency standards for content integration and sharing. Create powerful, custom content processing pipelines Using the built-in Content Processing Framework, MarkLogic Server lets organizations define sequences of content processing steps and seamlessly incorporate functions such as document categorization, entity extraction, or linguistic analysis. MarkLogic Server can execute sequences of native XQuery statements plus call out to Web services-enabled external applications within the content processing flow. Support mission-critical, multi-terabyte contentbases Robust, enterprise-class capabilities allow government organizations to confidently deploy MarkLogic Server for their most mission-critical content assets. MarkLogic Server features high availability, error recovery, and cluster monitoring as well as a comprehensive administrator interface. analysis solution. Use XQuery to perform detailed, highly precise queries and content processing tasks that leverage all XML structural elements, with or without a formal DTD or XML schema. MarkLogic Server delivers millisecond response times against terabyte-scale contentbases.

Find actionable intelligence within massive content sets
MarkLogic Server™ is the backbone of a powerful content integration, discovery, and analysis system that takes full advantage of XML content through the flexibility of XQuery. Designed and built to enterprise architecture standards, MarkLogic Server enables the seamless integration of COTS and custom analysis tools – such as entity extractors, classifiers, and language analyzers – to enrich content with new, vital information without incurring the time and cost typically associated with multi-vendor product integration. MarkLogic Server can easily handle multi-terabyte content sets.

The industry’s leading XML content server
MarkLogic Server is the industry’s most complete and powerful XML content server. A pure XML system that combines powerful full-text search with the W3C-standard XQuery language, MarkLogic Server unlocks the value of content by enabling government organizations to convert, query, manipulate, and render content.
Mark Logic Corporation 2000 Alameda de las Pulgas Suite 100 San Mateo, CA 94403 Phone +1 650 655 2300 Fax +1 650 655 2310 www.marklogic.com

Rapidly query and analyze large contentbases MarkLogic Server combines XML element query, XML proximity search, and full-text search to create a scaleable, fast, and complete content retrieval and

analyze and manipulate the extracted content. text and binary documents • Metadata sheet for every document • Failover Performance and Scalability • Designed for modern processor architectures • Multi-threaded. XQuery can isolate and extract specific portions of XML content from multiple sources. allowing you to not only find and retrieve information of interest. MarkLogic Server goes straight to the details. Users can ask for – and get – exactly the results they want. and render content. Mine content to discover details that search engines miss Retrieve detailed content references with full context. spell check • Programmable highlighting • Full-text and XML search via XQuery XQuery Support • Complete XQuery specification • High-performance.Implement powerful content discovery and analysis systems Integrate content from isolated silos Seamlessly integrate content from multiple sources and formats. high-performance C++ implementation • Single host or clustered configuration APIs and Integration • Java and . PowerPoint. XML is rapidly becoming the standard for information sharing within the defense and intelligence communities. It is an open standard endorsed and supported by many leading technology companies. thesaurus. built-in libraries support update. MarkLogic Server pinpoints and returns the specific information sought at the level of granularity required. wildcards. XQuery is a query language being developed by the World Wide Web Consortium (W3C) for querying and manipulating XML data. MarkLogic Server Features XML and XQuery XML is revolutionizing the way organizations integrate. Avoid the time and cost associated with “shredding” and “chunking” content for use in a relational database. enabling them to more easily repurpose content and create custom documents and views that speed analysis and provide deeper insights. actionable intelligence. enabling diverse organizations with unique mission requirements to more easily access and mine the vast amount of contentbased information that must be sifted to discern vital. 08/05 . cluster-wide administration. backup and restore • Role-based security • Two-minute installation Operating Systems and Platforms • Red Hat Linux ES3 on AMD Opteron and x86 architectures • Windows Server 2000 and Windows Server 2003 on x86 architectures • Sun Solaris 8 and 9 on SPARC architectures © Copyright 2005 Mark Logic Corporation.Microsoft Office 97 or later (Word. stemming. Assemble content into meaningful forms for analysis Query.0 • XPath 2. Standards • XQuery 1. providing the exact context required for each query.Adobe PDF . including ordered/unordered Boolean. event-driven automation • Content processing pipelines • Web-services integration Search • Full-text. but to automatically combine it with other information of interest to create meaningful content products for analysis. optimizing XQuery evaluator • Dynamic XPath optimization includes multi-step paths.0 (May 2003) • XML 1. Automatically convert content into well-formed XML without requiring adherence to any pre-defined DTD or schema. search and other functions Storage • Multi-terabyte XML scalability • Transactional element-level update • Automatic directory creation and management • XML. XQuery is a powerful step forward in content processing technology and is on its way to widespread adoption. All other product names mentioned herein are the property of their respective owners. and analyze content. and dynamically generate new content.Net APIs • Embedded HTTP and WebDAV server • XQuery-level HTTP. manipulate. MarkLogic Server features the industry’s most powerful XQuery implementation that will not only search. automatic indexing of content and structure • Automatic conversion from: . // and unions • Extensive. Excel) . access.0 Ingestion and Conversion • Schema-independent. not just links to documents. while XQuery searches both the content and the structure of that content. but also transform.HTML • Other format conversions via third-party applications Content Processing Framework • Programmable. With cross-agency initiatives such as the Intelligence Community Metadata Working Group. proximity. all rights reserved. store. and deliver content products in multiple formats per varying user and mission requirements.0 • XML Namespaces 1. Unlike a search engine that will respond to a simple keyword query with a list of links to documents that contain the keywords.0 (May 2003) • XML Schema 1. Mark Logic is a registered trademark and MarkLogic Server is a trademark of Mark Logic Corporation. Scale systems to handle multi-terabytes of content. query and manipulate content. relevance-ranked XML search. SOAP and SMTP access Administration • Web-based administrator interface • Hot. MarkLogic Server goes beyond the limited content processing capabilities of relational databases. This query precision eliminates the timeconsuming manual process of following search links to evaluate the true relevance of each hit. complex predicates.