/  35
 
© ISO 2006 — All rights reserved
ISO TC 46/SC 4 N
Date: 20087-11XX-XX05
ISO/DIS 28500
ISO TC 46/SC 4/WG 12Secretariat: Standards New Zealand
Information and documentation — The WARC File Format
Élément introductif — Élément central — Élément complémentaire
Warning
This document is not an ISO International Standard. It is distributed for review and comment. It is subject tochange without notice and may not be referred to as an International Standard.Recipients of this draft are invited to submit, with their comments, notification of any relevant patent rights of which they are aware and to provide supporting documentation.Document type: International StandardDocument subtype:Document stage: DISDocument language: E
 
IS0/DIS 28500
Copyright notice
This ISO document is a working draft or committee draft and is copyright-protected by ISO. While the reproductionof working drafts or committee drafts in any form for use by participants in the ISO standards developmentprocess is permitted without prior permission from ISO, neither this document nor any extract from it may bereproduced, stored or transmitted in any form for any other purpose without prior written permission from ISO.Requests for permission to reproduce this document for the purpose of selling it should be addressed as shownbelow or to ISO's member body in the country of the requester:[Indicate the full address, telephone number, fax number, telex number, and electronic mail address, asappropriate, of the Copyright Manger of the ISO member body responsible for the secretariat of the TC or SCwithin the framework of which the working document has been prepared.]Reproduction for sales purposes may be subject to royalty payments or a licensing agreement.Violators may be prosecuted.II
© ISO 2006 — All rights reserved
 
ISO/DIS 28500
Contents
Page
1 Scope ............................................................................................................................................ ..............12 Normative references............................................................................................................. ....................13 Terms, definitions and acronyms............................................................................................... ...............23.1 Terms and definitions............................................................................................................................ .......23.1.1 WARC record .......................................................................................................................................... ...23.1.2 WARC record content block ........................................................................................... .........................23.1.3 WARC record payload.............................................................................................................. .................33.1.4 WARC record header ..................................................................................................................... ...........33.1.5 WARC named fields ....................................................................................................................... ...........33.1.6 WARC logical record ......................................................................................................... .......................33.2 Acronyms................................................................................................................................. .....................34 File and record model.............................................................................................................................. ...35 Named fields................................................................................................................... ............................55.1 General.................................................................................................................................... ......................55.2 WARC-Record-ID (mandatory)............................................................................................................. ........65.3 Content-Length (mandatory).............................................................................................. .........................65.4 WARC-Date (mandatory)............................................................................................................. .................65.5 WARC-Type (mandatory)...................................................................................................................... ........65.6 Content-Type....................................................................................................................................... ..........75.7 WARC-Concurrent-To...................................................................................................................... .............75.8 WARC-Block-Digest.................................................................................................................................. ....75.9 WARC-Payload-Digest............................................................................................................................. .....75.10 WARC-IP-Address................................................................................................................................... ....85.11 WARC-Refers-To................................................................................................................... ......................85.12 WARC-Target-URI........................................................................................................................................ 85.13 WARC-Truncated.............................................................................................................. ..........................95.14 WARC-Warcinfo-ID................................................................................................................................... ...95.15 WARC-Filename........................................................................................................................... ...............95.16 WARC-Profile..................................................................................................................... .........................95.17 WARC-Identified-Payload-Type................................................................................................................ 105.18 WARC-Segment-Number................................................................................................................... .......105.19 WARC-Segment-Origin-ID........................................................................................................... .............105.20 WARC-Segment-Total-Length....................................................................................................... ...........106 WARC Record Types.................................................................................................... ............................106.1 General................................................................................................................................. .......................106.2 'warcinfo'........................................................................................................................... ..........................116.3 'response'............................................................................................................................... .....................116.3.1 General..................................................................................................................................................... .116.3.2 for 'http' and 'https' schemes........................................................................................ .........................126.3.3 for other URI schemes............................................................................................................................ .126.4 'resource'............................................................................................................................................ .........126.4.1 General.............................................................................................................................. .......................126.4.2 for 'http' and 'https' schemes........................................................................................ .........................126.4.3 for 'ftp' scheme................................................................................................................. .......................136.4.4 for 'dns' scheme...................................................................................................................................... .136.4.5 for other URI schemes............................................................................................................................ .136.5 'request'......................................................................................................................................... ..............136.5.1 General.............................................................................................................................. .......................136.5.2 for 'http' and 'https' schemes........................................................................................ .........................136.5.3 for other URI schemes............................................................................................................................ .136.6 'metadata'............................................................................................................................... .....................136.7 'revisit'................................................................................................................................ .........................146.7.1 General.............................................................................................................................. .......................14
 © ISO 2006 — All rights reserved
III

Share & Embed

More from this user

Recent Readcasters