Cambridge Judge Business School, Market Research Digital Preservation
Cambridge Judge Business School, Market Research Digital Preservation
Cambridge Judge Business School, Market Research Digital Preservation
Group 12: Yunghan Au, Rizik Kandalaft, Ming Kuang, Savita Nair
Thursday, December 9, 2010
Disclaimer: This work has been undertaken as part of a student educational project and the material should be viewed in this
context. The work does not constitute professional advice and no warranties are made regarding the information presented. The
Authors, Cambridge Judge Business School and its Faculty do not accept any liability for the consequences of any action taken a
result of the work or any recommendations made or inferred.
Agenda
Introduction
Digital Preservation
Open Planets Foundation
Project Objectives
Research Methodology
Secondary Research
Primary Research: Healthcare Industry
Volume of Data
2011 Forecasted Data
Preservation Market Potential
Preservation Cost
$0.25 / GB3
Volume of Data
Generation = 1,530 EB1,2
1,530 EB * 0.35 = 530 EB
% Data Preservation
= 35%1
~ $10 Tn / year
1
Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at:
http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
2
EB = 1,024 PB; PB = 1,024 TB; TB = 1,024 GB
3
Schofield, J. (2010) Google finally enters the online storage arena with a free 1GB. Available at:
http://www.guardian.co.uk/technology/blog/2010/jan/12/google-docs-storage-cloud-gdrive (Accessed: 8 December 2010).
Market Size (2 of 2)
Expected
Data Generation Market Size
In 2011
$1.62 Bn1
Third party
storage industry
Data
Preservation
* 60 - 70%2
Data Storage
Minimum
Market Size
in 2011
1
Market Size in 2010 ($1.45 Bn) * CAGR (11.7%)
Source: IDC. (2009) Worldwide Archiving Software 2009-2013 Forecast. ~ $1 Bn Preservation and
Available at: http://www.idc.com/getdoc.jsp?containerId=219114
Access
2
Phone conversation with Jing Wang, Strategic Business Manager, EMC
Research Methodology
Industry Specific Considerations
Financial Services
• Strict data retention regulations around transaction data
Healthcare
• Electronic patient records (EPRs), including family medical history
Manufacturing
• Product life cycle (5-20 years) vs. technology lifecycle (4-5 years)
Refer to Appendices 2-7 for detailed analyses of the industries mentioned in this slide.
Potential Drivers of Demand
Software
Spend
Digital
Preservation
- Fines
Industry Cost of Data - Reputation
Standards Loss - Rework / Lower
productivity
- Customer dissatisfaction
Regulations - Loss of irrecoverable data
Secondary Research Findings
Financial
Financial Aerospace
Aerospace Oil
Oil&&
Relative
RelativePoints
Points Healthcare
Healthcare Manufacturing
Manufacturing Weight
Weight
Services
Services &
&Defense
Defense Gas
Gas
Cost
Costof
ofdata
datareplacement
replacement 33 22 11 22 33 66
Cost
Costof
ofdata
dataloss
loss 33 55 55 44 22 55
Regulations
Regulations
Industry
Industrystandards
Data
standards
DataStorage
Storage//Archiving
Archiving
22
11
55
55
11
44
11
33
00
11
22
55
11
55
33
* 44
33
22
Software
Software Spending
Spending 55 11 00 00 44 11
Total
Total 19
19 18
18 10
10 14
14 18
18
Industry Regulation
Telecommunication Data Retention Directive 2 years
Energy Federal Energy Regulatory Commission 5 years
Indefinite
UK Dept of Health
Healthcare
HIPAA 6 years
Financial Services Authority
6 years
Bank Secrecy Act / AML
5 years
USA Patriot Act
5 years
Financial Services MiFID
5 years
NASD 2210, 2711, 3010, 3110
3 years
SEC 17a(3,4)
6 years
Basel II Accord
5 years
US listed companies Sarbanes-Oxley Act 7 years
Indefinite
US firms in litigation Federal Rules of Civil Procedure
0 5 10 15 20 25 30
Maximum Retention Period
US firms in litigation US listed cos. Telecom Energy Healthcare Financial Services
Healthcare Records
Retention (Years)
Electronic patient records (EPRs), X-ray reports,
Mammography scre ens, Diagnostic Image Data, Indefinite
Environmental monitoring re sults
Source: Records Management, NHS Code of Practice, Part 2 (2 nd Edition). Available at: http://www.dh.gov.uk/ (Accessed: 7 December
2010)
Healthcare (1 of 2)
Surveyed (24)
Archived 80%
Long-term access
(22)
In- Third-Party
house
May use OPF’s tools May pay for a service if
quickly with further available
awareness Future
SWOT Analysis
Target Market
STRENGTHS WEAKNESSES Similar Differen
t
1. Non profit 1. Lack of brand
foundation recognition
2. Flexible service 2. No standardized
Similar
offering product
3. Alliance with 3. Reliance on non-
memory OPF personnel for
institutions service delivery Product
OPPORTUNITIES THREATS
1. First to market 1. Perceived Differen
2. Infant market instability of open-
t
3. High expected source technology
growth in demand 2. Cutbacks
3. Competitors
Consultant Talk
1
IDC. (2010) Worldwide Archiving Software 2009 Vendor Shares. Available at:
http://www.idc.com/research/viewdocsynopsis.jsp?containerId=223466§ionId=null&elementId=null&pageType=SYNOPSIS
(Accessed: 7 December 2010)
Appendix 2: Financial Services
1
Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at:
http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
Appendix 3: Healthcare
1
Howard, C.E. (2009) ‘Secure information storage: military and aerospace equipment designers have a wealth of online options
available to fill the ever-increasing need for secure data storage in the field’, Military and Aerospace Electronics, Volume 20,
Issue 12. Factiva [Online]. Available at: http://global.factiva.com (Accessed: 4 November 2010).
Appendix 6: Oil & Gas
1
Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at:
http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
Appendix 7: Manufacturing(1 of 2)
1
Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at:
http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
Appendix 7: Manufacturing(2 of 2)
Technology
T T T T
Life Cycle
T Technology changes
Financial Services
25
Financial Services
Recover transaction
20 Recover transaction
information and customer trust
information and customer trust
Manufacturing
15 Manufacturing
Reverse engineering costs
Weighted Relative Points
g
es
Ga
ns
in
ar
vic
hc
tu
l&
er
De
alt
ac
lS
Oi
He
uf
e&
cia
an
an
ac
M
sp
Fin
ro
Ae
Appendix 9: Cost of Data Loss
25
Healthcare
Healthcare
Medical history irrecoverable if
Medical history irrecoverable if
20
electronic patient records
electronic patient records
(EPRs) are lost
(EPRs) are lost
15
Software obsolescence found
Software obsolescence found
Weighted Relative Points
e
es
g
Ga
ns
rin
ar
vic
hc
fe
tu
l&
er
De
alt
ac
lS
Oi
He
uf
e&
cia
an
an
ac
M
sp
Fin
ro
Ae
Appendix 10: Software Spend
Healthcare
Communications
Public Sector
18%
Retail
Manufacturing
18%
Chronopolis is a national center organization for the Chronopolis addresses this critical problem (digital
Non-for-profit;
Chronopolis management, long-term preservation, information) and provide the service of long-term data
membership
and promulgation of national digital assets. preservation and access
InApp provides reengineering & migration services InApp is committed to be a world class business
to software and hardware. It helps customers solution provider delivering exceptional value to the For-profit and
InApp
reengineer legacy systems, upgrade current products customer leveraging on an organization culture that offer service
to a new platform, architecture, OS and / or language. promotes innovation and excellence.
The largest European earth observation (EO) data Many users are calling for
Non-for-profit;
ESA-ESRIN provider and operates as the reference European the need to preserve the EO data and ensure long-term
membership
centre for EO payload data exploitation. accessibility.
No.1 company for data storage service and start Offer the best service on data storage For-profit and
EMC
researching on preservation and access offer service
Appendix 13: Consultant Advisors
Government Finance
Cross-sector
1. Business Insights. (2009) ‘Key Players in Enterprise Security – Strategy, performance and SWOT analysis’, Datamonitor
360 [Online]. Available at: http://360.datamonitor.com/ (Accessed: 1 November 2010).
2. Eastwood, G. (2006) ‘The IT Services opportunity in vertical sectors’, Datamonitor 360 [Online]. Available at:
http://360.datamonitor.com/ (Accessed: 1 November 2010).
3. http://www.slideshare.net/rsink/gartner-report-it-spending-2010 (no date). (Accessed: 1 November 2010).
4. Mohammed, A. (2007) ‘Retention challenge’, Computer Weekly, Oct 16, 2007, pp. 26, 28. ABI/INFORM Trade & Industry
[Online]. Available at: http://proquest.umi.com/pqdweb?did=1381419601 (Accessed: 1 November 2010).
5. Dr. Gishner, B. (2006) The Role of ISO 10303 (STEP) in Long Term Data Retention. Available at:
http://edge.cs.drexel.edu/LTKR/Gischner%20-%20STEP%20Presentation%20on%20LTKR.pdf (Accessed: 4 November
2010).
6. Harada, R. (2003) ‘Are you prepared for long-term data preservation? - first in/first out’, Computer Technology Review, Oct
2003 [Online]. Available at: http://findarticles.com/p/articles/mi_m0BRZ/is_10_23/ai_111062977/ (Accessed: 2 November
2010).
7. Ex Libris Ltd. CASE STUDY: DIGITAL PRESERVATION AT THE NATIONAL LIBRARY OF NEW ZEALAND. Preservation:
A Forward-Looking Mission. Available at: http://www.exlibrisgroup.com/files/CaseStudy/SunPreservationandNLNZ.pdf
(Accessed: 29 October 2010).
8. Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at:
http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
9. Iron Mountain Incorporated. (2010) New Insights in Records Management Compliance. Available at:
http://www.ironmountain.com/webcasts/New-Insights-in-Records-Management-Compliance.html (Accessed: 29 November
2010).
10. Caldwell, F. (2010) Magic Quadrant for Enterprise Governance, Risk and Compliance Platforms. Available at:
http://www.gartner.com/technology/media-products/reprints/metricstream/vol3/article2/article2.html (Accessed: 2 November
2010).
11. InTechnology Ltd. (2004) Making Sense of Data Law, InTechnology Managed Services White Paper, April. Available at:
http://www.intechnology.co.uk/documents/whitepapers/MakingSense_DataLaw.pdf (Accessed: 2 November 2010).
Appendix 15: References (2 of 4)
12. Bleicher, P. (2002) Diamonds May Be Forever, But Data? – Applied Clinical Trials. Available at:
http://appliedclinicaltrialsonline.findpharma.com/appliedclinicaltrials/article/articleDetail.jsp?id=87252 (Accessed: 1
December 2010).
13. Covington & Burling LLP. (2006) White Paper on Data Retention of Voice mail for Regulated and Unregulated Industries in
the U.S. and E.U. Microsoft Corporation. Available at: http://download.microsoft.com (Accessed: 30 October 2010).
14. Clark, E. (2004) ‘Data retention regulations: keeping it legal’, Network Magazine, Mar 2004, 19 (3), pp. 24-30.
ABI/INFORM Trade & Industry [Online]. Available at: http://proquest.umi.com/pqdweb?did=1381419601 (Accessed: 7
November 2010).
15. Brooks, P.K. (1997). ‘Records Retention: Producing value from compliance’, ABA Bank Compliance, May/Jun 1997, 18 (5),
pp. 42. ABI/INFORM Global [Online]. Available at: http://proquest.umi.com/pqdweb?did=1381419601 (Accessed: 7
November 2010).
16. NetIQ Corporation. (2006) Addressing Regulatory Compliance in the Healthcare Industry. Available at:
http://download.netiq.com/CMS/WHITEPAPER/NetIQIndustryWP_HC.pdf (Accessed: 7 November 2010).
17. Furness, V. (2007). ‘The future of regulatory compliance - Strategies for a second wave of regulation’, Datamonitor 360
[Online]. Available at: http://360.datamonitor.com/ (Accessed: 1 November 2010).
18. Harris, R. (2010) ‘100 year data preservation’, ZDNet, 22 September. Available at:
http://www.zdnet.com/blog/storage/100-year-data-preservation/1093?utm_source=feedburner&utm_medium=feed&utm_cam
paign=Feed%3A+ZDNetBlogs+%28ZDNet+All+Blogs%29
(Accessed: 15 November 2010).
19. Crandall, W. (2008) ‘Best Practices in ECM Backup and Recovery’, Business TechEdge, August 2008 [Online]. Available at:
http://techedgemag.com/content/ContentCT.asp?P=275 (Accessed: 17 November 2010).
20. MessageSolution, Inc. Compliance - Regulations Overview. Available at:
http://www.messagesolution.com/resources.htm#frcp (no date) (Accessed: 17 November 2010).
21. CA Technologies. (2010) The Avoidable Cost of Downtime, CA Technology Research Report. Available at:
http://www.arcserve.com/files/supportingpieces/acd_report_100908_244254.pdf (Accessed: 5 November 2010).
Appendix 15: References (3 of 4)
22. University of Miami Leonard M. Miller School of Medicine. Data retention: Selected requirements by data type –
Information Technology – Miller School of Medicine at the University of Miami. Available at:
http://it.med.miami.edu/x1312.xml (no date) (Accessed: 27 November 2010).
23. http://www.metricstream.com/solutions/ferc_compliance.htm (no date) (Accessed: 27 November 2010).
24. Raffo, D. (2010) Disaster recovery/business continuity planning top spending priority, according to Forrester. Available at:
http://searchstorage.techtarget.co.uk/news/article/0,289142,sid181_gci1519940,00.html (Accessed: 27 November 2010).
25. Lyman, P. (2002) ‘Archiving the World Wide Web’, Building a National Strategy for Digital Preservation, April. Available
at: http://www.clir.org/pubs/reports/pub106/web.html (Accessed: 8 November 2010).
26. FaceTime Communications, Inc. Instant Messaging and Social Media Regulatory Requirements. Available at:
http://www3.facetime.com/solutions/regulatoryrequirements.aspx (no date) (Accessed: 27 November 2010).
27. ALSTON + BIRD LLP. What are the HIPAA rules? Available at: http://www.klsecurity.com/hipaa_regulation.htm (no date)
(Accessed: 27 November 2010).
28. Moffet, B.L. (2004) Timeframes for Document Retention. Available at:
http://www.gfrlaw.com/pubs/GordonPubDetail.aspx?xpST=PubDetail&pub=96 (Accessed: 3 December 2010).
29. Breaux, T.D. (2008) ‘Analyzing Regulatory Rules for Privacy and Security Requirements’, IEEE TRANSACTIONS ON
SOFTWARE ENGINEERING, VOL. 34, NO. 1, JANUARY/FEBRUARY 2008 [Online]. Available at:
http://www.computer.org/portal/web/csdl/doi/10.1109/TSE.2007.70746 (Accessed: 6 December 2010).
30. Swartz, N.. (2007). ‘UK Stores Phone Data for One Year’, Information Management Journal 41, no. 6: 17. Business Source
Complete, EBSCOhost [Online]. Available at:
http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=27569860&site=bsi-live (Accessed: 1 November 2010).
31. Lee, J.J. (2005) ‘The Compliance Imperative: Managing Record Retention in a Rapidly Changing Regulatory Environment’,
DM Review, 15(6), pp. 38-41. ABI/INFORM Trade & Industry [Online]. Available at: :
http://proquest.umi.com/pqdweb?did=1381419601 (Accessed: 6 December 2010).
32. Watson Hall Ltd. UK Data Retention Requirements. Available at:
https://www.watsonhall.com/methodology/uk-data-retention-requirements.pl (no date) (Accessed: 6 December 2010).
Appendix 15: References (4 of 4)
33. Preimesberger, C. (2008) ‘ON THE BRINK OF DISASTER. (cover story).’, eWeek, 25, no. 22, pp. 31-38. Business Source
Complete, EBSCOhost [Online]. Available at:
http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=33986023&site=bsi-live (Accessed: 1 November 2010).
34. Nettleton, E., and Watts, M. (2007) ‘Assessing the costs of data retention in the UK’, Journal of Database Marketing &
Customer Strategy Management, 15, no. 1, pp. 56-59. Business Source Complete, EBSCOhost [Online]. Available at:
http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=30050546&site=bsi-live (Accessed: 1 November 2010).
35. Lior, A. (2010) ‘Protecting Critical Data’, Dataquest, November 2. Factiva [Online]. Available at: http://global.factiva.com
(Accessed: 6 December 2010).
36. Smith, E.M. (2009) ‘Storage management: What radiologists need to know’, Applied Radiology, Volume 38, Issue 5. Factiva
[Online]. Available at: http://global.factiva.com (Accessed: 2 November 2010).
37. Mitchell, D.R. and Mitchell, J.A. (2006) ‘Status of clinical gene sequencing data reporting and associated risks for
information loss’, Journal of Biomedical Informatics, 40 (2007), pp. 47-54. ScienceDirect [Online]. Available at:
http://www.sciencedirect.com (Accessed: 4 November 2010).
38. Howard, C.E. (2009) ‘Secure information storage: military and aerospace equipment designers have a wealth of online
options available to fill the ever-increasing need for secure data storage in the field’, Military and Aerospace Electronics,
Volume 20, Issue 12. Factiva [Online]. Available at: http://global.factiva.com (Accessed: 4 November 2010).
39. Naccarati, P. (2003) Data Storage: Managing Obsolescence, Adtron Marketing, June. Available at:
http://www.adtron.com/pdf/AdtronMO030606.pdf (Accessed: 3 November 2010).
40. Harris, R. and Olby, N. (2001) ‘Earth observation data archiving in the USA and Europe’, Space Policy, 17 (2001), pp. 35-48.
ScienceDirect [Online]. Available at: http://www.sciencedirect.com (Accessed: 4 November 2010).
41. Scott, R.E. (2007) ‘e-Records in health – Preserving our future’, International Journal of Medical Informatics, 76 (2007), pp.
427-431. ScienceDirect [Online]. Available at: http://www.sciencedirect.com (Accessed: 4 November 2010).
42. Ball, A., Ding, L., and Patel, M. (2008) ‘An approach to accessing product data across system and software revisions’,
Advanced Engineering Informatics, 22 (2008), pp. 222-235. ScienceDirect [Online]. Available at:
http://www.sciencedirect.com (Accessed: 4 November 2010).