OAP00049599

GEORGE W. BUSH PRESIDENTIAL RECORD

OAP00049600

GEORGE W. BUSH PRESIDENTIAL RECORD

OAP00049601

• 10 are

as

4

GEORGE W. BUSH PRESIDENTIAL RECORD

OAP00049602

GEORGE W. BUSH PRESIDENTIAL RECORD

OAP00049603

GEORGE W. BUSH PRESIDENTIAL RECORD

OAP00049604

GEORGE W. BUSH PRESIDENTIAL RECORD

OAP00049605

OAP00049606

We believe that the time and resources dedicated to this have paid off for the current transition and that EOP is better positioned for the next administration to meet PRA requirements.

Recap of why Phase I and Phase II are different than the 2005 analysis:

2005 analysis was a good first pass indicating that EOP had issues with their email inventory that needed to be resolved.

2005 analysis was not able to identify all messages in the inventory for the specific time period. (eg., lower message counts)

Leveraged outside expertise to define approach to inventory, modeling, analysis, and best practices.

New tools. CMDFI limitations including record count max as well as not marking a file when counted so no ability to know if a file was skipped during inventory. Statistical model vs. a 27 day rolling average is used to identify anomalies.

If we are asked to do statistical sampling to prove we have everything:

Nota .• lot.Out theretow~rk witp, •••. T9g.~tarn~t~iflO.f~r~~[()f.2.-.~fgrna w.ith.·95% .confidence level,you need to randomly select andprocessone-in .. ~OOdocuments. ·.lwo~I(j.assume we would narrow the population down tothe redRRAdays fr9.mthe2005analysis,.lookat thatpopulationofemails and then apply the one-in-400 to that. Does that make sense? Jfso, how many is that? I'll have to run this by WHeo.

Current as of 717/08

Page: 9

GEORGE W. BUSH PRESIDENTIAL RECORD

OAP00049607