Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Buy Now $31.99
Standard view
Full view
of .
Save to My Library
Look up keyword
Like this
8Activity
P. 1
Bad Data Handbook: Cleaning Up The Data So You Can Get Back To Work

Bad Data Handbook: Cleaning Up The Data So You Can Get Back To Work

Ratings:

4.0

(1)
|Views: 303 |Likes:

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

Test drive your data to see if it’s ready for analysis Work spreadsheet data into a usable form Handle encoding problems that lurk in text data Develop a successful web-scraping effort Use NLP tools to reveal the real sentiment of online reviews Address cloud computing issues that can impact your analysis effort Avoid policies that create data analysis roadblocks Take a systematic approach to data quality analysis

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

Test drive your data to see if it’s ready for analysis Work spreadsheet data into a usable form Handle encoding problems that lurk in text data Develop a successful web-scraping effort Use NLP tools to reveal the real sentiment of online reviews Address cloud computing issues that can impact your analysis effort Avoid policies that create data analysis roadblocks Take a systematic approach to data quality analysis

More info:

Publish date: Nov 7, 2012
Added to Scribd: Nov 08, 2012
Copyright:Traditional Copyright: All rights reservedISBN:9781449324988
List Price: $31.99 Buy Now

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
See more
See less

08/19/2014

264

9781449324988

$31.99

USD

pdf

You're Reading a Free Preview
Pages 11 to 93 are not shown in this preview.
You're Reading a Free Preview
Pages 104 to 111 are not shown in this preview.
You're Reading a Free Preview
Pages 122 to 146 are not shown in this preview.
You're Reading a Free Preview
Pages 157 to 184 are not shown in this preview.
You're Reading a Free Preview
Pages 195 to 264 are not shown in this preview.

Activity (8)

You've already reviewed this. Edit your review.
1 hundred reads
1 thousand reads
dcabo liked this
saibaldas liked this
Dolan McElmurry liked this
ali liked this
rasguli liked this
humblejoe liked this

You're Reading a Free Preview

Download
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->