Bad Data Handbook

Cleaning Up The Data So You Can Get Back To Work

Nonfiction, Computers, Database Management
Cover of the book Bad Data Handbook by Q. Ethan McCallum, O'Reilly Media
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Q. Ethan McCallum ISBN: 9781449324971
Publisher: O'Reilly Media Publication: November 7, 2012
Imprint: O'Reilly Media Language: English
Author: Q. Ethan McCallum
ISBN: 9781449324971
Publisher: O'Reilly Media
Publication: November 7, 2012
Imprint: O'Reilly Media
Language: English

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

  • Test drive your data to see if it’s ready for analysis
  • Work spreadsheet data into a usable form
  • Handle encoding problems that lurk in text data
  • Develop a successful web-scraping effort
  • Use NLP tools to reveal the real sentiment of online reviews
  • Address cloud computing issues that can impact your analysis effort
  • Avoid policies that create data analysis roadblocks
  • Take a systematic approach to data quality analysis
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

More books from O'Reilly Media

Cover of the book The New How [Paperback] by Q. Ethan McCallum
Cover of the book Enabling Programmable Self with HealthVault by Q. Ethan McCallum
Cover of the book Learning JavaScript by Q. Ethan McCallum
Cover of the book Moving Hadoop to the Cloud by Q. Ethan McCallum
Cover of the book Funded by Q. Ethan McCallum
Cover of the book Head First C# by Q. Ethan McCallum
Cover of the book AspectJ Cookbook by Q. Ethan McCallum
Cover of the book Network Security Assessment by Q. Ethan McCallum
Cover of the book Programmieren lernen mit Python by Q. Ethan McCallum
Cover of the book Designing with Sound by Q. Ethan McCallum
Cover of the book Data Science at the Command Line by Q. Ethan McCallum
Cover of the book Head First Physics by Q. Ethan McCallum
Cover of the book Learning MCollective by Q. Ethan McCallum
Cover of the book Lightweight Django by Q. Ethan McCallum
Cover of the book Postfix: The Definitive Guide by Q. Ethan McCallum
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy