Search Books
Make: Lego and Arduino Proj… Civic Apps Competition Hand…

Bad Data Handbook: Cleaning Up The Data So You Can Get Back To Work

Author Q. Ethan McCallum
Publisher O'Reilly Media
Category Computers
📄 Viewing lite version Full site ›
🌎 Shop on Amazon — choose country
34.74 39.99 USD
🛒 Buy New on Amazon 🇺🇸 🏷 Buy Used — $6.40

✓ Usually ships in 24 hours

Share:
Book Details
ISBN / ASIN1449321887
ISBN-139781449321888
AvailabilityUsually ships in 24 hours
Sales Rank388,871
CategoryComputers
MarketplaceUnited States 🇺🇸

Description

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

  • Test drive your data to see if it’s ready for analysis
  • Work spreadsheet data into a usable form
  • Handle encoding problems that lurk in text data
  • Develop a successful web-scraping effort
  • Use NLP tools to reveal the real sentiment of online reviews
  • Address cloud computing issues that can impact your analysis effort
  • Avoid policies that create data analysis roadblocks
  • Take a systematic approach to data quality analysis
The Good Web Site Guide 2006: The Completely Revised, …
View
The Pentium Microprocessor
View
Advanced Intel Microprocessors: 80286, 80386, And 80486
View
Differential Equations: Matrices and Models
View
Digital Experiments: Emphasizing Troubleshooting (Merr…
View
Data Structures for Computer Information Systems
View
The Little LISPer, Third Edition
View
Inside Networks
View
Computer Graphics Using Open GL (2nd Edition)
View