Search Books

Text Processing Basics

Author Arun Jagota
📄 Viewing lite version Full site ›
🌎 Shop on Amazon — choose country
Price not listed
🛒 Buy New on Amazon 🇺🇸
Share:
Book Details
Author(s)Arun Jagota
ISBN / ASINB00HBF3TVO
ISBN-13978B00HBF3TV3
Sales Rank759,120
MarketplaceUnited States 🇺🇸

Description

This book covers basic concepts of text processing with an emphasis on methods used in information retrieval, document matching and clustering, and natural language processing (NLP). It starts by defining the concepts of tokenization, n-grams, shingles, and text similarity measures. It then introduces core problems in NLP and in text matching and clustering, followed by algorithms for them. Covered problems and algorithms include those for text clustering, text classification, topic modeling, sequence modeling, and information extraction.