https://www.ebooknetworking.net/books_detail-B000RR4ASY.html

Text summarization using a trainable summarizer and latent semantic analysis [An article from: Information Processing and Management]

Name: Text summarization using a trainable summarizer and latent semantic analysis [An article from: Information Processing and Management]
Author: J.Y. Yeh, H.R. Ke, W.P. Yang, I.H. Meng
ISBN: 978B000RR4AS0

AuthorJ.Y. Yeh, H.R. Ke, W.P. Yang, I.H. Meng

PublisherElsevier

Shop on Amazon — choose your country

🇺🇸 USA 🇨🇦 Canada 🇬🇧 UK 🇩🇪 Germany 🇫🇷 France 🇮🇳 India

Buy New on Amazon 🇬🇧

Book Details

Author(s)J.Y. Yeh, H.R. Ke, W.P. Yang, I.H. Meng

PublisherElsevier

ISBN / ASINB000RR4ASY

ISBN-13978B000RR4AS0

MarketplaceUnited Kingdom 🇬🇧

Description

This digital document is a journal article from Information Processing and Management, published by Elsevier in 2005. The article is delivered in HTML format and is available in your Amazon.com Media Library immediately after purchase. You can view it with any web browser.

Description:
This paper proposes two approaches to address text summarization: modified corpus-based approach (MCBA) and LSA-based T.R.M. approach (LSA+T.R.M.). The first is a trainable summarizer, which takes into account several features, including position, positive keyword, negative keyword, centrality, and the resemblance to the title, to generate summaries. Two new ideas are exploited: (1) sentence positions are ranked to emphasize the significances of different sentence positions, and (2) the score function is trained by the genetic algorithm (GA) to obtain a suitable combination of feature weights. The second uses latent semantic analysis (LSA) to derive the semantic matrix of a document or a corpus and uses semantic sentence representation to construct a semantic text relationship map. We evaluate LSA+T.R.M. both with single documents and at the corpus level to investigate the competence of LSA in text summarization. The two novel approaches were measured at several compression rates on a data corpus composed of 100 political articles. When the compression rate was 30%, an average f-measure of 49% for MCBA, 52% for MCBA+GA, 44% and 40% for LSA+T.R.M. in single-document and corpus level were achieved respectively.