Theory and Application of Text-representing Centroids
- Autor:innen:
- |
- Reihe:
- Informatik/ Kommunikation, Band 863
- Verlag:
- 2019
Zusammenfassung
Centroid terms are single, descriptive words that semantically and topically characterise text documents and thus can act as their very compact representation in automated text processing tasks that strongly rely on the semantic similarity of texts. Algorithms to classify and cluster them make use of this information. In this book, the novel, brain- and physicsinspired concept of centroid terms is introduced and deeply discussed. Furthermore, their unique properties and practical usage in major natural language processing and text mining tasks are covered. In this regard, a new graph-based method for their fast calculation is presented as well. In contrast to methods relying on the bag-of-words model, the derived centroid distance measure can uncover
a topical relationship between texts even when their wording differs. As centroid terms can also represent short texts, the presented first fully integrated, P2P-based web search engine, called “WebEngine”, therefore makes heavy use of...
Schlagworte
Publikation durchsuchen
Bibliographische Angaben
- Copyrightjahr
- 2019
- ISBN-Print
- 978-3-18-386310-5
- ISBN-Online
- 978-3-18-686310-2
- Verlag
- VDI Verlag, Düsseldorf
- Reihe
- Informatik/ Kommunikation
- Band
- 863
- Sprache
- Deutsch
- Seiten
- 144
- Produkttyp
- Monographie
Inhaltsverzeichnis
- Titelei/Inhaltsverzeichnis Teilzugriff Seiten I - VI Download Kapitel (PDF)
- a `Librarian of the Web' really needed? Kein Zugriff Seiten 1 - 6 H. Unger
- Centroid Terms as Text Representatives Kein Zugriff Seiten 7 - 26 M. M. Kubek, H. Unger
- Spreading Activation: A Fast Calculation Method for Text Centroids Kein Zugriff Seiten 27 - 38 M. M. Kubek, T. Böhme, H. Unger
- Empiric Experiments with Text-representing Centroids Kein Zugriff Seiten 39 - 54 M. M. Kubek, T. Böhme, H. Unger
- Towards a Librarian of the Web Kein Zugriff Seiten 55 - 78 M. M. Kubek
- Concept Supporting a Resilient, Fault-tolerant and Decentralised Search Kein Zugriff Seiten 79 - 90 H. Unger, M. M. Kubek
- Associative Ring Memory to Support Decentralised Search Kein Zugriff Seiten 91 - 106 H. Unger, M. M. Kubek
- The WebEngine – A Fully Integrated, Decentralised Web Search Engine Kein Zugriff Seiten 107 - 120 M. M. Kubek, H. Unger
- Evolving Text Centroids Kein Zugriff Seiten 121 - 130 H. Unger, M. M. Kubek
- Addendum Kein Zugriff Seiten 131 - 139
- Authors Kein Zugriff Seiten 140 - 144





