Information Retrieval Models: Foundations & Relationships

· Springer Nature
电子书
141
评分和评价未经验证  了解详情

关于此电子书

Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR). Regarding intuition and simplicity, though LM is clear from a probabilistic point of view, several people stated: "It is easy to understand TF-IDF and BM25. For LM, however, we understand the math, but we do not fully understand why it works." This book takes a horizontal approach gathering the foundations of TF-IDF, PRF, BIR, Poisson, BM25, LM, probabilistic inference networks (PIN's), and divergence-based models. The aim is to create a consolidated and balanced view on the main models. A particular focus of this book is on the "relationships between models." This includes an overview over the main frameworks (PRF, logical IR, VSM, generalized VSM) and a pairing of TF-IDF with other models. It becomes evident that TF-IDF and LM measure the same, namely the dependence (overlap) between document and query. The Poisson probability helps to establish probabilistic, non-heuristic roots for TF-IDF, and the Poisson parameter, average term frequency, is a binding link between several retrieval models and model parameters. Table of Contents: List of Figures / Preface / Acknowledgments / Introduction / Foundations of IR Models / Relationships Between IR Models / Summary & Research Outlook / Bibliography / Author's Biography / Index

作者简介

Thomas Roelleke holds a Dr rer nat (Ph.D.) and a Diplom der Ingenieur-Informatik (MSc in Engineering & Computer Science) of the University of Dortmund. After school education in Meschede, Germany, he attended the b.i.b., the Nixdorf Computer school for professions in informatics, in Paderborn. Nixdorf Computer awarded him a sales and management trainee program, after which he was appointed as product consultant in the Unix/DB/4GL marketing of Nixdorf Computer. He studied Diplom-Ingenieur-Informatik at the University of Dortmund (UniDo), and was later a lecturer/researcher at UniDo. His research focused on probabilistic reasoning and knowledge representations, hypermedia retrieval, and the integration of retrieval and database technologies. His lecturing included information/database systems, object-oriented design and programming, and software engineering. He obtained his Ph.D. in 1999 for the thesis titled "POOL: A probabilistic object-oriented logic for the representation and retrieval of complex objects - a model for hypermedia retrieval." Since 1999, he has been working as a strategic IT consultant, founder and director of small businesses, research fellow, and lecturer at the Queen Mary University of London (QMUL). Research contributions include a probabilistic relational algebra (PRA), a probabilistic object-oriented logic (POOL), the relational Bayes, a matrix-based framework for IR, a parallel derivation of IR models, a probabilistic interpretation of the BM25-TF based on "semi-subsumed" event occurrences, and theoretical studies of retrieval models. Thomas Roelleke lives in England, in a village in the middle between buzzy London and beautiful East Anglia.

为此电子书评分

欢迎向我们提供反馈意见。

如何阅读

智能手机和平板电脑
只要安装 AndroidiPad/iPhone 版的 Google Play 图书应用,不仅应用内容会自动与您的账号同步,还能让您随时随地在线或离线阅览图书。
笔记本电脑和台式机
您可以使用计算机的网络浏览器聆听您在 Google Play 购买的有声读物。
电子阅读器和其他设备
如果要在 Kobo 电子阅读器等电子墨水屏设备上阅读,您需要下载一个文件,并将其传输到相应设备上。若要将文件传输到受支持的电子阅读器上,请按帮助中心内的详细说明操作。