| New Releases | | • | Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations | | • | Finding Out About: A Cognitive Perspective on Search Engine Technology and the WWW | | • | Computational Forensics: Second International Workshop, IWCF 2008, Washington, DC, USA, August 7-8, 2008, Proceedings (Lecture Notes in Computer Science) | | • | Machine Translation: Its Scope and Limits | | • | Machine Learning for Multimodal Interaction: 5th International Workshop, MLMI 2008, Utrecht, The Netherlands, September 8-10, 2008, Proceedings (Lecture Notes in Computer Science) |
|
|
|
|
Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series)(2nd Edition) | 
enlarge | Authors: David A. Grossman, Ophir Frieder Publisher: Springer Category: Book
List Price: $59.95 Buy New: $38.00 You Save: $21.95 (37%)
New (22) Used (6) from $36.99
Avg. Customer Rating: 8 reviews Sales Rank: 221340
Media: Paperback Edition: 2nd Number Of Items: 1 Pages: 332 Shipping Weight (lbs): 0.8 Dimensions (in): 9.4 x 6.2 x 0.7
ISBN: 1402030045 Dewey Decimal Number: 004 EAN: 9781402030048 ASIN: 1402030045
Publication Date: December 20, 2004 Availability: Usually ships in 1-2 business days Shipping: International shipping available
|
| Editorial Reviews:
Product Description
Interested in how an efficient search engine works? Want to know what algorithms are used to rank resulting documents in response to user requests? The authors answer these and other key information retrieval design and implementation questions. This book is not yet another high level text. Instead, algorithms are thoroughly described, making this book ideally suited for both computer science students and practitioners who work on search-related applications. As stated in the foreword, this book provides a current, broad, and detailed overview of the field and is the only one that does so. Examples are used throughout to illustrate the algorithms. The authors explain how a query is ranked against a document collection using either a single or a combination of retrieval strategies, and how an assortment of utilities are integrated into the query processing scheme to improve these rankings. Methods for building and compressing text indexes, querying and retrieving documents in multiple languages, and using parallel or distributed processing to expedite the search are likewise described. This edition is a major expansion of the one published in 1998. Besides updating the entire book with current techniques, it includes new sections on language models, cross-language information retrieval, peer-to-peer processing, XML search, mediators, and duplicate document detection.
|
| Customer Reviews: Read 3 more reviews...
Easily the best intro to the IR field - Simple, very readable, practical and directly applicable May 21, 2008 Easily the best introduction to the field of IR. Its hallmarks are: 1. very readable (concepts presented in layman terms) and directly applicable (you can literally read and apply the concepts in the real world) 2. excellent survey of the field with an comprehensive compedium of references for further reading (surveys, topical and detailed references) 3. the only book with latest information on IR strategies and utilities - so far (May 2008)
For the learnings you will get out of this book - it is a real BARGAIN. You could easily spend hours and hours of your time just trying to figure out what to read. Its a gem. Its not expensive. Buy it->implement it->realize success from it! buy it now :).
To improve your understanding from a novice level to an intermediatre level, would recommend the following books: A. Introduction 1st book: Information Retrieval (this book) David Grossman and Ophir Frieder B. Modern Information Retrieval by Ricardo Baeza-Yates (more technical and deeper) C. Managing Gigabytes: Compressing and Indexing Documents and Images by by Ian H. Witten, Alistair Moffat, Timothy C. Bell (best for implementing and a decent overview) D. Programming Collective Intelligence: Building Smart Web 2.0 Applications by Toby Segaran (algos/pseudo-code, good utilities and concise summaries) F. Mining the web, by Chakrabarti (excellent intro the field of web mining, with some excellent chapters on IR) G. Books by Salton (vector models and fuzzy sets) H. IR book by C. J. Rijsbergen (probabilistic models; available online for free) I. the book by Lesk et. al. J. Follow-up by reading a ton of papers available on citeseer or via google
Good luck!
Poor content and presentation March 18, 2008 3 out of 4 found this review helpful
This book presents information retrieval in an incomprehensible fashion. The content appears to have been cut and pasted from diverse unrelated sources with no effort put into massaging the parts into a coherent whole. There are several mistakes, typos, and wrong formulas throughout the book. Often concepts and abbreviation pop out without any context. Loose statements that are vague and hard to understand are made in several places. The book also fails to use a consistent style in its presentation. Overall, it does little to explain anything in Information Retrieval.
A good alternative to "Modern Information Retrieval" March 1, 2008 This is a very clear and current book on information storage and retrieval. If you are assigned this book as a textbook in a class, then the book is going to make the task of understanding the material much easier. All of the algorithms are clearly explained and the background material in probability is clearly outlined with good examples and figures. However, I still think I prefer Modern Information Retrieval for the theory of information storage and retrieval. It's out of print, but you can easily find it used and just like in this book, all of the background mathematics is outlined in regards to the algorithms and tasks at hand. The other book I'd recommend is Programming Collective Intelligence: Building Smart Web 2.0 Applications. That book is on the cutting edge of using information retrieval techniques for web applications with plenty of code examples. Even if you go with this book over Modern Information Retrieval as a main source of instruction, Programming Collective Intelligence is a good book about information retrieval in combination with artificial intelligence as it is applied to the web.
Excellent coverage of IR topics February 8, 2008 2 out of 2 found this review helpful
This book provides an excellent blend of theoretical and practical knowledge of the IR field, particularly for those of us with a computer science background, yet no practical working experience in IR. In my opinion, the math is an essential part of expressing the concepts more formally, so it was refreshing to see the authors incorporate just enough formulae, but no more. This book is not going to provide you with a set of recipes for building an indexing or search engine, nor would I expect it do so. However, it does give you an idea of how such engines might be built. Further, I found this book to be a necessary prerequisite for other practitioner-oriented texts, such as Lucene in Action (In Action series). Anyone delving into this field for the first time and attempting to use libraries like Lucene may find it difficult to fully exploit its capabilities without a firm understanding of the theoretical underpinnings of IR.
Mistakes in Bayes explanations January 12, 2008 0 out of 2 found this review helpful
Contains a bad mathematical mistake in section 2.2.1 on page 22. The probability P(win|sunny,good-shortstop) cannot be derived from P(win|sunny) and P(win|good-shortstop). It can take any value, even zero. Suppose, shortstop is a vampire and plays good only on cloudy weather, and on sunny weather he always leads his team to defeat. It doesn't contradict to having positive P(win|sunny) and P(win|good-shortstop).
|
|
|
Copyright 2008 - RailroadBookstore.com
| |