Apache Solr 7.5 install (techproducts tutorial) 1

Solr (pronounced “solar”) is an open source enterprise search platform, written in Java, from the Apache Lucene project. Its major features include full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features[2] and rich document (e.g., Word, PDF) handling. Providing distributed search and index replication, Solr is designed for scalability and […]

Information Retrieval chp1 (Information Retrieval, Manning, Raghavan, Schütze) 2

Ad hoc retrieval Let us now consider a more realistic scenario, simultaneously using the opportunity to introduce some terminology and notation. We will refer to the group of documents over which we perform retrieval as the (document) collection . It is sometimes also referred to as a corpus (a body of texts) Suppose each document […]

Information Retrieval chp1 (Information Retrieval, Manning, Raghavan, Schütze) 1

Boolean retrieval The meaning of the term information retrieval can be very broad. Just getting a credit card out of your wallet so that you can type in the card number is a form of information retrieval. However, as an academic field of study, information retrieval might be defined thus: Information retrieval (IR) is finding […]