SearchDatabase
目录
Lux, Mathias, and Savvas A. Chatzichristofis. “Lire: lucene image retrieval: an extensible java cbir library.” Proceedings of the 16th ACM international conference on Multimedia. 2008. cited by 396.
Paper: LIRe
Summary
- LiRe(Lucence Image Retrieval) is a light weight open source java library for content based image retrival.
- provides common and state of the art global image features and offers means for indexing and retrieval.
- images features:
- color histograms in RGB and HSV space;
- MPEG-7 descriptors scalable color, color layout and edge histogram;
- the Tamura texture features coarseness, contrast and directionality;
- color and edge directivity descriptor, CEDD;
- Fuzzy color and texture histogram, FCTH;
- Auto color correlation feature defined by Huang et.
Indexing
: the signatures or vectors extracted by the feature implementations are wrapped int the documents as text, and add to the lucene index;Search
: ImageSearcher either takes the given query feature or extracts the feature from a query image, then reads documents from the index sequentially and compares them to the query image.
Yang, Peilin, Hui Fang, and Jimmy Lin. “Anserini: Enabling the use of Lucene for information retrieval research.” Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2017.
Paper: Anserini
Summary
- Lucene can handle heterogeneous web collections at scale, but lacks systematic support for evaluation over standard test collections.
- introduces Anserini, a new information retrieval toolkit that aims to provide the best of both worlds, to better align information retrieval practice and research.
- focused on
scalable, multi-threaded inverted indexing
to handle modern web-scale collections, streamlined IR evaluation for ad hoc retrieval on standard test collecitions, and an extensible architecture for multi-stage ranking; - Multi-threaded indexing(wrapper): lucene only provides access to a collection of indexing components that researchers need to assemble together to build and end-to-end indexer, eg. write from scatch custom document processing pipelines, code for managing individual indexing threads, and implementations of load balancing and synchronization procedures.
- Streamlined IR evaluation: parsers for different query formats, a unified driver program for ad hoc experiments that outputs standard trec_eval format.