Skip to content

Under the hood the METAREP search engine

jcvi edited this page Sep 14, 2010 · 10 revisions

METAREP uses the open source enterprise search platform Solr/Lucene for extremely fast querying of large metagenomics datatsets. As of now, we have indexed 68 million documents distributed over 330 index files. Much larger index volumes can be handled as shown by HAthi Trust, a digital library, which currently indexes 227 terabytes of data. Our current Solr server setup peaks at a query response performance of 3,000 search requests per second (benchmark was carried out for a 8 million document index). To improve query response time, METAREP can be configured to run on two Solr servers in a index replication and load balancing set-up.

Clone this wiki locally