Category Archives: Faceting

Sampling methods for heuristic faceting

Initial experiments with heuristic faceting in Solr were encouraging: Using just a sample of the result set, it was possible to get correct facet results for large result sets, reducing processing time by an order of magnitude. Alas, further experimentation … Continue reading

Posted in eskildsen, Faceting, Low-level, open source, Performance, Solr | Leave a comment

Dubious guesses, counted correctly

We do have a bit of a performance challenge with heavy faceting on large result sets in our Solr based Net Archive Search. The usual query speed is < 2 seconds, but if the user requests aggregations based on large … Continue reading

Posted in eskildsen, Faceting, Low-level, open source, Performance, Solr | 1 Comment

Heuristically correct top-X facets

For most searches in our Net Archive, we have acceptable response time, due to the use of sparse faceting with Solr. Unfortunately as well as expectedly, some of the searches are slow. Response times in minutes slow, if we’re talking … Continue reading

Posted in eskildsen, Faceting, Hacking, Low-level, Performance, Solr | 1 Comment

Alternative counter tracking

Warning: Bit-fiddling ahead. The initial driver for implementing Sparse Faceting was to have extraction-time scale with the result set size, instead of with the total number of unique values in the index. From a performance point of view, this works … Continue reading

Posted in eskildsen, Faceting, Hacking, Low-level, Performance, Solr | Leave a comment

Measuring N-plane time & space

This article explores the boundaries of the experimental sparse faceting code, both in terms of processing time and in terms of space requirements. The code base has just been updated and the new features are available as a Solr 4.8 … Continue reading

Posted in eskildsen, Faceting, Hacking, Low-level, Performance, Solr, Uncategorized | Leave a comment

Facet filtering

In generation 2 of our net archive search we plan to experiment with real time graphs: We would like to visualize links between resources and locate points of interest based on popularity. Our plan is to use faceting with Solr … Continue reading

Posted in eskildsen, Faceting, Hacking, Low-level, Performance, Solr | Leave a comment

N-plane packed counters for faceting

Faceting in Solr works well out of the box up to some millions of unique values in the facet field. There is a small performance penalty linear to the number of unique values, which begins to show after some point. … Continue reading

Posted in eskildsen, Faceting, Hacking, Low-level, Performance, Solr | 1 Comment