Monthly Archives: September 2014

Sparse facet caching

As explained in Ten times faster, distributed faceting in standard Solr is two-phase: Each shard performs standard faceting and returns the top limit*1.5+10 terms. The merger calculates the top limit terms. Standard faceting is a two-step process: For each term … Continue reading

Posted in eskildsen, Faceting, Hacking, Low-level, open source, Performance, Solr | 3 Comments

Even sparse faceting is limited

Recently, Andy Jackson from UK Web Archive discovered a ginormous Pit Of Pain with Solr distributed faceting, where some response times reached 10 minutes. The culprit is facet.limit=100 (the number of returned values for each facet is 100), as the … Continue reading

Posted in eskildsen, Faceting, Low-level, Performance, Solr | 3 Comments

A few Highlights from the Digital Libraries #DL2014 conference in London

This post is written at the DL2014 conference in London in September, where it’s warm and sunny 🙂 My visit started at the Digital Preservation Sustainability on the EU Policy Level workshop on Monday. This workshop had an interesting panel … Continue reading

Posted in Conference, Presentations | Tagged | 1 Comment

Small scale sparse faceting

While sparse faceting has profound effect on response time in our web-archive, we are a bit doubtful about the amount of multi billion document Solr indexes out there. Luckily we also have our core index at Statsbiblioteket, which should be … Continue reading

Posted in eskildsen, Faceting, Performance, Solr, Uncategorized | 5 Comments