And we proudly present: The Deregifier! · 2006-08-23 00:26 by Wladimir Palant
After years of regexp mania we now have to go back to simple filters — starting with Adblock Plus 0.7 those are more efficient than regular expressions. This doesn’t mean that regular expressions got any slower, it’s that simple filters got very fast. There are also other advantages a dozen simple filters have over one long regexp. They are simply easier to read, “effective filter” in the blockable items tooltip presents more relevant information (one rule instead of the whole regexp), we get hit counts for every filter that are also more relevant, and we can deativate every simple filter separately if it causes problems.
Yet a number of regular expressions already exist in various filter lists, e.g. in Filterset.G. Splitting them up again is a boring, time-consuming and error-prone job. To make it somewhat simpler I wrote a new web tool: The Deregifier. It gets a filter list as input and tries to translate regular expressions back in simple filters. Of course not all regular expressions can be translated, the effects of the Adblock list optimizer can always be reversed however. As to Filterset.G: the tool managed to get rid of 24 regexps, increasing filter matching performance by 35%. 25 regular expressions are still left, however two of those couldn’t be converted simply because they contained dots in them that G forgot to escape.
Commenting is closed for this article.