Mirrors/uBlock - uBlock - Git.je (Gitea)

Mirrors/uBlock

mirror of https://github.com/gorhill/uBlock.git synced 2024-11-17 16:02:33 +01:00

Author	SHA1	Message	Date
Raymond Hill	c3bc2c741d	Add support for `cname` type and `denyallow` option This concerns the static network filtering engine. Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/943 * * * New static network filter type: `cname` By default, network requests which are result of resolving a canonical name are subject to filtering. This filtering can be bypassed by creating exception filters using the `cname` option. For example: @@$cname The filter above tells the network filtering engine to except network requests which fulfill all the following conditions: - network request is blocked - network request is that of an unaliased hostname Filter list authors are discouraged from using exception filters of `cname` type, unless there no other practical solution such that maintenance burden become the greater issue. Of course, such exception filters should be as narrow as possible, i.e. apply to specific domain, etc. * * New static network filter option: `denyallow` The purpose of `denyallow` is bring default-deny/allow-exceptionally ability into static network filtering arsenal. Example of usage: $3p,script, \ denyallow=x.com\|y.com \ domain=a.com\|b.com The above filter tells the network filtering engine that when the context is `a.com` or `b.com`, block all 3rd-party scripts except those from `x.com` and `y.com`. Essentially, the new `denyallow` option makes it easier to implement default-deny/allow-exceptionally in static filter lists, whereas before this had to be done with unwieldy regular expressions[1], or through the mix of broadly blocking filters along with exception filters[2]. [1] https://hg.adblockplus.org/ruadlist/rev/f362910bc9a0 [2] Typically filters which pattern are of the form `\|http://`	2020-03-15 12:23:25 -04:00
Raymond Hill	3621792f16	Rework/remove remnant of code dependent on localStorage Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/899	2020-02-23 12:18:45 -05:00
Raymond Hill	b784b7d569	Support loading of benchmark dataset in published versions New advanced setting: `benchmarkDatasetURL` Default value: `unset` To specify a URL from where the benchmark dataset will be fetched. This allows to launch benchmark operations from within published versions of uBO, rather than from just a locally built version.	2020-02-21 08:06:52 -05:00
Raymond Hill	5ccf435754	Add `edge-scheme` to default whitelist directives Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/879	2020-02-20 16:43:56 -05:00
Raymond Hill	2b0316440e	First draft of popup panel for Firefox Preview First draft of changes as discussed with Firefox Preview people. In order to allow testing/evaluating these changes, the new advanced setting `uiFlavor` has been added. Default to `unset`; and can currently only be set to `fenix`. The new setting takes effect at launch only. This new setting is not to be mentioned in official documentation for now. This is ongoing work, not open to external feedback.	2020-01-25 09:24:59 -05:00
Raymond Hill	d0738c0835	Visually distinguish canonical names in popup panel Further fine-tuning support for canonical names. Aliased canonical names will be rendered blue in the dynamic filtering pane of the popup panel.	2019-12-31 16:36:51 -05:00
Raymond Hill	91e702cebb	Enable CNAME uncloaking by default Advanced setting `cnameAliasList` has been removed. New advanced settings: cnameUncloak: Boolean Default value: true Description: Whether to CNAME-uncloak hostnames. cnameIgnoreExceptions: Boolean Default value: true Description: Whether to bypass the uncloaking of network requests which were excepted by filters/rules. This is necessary so as to avoid undue breakage by having exception filters being rendered useless as a result of CNAME-uncloaking. For example, `google-analytics.com` uncloaks to `www-google-analytics.l.google.com` and both hostnames appear in Peter Lowe's list, which means exception filters for `google-analytics.com` (to fix site breakage) would be rendered useless as the uncloaking would cause the network request to be ultimately blocked.	2019-12-01 12:05:49 -05:00
Raymond Hill	a16e4161de	Fine tune hostname uncloaking through CNAME-lookup Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/780 Related commit: - https://github.com/gorhill/uBlock/commit/3a564c199260 This adds two new advanced settings: - cnameIgnoreRootDocument - Default to `true` - Tells uBO to skip CNAME-lookup for root document. - cnameReplayFullURL - Default to `false` - Tells uBO whether to replay the whole URL or just the origin part of it. Replaying only the origin part is meant to lower undue breakage and improve performance by avoiding repeating the pattern-matching of the whole URL -- which pattern-matching was most likely already accomplished with the original request. This commit is meant to explore enabling CNAME-lookup by default for the next stable release while: - Eliminating a development burden by removing the need to create a new filtering syntax to deal with undesirable CNAME-cloaked hostnames - Eliminating a filter list maintainer burden by removing the need to find/deal with all base domains which engage in undesirable CNAME-cloaked hostnames The hope is that the approach implemented in this commit should require at most a few unbreak rules with no further need for special filtering syntax or filter list maintance efforts.	2019-11-23 13:07:23 -05:00
Raymond Hill	3a564c1992	Add ability to uncloak CNAME records Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/780 New webext permission added: `dns`, which purpose is to allow an extension to fetch the DNS record of specific hostnames, reference documentation: https://developer.mozilla.org/en-US/docs/Mozilla/Add-ons/WebExtensions/API/dns The webext API `dns` is available in Firefox 60+ only. The new API will enable uBO to "uncloak" the actual hostname used in network requests. The ability is currently disabled by default for now -- this is only a first commit related to the above issue to allow advanced users to immediately use the new ability. Four advanced settings have been created to control the uncloaking of actual hostnames: cnameAliasList: a space-separated list of hostnames. Default value: unset => empty list. Special value: * => all hostnames. A space-separated list of hostnames => this tells uBO to "uncloak" the hostnames in the list will. cnameIgnoreList: a space-separated list of hostnames. Default value: unset => empty list. Special value: * => all hostnames. A space-separated list of hostnames => this tells uBO to NOT re-run the network request through uBO's filtering engine with the CNAME hostname. This is useful to exclude commonly used actual hostnames from being re-run through uBO's filtering engine, so as to avoid pointless overhead. cnameIgnore1stParty: boolean. Default value: true. Whether uBO should ignore to re-run a network request through the filtering engine when the CNAME hostname is 1st-party to the alias hostname. cnameMaxTTL: number of minutes. Default value: 120. This tells uBO to clear its CNAME cache after the specified time. For efficiency purpose, uBO will cache alias=>CNAME associations for reuse so as to reduce calls to `browser.dns.resolve`. All the associations will be cleared after the specified time to ensure the map does not grow too large and too ensure uBO uses up to date CNAME information.	2019-11-19 12:05:33 -05:00
Raymond Hill	085a8cdbcc	Fine tune cosmetic filtering badge-related code Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/756 As per feedback: - https://github.com/uBlockOrigin/uBlock-issues/issues/756#issuecomment-549128106	2019-11-03 09:38:36 -05:00
Raymond Hill	571db71318	Fine tune cosmetic filtering badge-related code Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/756 As per various feedbacks: Added an advanced setting to keep the original behavior, which can be potentially costly CPU-wise on some sites: popupCosmeticFilterBadgeSlow Default to `false`. Set to `true` to restore original method of surveying the number of elements hidden as a result of applying cosmetic filtering. As suggested by <https://github.com/gwarser>, skip descendant of nodes which have been found to be a match in order to potentially increase the number of nodes which can be surveyed in the alloted time.	2019-11-02 19:03:07 -04:00
Raymond Hill	a69b301d81	Fine-tune new bidi-trie code Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/761	2019-10-29 10:26:34 -04:00
Raymond Hill	5cc797fb47	Add WASM implementation for BidiTrieContainer.matches() Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/761	2019-10-28 13:57:35 -04:00
Raymond Hill	d7b2d31180	Harden compiled/selfie format change detection at launch Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/759 This commit adds code to rely less on the state of the cache storage to decide whether filter lists should be re-compiled or whether the selfie is currently valid at launch time when a change in compiled/selfie format is detected.	2019-10-27 11:49:05 -04:00
Raymond Hill	7971b22385	Expand bidi-trie usage in static network filtering engine Related issues: - https://github.com/uBlockOrigin/uBlock-issues/issues/761 - https://github.com/uBlockOrigin/uBlock-issues/issues/528 The previous bidi-trie code could only hold filters which are plain pattern, i.e. no wildcard characters, and which had no origin option (`domain=`), right and/or left anchor, and no `csp=` option. Example of filters that could be moved into a bidi-trie data structure: &ad_box_ /w/d/capu.php?z=$script,third-party \|\|liveonlinetv247.com/images/muvixx-150x50-watch-now-in-hd-play-btn.gif Examples of filters that could NOT be moved to a bidi-trie: -adap.$domain=~l-adap.org /tsc.php?&ses= \|\|ibsrv.net/forumsponsor$domain=[...] @@\|\|imgspice.com/jquery.cookie.js\|$script \|\|view.atdmt.com^/iview/$third-party \|\|postimg.cc/image/$csp=[...] Ideally the filters above should be able to be moved to a bidi-trie since they are basically plain patterns, or at least partially moved to a bidi-trie when there is only a single wildcard (i.e. made of two plain patterns). Also, there were two distinct bidi-tries in which plain-pattern filters can be moved to: one for patterns without hostname anchoring and another one for patterns with hostname-anchoring. This was required because the hostname-anchored patterns have an extra condition which is outside the bidi-trie knowledge. This commit expands the number of filters which can be stored in the bidi-trie, and also remove the need to use two distinct bidi-tries. - Added ability to associate a pattern with an integer in the bidi-trie [1]. - The bidi-trie match code passes this externally provided integer when calling an externally provided method used for testing extra conditions that may be present for a plain pattern found to be matching in the bidi-trie. - Decomposed existing filters into smaller logical units: - FilterPlainLeftAnchored => FilterPatternPlain + FilterAnchorLeft - FilterPlainRightAnchored => FilterPatternPlain + FilterAnchorRight - FilterExactMatch => FilterPatternPlain + FilterAnchorLeft + FilterAnchorRight - FilterPlainHnAnchored => FilterPatternPlain + FilterAnchorHn - FilterWildcard1 => FilterPatternPlain + [ FilterPatternLeft or FilterPatternRight ] - FilterWildcard1HnAnchored => FilterPatternPlain + [ FilterPatternLeft or FilterPatternRight ] + FilterAnchorHn - FilterGenericHnAnchored => FilterPatternGeneric + FilterAnchorHn - FilterGenericHnAndRightAnchored => FilterPatternGeneric + FilterAnchorRight + FilterAnchorHn - FilterOriginMixedSet => FilterOriginMissSet + FilterOriginHitSet - Instances of FilterOrigin[...], FilterDataHolder can also be added to a composite filter to represent `domain=` and `csp=` options. - Added a new filter class, FilterComposite, for filters which are a combination of two or more logical units. A FilterComposite instance is a match when all* filters composing it are a match. Since filters are now encoded into combination of smaller units, it becomes possible to extract the FilterPatternPlain component and store it in the bidi-trie, and use the integer as a handle for the remaining extra conditions, if any. Since a single pattern in the bidi-trie may be a component for different filters, the associated integer points to a sequence of extra conditions, and a match occurs as soon as one of the extra conditions (which may itself be a sequence of conditions) is fulfilled. Decomposing filters which are currently single instance into sequences of smaller logical filters means increasing the storage and CPU overhead when evaluating such filters. The CPU overhead is compensated by the fact that more filters can now moved into the bidi-trie, where the first match is efficiently evaluated. The extra conditions have to be evaluated if and only if there is a match in the bidi-trie. The storage overhead is compensated by the bidi-trie's intrinsic nature of merging similar patterns. Furthermore, the storage overhead is reduced by no longer using JavaScript array to store collection of filters (which is what FilterComposite is): the same technique used in [2] is imported to store sequences of filters. A sequence of filters is a sequence of integer pairs where the first integer is an index to an actual filter instance stored in a global array of filters (`filterUnits`), while the second integer is an index to the next pair in the sequence -- which means all sequences of filters are encoded in one single array of integers (`filterSequences` => Uint32Array). As a result, a sequence of filters can be represented by one single integer -- an index to the first pair -- regardless of the number of filters in the sequence. This representation is further leveraged to replace the use of JavaScript array in FilterBucket [3], which used a JavaScript array to store collection of filters. Doing so means there is no more need for FilterPair [4], which purpose was to be a lightweight representation when there was only two filters in a collection. As a result of the above changes, the map of `token` (integer) => filter instance (object) used to associate tokens to filters or collections of filters is replaced with a more efficient map of `token` (integer) to filter unit index (integer) to lookup a filter object from the global `filterUnits` array. Another consequence of using one single global array to store all filter instances means we can reuse existing instances when a logical filter instance is parameter-less, which is the case for FilterAnchorLeft, FilterAnchorRight, FilterAnchorHn, the index to these single instances is reused where needed. `urlTokenizer` now stores the character codes of the scanned URL into a bidi-trie buffer, for reuse when string matching methods are called. New method: `tokenHistogram()`, used to generate histograms of occurrences of token extracted from URLs in built-in benchmark. The top results of the "miss" histogram are used as "bad tokens", i.e. tokens to avoid if possible when compiling filter lists. All plain pattern strings are now stored in the bidi-trie memory buffer, regardless of whether they will be used in the trie proper or not. Three methods have been added to the bidi-trie to test stored string against the URL which is also stored in then bidi-trie. FilterParser is now instanciated on demand and released when no longer used. *** [1] `135a45a878/src/js/strie.js (L120)` [2] `e94024d350` [3] `135a45a878/src/js/static-net-filtering.js (L1630)` [4] `135a45a878/src/js/static-net-filtering.js (L1566)`	2019-10-21 08:15:58 -04:00
Raymond Hill	4bf6503f0a	Store `csp=` filters into main data structure This commits make it so that `csp=` filters are now stored in the same data structures as all other static network filters rather than being stored in a separate one. This internal change is motivated by the wish to bring session filters to the static network filtering engine, as has already been done for the static extended filtering engine in the following commit: `59c9a34d34`	2019-09-28 11:30:26 -04:00
Raymond Hill	59c9a34d34	Add ability to quickly create exceptions in logger This is a feature under development, hidden behind a new advanced setting, `filterAuthorMode` which default to `false`. Ability to point-and-click to create temporary exception filters for static extended filters (i.e. cosmetic, scriptlet & html filters) from within the summary pane in the logger. The button to toggle on/off temporary exception filter is labeled `#@#`. The created exceptions are temporary and will be lost when restarting uBO, or manually toggling off the exception filters. Creating temporary exception filters does not cause the filter lists to reloaded, and thus there is no overhead in creating/removing these temporary exception filters.	2019-09-24 17:05:03 -04:00
Raymond Hill	010635acd6	Add support for `ping` static filter option Related issue: - https://github.com/gorhill/uBlock/issues/1493 Documentation: - https://help.eyeo.com/adblockplus/how-to-write-filters#type-options Test page: - https://testpages.adblockplus.org/en/filters/ping Additionally, network requests of type `beacon` will be mapped to `ping` by the static filtering engine.	2019-09-22 09:11:55 -04:00
Raymond Hill	eb871ae558	Fix regression in selfie destruction code Related commit: - `915687fddb (diff-73ef8c4664f2ec8c02320d50b2908efdR1100-R1113)` Since selfie destruction is now deferred so as to coallesce burst of call to destroy(), the selfie load code must mind whether there is a pending destruction in order to decide whether the selfie can be safely loaded. Related feedback: - `23c4c80136 (commitcomment-35179834)`	2019-09-21 19:24:47 -04:00
Raymond Hill	23c4c80136	Add support for `elemhide` (through `specifichide`) Related documentation: - https://help.eyeo.com/en/adblockplus/how-to-write-filters#element-hiding Related feedback/discussion: - https://www.reddit.com/r/uBlockOrigin/comments/d6vxzj/ The `elemhide` filter option as per ABP semantic is now supported. Previously uBO would consider `elemhide` to be an alias of `generichide`. The support of `elemhide` is through the convenient conversion of `elemhide` option into existing `generichide` option and new `specifichide` option. The purpose of the new `specifichide` filter option is to disable all specific cosmetic filters, i.e. those who target a specific site. Additionally, for convenience purpose, the filter options `generichide`, `specifichide` and `elemhide` can be aliased using the shorter forms `ghide`, `shide` and `ehide` respectively.	2019-09-21 11:30:38 -04:00
Raymond Hill	917f3620e0	Revisit element picker arguments code No need to store mouse coordinates in background page, thus no need to post mouse coordinates information for every click. Rename/group element picker arguments and popup arguments separately.	2019-09-18 12:17:45 -04:00
Raymond Hill	0051f3b5c7	Work toward modernizing code base: promisification Swathes of code have been converted to use Promises/async/await. More left to do. Related commits: - `eec53c0154` - `915687fddb` - `55cc0c6997` - `e27328f931`	2019-09-16 16:17:48 -04:00
Raymond Hill	93f438f55e	Add advanced setting for extension reload on update Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/717 Related feedback: - https://github.com/uBlockOrigin/uBlock-issues/issues/717#issuecomment-530275655 New advanced setting: `extensionUpdateForceReload` Default value: `false` If set to `true`, the extension will unconditionally reload when an update is available; otherwise the extension will reload only when being explicitly disabled then enabled, or when the browser is restarted.	2019-09-11 08:00:55 -04:00
Raymond Hill	bcf5ac1fee	Add advanced setting to control logger popup type Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/663 The advanced setting `loggerPopupType` has been added, to control the type of window to be used when the logger is launched as a separate window. The default value is `popup`, it can be changed to any of the values documented at: https://developer.mozilla.org/en-US/docs/Mozilla/Add-ons/WebExtensions/API/windows/CreateType	2019-09-06 11:41:07 -04:00
Raymond Hill	5ad809c07d	Code review color badge code Related commit: - `07c950f1e5` Cache [blocking mode, color] pair for fast lookup in subsequent calls.	2019-08-19 09:00:53 -04:00
Raymond Hill	07c950f1e5	Review icon badge color management Related commit & feedback: - `7ff750eaf6` The color value for the icon badge is now "attached" to the blocking profile value. Additionally, as per feedback, `3p` rules will be relaxing before master JavaScript switch rules.	2019-08-11 13:55:39 -04:00
Raymond Hill	7ff750eaf6	Reflect blocking mode in badge color of toolbar icon Related feedback: - https://www.reddit.com/r/uBlockOrigin/comments/cmh910/ Additionally, the `3p` rule has been made distinct from `3p-script`/`3p-frame` for the purpose of "Relax blocking mode" command. The badge color will hint at the current blocking mode. There are four colors for the four following blocking modes: - JavaScript wholly disabled - All 3rd parties blocked - 3rd-party scripts and frames blocked - None of the above The default badge color will be used when JavaScript is not wholly disabled and when there are no rules for `3p`, `3p-script` or `3p-frame`. A new advanced setting has been added to let the user choose the badge colors for the various blocking modes, `blockingProfileColors`. The value must be a sequence of 4 valid CSS color values that match 6 hexadecimal digits prefixed with`#` -- anything else will be ignored.	2019-08-10 10:57:24 -04:00
Raymond Hill	048bfd251c	Add ability to bypass browser cache when fetching a resource Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/682#issuecomment-515197130 The following advanced setting has been added: updateAssetBypassBrowserCache Default to `false`. If set to `true`, uBO will ensure the browser cache is bypassed when fetching a remote resource. This is for the convenience of filter list maintainers who may want to test the latest version of their lists when fetched from their remote location.	2019-07-26 09:52:11 -04:00
Raymond Hill	10fe9fe656	Allow setting `assetsBootstrapLocation` from admin settings Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/666	2019-07-18 10:53:08 -04:00
Raymond Hill	f930da7ad6	Fix regression of reverse-lookup of scriptlet filters in logger Related commit: - `5552d6717d`	2019-07-05 11:44:40 -04:00
Raymond Hill	1fb9845c35	Remove useless code	2019-07-04 14:10:23 -04:00
Raymond Hill	0ba9a35818	Convert more resources as immutable Related commit: - `152cea2dfe`	2019-07-03 14:33:06 -04:00
Raymond Hill	6c34b3c3c9	Use "relax" instead of "toggle" Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/371	2019-06-27 08:16:18 -04:00
Raymond Hill	693687fd74	Add keyboard support for toggling down blocking profile Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/371 By default, no specific keyboard shortcut is predefined, this will have to be assigned by the user. The command name in English is "Toggle blocking profile". The default behavior is to toggle down according to one of the following scenarios. a) If script execution is disabled through the no-scripting switch, the no-scripting switch will be locally toggled so as to allow script execution. The page will be automatically reloaded. b) If script execution is not blocked but the 3rd-party script and/or frame cells are blocked, local no-op rules will be set so as to no longer block 3rd-party scripts and/or frames. The page will be automatically reloaded. Given this, it may take more than one toggle down command to reach the lowest blocking profile, which is one where JavaScript execution is not blocked and 3rd-party scripts and frames resources block rules, if any, are bypassed with local no-op rules. TODO: At this point, I haven't yet decided whether toggling from the lowest profile should restore the original highest blocking profile.	2019-06-26 07:47:14 -04:00
Raymond Hill	9065bbdd48	Code review of whitelisting-related code - Use `Map()` instead of `{}` for internal data structure - Export as array of directives instead of as a string	2019-06-25 11:57:14 -04:00
Raymond Hill	cfc2ce333d	Implement bidirectional plain-string trie The bidirectional trie allows storing the right and left parts of a string into a trie given a pivot position. Releated issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/528 Additionally, the mandatory token-at-index-0 rule for FilterPlainHnAnchored has been lifted, thus allowing the engine to pick a potentially better token at any position in the filter string. *** TODO: Eventually rename `strie.js` to `biditrie.js`. TODO: Fix dump() method, it currently only show the right-hand side of a filter string.	2019-06-18 19:16:39 -04:00
Raymond Hill	72d9758faa	Ensure the "Filter lists" pane is in sync with update status Related issue: - https://github.com/gorhill/uBlock/issues/2394 Additionally, I added a new advanced setting to control how long after launch an auto-update session should be started -- value is in seconds: autoUpdateDelayAfterLaunch 180	2019-05-19 18:31:12 -04:00
Raymond Hill	1caff7429e	Add optional support for generic procedural cosmetic filters Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/131 The new advanced setting and its default value is: allowGenericProceduralFilters false Whenever this setting is toggled, the user is responsible of forcing a reload of all filter lists so as to allow uBO to process differently any existing generic procedural cosmetic filters.	2019-05-18 18:57:32 -04:00
Raymond Hill	3cf71835c4	Set default delay for creating selfie to 3 minutes Related discussion: - https://www.reddit.com/r/uBlockOrigin/comments/bq49zi/	2019-05-18 14:43:44 -04:00
Raymond Hill	f7bbc80717	Improve "Whitelist pane"; remove now useless built-in switch rule Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/214 Built-in whitelist directives are now rendered differently than user-defined whitelist directives. Also, removing a built-in whitelist directive will only cause that directive to be commented out, so that users do not have to remember built-in directives should they want to bring them back. Related issue: https://github.com/uBlockOrigin/uBlock-issues/issues/494 The built-in per-site switch rule `no-scripting: behind-the-scene false` has been removed, it should not ever be needed since there will always be a valid root context for main- and sub-frames.	2019-05-18 14:20:05 -04:00
Raymond Hill	0ca44b847c	Avoid duplicated strings in filterOrigin w/ new approach The new approach is simpler and should benefit selfie serialization/unserialization. This renders stringDeduplicater obsolete -- it has been removed.	2019-05-17 10:13:58 -04:00
Raymond Hill	93f80eedfa	Refactor runtime storage of specific cosmetic filters This was a TODO item: - `07cbae66a4/src/js/cosmetic-filtering.js (L375)` µBlock.staticExtFilteringEngine.HostnameBasedDB has been re-factored to accomodate the storing of specific cosmetic filters. As a result of this refactoring: - Memory usage has been further decreased - Performance of selector retrieval marginally improved - New internal representation opens the door to use a specialized version of HNTrie, which should further improve performance/memory usage	2019-05-14 08:52:34 -04:00
Raymond Hill	915c1f1f3c	Report resources blocked by `csp=` option in logger Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/552	2019-05-11 10:40:34 -04:00
Raymond Hill	12bdd01595	Ensure "Ignore generic cosmetic filters" sticks on Fennec Related issue: - https://www.reddit.com/r/uBlockOrigin/comments/blkudl/ The setting was not sticking at first-install time.	2019-05-11 09:04:13 -04:00
Raymond Hill	0e4fbefd07	Remove unecessary `null` placeholders FilterOriginHitSet et al. The `null` placeholder are not necessary, we can just use default arguments instead, and add the HNTrieContainer references if and only if they are instanciated.	2019-05-01 18:54:11 -04:00
Raymond Hill	adabb56dc9	Do not store impossible to match filters in HNTrie Consider the two following filters: example.com www.example.com This commit make it so that if the first filter is already present in a given HNTrie, the second filter will not be stored, since HNTrie will _always_ return the first filter as a match whenever the hostname to match is example.com or any subdomain of example.com. The detection of such pointless filters is virtually free when adding a hostname to an HNTrie instance (given how data is stored in the trie), so in practice no overhead is incurred to detect such pointless filters. The ability to ignore impossible to match filters in HNTrie instances will _especially_ benefit those using large hosts files. Examples of how this helps using real configurations: - Default lists: 444 filters out of 100,382 were ignored as a result of this commit. - Default lists + "Energized Ultimate Protection": 283,669 filters out of 903,235 were ignored as a result of this commit. Side note: There was no measurable difference between the two configurations above in the performance of the matching algorithm as reported by the built-in benchmark tool.	2019-04-29 13:15:16 -04:00
Raymond Hill	ac58b8e688	Make token hashes fit within a 32-bit integer The staticNetFilteringEngine uses token hashes to store/lookup filters into Map objects. Before this commit, the tokens were encoded into token hashes as JS numbers (not exceeding MAX_SAFE_INTEGER) using at most the 8 first characters of the token. With this commit, token hashes are now restricted to fit into 32-bit integers, and are derived from at most the 7 first characters. This improves filter look-up performance as per built-in benchmark().	2019-04-28 10:15:15 -04:00
Raymond Hill	96dce22218	Increase resolution of known-token lookup table Related commit: - `69a43e07c4` Using 32 bits of token hash rather than just the 16 lower bits does help discard more unknown tokens. Using the default filter lists, the known-token lookup table is populated by 12,276 entries, out of 65,536, thus making the case that theoretically there is a lot of possible tokens which can be discarded. In practice, running the built-in staticNetFilteringEngine.benchmark() with default filter lists, I find that 1,518,929 tokens were skipped out of 4,441,891 extracted tokens, or 34%.	2019-04-27 08:18:01 -04:00
Raymond Hill	69a43e07c4	Ignore unknown tokens in urlTokenizer.getTokens() Given that all tokens extracted from one single URL are potentially iterated multiple times in a single URL-matching cycle, it pays to ignore extracted tokens which are known to not be used anywhere in the static filtering engine. The gain in processing a single network request in the static filtering engine can become especially high when dealing with long and random-looking URLs, which URLs have a high likelihood of containing a majority of tokens which are known to not be in use.	2019-04-26 17:14:00 -04:00
Raymond Hill	19ece97b0c	Leverage compile-time token information in new fitler classes Related commit: - `99390390fc` The token information available at compile time can be stored in the filter to be used at match() time. This allows the use of startsWith() rather than a more costly indexOf() call as a first quick test to detect mismatches.	2019-04-26 11:16:47 -04:00

1 2 3 4 5