Mirrors/uBlock - uBlock - Git.je (Gitea)

Mirrors/uBlock

mirror of https://github.com/gorhill/uBlock.git synced 2024-11-17 07:52:42 +01:00

Author	SHA1	Message	Date
Raymond Hill	41876336db	Add trusted-source support for privileged scriptlets At the moment, the only filter lists deemed from a "trusted source" are uBO-specific filter lists (i.e. "uBlock filters -- ..."), and the user's own filters from "My filters". A new scriptlet which can only be used by filter lists from trusted sources has been introduced: `sed.js`. The new `sed.js` scriptlet provides the ability to perform text-level substitutions. Usage: example.org##+js(sed, nodeName, pattern, replacement, ...) `nodeName` The name of the node for which the text content must be substituted. Valid node names can be found at: https://developer.mozilla.org/en-US/docs/Web/API/Node/nodeName `pattern` A string or regex to find in the text content of the node as the target of substitution. `replacement` The replacement text. Can be omitted if the goal is to delete the text which matches the pattern. Cannot be omitted if extra pairs of parameters have to be used (see below). Optionally, extra pairs of parameters to modify the behavior of the scriptlet: `condition, pattern` A string or regex which must be found in the text content of the node in order for the substitution to occur. `sedCount, n` This will cause the scriptlet to stop after n instances of substitution. Since a mutation oberver is used by the scriptlet, it's advised to stop it whenever it becomes pointless. Default to zero, which means the scriptlet never stops. `tryCount, n` This will cause the scriptlet to stop after n instances of mutation observer run (regardless of whether a substitution occurred). Default to zero, which means the scriptlet never stops. `log, 1` This will cause the scriptlet to output information at the console, useful as a debugging tool for filter authors. The logging ability is supported only in the dev build of uBO. Examples of usage: example.com##+js(sed, script, /devtoolsDetector\.launch\(\)\;/, , sedCount, 1) example.com##+js(sed, #text, /^Advertisement$/)	2023-05-21 14:16:56 -04:00
Raymond Hill	236fb3ad59	Add scriptlet dependencies to reduce code duplication	2023-03-26 09:13:17 -04:00
Raymond Hill	18a84d2819	Refactor scriptlets injection code Builtin scriptlets are no longer parsed as text-based resources, they are imported as JS functions, and `toString()` is used to obtain text-based representation of a scriptlet. Scriptlet parameters are now passed as function call arguments rather than by replacing text-based occurrences of `{{i}}`. The arguments are always string values (see below for exception). Support for argument as Object has been added. This opens the door to have scriptlets using named arguments rather than positional arguments, and hence easier to extend functionality of existing scriptlets. Example: example.com##+js(scriplet, { "prop": "adblock", "value": false, "log": true }) Compatibility with user-provided scriptlets has been preserved. User-provided scriptlets can benefit some of the changes: Use the form `function(..){..}` instead of `(function(..){..})();` in order to received scriptlet arguments as part of function call -- instead of using `{{i}}`. If using the form `function(..){..}`, you can choose to receive an Object as argument -- just be sure that your scriptlet's parameter is valid JSON notation.	2023-03-24 14:05:18 -04:00
Raymond Hill	4564e3a9b8	Add redirectable resource noop.css, as suggested Related feedback: - https://github.com/uBlockOrigin/uAssets/issues/16391#issuecomment-1396316194	2023-01-23 16:39:46 -05:00
Raymond Hill	0971025b21	Use Blob URLs to reliably inject scriptlets Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/235 Fixed as suggested by <https://github.com/evilpie>, to safely bypass a page's own CSP.	2022-12-11 10:08:10 -05:00
Raymond Hill	985ea24e82	[mv3] Add support for redirect= filters This adds support for `redirect=` filters. As with `removeparam=` filters, `redirect=` filters can only be enforced when the default filtering mode is set to Optimal or Complete, since these filters require broad host permissions to be enforced by the DNR engine. `redirect-rule=` filters are not supported since there is no corresponding DNR syntax. Additionally, fixed the dropping of whole network filters even though those filters are still useful despite not being completely enforceable -- for example a filter with a single (unsupported) domain using entity syntax in its `domain=` option should not be wholly dropped when there are other valid domains in the list.	2022-10-16 12:05:24 -04:00
Raymond Hill	067e128163	Patch google-ima shim script for proper integration into uBO Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/2158 Additionally, added firing of CONTENT_RESUME_REQUESTED event in start() method.	2022-09-11 11:03:47 -04:00
Raymond Hill	a559f5f271	Add experimental mv3 version This create a separate Chromium extension, named "uBO Minus (MV3)". This experimental mv3 version supports only the blocking of network requests through the declarativeNetRequest API, so as to abide by the stated MV3 philosophy of not requiring broad "read/modify data" permission. Accordingly, the extension should not trigger the warning at installation time: Read and change all your data on all websites The consequences of being permission-less are the following: - No cosmetic filtering (##) - No scriptlet injection (##+js) - No redirect= filters - No csp= filters - No removeparam= filters At this point there is no popup panel or options pages. The default filterset correspond to the default filterset of uBO proper: Listset for 'default': https://ublockorigin.github.io/uAssets/filters/badware.txt https://ublockorigin.github.io/uAssets/filters/filters.txt https://ublockorigin.github.io/uAssets/filters/filters-2020.txt https://ublockorigin.github.io/uAssets/filters/filters-2021.txt https://ublockorigin.github.io/uAssets/filters/filters-2022.txt https://ublockorigin.github.io/uAssets/filters/privacy.txt https://ublockorigin.github.io/uAssets/filters/quick-fixes.txt https://ublockorigin.github.io/uAssets/filters/resource-abuse.txt https://ublockorigin.github.io/uAssets/filters/unbreak.txt https://easylist.to/easylist/easylist.txt https://easylist.to/easylist/easyprivacy.txt https://malware-filter.gitlab.io/malware-filter/urlhaus-filter-online.txt https://pgl.yoyo.org/adservers/serverlist.php?hostformat=hosts&showintro=1&mimetype=plaintext The result of the conversion of the filters in all these filter lists is as follow: Ruleset size for 'default': 22245 Good: 21408 Maybe good (regexes): 127 redirect-rule= (discarded): 458 csp= (discarded): 85 removeparams= (discarded): 22 Unsupported: 145 The fact that the number of DNR rules are far lower than the number of network filters reported in uBO comes from the fact that lists-to-rulesets converter does its best to coallesce filters into minimal set of rules. Notably, the DNR's requestDomains condition property allows to create a single DNR rule out of all pure hostname-based filters. Regex-based rules are dynamically added at launch time since they must be validated as valid DNR regexes through isRegexSupported() API call. At this point I consider being permission-less the limiting factor: if broad "read/modify data" permission is to be used, than there is not much point for an MV3 version over MV2, just use the MV2 version if you want to benefit all the features which can't be implemented without broad "read/modify data" permission. To locally build the MV3 extension: make mv3 Then load the resulting extension directory in the browser using the "Load unpacked" button. From now on there will be a uBlock0.mv3.zip package available in each release.	2022-09-06 13:47:52 -04:00
Raymond Hill	c521479ef9	Add 0.5s mp3 redirectable resource Command used to generate 0.5s mp3 file: ffmpeg -ar 48000 -t 0.5 -f s16le -acodec pcm_s16le -ac 2 -i /dev/zero -acodec libmp3lame -aq 4 noop-0.5s.mp3 Related filter issue: - https://github.com/uBlockOrigin/uAssets/issues/14231	2022-08-14 13:06:00 -04:00
Raymond Hill	ef25f30b30	Squashed commit of the following: commit 34a290bdd62013591b17efbd2320698b95925c00 Author: Yuki2718 <58900598+Yuki2718@users.noreply.github.com> Date: Mon Feb 7 19:14:02 2022 +0900 update last commit commit f34ffbcc3d78bc98ee43b015f0ad0dae9d99720e Author: Yuki2718 <58900598+Yuki2718@users.noreply.github.com> Date: Mon Feb 7 19:05:17 2022 +0900 Improve and rename canrunads.js Related issue: - https://github.com/AdguardTeam/Scriptlets/issues/190 Related commit: - `e8bfc9a031`	2022-02-07 07:05:44 -05:00
Raymond Hill	e8bfc9a031	Add canrunads.js as redirectable resource Related discussion: - `036e13101e (commitcomment-65068100)`	2022-01-29 08:40:47 -05:00
Raymond Hill	f4824bd0d9	Add shim for FingerprintJS (aka Fingerprint v3) Related issue: - https://github.com/uBlockOrigin/uAssets/issues/11408	2022-01-21 08:38:48 -05:00
Raymond Hill	d17d634b7c	Define new nobab scriptlet Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1863 As per internal discussion with team, best to have a simpler scriplet, and which is hard-coded to work only on a specific set of domains -- only those seen used by BAB.	2021-12-08 12:10:18 -05:00
Raymond Hill	33a18c3a1e	Convert fingerprint2.js scriptlet into a redirectable resource As per internal discussion with volunteer filter list maintainers.	2021-09-18 10:55:22 -04:00
Raymond Hill	f8daea085b	Remove `assets` dependency from redirect engine Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1664 This change allows to add the redirect engine into the nodejs package. The purpose of the redirect engine is to resolve a redirect token into a path to a local resource, to be used by the caller as wished.	2021-08-02 09:23:48 -04:00
Raymond Hill	cb72211795	Move orphanizeString() into text-utils module Another small step toward the goal of reducing dependency on `µb`. Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1664 text-iterators module has been renamed text-utils to better reflect its content.	2021-07-31 08:38:33 -04:00
Raymond Hill	62b6826dd5	Further modularize uBO's codebase Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1664 Modularization is a necessary step toward possibly publishing a more complete nodejs package to allow using uBO's filtering capabilities outside of the uBO extension. Additionally, as per feedback, remove undue usage of console output as per feedback: - https://github.com/uBlockOrigin/uBlock-issues/issues/1664#issuecomment-888451032	2021-07-28 19:48:38 -04:00
Raymond Hill	22022f636f	Modularize codebase with export/import Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1664 The changes are enough to fulfill the related issue. A new platform has been added in order to allow for building a NodeJS package. From the root of the project: ./tools/make-nodejs This will create new uBlock0.nodejs directory in the ./dist/build directory, which is a valid NodeJS package. From the root of the package, you can try: node test This will instantiate a static network filtering engine, populated by easylist and easyprivacy, which can be used to match network requests by filling the appropriate filtering context object. The test.js file contains code which is typical example of usage of the package. Limitations: the NodeJS package can't execute the WASM versions of the code since the WASM module requires the use of fetch(), which is not available in NodeJS. This is a first pass at modularizing the codebase, and while at it a number of opportunistic small rewrites have also been made. This commit requires the minimum supported version for Chromium and Firefox be raised to 61 and 60 respectively.	2021-07-27 17:26:04 -04:00
Raymond Hill	e03bb99f57	Add neutered replacement script for mixpanel Related discussion: - https://www.reddit.com/r/uBlockOrigin/comments/oicch9/ The new replacement script contains the smallest API possible to resolve the reported case. Please report instances where it's not sufficient to unbreak a site, in which case I will extend the neutered API to address these cases on an on-demand basis.	2021-07-13 07:58:31 -04:00
Raymond Hill	8cd2a1d263	Make googletagmanager_gtm.js an alias of google-analytics_analytics.js Related feedback: - https://ilakovac.com/teespring-ublock-issue/ The surrogate script googletagmanager_gtm.js was essentially a subset of surrogate script google-analytics_analytics.js. This commit makes it a plain alias so that the whole GA API -- often expected by clients of GTM -- is properly stubbed.	2021-05-18 11:08:20 -04:00
Raymond Hill	40c145d76a	Fix handling of cname-aliased URLs in click-to-load widget Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1455	2021-01-22 09:37:12 -05:00
Raymond Hill	1669d122df	Add resource for noop VMAP Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1425 The resource content is a copy/paste of AdGuard's code: - `bc5eec1989/src/redirects/static-redirects.yml (L134)`	2020-12-29 09:05:28 -05:00
Raymond Hill	4ba3adc28c	Fix comment	2020-12-28 07:07:04 -05:00
Raymond Hill	d910111d4a	Fix parsing of trailing resource Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1419	2020-12-28 07:03:52 -05:00
Raymond Hill	187f1831f0	Allow more local resources to be redirected as data: URIs Related feedback: - https://github.com/uBlockOrigin/uBlock-issues/issues/1388#issuecomment-748625280	2020-12-20 11:54:24 -05:00
Raymond Hill	24755d4300	Fix broken alias `nostif` Related feedback: - `ba11a70013 (r45030152)` Regression from: - `ba11a70013`	2020-12-11 10:34:33 -05:00
Raymond Hill	0b5f53923f	Add basic compatibility with ABP's `rewrite` option Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/857 The recognized resources are: - abp-resource:blank-mp3 - abp-resource:blank-js ABP's tokens are excluded from auto-complete so as to not get in the way of uBO's filter list maintainers.	2020-12-09 08:16:28 -05:00
Raymond Hill	904aa87e2a	Fix various regression in behavior of `redirect-rule=` Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1388 Fixed the special `none` redirect resource no longer being enforced. Fixed the enforcement of `important` redirect rules over exceptions and non-important ones.	2020-12-07 11:12:41 -05:00
Raymond Hill	26dc7a1490	Minor review of redirect-related code Notably, I finally settled for implicit priority of 0, but now negative priority values are allowed.	2020-12-02 08:18:55 -05:00
Raymond Hill	eae7cd58fe	Add support for `match-case` option; fine-tune behavior of `redirect=` `match-case` ------------ Related issue: - https://github.com/uBlockOrigin/uAssets/issues/8280#issuecomment-735245452 The new filter option `match-case` can be used only for regex-based filters. Using `match-case` with any other sort of filters will cause uBO to discard the filter. `redirect=` ----------- Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1366 `redirect=` filters with unresolvable resource token at runtime will be discarded. Additionally, the implicit priority is now set to 1 (was 0). The idea is to allow custom `redirect=` filters to be used strictly as fallback `redirect=` filters in case another `redirect=` filter is not picked up. For example, one might create a `redirect=click2load.html:0` filter, to be taken if and only if the blocked resource is not already being redirected by another "official" filter in one of the enabled filter lists.	2020-11-28 11:26:28 -05:00
Raymond Hill	8985376b00	Fix timing issue with cached redirection to web accessible resources Reported internally by @gwarser. In rare occasion, a timing issue could cause uBO to redirect to a web accessible resource meant to be used for another network request. This is a regression introduced with the following commit: - `2e5d32e967` Additionally, I identified another issue which would cause cached redirection to fail when a cache entry with redirection to a web accessible resource was being reused, an issue which could especially affect pages which are generated dynamically (i.e. without full page reload).	2020-11-10 10:43:26 -05:00
Raymond Hill	157cef6034	Re-classify `redirect=` option as a modifier option This commit moves the parsing, compiling and enforcement of the `redirect=` and `redirect-rule=` network filter options into the static network filtering engine as modifier options -- just like `csp=` and `queryprune=`. This solves the two following issues: - https://github.com/gorhill/uBlock/issues/3590 - https://github.com/uBlockOrigin/uBlock-issues/issues/1008#issuecomment-716164214 Additionally, `redirect=` option is not longer afflicted by static network filtering syntax quirks, `redirect=` filters can be used with any other static filtering modifier options, can be excepted using `@@` and can be badfilter-ed. Since more than one `redirect=` directives could be found to apply to a single network request, the concept of redirect priority is introduced. By default, `redirect=` directives have an implicit priority of 0. Filter authors can declare an explicit priority by appending `:[integer]` to the token of the `redirect=` option, for example: \|\|example.com/*.js$1p,script,redirect=noopjs:100 The priority dictates which redirect token out of many will be ultimately used. Cases of multiple `redirect=` directives applying to a single blocked network request are expected to be rather unlikely. Explicit redirect priority should be used if and only if there is a case of redirect ambiguity to solve.	2020-11-03 09:15:26 -05:00
Raymond Hill	5468b92643	Built-in redirect token `none` must be seen as valid Related feedback: - `1727585faa (commitcomment-43787843)`	2020-11-02 04:52:47 -05:00
Raymond Hill	2e5d32e967	Fine tune code related to click-to-load feature The redirectable resource has been renamed `click2load.html`, so as to avoid uses of dash characters and to also allow for future different click-to-load resources.	2020-10-10 08:36:30 -04:00
Raymond Hill	5916920985	Add support for click-to-load of embedded frames Additionally, as a requirement to support click-to-load feature, redirected resources will from now on no longer be collapsed. Related issues: - https://github.com/gorhill/uBlock/issues/2688 - https://github.com/gorhill/uBlock/issues/3619 - https://github.com/gorhill/uBlock/issues/1899 This new feature should considered in its draft stage and it needs to be fine-tuned as per feedback. Important: Only embedded frames can be converted into click-to-load widgets, as only these can be properly shieded from access by page content. Examples of usage: \|\|youtube.com/embed/$3p,frame,redirect=clicktoload \|\|scribd.com/embeds/$3p,frame,redirect=clicktoload \|\|player.vimeo.com/video/$3p,frame,redirect=clicktoload	2020-10-09 13:50:54 -04:00
Raymond Hill	79ccd23ccf	Also remove references to remove scriptlets Related commit: - `7c22a31294`	2020-08-06 11:40:18 -04:00
Raymond Hill	d49a9dce66	Fix spurious rejection of some AdGuard redirect filters Lines in AdGuard filter lists have trailing `\r` characters, and these caused the redirect engine compile code to reject as invalid the redirect token. This is trivially fixed by trimming the raw option strings before parsing it in the redirect engine.	2020-07-13 09:33:38 -04:00
Raymond Hill	c9cfd62c21	Add auto-completion capability for filter options Related commit: - `3e72a47c1f` Use ctrl-space to auto-complete filter options and `redirect=` resources in _"My filters"_ pane.	2020-06-15 19:05:39 -04:00
Raymond Hill	3e72a47c1f	Add support for auto-completion in _My filters_ pane This commit adds CodeMirror's auto-completion capability to the _My filters_ pane. Currently, auto-completion is available for scriptlet tokens: pressing ctrl-space while the text cursor is positioned where a scriptlet token should appear will cause auto-completion to kick-in. In case of ambiguity, CodeMirror's widget to pick a specific scriptlet will appear.	2020-06-15 09:15:13 -04:00
Raymond Hill	cb5437b161	Support redirect rules with no pattern Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/977 No pattern will imply `*` for the redirect destination part of the rule.	2020-06-14 15:09:35 -04:00
Raymond Hill	f842ab6d3c	Add new scriptlet to allow blocking Amazon's apstag.js Related issues: - https://github.com/NanoMeow/QuickReports/issues/3717 - https://www.reddit.com/r/uBlockOrigin/comments/ghjqph/ The specific issue on the mentioned site is that the site's code expect `window.apstag.fetchBids` to call client-supplied function. The new scriptlet defuse this by calling the client code with an empty array.	2020-05-11 07:57:14 -04:00
Raymond Hill	6259f88598	Add an alias for window.open-defuser scriptlet As per request from filter list maintainers. The alias is `nowoif`, in line with other such defusing scriplets.	2020-04-27 11:24:41 -04:00
Raymond Hill	1d51927d2e	Fix handling of end-anchor in redirect patterns Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/872 An end-anchor was treated as literal `\|` in the redirect pattern to match instead of as a end-of-string condition.	2020-02-01 12:47:17 -05:00
Raymond Hill	7971b22385	Expand bidi-trie usage in static network filtering engine Related issues: - https://github.com/uBlockOrigin/uBlock-issues/issues/761 - https://github.com/uBlockOrigin/uBlock-issues/issues/528 The previous bidi-trie code could only hold filters which are plain pattern, i.e. no wildcard characters, and which had no origin option (`domain=`), right and/or left anchor, and no `csp=` option. Example of filters that could be moved into a bidi-trie data structure: &ad_box_ /w/d/capu.php?z=$script,third-party \|\|liveonlinetv247.com/images/muvixx-150x50-watch-now-in-hd-play-btn.gif Examples of filters that could NOT be moved to a bidi-trie: -adap.$domain=~l-adap.org /tsc.php?&ses= \|\|ibsrv.net/forumsponsor$domain=[...] @@\|\|imgspice.com/jquery.cookie.js\|$script \|\|view.atdmt.com^/iview/$third-party \|\|postimg.cc/image/$csp=[...] Ideally the filters above should be able to be moved to a bidi-trie since they are basically plain patterns, or at least partially moved to a bidi-trie when there is only a single wildcard (i.e. made of two plain patterns). Also, there were two distinct bidi-tries in which plain-pattern filters can be moved to: one for patterns without hostname anchoring and another one for patterns with hostname-anchoring. This was required because the hostname-anchored patterns have an extra condition which is outside the bidi-trie knowledge. This commit expands the number of filters which can be stored in the bidi-trie, and also remove the need to use two distinct bidi-tries. - Added ability to associate a pattern with an integer in the bidi-trie [1]. - The bidi-trie match code passes this externally provided integer when calling an externally provided method used for testing extra conditions that may be present for a plain pattern found to be matching in the bidi-trie. - Decomposed existing filters into smaller logical units: - FilterPlainLeftAnchored => FilterPatternPlain + FilterAnchorLeft - FilterPlainRightAnchored => FilterPatternPlain + FilterAnchorRight - FilterExactMatch => FilterPatternPlain + FilterAnchorLeft + FilterAnchorRight - FilterPlainHnAnchored => FilterPatternPlain + FilterAnchorHn - FilterWildcard1 => FilterPatternPlain + [ FilterPatternLeft or FilterPatternRight ] - FilterWildcard1HnAnchored => FilterPatternPlain + [ FilterPatternLeft or FilterPatternRight ] + FilterAnchorHn - FilterGenericHnAnchored => FilterPatternGeneric + FilterAnchorHn - FilterGenericHnAndRightAnchored => FilterPatternGeneric + FilterAnchorRight + FilterAnchorHn - FilterOriginMixedSet => FilterOriginMissSet + FilterOriginHitSet - Instances of FilterOrigin[...], FilterDataHolder can also be added to a composite filter to represent `domain=` and `csp=` options. - Added a new filter class, FilterComposite, for filters which are a combination of two or more logical units. A FilterComposite instance is a match when all* filters composing it are a match. Since filters are now encoded into combination of smaller units, it becomes possible to extract the FilterPatternPlain component and store it in the bidi-trie, and use the integer as a handle for the remaining extra conditions, if any. Since a single pattern in the bidi-trie may be a component for different filters, the associated integer points to a sequence of extra conditions, and a match occurs as soon as one of the extra conditions (which may itself be a sequence of conditions) is fulfilled. Decomposing filters which are currently single instance into sequences of smaller logical filters means increasing the storage and CPU overhead when evaluating such filters. The CPU overhead is compensated by the fact that more filters can now moved into the bidi-trie, where the first match is efficiently evaluated. The extra conditions have to be evaluated if and only if there is a match in the bidi-trie. The storage overhead is compensated by the bidi-trie's intrinsic nature of merging similar patterns. Furthermore, the storage overhead is reduced by no longer using JavaScript array to store collection of filters (which is what FilterComposite is): the same technique used in [2] is imported to store sequences of filters. A sequence of filters is a sequence of integer pairs where the first integer is an index to an actual filter instance stored in a global array of filters (`filterUnits`), while the second integer is an index to the next pair in the sequence -- which means all sequences of filters are encoded in one single array of integers (`filterSequences` => Uint32Array). As a result, a sequence of filters can be represented by one single integer -- an index to the first pair -- regardless of the number of filters in the sequence. This representation is further leveraged to replace the use of JavaScript array in FilterBucket [3], which used a JavaScript array to store collection of filters. Doing so means there is no more need for FilterPair [4], which purpose was to be a lightweight representation when there was only two filters in a collection. As a result of the above changes, the map of `token` (integer) => filter instance (object) used to associate tokens to filters or collections of filters is replaced with a more efficient map of `token` (integer) to filter unit index (integer) to lookup a filter object from the global `filterUnits` array. Another consequence of using one single global array to store all filter instances means we can reuse existing instances when a logical filter instance is parameter-less, which is the case for FilterAnchorLeft, FilterAnchorRight, FilterAnchorHn, the index to these single instances is reused where needed. `urlTokenizer` now stores the character codes of the scanned URL into a bidi-trie buffer, for reuse when string matching methods are called. New method: `tokenHistogram()`, used to generate histograms of occurrences of token extracted from URLs in built-in benchmark. The top results of the "miss" histogram are used as "bad tokens", i.e. tokens to avoid if possible when compiling filter lists. All plain pattern strings are now stored in the bidi-trie memory buffer, regardless of whether they will be used in the trie proper or not. Three methods have been added to the bidi-trie to test stored string against the URL which is also stored in then bidi-trie. FilterParser is now instanciated on demand and released when no longer used. *** [1] `135a45a878/src/js/strie.js (L120)` [2] `e94024d350` [3] `135a45a878/src/js/static-net-filtering.js (L1630)` [4] `135a45a878/src/js/static-net-filtering.js (L1566)`	2019-10-21 08:15:58 -04:00
Raymond Hill	e27328f931	Work toward modernizing code base: promisification Swathes of code have been converted to use Promises/async/await. More left to do. In the process, a regression affecting the fix to <https://github.com/uBlockOrigin/uBlock-issues/issues/682> has been fixed.	2019-09-15 07:58:28 -04:00
Raymond Hill	3eeaba45d9	Cherry-picked `ac7825c789`	2019-09-07 08:31:32 -04:00
Raymond Hill	e0b8cf24d1	Clear internal cache when loading redirect rules Related commit: - `3e5c9e00ab` This fix a regression: newly added redirect rules could end up not being taken into account unless uBO was restarted.	2019-08-24 13:48:50 -04:00
Raymond Hill	6c73bd78f4	Fix regression when generating `data` URI in redirect engine Related feedback: - https://www.reddit.com/r/uBlockOrigin/comments/cpxm1v/ Put back erroneously removed code which enable to generate a `data` URI from already encoded resources.	2019-08-16 13:45:07 -04:00
Raymond Hill	68ae847ba3	Add support for AdGuard's `mp4` filter option Related discussion: - https://github.com/uBlockOrigin/uBlock-issues/issues/701#issuecomment-520884196 The `mp4` filter option will be converted to `redirect=noopmp4-1s` internally, and `media` type will be assumed.	2019-08-13 12:30:11 -04:00
Raymond Hill	3e5c9e00ab	Add support for AdGuard's `empty` option Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/701 The filter option `empty` is converted to `redirect=empty` by uBO internally; however unlike when the `redirect=` option is used expressly, the `empty` option does not require a resource type. When `empty` is used, only network requests which are meant to return a text response will be redirected to an empty response body by uBO -- so `empty` will not work for resources such as images, media, or other binary resources.	2019-08-13 08:16:21 -04:00

1 2 3