uBlock

mirror of https://github.com/gorhill/uBlock.git synced 2024-11-17 16:02:33 +01:00

Author	SHA1	Message	Date
Raymond Hill	d1895d4749	Another round of fine-tuning `queryprune=` syntax Related discussions: - https://github.com/uBlockOrigin/uBlock-issues/issues/1356#issuecomment-732411286 - https://github.com/AdguardTeam/CoreLibs/issues/1384 Changes: Negation character is `~` (instead of `!`). Drop special anchor character `\|` -- leading `\|` will be supported until no such filter is present in uBO's own filter lists. For example, instance of `queryprune=\|ad` will have to be replaced with `queryprune=/^ad/` (or `queryprune=ad` if the name of the parameter to remove is exactly `ad`). Align semantic with that of AdGuard's `removeparam=`, except that specifying multiple `\|`-separated names is not supported.	2020-11-29 11:02:40 -05:00
Raymond Hill	dac8d6becb	Fix broken token extraction Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1367 Regression from: - `6ac09a2856` Need to mind wildcards adjacent to extracted token.	2020-11-29 07:38:15 -05:00
Raymond Hill	eae7cd58fe	Add support for `match-case` option; fine-tune behavior of `redirect=` `match-case` ------------ Related issue: - https://github.com/uBlockOrigin/uAssets/issues/8280#issuecomment-735245452 The new filter option `match-case` can be used only for regex-based filters. Using `match-case` with any other sort of filters will cause uBO to discard the filter. `redirect=` ----------- Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1366 `redirect=` filters with unresolvable resource token at runtime will be discarded. Additionally, the implicit priority is now set to 1 (was 0). The idea is to allow custom `redirect=` filters to be used strictly as fallback `redirect=` filters in case another `redirect=` filter is not picked up. For example, one might create a `redirect=click2load.html:0` filter, to be taken if and only if the blocked resource is not already being redirected by another "official" filter in one of the enabled filter lists.	2020-11-28 11:26:28 -05:00
Raymond Hill	c6d0204b23	Remove requirement for presence of type with `redirect=` option Related issue: - https://github.com/gorhill/uBlock/issues/3590 Since the `redirect=` option was refactored into a modifier filter, presence of a type (`script`, `xhr`, etc.) is no longer a requirement.	2020-11-28 08:52:18 -05:00
Raymond Hill	ab5ab8575c	Avoid re-assigning asset cache registry at launch Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1365	2020-11-28 08:28:20 -05:00
Raymond Hill	c959fd6cd9	Fix comment	2020-11-27 16:01:34 -05:00
Raymond Hill	6ac09a2856	Add ability to parse `removeparam=` as `queryprune=` Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1356 Related commit: - `bde3164eb4` It is not possible to achieve perfect compatiblity at this point, but reasonable compatibility should be achieved for a majority of instances of `removeparam=`. Notable differences: -------------------- uBO always matches in a case insensitive manner, there is no need to ask for case-insensitivity, and no need to use uppercase characters in `queryprune=` values. uBO does not escape special regex characters since the `queryprune=` values are always assumed to be literal regex expression (leaving out the documented special characters). This means `removeparam=` with characters which are special regex characters won't be properly translated and are unlikely to work properly in uBO. For example, the `queryprune` value of a filter such as `$removeparam=__xts__[0]` internally become the literal regex `/__xts__[0]/`, and consequently would not match a query parameter such as `...?__xts__[0]=...`. Notes: ------ Additionally, for performance reason, when uBO encounter a pattern-less `queryprune=` (or `removeparam=`) filter, it will try to extract a valid pattern from the `queryprune=` value. For instance, the following filter: $queryprune=utm_campaign Will be translated internally into: utm_campaign$queryprune=utm_campaign The logger will reflect this internal translation.	2020-11-26 09:34:12 -05:00
Raymond Hill	80413dff83	Fix forgotton instances of `1P`/`3P` Related commit: - `60d5b85e41` Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1362	2020-11-26 05:43:14 -05:00
Raymond Hill	60d5b85e41	Rename `1P`/`3P` tp `strict1p`/`strict3p` as suggested Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1362	2020-11-26 05:09:46 -05:00
Raymond Hill	57013c16e6	Fix compilation of blocking counterpart of `redirect=` filters Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1358	2020-11-25 09:36:12 -05:00
Raymond Hill	e45949417b	Magic compile/selfie numbers need to increased Related commit: - `bde3164eb4`	2020-11-23 10:26:15 -05:00
Raymond Hill	43cb63f80a	Fix parsing of `queryprune=*` in static filtering parser	2020-11-23 08:47:29 -05:00
Raymond Hill	bde3164eb4	Add support for `1P`, `3P`, `header=` filter options and other changes New filter options ================== Strict partyness: `1P`, `3P` ---------------------------- The current options 1p/3p are meant to "weakly" match partyness, i.e. a network request is considered 1st-party to its context as long as both the context and the request share the same base domain. The new partyness options are meant to check for strict partyness, i.e. a network request will be considered 1st-party if and only if both the context and the request share the same hostname. For examples: - context: `www.example.org` - request: `www.example.org` - `1p`: yes, `1P`: yes - `3p`: no, `3P`: no - context: `www.example.org` - request: `subdomain.example.org` - `1p`: yes, `1P`: no - `3p`: no, `3P`: yes - context: `www.example.org` - request: `www.example.com` - `1p`: no, `1P`: no - `3p`: yes, `3P`: yes The strict partyness options will be visually emphasized in the editor so as to prevent mistakenly using `1P` or `3P` where weak partyness is meant to be used. Filter on response headers: `header=` ------------------------------------- Currently experimental and under evaluation. Disabled by default, enable by toggling `filterOnHeaders` to `true` in advanced settings. Ability to filter network requests according to whether a specific response header is present and whether it matches or does not match a specific value. For example: $1p,3P,script,header=via:1\.1\s+google The above filter is meant to block network requests which fullfill all the following conditions: - is weakly 1st-party to the context - is not strictly 1st-party to the context - is of type `script` - has a response HTTP header named `via`, which value matches the regular expression `1\.1\s+google`. The matches are always performed in a case-insensitive manner. The header value is assumed to be a literal regular expression, except for the following special characters: - to anchor to start of string, use leading `\|`, not `^` - to anchor to end of string, use trailing `\|`, not `$` - to invert the test, use a leading `!` To block a network request if it merely contains a specific HTTP header is just a matter of specifying the header name without a header value: $1p,3P,script,header=via Generic exception filters can be used to disable specific block `header=` filters, i.e. `@@$1p,3P,script,header` will override the block `header=` filters given as example above. Dynamic filtering's `allow` rules override block `headers=` filters. Important: It is key that filter authors use as many narrowing filter options as possible when using the `header=` option, and the `header=` option should be used ONLY when other filter options are not sufficient. More documentation justifying the purpose of `header=` option will be provided eventually if ever it is decided to move it from experimental to stable status. To be decided: to restrict usage of this filter option to only uBO's own filter lists or "My filters". Changes ======= Fine tuning `queryprune=` ------------------------- The following changes have been implemented: The special value `` (i.e. `queryprune=*`) means "remove all query parameters". If the `queryprune=` value is made only of alphanumeric characters (including `_`), the value will be internally converted to regex equivalent `^value=`. This ensures a better future compatibility with AdGuard's `removeparam=`. If the `queryprune=` value starts with `!`, the test will be inverted. This can be used to remove all query parameters EXCEPT those who match the specified value. Other ----- The legacy code to test for spurious CSP reports has been removed. This is no longer an issue ever since uBO redirects to local resources through web accessible resources. Notes ===== The following new and recently added filter options are not compatible with Chromium's manifest v3 changes: - `queryprune=` - `1P` - `3P` - `header=`	2020-11-23 08:22:43 -05:00
Raymond Hill	daf464b3c3	Add support to auto-complete values of domain lists The auto-complete feature in the _"My filters"_ pane will use hostname/domain from the set of opened tabs to assist in entering values for `domain=` option. This also works for the implict `domain=` option ṗrepending static extended filters.	2020-11-21 09:57:54 -05:00
Raymond Hill	8d3c4916b0	Skip trying to find effective context for `about:srcdoc` frames `about:srcdoc` frames are their own origin, trying to use the origin of the parent context causes an exception to be thrown when accessing location.href.	2020-11-21 09:51:14 -05:00
Raymond Hill	13f6bdae37	Improve representation of modifier filters in logger As per feedback from filter list maintainers.	2020-11-20 07:14:02 -05:00
Raymond Hill	ab98cd46b1	Bring back action/state highlighting in _"My rules"_	2020-11-20 05:34:56 -05:00
Raymond Hill	b1c55b3de9	Emphasize entity portion of hostnames in _"My rules"_	2020-11-19 11:33:09 -05:00
Raymond Hill	38cecddcd1	Improve zapper's detection of scroll-locked documents	2020-11-18 14:11:36 -05:00
Raymond Hill	ee2fd45f00	Ensure we do not extract truncated URL for Homepage directive Related feedback: - `b12e0e05ea (commitcomment-44309540)`	2020-11-18 12:14:23 -05:00
Raymond Hill	b12e0e05ea	Extract `Homepage` URL from a list when present Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1346 Additionally, fixed a case of filter list being compiled twice at subscription time.	2020-11-18 10:02:22 -05:00
Raymond Hill	d87a3b950f	Sort on base domains rather than TLDs in "My rules" pane Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1293	2020-11-18 08:01:00 -05:00
Raymond Hill	a683297931	Fix type assignment in logger page Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1349	2020-11-17 11:18:59 -05:00
Raymond Hill	e360e90d1e	Fix invalid support URL in document-blocked page Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1345	2020-11-15 10:19:09 -05:00
Raymond Hill	46d7f8a70c	Fine tune click-to-load widget Notably, add clickable link to open the widget in its own tab. Also, allows the URL to be text- selected so that it becomes possible to use the selection in a browser contextual menu's "Open in a new tab" option.	2020-11-14 08:34:47 -05:00
Raymond Hill	5cf9bcf27c	Fine tune code of optimizeOriginHitTests() Related commit: - `b265f2644d`	2020-11-14 07:28:51 -05:00
Raymond Hill	4afb3dc149	Allow `domain=` with entity values into pre-test buckets Related commit: - `b265f2644d` Filters which have `domain=` option with an entity value will no longer be prevented from joining pre-test buckets.	2020-11-14 07:04:21 -05:00
Raymond Hill	56cd238ad4	Disable auto activation of dark theme in next release Until a fully usable dark theme is available. uBO's incomplete dark theme can still be forced by setting advanced setting `uiTheme` to `dark`.	2020-11-13 12:15:29 -05:00
Raymond Hill	eb8433cb19	Enable cloud storage compression by default in next release Related commit: - `d8b6b31eca`	2020-11-13 12:14:06 -05:00
Raymond Hill	2cfeaddbed	Fine tune various static filtering code Notably, make `queryprune` option available only to filter list authors, until there are guards against bad filters in some future and until the option syntax and behavior is fully settled. Instances of `queryprune` in filter lists will be compiled, however instances of `queryprune` in _"My filters"_ will be ignored unless users indicated they are a filter list author.	2020-11-13 09:23:25 -05:00
Raymond Hill	525d7b1b3b	Fine tune port connection code Related commit: - `a223031b98`	2020-11-13 08:32:51 -05:00
Raymond Hill	02b4d149e3	Do not skip querypruning when no-strict-blocking is true Related issue: - https://github.com/uBlockOrigin/uBlock-issues/issues/1341	2020-11-13 08:30:43 -05:00
Raymond Hill	a223031b98	Work around Firefox's `data:` favIconUrl leak Related issue: - https://bugzilla.mozilla.org/show_bug.cgi?id=1652925	2020-11-12 12:14:59 -05:00
Raymond Hill	280dd8ddd6	Fix picker use of extraneous `body` in suggested filter Related feedback: - https://www.reddit.com/r/uBlockOrigin/comments/jregqx/	2020-11-11 09:39:07 -05:00
Raymond Hill	cfb050f521	Detect bad queryprune values `queryprune=` values are used as literal regex value after converting leading/trailing `\|` into `^`/`$`.	2020-11-11 08:15:39 -05:00
Raymond Hill	8cc3779fb3	Last commit changes compiled format	2020-11-11 08:15:27 -05:00
Raymond Hill	0e851c035e	Revisit realm & action bits The important bit is now considered an action bit so that there is no more a need for the `important` property in the parser. The modify bit is now considered a realm bit. When the modify bit is set, the action bits become available to be used to further narrow the realm. This could be useful in the future if we want to spread the population of modifier filters across different buckets.	2020-11-11 07:53:46 -05:00
Raymond Hill	32eca67154	Reuse one instance of domain option iterator Reusing the same iterator instance for all cases of `domain=` option parsing should reduce memory churning. Additonally, fine tune regex used to extract valid token from regex-based filters to increase likelihood of being able to extract a valid token.	2020-11-10 12:49:46 -05:00
Raymond Hill	8985376b00	Fix timing issue with cached redirection to web accessible resources Reported internally by @gwarser. In rare occasion, a timing issue could cause uBO to redirect to a web accessible resource meant to be used for another network request. This is a regression introduced with the following commit: - `2e5d32e967` Additionally, I identified another issue which would cause cached redirection to fail when a cache entry with redirection to a web accessible resource was being reused, an issue which could especially affect pages which are generated dynamically (i.e. without full page reload).	2020-11-10 10:43:26 -05:00
Raymond Hill	76ef4811a3	Fix `queryprune` for tabless requests Related feedback: - https://github.com/uBlockOrigin/uBlock-issues/issues/760#issuecomment-724693549	2020-11-10 08:58:39 -05:00
Raymond Hill	0196993828	Use buffer-like approach for filterUnits array filterUnits is now treated as a buffer which is pre-allocated and which will grow in chunks so as to minimize memory allocations. Entries are never released, just null-ed. Additionally, move urlTokenizer into the static network filtering engine, since it's not used anywhere else.	2020-11-09 06:54:51 -05:00
Raymond Hill	db4f02199d	Convert filterSequences into a const variable Making filterSequences constant allows to no longer mind how the array is accessed in loops.	2020-11-08 16:00:24 -05:00
Raymond Hill	50da6706a4	Code review of static network filtering engine - Convert this.categories Map() into an array; - Fix case of potentially using an invalid UintArray32 (regression from latest changes)	2020-11-08 13:50:36 -05:00
Raymond Hill	96bfe3c9a7	Convert filterUnits into a const variable Making filterUnits constant allows to no longer mind how the array is accessed in loops.	2020-11-08 10:30:47 -05:00
Raymond Hill	cb91d167d1	Fine tune static network filtering engine code Notably, defer the post-load optimization operations to a few seconds after the filters have been all loaded in memory -- this is not a critical step for the filtering engine to work properly, hence this can be delayed in order to ensure readiness as soon as possible.	2020-11-07 13:25:01 -05:00
Raymond Hill	efea83a825	Incrementally improve static filtering parser Most notably, the `denyallow=` option now requires the presence of a valid `domain=` option to not be rejected. Using `denyallow=` without narrowing down using the `domain=` option leads to catastrophic blocking behvior, hence the requirement for a valid `domain=` option.	2020-11-07 13:20:02 -05:00
Raymond Hill	1d679143d2	Enable origin-hit coalescing optimisation for modifier filters Related commit: - `b265f2644d` The optimization in the commit above was meant to improve the performance of lookup operations of modifier filters, but I forgot to enable the optimisation for that class of filters. This means this commit brings another significant performance gain on top of the previous commit, as shown by the built-in benchmark. Additionally a few minor code rearrangements.	2020-11-06 18:24:46 -05:00
Raymond Hill	b265f2644d	Coallesce origin hit filters into their own bucket Performance-related work. There is a fair number of filters which can't be tokenized in uBO's own filter lists. Majority of those filters also declare a `domain=` option, examples: $script,redirect-rule=noopjs,domain=... $script,3p,domain=...,denyallow=... $frame,3p,domain=... Such filters can be found in uBO's asset viewer using the following search expression: /^\?\$[^\n]*?domain=/ Some filter buckets will contain many of those filters, for instance one of the bucket holding untokenizable `redirect=` filters has over 170 entries, which must be all visited when collating all matching `redirect=` filters. When a bucket contains many such filters, I found that it's worth to extract all the non-negated hostname values from `domain=` options into a single hntrie and perform a pre-test at match() time to find out whether the current origin of a network request matches any one of the collected hostnames, so as to avoid iterating through all the filters. Since there is rarely a match() for vast majority of network requests with `domain=` option, this pre-test saves a good amount of work, and this is measurable with the built-in benchmark.	2020-11-06 12:04:03 -05:00
Raymond Hill	19331f1ab5	Fine tune latest changes for performance Related commits: - `157cef6034` - `1e2eb037e5`	2020-11-04 07:50:51 -05:00
Raymond Hill	157cef6034	Re-classify `redirect=` option as a modifier option This commit moves the parsing, compiling and enforcement of the `redirect=` and `redirect-rule=` network filter options into the static network filtering engine as modifier options -- just like `csp=` and `queryprune=`. This solves the two following issues: - https://github.com/gorhill/uBlock/issues/3590 - https://github.com/uBlockOrigin/uBlock-issues/issues/1008#issuecomment-716164214 Additionally, `redirect=` option is not longer afflicted by static network filtering syntax quirks, `redirect=` filters can be used with any other static filtering modifier options, can be excepted using `@@` and can be badfilter-ed. Since more than one `redirect=` directives could be found to apply to a single network request, the concept of redirect priority is introduced. By default, `redirect=` directives have an implicit priority of 0. Filter authors can declare an explicit priority by appending `:[integer]` to the token of the `redirect=` option, for example: \|\|example.com/*.js$1p,script,redirect=noopjs:100 The priority dictates which redirect token out of many will be ultimately used. Cases of multiple `redirect=` directives applying to a single blocked network request are expected to be rather unlikely. Explicit redirect priority should be used if and only if there is a case of redirect ambiguity to solve.	2020-11-03 09:15:26 -05:00

1 2 3 4 5 ...

2248 Commits