Mike Fährmann
|
ccb413df71
|
[wikimedia] support 'pidgi.net' and 'bulbapedia.bulbagarden.net' (#5205, #5206)
|
2024-02-17 17:35:10 +01:00 |
|
Mike Fährmann
|
7033cc14e9
|
[vsco] add 'space' extractor (#5202)
|
2024-02-17 01:54:05 +01:00 |
|
Mike Fährmann
|
770aec922d
|
[fapachi] ignore empty entries
|
2024-02-16 22:43:37 +01:00 |
|
Mike Fährmann
|
c9efccc959
|
[tests] update extractor results
|
2024-02-16 22:42:06 +01:00 |
|
Mike Fährmann
|
c413834dfc
|
[bluesky] extend tests
|
2024-02-16 16:30:02 +01:00 |
|
Mike Fährmann
|
ee7c054855
|
[bluesky] add 'search' extractor (#4438)
Both https://bsky.app/search?q=QUERY and https://bsky.app/search/QUERY
are recognized as search URLs, where QUERY gets forwarded unmodified as
'q' parameter for app.bsky.feed.searchPosts .
User searches are not supported yet.
|
2024-02-16 15:58:47 +01:00 |
|
Mike Fährmann
|
91e5c4fdfe
|
[bluesky] add 'avatar' and 'background' extractors (#4438)
|
2024-02-16 15:41:19 +01:00 |
|
Mike Fährmann
|
24c1317e0d
|
[batoto] fix crash when manga/chapter contains a '-' (#5200)
|
2024-02-16 00:10:08 +01:00 |
|
Mike Fährmann
|
0abd9723af
|
[bluesky] add 'metadata' option (#4438)
allow extracting 'user' metadata and
make 'facets' extraction optional
|
2024-02-15 23:30:16 +01:00 |
|
Mike Fährmann
|
7e036ea290
|
[bluesky] add 'depth' option (#4438)
and reduce default depth and parentHeight values
|
2024-02-15 22:26:05 +01:00 |
|
Mike Fährmann
|
42335ea880
|
[zerochan] fix skipping every other post
|
2024-02-15 02:51:01 +01:00 |
|
Mike Fährmann
|
c97b92cc35
|
[fanbox] add 'home' and 'supporting' extractors (#5138)
|
2024-02-14 23:25:39 +01:00 |
|
Mike Fährmann
|
04e4ffc64c
|
[deviantart] combine 'png' option with 'quality' (#4846)
"quality": "png" to download PNGs instead og JPEGs
|
2024-02-14 22:07:29 +01:00 |
|
Mike Fährmann
|
9cc4ec2c58
|
[deviantart] add 'png' option (#4846)
|
2024-02-14 01:03:15 +01:00 |
|
Mike Fährmann
|
966c8608e6
|
[deviantart] move image content extraction into separate function
|
2024-02-14 00:30:06 +01:00 |
|
Mike Fährmann
|
61a50da086
|
merge #5195: [pornpics] support multiple 'channel' values
i.e. change 'channel' from string to list
use '{channel[0]}' to get the old behavior
|
2024-02-13 23:54:10 +01:00 |
|
Mike Fährmann
|
1d1ffe3317
|
[pornpics] update 'channel' extraction & add test
change 'channel' to a list, since extracting both 'channel' and
'channels' does not really work with text.extract_from()
|
2024-02-13 23:48:46 +01:00 |
|
cc1234
|
32472d7d6c
|
Add support for multi channels
|
2024-02-13 18:34:04 +00:00 |
|
Mike Fährmann
|
139ff3f6ab
|
[kemonoparty] add 'posts' extractor (#5194)
|
2024-02-13 15:41:34 +01:00 |
|
Mike Fährmann
|
814ad9321e
|
[deviantart] skip locked/blurred posts (#4567, #5193)
|
2024-02-13 14:15:12 +01:00 |
|
Mike Fährmann
|
f7f8ef8684
|
[twitter] support communities (#4913)
|
2024-02-13 01:30:23 +01:00 |
|
Mike Fährmann
|
8f27f43d4d
|
[tests] implement explicitly disabling auth
|
2024-02-13 00:08:27 +01:00 |
|
Mike Fährmann
|
cae77e85f8
|
[twitter] update query hashes
... as well as 'variables' and 'features' values
also remove unused legacy API code
|
2024-02-12 23:19:13 +01:00 |
|
Mike Fährmann
|
06cb518d97
|
[bunkr] fix extraction (#5088, #5151, #5153)
- remove legacy code
- map legacy domains to bunkr.sk
- use input URL domain for newer domains
- update tests (some files got slightly modified or deleted)
|
2024-02-11 22:36:03 +01:00 |
|
Mike Fährmann
|
dcc6e3f65c
|
merge #5134: [bunkr] add new bunkr domains (#5130)
|
2024-02-11 21:10:06 +01:00 |
|
Mike Fährmann
|
4641937ca3
|
[imagetwist] add 'gallery' extractor (#5190)
|
2024-02-11 18:41:02 +01:00 |
|
Mike Fährmann
|
fde82ab0ce
|
[imagechest] add 'user' extractor (#5143)
|
2024-02-11 18:38:33 +01:00 |
|
Mike Fährmann
|
4474cea31b
|
merge #5187: [skeb] add 'num' and 'count' metadata fields
|
2024-02-10 19:36:59 +01:00 |
|
Mike Fährmann
|
4cfceb23cb
|
[skeb] rename 'data' -> 'file' & add tests
|
2024-02-10 19:35:50 +01:00 |
|
Mike Fährmann
|
44a1a66dac
|
merge #5186: Fix filename formatting silently failing under certain circumstances
|
2024-02-10 19:22:41 +01:00 |
|
Mike Fährmann
|
c83d0a1596
|
[weibo] add 'gifs' option (#5183)
|
2024-02-10 18:17:07 +01:00 |
|
blankie
|
f9a8e8cacf
|
[skeb] add 'num' and 'count' metadata fields
|
2024-02-10 21:51:23 +11:00 |
|
blankie
|
909830f8ea
|
fix filename formatting silently failing under certain circumstances
|
2024-02-10 21:18:57 +11:00 |
|
Mike Fährmann
|
af61d2b037
|
[wikimedia] combine most wikimedia.org sites (#1443)
add wikidata.org and wikivoyage.org
|
2024-02-10 03:00:58 +01:00 |
|
Mike Fährmann
|
c7d17f1111
|
[bluesky] extract 'hashtags', 'mentions', and 'uris' metadata (#4438)
|
2024-02-10 00:01:55 +01:00 |
|
Mike Fährmann
|
55bbd49a0e
|
[bluesky] download images in original resolution (#4438)
at least up to 2000 px
|
2024-02-09 21:33:33 +01:00 |
|
Mike Fährmann
|
6414dc6bca
|
[idolcomplex] fix pagination for tags containing ':' (#5171)
|
2024-02-09 17:51:08 +01:00 |
|
Mike Fährmann
|
5c2a2321a2
|
[bluesky] update refresh token after using it (#4438)
|
2024-02-08 22:33:34 +01:00 |
|
Mike Fährmann
|
9c10be54fb
|
[bluesky] add 'following' extractor (#4438)
|
2024-02-08 21:58:17 +01:00 |
|
Mike Fährmann
|
86ce35d6a1
|
[bluesky] simplify 'pattern'
|
2024-02-08 21:28:21 +01:00 |
|
Mike Fährmann
|
da292ded4e
|
[bluesky] add 'list' extractor (#4438)
|
2024-02-08 21:24:07 +01:00 |
|
Mike Fährmann
|
004bf7bb38
|
[bluesky] add 'feed' extractor (#4438)
|
2024-02-08 21:01:44 +01:00 |
|
Mike Fährmann
|
6aea818d4e
|
[bluesky] allow using DIDs as user handles (#4438)
|
2024-02-08 20:15:54 +01:00 |
|
Mike Fährmann
|
aee5580c62
|
[idolcomplex] extract 'id_alnum' metadata (#5171)
|
2024-02-08 18:29:54 +01:00 |
|
Mike Fährmann
|
cf7d6be2d4
|
[bluesky] initial support (#4438, #4708, #4722, #5047)
|
2024-02-07 19:09:33 +01:00 |
|
Mike Fährmann
|
6ef143ea31
|
[idolcomplex] support alphanumeric post IDs (#5171)
|
2024-02-07 14:57:13 +01:00 |
|
Mike Fährmann
|
6e928300bc
|
[flickr] handle non-JSON errors (#5131)
|
2024-02-06 21:22:10 +01:00 |
|
Mike Fährmann
|
90ac6d7375
|
[wikimedia] use '/api.php' as default API path
|
2024-02-06 00:36:51 +01:00 |
|
Mike Fährmann
|
d7823b9f81
|
[pinterest] fix section URLs for boards with /?# in name (#5104)
|
2024-02-05 15:54:06 +01:00 |
|
Mike Fährmann
|
de752eb7b1
|
[naverwebtoon] support '/webtoon/' paths for all comics (#5123)
|
2024-02-04 21:38:46 +01:00 |
|