1
0
mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-23 11:12:40 +01:00
Commit Graph

4437 Commits

Author SHA1 Message Date
Mike Fährmann
bab13402df
[redgifs] update 'search' URL pattern (#4115) 2023-05-30 15:42:31 +02:00
Mike Fährmann
5a6fd8027d
[redgifs] support galleries (#4021) 2023-05-30 15:02:18 +02:00
Mike Fährmann
0ad59c92b1
[blogger] download files from 'lh*.googleusercontent.com' (4070) 2023-05-28 19:58:20 +02:00
Mike Fährmann
ffed7efb6f
[pixiv] use BASE_PATTERN 2023-05-28 18:06:47 +02:00
Mike Fährmann
b286efefcc
[pixiv] add 'novel-bookmark' extractor (#4111) 2023-05-28 16:30:17 +02:00
Mike Fährmann
5283db1aae
release version 1.25.5 2023-05-27 17:08:25 +02:00
Mike Fährmann
28f6487c64
[instagram] add 'metadata' option (#3107) 2023-05-27 15:51:13 +02:00
Mike Fährmann
8cf13f8696
merge #4104: [lensdump] add lensdump.com extractors 2023-05-26 23:42:02 +02:00
Mike Fährmann
58f7480d46
[lensdump] update
- update docs/supportedsites.md
- add GPL2 header
- use BASE_PATTERN
- improve LensdumpImageExtractor
2023-05-26 23:39:17 +02:00
Mike Fährmann
3516fdae74
[kemonoparty] fix kemono and coomer logins using the same cache
(#4098)
2023-05-26 13:35:02 +02:00
chio0hai
d5300cf381 [lensdump] subcategory 2023-05-26 03:51:42 -04:00
chio0hai
82ba6bfdc0 [lensdump] f-string fix 2023-05-26 03:46:12 -04:00
chio0hai
9b2326e4e1 [lensdump] add lensdump.com extractor 2023-05-26 03:22:23 -04:00
Mike Fährmann
a5d0b03bde
[ytdl] fix crash due to removed 'no_color' attribute
8417f26b8a
2023-05-25 23:11:49 +02:00
Mike Fährmann
148bdc04a4
merge #2719: [jpgfish] add 'jpgfish' extractors 2023-05-25 23:04:46 +02:00
Mike Fährmann
609c4f3e07
[jpgfish] simplify and improve 2023-05-25 22:58:42 +02:00
Mike Fährmann
2b1f875ef4
[jpgchurch] update to 'jpgfish' 2023-05-25 18:58:51 +02:00
Mike Fährmann
3d29c42142
[mangaread] fix 'tags' extraction 2023-05-25 17:04:20 +02:00
Mike Fährmann
5f86527cbe
merge #2781: [mangaread] Add Mangaread extractor 2023-05-25 17:03:28 +02:00
Mike Fährmann
cdc6549fd2
merge #3329: [8muses] Add 'parts' to album data
and fix 'album[url]'
2023-05-25 16:31:29 +02:00
Mike Fährmann
ad760429b1
[8muses] update 2023-05-25 16:25:53 +02:00
Mike Fährmann
d0184fddcf
[twitter] optimize '_extract_twitpic()'
- use findall instead of finditer
- store URLs in a dict to discard duplicates
2023-05-25 15:18:49 +02:00
Mike Fährmann
3dc862c7fc
merge #3796: [twitter] extract TwitPic URLs in text (#3792) 2023-05-25 14:59:07 +02:00
Mike Fährmann
243de697b9
merge #3976: [reddit] support cross-posted media (#887, #3586) 2023-05-25 13:16:25 +02:00
Mike Fährmann
f8c4c5eef9
[reddit] simplify and add tests 2023-05-25 13:15:11 +02:00
thatfuckingbird
822a77d846 [danbooru] add support for booru.borvar.art instance 2023-05-24 19:06:40 +02:00
Mike Fährmann
f3cca50b9e
[mangadex] update links to API docs 2023-05-24 12:32:13 +02:00
Mike Fährmann
65a9f4b124
merge #3950: [misskey] add 'favorite' extractor 2023-05-23 22:18:32 +02:00
Mike Fährmann
c76f0f3a1b
[misskey] update
- rename to 'MisskeyFavoriteExtractor'
- add 'access-token' option to docs
- add test URLs for other instances
- simplify 'pattern'
2023-05-23 22:17:17 +02:00
Mike Fährmann
3fca455b82
[pixiv] add 'embeds' option (#1241) 2023-05-23 12:14:06 +02:00
Mike Fährmann
d1f2ef3b7b
[imagechest] update
- don't load HTML page when using API
- restructure some code
- add more methods to ImagechestAPI
2023-05-22 22:42:16 +02:00
Mike Fährmann
856f6c10cd
allow for GalleryExtractors to skip loading gallery_url 2023-05-22 22:29:30 +02:00
Mike Fährmann
4fc9675d48
[fanbox] skip 404ed or otherwise invalid posts (#4088) 2023-05-22 18:49:06 +02:00
Mike Fährmann
69865dcc05
[formatter] implement slicing strings as bytes (#4087)
prefixing a slice '[10:30]' with a lowercase b '[b10:30]' encodes
the string to bytes in filesystem encoding before applying the slice
2023-05-22 18:30:45 +02:00
Mike Fährmann
56b8b8cd36
[pixiv] support short novel URLs
https://www.pixiv.net/n/<ID>
2023-05-21 14:26:30 +02:00
Mike Fährmann
e6f55d1555
[imagechest] add API support and 'access-token' option (#4065) 2023-05-20 17:00:27 +02:00
Mike Fährmann
77abcf5ab3
[gofile] automatically fetch 'website-token' by default
the hardcoded token changed yet again
2023-05-20 17:00:27 +02:00
Mike Fährmann
e3fed9bd17
[tcbscans] update domain to 'tcbscans.com' (#4080) 2023-05-20 13:59:35 +02:00
Mike Fährmann
a83983c651
[instagram] add 'order-posts' option (#4017, #3993) 2023-05-18 22:50:12 +02:00
Mike Fährmann
d680623db3
[instagram] add 'order-files' option (#4017, #3993) 2023-05-18 22:34:33 +02:00
Naatie
f9b7a033e0 [misskey] refactor misskey extractor 2023-05-18 20:15:59 +07:00
Naatie
04dbfd994e [misskey] add my favorites extractor 2023-05-18 20:15:59 +07:00
Mike Fährmann
82a12d6126
[nsfwalbum] detect placeholder images
patch by an anonymous contributor
2023-05-17 15:18:10 +02:00
Mike Fährmann
011e4607c3
[poipiku] extract full 'descriptions' (#4066)
don't cut it off after the first line
2023-05-15 23:16:31 +02:00
Mike Fährmann
5037013e2b
[gofile] update 'website-token' (#4056) 2023-05-15 14:59:44 +02:00
Mike Fährmann
6b6bb4be73
[weibo] require numeric IDs to have length >= 10 (#4059) 2023-05-14 18:45:37 +02:00
Mike Fährmann
494acabd38
[danbooru] refactor pagination logic (#4002)
- only use 'b<ID>' when no other order is specified
- support 'a<ID>' when  using 'order:id' as tag
2023-05-14 18:39:59 +02:00
Mike Fährmann
fd0e1ffd6e
[danbooru] improve 75666cf9 (#4002)
Search for direct post IDs instead of trying to
replicate the same results as the initial request.
2023-05-14 16:35:31 +02:00
Mike Fährmann
e41e45ff6b
[gofile] add basic password support (#4056) 2023-05-13 15:46:29 +02:00
Mike Fährmann
790dd365e1
[postprocessor:exec] support tilde expansion for 'command'
https://github.com/mikf/gallery-dl/issues/146#issuecomment-1544733532
2023-05-12 16:34:17 +02:00
Mike Fährmann
2e6cea95db
[cookies] update logging behavior (#4050)
- only show the same warning/error once
- simplify and capitalize logging messages
2023-05-12 16:34:17 +02:00
Mike Fährmann
20dc13f832
[pixiv] initial 'novel' support (#1241, #4044)
supported URLs are
- https://www.pixiv.net/novel/show.php?id=<ID>
- https://www.pixiv.net/novel/series/<ID>
- https://www.pixiv.net/en/users/<ID>/novels
2023-05-12 16:34:08 +02:00
Mike Fährmann
c698c3de44
[newgrounds] add default delay between requests (#4046) 2023-05-11 16:04:37 +02:00
Mike Fährmann
708f478d15
[danbooru][e621] add 'date' metadata field (#4047) 2023-05-11 15:59:42 +02:00
Mike Fährmann
306e13a4d4
release version 1.25.4 2023-05-07 20:57:36 +02:00
Mike Fährmann
35c23a2fd8
merge #4031: [mangadex] add 'status' and 'tags' metadata 2023-05-07 20:16:23 +02:00
Mike Fährmann
2266fc8cc5
[mangadex] update and extend test results 2023-05-07 20:14:49 +02:00
Janne Alaranta
1ce5dc9e18 fix whitespaces 2023-05-07 18:47:04 +03:00
Janne Alaranta
13dedae09f add status and tags info to mangadex extractor 2023-05-07 18:35:02 +03:00
Mike Fährmann
be0fa94b2e
[imagechest] load all images when a 'Load More' button is present
(#4028)
2023-05-07 14:46:48 +02:00
Mike Fährmann
7eadcbea70
[4chanarchives] add end condition for 'board' extractor (#4012) 2023-05-06 20:52:45 +02:00
Mike Fährmann
1406f7125f
[4chanarchives] add 'thread' and 'board' extractors (#4012) 2023-05-06 20:45:57 +02:00
Mike Fährmann
285391df43
add '-C' as short option for '--cookies'
and put cookie options into their own section
2023-05-06 20:45:44 +02:00
Mike Fährmann
b9b1cdd71b
add '--cookies-export' command-line option 2023-05-06 17:37:49 +02:00
Mike Fährmann
d12dd3813c
[imgur] fix internal image/album URLs
URLs from "link" attributes of newer images/albums were all returned
as 'https://imgur.com/gallery/...' instead of the expected format,
causing them to be ignored.
2023-05-06 15:13:38 +02:00
Mike Fährmann
8520de57f0
[imgur] add 'favorite-folder' extractor (#4016) 2023-05-06 15:10:13 +02:00
Mike Fährmann
4c1f3b2160
[cookies] simplify '_mac_absolute_time_to_posix()'
hardcode UNIX timestamp of 2001-01-01
2023-05-06 15:10:13 +02:00
Mike Fährmann
a14b63d941
support selecting a domain for '--cookies-from-browser'
for example 'gallery-dl --cookies-from-browser firefox/twitter.com'
2023-05-06 15:09:49 +02:00
Mike Fährmann
3ca5dac8b6
extend 'cookies-update' functionality
Allow writing cookies to a different file than a given cookies.txt,
making it possible to export cookies imported with --cookies-from-browser

To convert browser cookies to cookies.txt format:
  gallery-dl --cookies-fr chromium \
             -o cookies-update=cookies.txt \
             --no-download \
             http://example.org/file.jpg
2023-05-04 15:10:47 +02:00
Mike Fährmann
bc6d65d203
implement 'Extractor.config_deprecated()'
a version of 'Extractor.config()'
that logs a warning when using a deprecated option name
2023-05-04 10:49:14 +02:00
Mike Fährmann
850df34c31
remove '&' from URL patterns part 2
follow-up on 968d3e8465
2023-05-03 20:26:25 +02:00
Mike Fährmann
4d415376d1
[pinterest] fix 'pin.it' extractor
it really was just the single '/' at the end of the url_shortener URL
2023-05-03 20:05:10 +02:00
Mike Fährmann
657b6a9100
[pinterest] update endpoint for related board pins 2023-05-03 18:41:09 +02:00
Mike Fährmann
79f47f98dd
[nana] remove module
permanently gone since 2023-03-13
2023-05-03 18:05:53 +02:00
Mike Fährmann
0e74df1de8
[420chan] remove module
offline since 2022-06-01
2023-05-03 17:46:21 +02:00
Mike Fährmann
7499fa7075
[exhentai] remove and update sad panda check
there hasn't been a sad panda in several years
2023-05-03 17:39:49 +02:00
Mike Fährmann
076380e079
remove '*' indicating keyword-only arguments
they are kind of unnecessary and
cause a non-insignificant function call overhead (~10%)
2023-05-02 22:23:33 +02:00
Mike Fährmann
0c46758a93
[foolslide] remove 'sensescans.com'
group moved to mangadex
https://mangadex.org/group/1071e71d-cc55-4fa6-81d1-4b5913a2fde5/sense-scans
2023-05-02 20:09:04 +02:00
Mike Fährmann
a08fdfac6e
[foolfuuka] add 'archive.palanq.win' 2023-05-02 19:58:55 +02:00
Mike Fährmann
1870df8b23
[foolfuuka] remove 'tokyochronos.net' 2023-05-02 19:25:50 +02:00
Mike Fährmann
ef4e2d8178
[foolfuuka] remove 'archive.alice.al' 2023-05-02 19:23:26 +02:00
Mike Fährmann
57cf942bb1
[config] include exception type in error message 2023-05-02 18:36:13 +02:00
Mike Fährmann
aa731c4298
[ytdl] run yt-dlp tests with latest code from master (#3989)
Only use PyPI version for Python 3.6, since that's no longer supported
by the current codebase.
2023-05-01 16:42:57 +02:00
Mike Fährmann
6a860876bc
release version 1.25.3 2023-04-30 18:35:03 +02:00
Mike Fährmann
b12dad8df5
[pixiv] fix 'pixivision' extraction 2023-04-30 15:35:32 +02:00
Mike Fährmann
5fb7107f2b
[imxto] fix 'gallery' extraction
support both single and double quotes
2023-04-30 15:23:13 +02:00
Mike Fährmann
15d7c5a199
[behance] 'items()' -> 'values()'
we only need 'size', 'name' is unnecessary
2023-04-30 13:53:51 +02:00
Mike Fährmann
61a65d5bb9
[ytdl] fix crash due to --geo-bypass deprecation (#3975) 2023-04-29 17:25:38 +02:00
Mike Fährmann
0fb580135d
[behance] fix extraction (#3980) 2023-04-29 16:18:35 +02:00
Alexandru Vasilescu
d4f8b2fe22 fix: linter issues 2023-04-28 13:45:23 +03:00
Alexandru Vasilescu
1b918bd937 fix(extractor): fix extraction for cross-posted reddit videos and galleries 2023-04-28 13:13:25 +03:00
Mike Fährmann
215028a462
[manganelo] match more minor version separators (#3972) 2023-04-27 13:12:11 +02:00
Mike Fährmann
c182094ebf
merge #3748: [downloader:http] add 'consume-content' option 2023-04-26 23:03:18 +02:00
thatfuckingbird
9f76783ac0 [pixiv] allow sorting by popularity (requires pixiv premium) 2023-04-26 22:49:29 +02:00
Mike Fährmann
7865067d19
[shimmie2] add generic extractors for Shimmie2 sites (#3734)
add support for
- loudbooru.com       (#3734)
- booru.cavemanon.xyz (#3734)
- giantessbooru.com   (#943)
- tentaclerape.net
2023-04-26 19:20:44 +02:00
Mike Fährmann
28419bf45a
[itchio] add 'game' extractor (#3923) 2023-04-26 19:20:43 +02:00
Mike Fährmann
3905f05f00
[postprocessor:metadata] support putting keys in quotes
for mode 'modify' and 'delete'
based on fe41a2b1
2023-04-25 14:30:18 +02:00
Mike Fährmann
7459e4abce
[postprocessor:metadata] fix traversing more than 1 level deep
for mode 'modify' and 'delete'
2023-04-25 14:17:25 +02:00
Mike Fährmann
5297ee0cd9
[tumblr] add 'day' extractor (#3951) 2023-04-24 22:01:47 +02:00
Mike Fährmann
de670bd7de
[tumblr] update pagination logic (#2191) 2023-04-24 20:07:10 +02:00
ClosedPort22
6f4a843fba
[downloader:http] release connection before logging messages
This allows connections to be properly released when using 'actions'
feature.
2023-04-24 23:59:36 +08:00
Mike Fährmann
98c9fdb414
[deviantart] revert e9353c63; retry downloads with private token 2023-04-23 21:10:16 +02:00
Mike Fährmann
5d7435e803
[nitter] extract user IDs from encoded banner URLs
still requires a banner to be present to begin with
2023-04-23 19:13:27 +02:00
Mike Fährmann
7f25cab56e
[sankaku] support post URLs with MD5 hashes (#3952) 2023-04-23 16:46:40 +02:00
Mike Fährmann
a05120412a
[oauth] catch exception from 'webbrowser.get()' (#3947)
It raises an exception instead of returning None
when no runnable browser is available.
2023-04-23 15:00:09 +02:00
Mike Fährmann
3fc2223893
merge #3935: [reddit] match 'preview.redd.it' URLs 2023-04-21 20:09:20 +02:00
Mike Fährmann
1d505b39f8
[twitter] support 'profile-conversation' entries (#3938) 2023-04-21 15:08:50 +02:00
Mike Fährmann
aaf58a1259
[imgur] document 'client-id' option (#3937) 2023-04-21 15:08:50 +02:00
Mike Fährmann
202f5d86a7
[reddit] ignore 'id-max' value "zik0zj"/2147483647
(#3939, #3862, #3697, #3606, #3546, #3521, #3412)
2023-04-21 15:08:50 +02:00
Mike Fährmann
8586ee81be
[nana] fix 'keyword' tests 2023-04-21 15:08:50 +02:00
ClosedPort22
cd4bfb0dd1
[reddit] match 'preview.redd.it' URLs 2023-04-20 15:54:09 +08:00
Mike Fährmann
faca32a850
[sankaku] sanitize 'date:…' tags (#1790) 2023-04-19 20:09:11 +02:00
Mike Fährmann
6f1e34ec69
[vipergirls] add 'thread' and 'post' extractors
(#731, #2720, #3812)
2023-04-19 15:28:26 +02:00
Mike Fährmann
81bd2af83e
[2chen] update domain to sturdychan.help 2023-04-19 13:54:44 +02:00
Mike Fährmann
f500b45b5e
[twitter] improve 480bc34e
only check for double user assignment where necessary
2023-04-18 20:50:23 +02:00
Mike Fährmann
5b635f2317
[imxto] add 'gallery' extractor (#1289) 2023-04-17 20:49:09 +02:00
Mike Fährmann
359e31e462
[nozomi] update file URLs (#3925)
Static images are now only available in WebP format over the 'w'
subdomain. GIFs also got their own 'g' subdomain.
2023-04-17 15:42:42 +02:00
Mike Fährmann
2dfd4a3de2
[imagefap] extract 'categories' metadata and fix empty 'tags' 2023-04-17 14:49:50 +02:00
Mike Fährmann
480bc34e54
[twitter] do not overwrite previously assigned users (#3922) 2023-04-16 17:30:43 +02:00
Mike Fährmann
02ec5bb8e5
[imagefap] extract 'description' metadata (#3905) 2023-04-16 17:02:16 +02:00
Mike Fährmann
842f964c49
release version 1.25.2 2023-04-15 22:31:10 +02:00
Mike Fährmann
d253a3c542
merge #3841: [urlshortener] add support for bit.ly & t.co 2023-04-15 18:08:21 +02:00
Mike Fährmann
5e63942b37
[urlshortener] update 2023-04-15 18:06:06 +02:00
Mike Fährmann
2edcdee32f
[downloader:http] add MIME type and signature for .heic files
(#3915)
https://github.com/strukturag/libheif/issues/83
2023-04-15 17:09:22 +02:00
Mike Fährmann
c45f09d2a8
[imagechest] fix extraction (#3914) 2023-04-14 20:06:59 +02:00
Mike Fährmann
2cd4411ff8
[nitter] extract videos from 'source' elements (#3912) 2023-04-14 19:00:56 +02:00
Mike Fährmann
9501579279
[sexcom] fix fetching HD videos 2023-04-13 15:40:53 +02:00
Mike Fährmann
a2f7274eae
[sexcom] fix pagination (#3906) 2023-04-13 15:39:15 +02:00
Mike Fährmann
e9353c63d6
[deviantart] keep using private access tokens
for deviations returned from a private API call

also fixes a bug from 0a7eee3e where '_pagination()'
would never switch from unspecified (None) to private access token
2023-04-13 14:46:06 +02:00
Mike Fährmann
e70af6a550
[hentaifoundry] do not update filters when cookies are provided 2023-04-13 14:16:53 +02:00
Mike Fährmann
9c29c904c7
[mastodon] try to get account IDs without access token
Try to query the public '/api/v1/accounts/lookup' endpoint
and fall back to '/v1/accounts/search' if it returns an error.

'/api/v1/accounts/lookup' is available since Mastodon v3.4.0.
The version of an instance can be found at '/api/v1/instance'.
2023-04-13 14:03:23 +02:00
Mike Fährmann
1614c5c4bf
[generic] write regular expressions without 'x' flags 2023-04-10 20:45:23 +02:00
Mike Fährmann
d84a617273
[hentaifoundry] fix setting content filters (#3887) 2023-04-09 18:04:49 +02:00
ClosedPort22
875485313f
[urlshortener] force HTTPS 2023-04-09 18:19:52 +08:00
Mike Fährmann
0a7eee3ee0
[deviantart] add 'public' option 2023-04-08 23:04:34 +02:00
Mike Fährmann
f5a59c4170
[twitter] add 'date_bookmarked' metadata (#3816) 2023-04-06 20:16:25 +02:00
Mike Fährmann
1c1f6fdc80
[twitter] fix regression from 160335ad
Tweets from 'homeConversation' or 'conversationthread' entries do not
contain a 'sortIndex' field. Accessing it raises a KeyError and would
erroneously get them labeled as 'deleted'.
2023-04-06 19:22:48 +02:00
Mike Fährmann
160335ad44
[twitter] add 'date_liked' metadata for liked Tweets (#3816) 2023-04-06 18:33:45 +02:00
Mike Fährmann
6d850ce629
[twitter] calculate 'date' from Tweet IDs
20 times faster than parsing 'created_at'
2023-04-05 22:29:14 +02:00
Mike Fährmann
25949bd767
merge #3871: [hotleak] Fix downloading of creators whose name starts with a category name 2023-04-04 16:24:20 +02:00
Mike Fährmann
dbe06cdba1
[twitter] warn about 'withheld' Tweets and users (#3864) 2023-04-04 16:15:08 +02:00
Mike Fährmann
3cc1dd1572
[twitter] update query hashes 2023-04-03 23:20:20 +02:00
Mike Fährmann
3846ce0de5
[twitter] update to bookmark timeline v2 (#3859) 2023-04-03 22:46:12 +02:00
Mike Fährmann
34699fbf64
[deviantart:search] detect login redirects (#3860) 2023-04-03 19:37:12 +02:00
Mike Fährmann
e6cb92864a
[twitter] allow setting custom features per API endpoint 2023-04-03 16:18:31 +02:00
Balgden
4b141cce66
Fix indentation 2023-04-03 13:44:14 +00:00
Balgden
bbc5977121
Fix line length 2023-04-03 13:38:42 +00:00
Balgden
ffd30abcb3
[hotleak] Fix downloading of creators whose name starts with a category name
E.g. `hot4lexi` would start downloading the `hot` section by mistake

This happened because the regex had a negative lookahead for the category names, but didn't ensure that they where followed by either end-of-string or a slash.
2023-04-03 13:30:27 +00:00
Mike Fährmann
5ca9d55595
merge #3870: [blogger] update 'sub' regex to get the highest resolution url 2023-04-03 14:47:18 +02:00
Mike Fährmann
fd7ce4c081
merge #3868: [shopify] fix 'collection' extractor 2023-04-03 14:44:46 +02:00
Mike Fährmann
135ac9c302
merge #3854: [twitter] fix: graphql_timeline_v2_bookmark_timeline cannot be null 2023-04-03 14:37:42 +02:00
enduser420
bbb1e34c34 [blogger] update sub regex 2023-04-03 12:43:58 +05:30
enduser420
96e3dd2128 [shopify] fix 'collection' extractor 2023-04-03 12:19:09 +05:30
Mike Fährmann
ac97aca99c
[realbooru] fix extraction
get file URLs from HTML pages
2023-04-02 20:45:16 +02:00
Mike Fährmann
75666cf9c3
[danbooru] reduce API requests for fetching extended 'metadata'
Instead of using one additional API request per post object (N+1),
this requires only one request per 200-post batch.
2023-04-02 20:11:52 +02:00
ClosedPort22
775d2ac999
[downloader:http] improve error logging when releasing connection 2023-03-31 20:08:38 +08:00
Amer Jazaerli
bebbff6578
fix: graphql_timeline_v2_bookmark_timeline cannot be null
twitter: 400 Bad Request (The following features cannot be null: graphql_timeline_v2_bookmark_timeline)
2023-03-31 00:06:49 +02:00
ClosedPort22
71b26adb9b
[urlshortener] add tinyurl.com as an example 2023-03-29 13:37:26 +08:00
Mike Fährmann
421db26aff
[bunkr] update domain to 'bunkr.la' 2023-03-28 20:10:36 +02:00
ClosedPort22
9e2a945013
[urlshortener] add support for bit.ly & t.co 2023-03-29 00:06:41 +08:00
Mike Fährmann
82f83c18e8
release version 1.25.1 2023-03-25 21:30:05 +01:00
Mike Fährmann
9b5e7ce8b9
[hiperdex] fix extraction 2023-03-25 18:18:27 +01:00
Mike Fährmann
89a67c45e0
[nitter] support nitter.it (#3819) 2023-03-25 13:29:22 +01:00
Mike Fährmann
88f29a751d
[nitter] skip broadcasts
instead of downloading an "Unsupported feature" HTML page
2023-03-25 13:09:24 +01:00
Mike Fährmann
1e013eba5a
[nitter] fix extraction for instances without user banners 2023-03-25 12:50:40 +01:00
Mike Fährmann
d94aa1ee02
[gelbooru] fix --range for favorites (#3704) 2023-03-23 22:58:13 +01:00
Mike Fährmann
1f82b00b8f
[gelbooru] fix and improve --range for pools 2023-03-23 18:22:46 +01:00
ClosedPort22
1a977f0f62
[downloader:http] handle exceptions in 'validate'
This isn't strictly necessary for 'exhentai.py', but it improves
efficiency when the adapter is reused
2023-03-23 19:57:13 +08:00
Mike Fährmann
197882cf12
[twitter] add 'hashtag' extractor (#3783) 2023-03-22 22:20:40 +01:00
Mike Fährmann
082d55de16
fix circular reference detection for -K 2023-03-21 23:46:36 +01:00
Mike Fährmann
2ab66ad899
update -K output to include quotes around keys 2023-03-21 22:28:04 +01:00
Mike Fährmann
fe41a2b159
[formatter] support putting keys in quotes
i.e. obj["key"] or obj['key']
as in f-strings
2023-03-21 22:06:54 +01:00
Mike Fährmann
46fdf46f21
[formatter] support loading an f-string from a template file
"\fTF ~/path/to/file.txt"
2023-03-20 22:05:33 +01:00
Mike Fährmann
1a4d4a799b
[formatter] support filesystem paths for \fM 2023-03-20 22:01:33 +01:00
Mike Fährmann
9789ebac52
[naverwebtoon] fix extraction (#3729) 2023-03-19 17:08:58 +01:00
Mike Fährmann
72f1f16eb2
[weibo] support 'mix_media_info' entries (#3793) 2023-03-18 15:19:25 +01:00
ClosedPort22
d4fb4ff47f
[twitter] extract TwitPic URLs in text (#3792)
also ignore previously seen URLs
2023-03-18 21:19:24 +08:00
Mike Fährmann
00f0233b28
[postprocessor:metadata] add 'skip' option (#3786) 2023-03-17 23:30:11 +01:00
Mike Fährmann
2bb937014f
[twitter] fall back to legacy /media endpoint when not logged in 2023-03-17 20:54:35 +01:00
Mike Fährmann
b68094d326
[twitter] support 'note_tweet's 2023-03-17 19:36:07 +01:00
Mike Fährmann
3dcabc97ed
[twitter] update API endpoints and parameters 2023-03-17 19:25:53 +01:00
Mike Fährmann
a1ca2404f9
add 'globals' instead of overwriting the default (#3773) 2023-03-16 18:37:00 +01:00
Mike Fährmann
dcb8af659a
[gelbooru] extract favorites without needing cookies (#3704)
TODO: fix --range
2023-03-15 19:21:35 +01:00
Mike Fährmann
b756dc13aa
[gelbooru] warn about missing cookies for favorites (#3704)
and add docstring so it shows up in --list-extractors
2023-03-15 14:58:55 +01:00
Mike Fährmann
17bd053d94
[hiperdex] fix extraction (#3768) 2023-03-15 14:28:03 +01:00
Mike Fährmann
f7ce33c85c
[output] set 'errors=replace' for output streams (#3765)
fixes regression from e480a933
2023-03-14 13:30:04 +01:00
Mike Fährmann
a14a2d6e59
release version 1.25.0 2023-03-11 21:05:28 +01:00
ClosedPort22
fcaeaf539c
[downloader:http] handle exceptions while consuming content 2023-03-11 21:36:37 +08:00
Mike Fährmann
4235d412c4
implement 'actions'
continuation of d37e7f48
but more versatile and extendable

Example:

"actions": [
    # change debug messages to info
    ["debug", "level ~info"],

    # change exit status to a non-zero value
    ["info:^No results for", "status |= 1"],

    # exit with status 2 on 429
    ["warning:429", "exit 2"],

    # restart extractor when no cookies found
    ["warning:^[Nn]o .*cookies", "restart"]
]
2023-03-10 22:08:10 +01:00
Mike Fährmann
817fc0fbd1
[nitter] remove nitter.pussthecat.org
"Shutdown"
2023-03-09 23:48:39 +01:00
Mike Fährmann
67ec91cdbd
[downloader:http] change '_http_retry' to accept a Python function
and rename '_http_retry_codes' to '_http_retry'

(#3569)
2023-03-09 23:30:15 +01:00
Mike Fährmann
175822e065
merge #3738: [generic] add tests 2023-03-09 22:26:20 +01:00
Mike Fährmann
4883420e67
[generic] revert pattern change 2023-03-09 22:25:23 +01:00
ClosedPort22
df77271438
[downloader:http] add 'consume-content' option
* fix connection not being released when the response is neither
  successful nor retried
* add the ability to consume the HTTP response body instead of closing
  the connection

reference:

https://docs.python-requests.org/en/latest/user/advanced/#body-content-workflow
2023-03-09 21:07:10 +08:00
Mike Fährmann
9037128315
[twitter] fix some 'original' retweets not downloading (#3744) 2023-03-08 18:33:19 +01:00
Mike Fährmann
ea3d95e7e8
merge #3740: [deviantart] add support for fxdeviantart.com URLs 2023-03-08 17:30:06 +01:00
Mike Fährmann
9abcb2b6e5
update headers and ciphers for '"browser": "chrome"' 2023-03-08 17:19:59 +01:00
ClosedPort22
c489aecb3e
[deviantart] add support for fxdeviantart.com URLs
fxdeviantart.com is a service that fixes embeds on Discord, similar to
fxtwitter.com
2023-03-07 16:04:17 +08:00
ClosedPort22
34a7fab0e2
[generic] add support for IDNs
(internationalized domain name)
2023-03-06 22:42:36 +08:00
Mike Fährmann
c9a7345228
[newgrounds] prevent archive ID overlap (#3681)
add an 'i' and 'a' prefix to image and audio files
(/art/view/, /audio/listen/)
since their numeric ID may conflict with movies and other media
2023-03-06 15:03:49 +01:00