1
0
mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-22 10:42:34 +01:00
Commit Graph

5579 Commits

Author SHA1 Message Date
Shelvacu
f8e707b92c [philomena] switch default ponybooru filter to get everything by default
The system filter mislabeled "Everything" hides 4 tags https://ponybooru.org/filters/2

There are [many public filters that don't hide anything](https://ponybooru.org/filters?fq=spoilered_count%3A0%2C+hidden_count%3A0), I just picked [the oldest one](https://ponybooru.org/filters/3).
2024-11-07 20:08:42 -08:00
Mike Fährmann
ce90566c56
[pinterest] detect video/audio by block content (#6421)
story blocks from search/board results do not always contain a 'type'
2024-11-05 15:55:24 +01:00
Mike Fährmann
a9a9f3a180
[pinterest] support 'story_pin_music_block' blocks (#6421) 2024-11-05 15:55:24 +01:00
Mike Fährmann
0b3ddd01af
[hiperdex] update domain to 'hipertoon.com' (#6420)
and fix 'description' extraction
2024-11-05 15:54:42 +01:00
Mike Fährmann
9afbe91f82
[rule34xyz] add 'format' option (#1078) 2024-11-05 15:45:52 +01:00
Mike Fährmann
51b16d078b
[rule34xyz] ensure 'files' keys are strings (#1078)
fixes -K/--list-keywords
2024-11-05 09:34:17 +01:00
Mike Fährmann
cf0817e17b
[cookies] try decryption with empty password as fallback
https://github.com/yt-dlp/yt-dlp/pull/6897
2024-11-05 08:34:24 +01:00
Mike Fährmann
d2db7060e2
[cookies] support chromium table version 24 (#6162)
https://github.com/yt-dlp/yt-dlp/pull/11425
2024-11-04 19:12:50 +01:00
Mike Fährmann
d77f5154a5
[aes] fix GCM pad length calculation
https://github.com/yt-dlp/yt-dlp/pull/11438
2024-11-03 21:25:45 +01:00
Mike Fährmann
390b8ddd3e
[common] emit logging messages for --write-pages files 2024-11-03 20:38:33 +01:00
Mike Fährmann
cb0d8cae77
merge #6227: [everia] add support (#1067, #2472, #4091) 2024-11-03 17:52:17 +01:00
Mike Fährmann
cea062ffc5
[everia] update
- implement general _pagination method
- simplify code
- adjust URL patterns
- update test results
2024-11-03 17:51:04 +01:00
missionfloyd
d31a3b5da3
[everia.club] Add support
- Unescape title and URL
- Add tags and categories metadata
    Lookup tag id with API instead of downloading tag page
- Add category extractor
- Add tests
- Rename EveriaExtractor to EveriaPostExtractor
- Fix EveriaPostExtractor example
- Lookup tags/categories by post id
- Add date extractor
- Remove leftover pages parameter
- Add error handling for invalid dates.
- Add filename numbering
    Parse date
- Rename extract() to images()
- Remove html import
- Fix search/date URLs with page number
- Fix tag/category search
- Fix post extractor
- Fix tag, category extractors
- Fix search extractor
- Only load first page once
- Fix date extractor
- Fix tests
- Clean up search extractor
2024-11-03 14:09:07 +01:00
Mike Fährmann
9b59af8d8d
[instagram] fix using numeric cursor values (#6414) 2024-11-03 12:03:01 +01:00
Mike Fährmann
d787c0c4ea
[rule34xyz] add support (#1078, #4960) 2024-11-03 10:12:26 +01:00
Mike Fährmann
7c0d2ca07d
[rule34vault] update
- implement 'tags' categorization
- don't use 'totalCount' for pagination end
- update tests
2024-11-03 09:59:25 +01:00
Mike Fährmann
d5fa1d6aba
[sankaku] improve tag categorization code
translate tag type ID to name for each category
instead of for each tag
2024-11-03 09:21:39 +01:00
Delphox
565dc5b43b
[bluesky] match fxbsky.app and vxbsky.app 2024-11-02 16:00:43 -03:00
Mike Fährmann
93adfbe935
merge #6410: [bluesky] match common bluesky embed fixes 2024-11-02 18:28:07 +01:00
Mike Fährmann
cd47425ccc
[bluesky] fix downloads from non-bsky PDSs (#6406) 2024-11-02 18:22:34 +01:00
Mike Fährmann
9deed87340
[bluesky] add 'author["instance"]' metadata (#4438) 2024-11-02 17:37:11 +01:00
Delphox
80c7246732
[bluesky] match cbsky.app, bskye.app, bskyx.app and bsyy.app urls 2024-11-02 13:04:32 -03:00
Mike Fährmann
99fe2b1f55
[bluesky] support 'main.bsky.dev' URLs (#4438) 2024-11-02 15:33:31 +01:00
Mike Fährmann
b92edb4614
[boosty] update default video format list (#2387) 2024-10-31 20:55:32 +01:00
Mike Fährmann
b321696182
[pixiv] get ugoira frame extension from 'meta_single_page' (#6056)
saves 1-3 HEAD requests
2024-10-31 19:47:05 +01:00
Mike Fährmann
f79e57b71e
[dl:ytdl] change 'forward-cookies' default to 'true' (#6401, #6348)
revert dba87ca99e
2024-10-31 17:35:08 +01:00
Mike Fährmann
24408cd450
[patreon] use legacy mobile UA when no 'session_id' is set 2024-10-31 17:26:33 +01:00
Mike Fährmann
e3fbd6825b
[twitter] remove cookies migration workaround
revert 141efc2ad3
2024-10-31 17:10:13 +01:00
Mike Fährmann
6f54328a39
[hitomi] update
- remove f-strings
- fix flake8 warnings
- move tests to test/results/hitomi.py
2024-10-29 16:56:52 +01:00
space-nuko
f170d73ffc
[hitomi] add 'index' and 'search' extractors
- Support hitomi.la multiple tag searches
- Support hitomi.la index searches
- Fix tests
2024-10-29 16:55:52 +01:00
Allen
0f94fa9015
[tumblr] search extractor minimal styling changes 2024-10-29 13:06:23 +01:00
Mike Fährmann
57e19a63a5
[tumblrgallery] fix file downloads (#6391)
do not send "https://tumblrgallery.xyz/" Referer headers by default
2024-10-29 08:38:59 +01:00
Mike Fährmann
655e42dc92
merge #6240: [rule34vault] add support (#5708) 2024-10-28 22:31:05 +01:00
Mike Fährmann
a974c30057
[rule34vault] update
- use more of BooruExtractor's interface
- use BASE_PATTERN
- define generic pagination method
- parse 'date' metadata
- make tags & extended metadata optional
2024-10-28 22:28:23 +01:00
ssdaniel24
3d0263b3ab
[rule34vault] Added initial support for rule34vault.com
- Added playlists support for rule34vault
- Added support for posts in rule34vault
- Fixed supported sites with script
- Fixed posts pattern in rule34vault
- Added tests for rule34vault
- Clean
- Fixed lint warnings
2024-10-28 22:26:47 +01:00
Mike Fährmann
4a6e5af28e
[noop] insert noop extractor when loading cookies without URL
allows writing --cookies-from-browser results with just
gallery-dl --cookies-from-browser BROWSER --cookies-export DEST
2024-10-28 19:45:32 +01:00
Mike Fährmann
5de8576ff6
[noop] add 'noop' extractor 2024-10-28 19:45:24 +01:00
Mike Fährmann
76f855d87b
[output] use ANSI colors only when stream is a TTY 2024-10-28 19:36:42 +01:00
Mike Fährmann
ee61256054
[output] define and use global TTY_STD... values 2024-10-28 14:59:14 +01:00
Mike Fährmann
e93b8efe7b
[output] make it easier to disable output to stdout
allow 'output.mode' to be set to any falsy value
instead of requiring it to be the string "null"
2024-10-28 14:49:52 +01:00
Mike Fährmann
2bbae0e98d
merge #6389: [reddit] support user profile share links 2024-10-28 14:46:18 +01:00
Mike Fährmann
604879d1a3
[reddit] simplify
- use 'self.groups' to access matched values
- use 'str.format()' to build URL
2024-10-28 14:44:15 +01:00
inty
625fe0efce [reddit] support user profile share links 2024-10-28 14:58:06 +03:00
Mike Fährmann
10c076e7f2
[saint] add 'album' and 'media' extractors (#4405, #6324) 2024-10-27 22:27:30 +01:00
Mike Fährmann
3cbf15a107
[8chan] don't make network requests in '_init()' (#6387) 2024-10-27 19:36:49 +01:00
Mike Fährmann
6693ae19e8
[civitai] add 'metadata' option (#6383) 2024-10-27 15:46:00 +01:00
Mike Fährmann
a4791f5243
[bluesky] add 'hashtag' extractor (#4438)
https://github.com/mikf/gallery-dl/issues/4438#issuecomment-2439979958
2024-10-27 13:59:46 +01:00
Mike Fährmann
8dce52471a
[pixiv] restructure file extraction code 2024-10-27 09:01:12 +01:00
Mike Fährmann
ae448f8ca3
[pixiv] fix/improve PHPSESSID warning code 2024-10-27 08:59:46 +01:00
Mike Fährmann
cc07c81063
[mangadex] apply 'lang' only to chapter results (#6372) 2024-10-27 07:55:14 +01:00
Mike Fährmann
7e43c9292d
release version 1.27.7 2024-10-25 18:59:39 +02:00
Mike Fährmann
8f396cfc57
[bluesky] add 'quoted' option (#6323) 2024-10-25 17:22:33 +02:00
Mike Fährmann
d34e2d56aa
[mangadex] use '/manga' endpoint to get manga by author (#6372)
allows for more than 10 manga to be returned
2024-10-24 20:07:11 +02:00
Mike Fährmann
0fd98f67ba
[mangadex] add 'author' extractor (#6372) 2024-10-24 14:57:17 +02:00
Mike Fährmann
c243a7b060
[lensdump] fix extraction (#6313)
- support custom sort orders & query parameters
- rewrite pagination logic
2024-10-24 13:59:39 +02:00
Mike Fährmann
8f0b0b0735
[wikimedia] fix non-English Fandom/wiki.gg articles (#6370) 2024-10-24 13:00:02 +02:00
Mike Fährmann
bce3c4b424
[bunkr] try different domain when encountering a CF challenge page
(#6344, #6352, #6368)
2024-10-23 21:17:01 +02:00
Mike Fährmann
75674944f0
[pixiv] implement 'sanity_level' workaround for user artworks results
(#4327, #5435, #6339)
2024-10-22 20:38:00 +02:00
Mike Fährmann
b08da4ffc7
[reddit] add 'embeds' option (#6357) 2024-10-22 17:06:54 +02:00
Mike Fährmann
33161da121
[pixiv] add 'captions' option (#4327)
make extra requests for empty captions independent of 'sanity'
2024-10-22 16:31:37 +02:00
Mike Fährmann
2d15e64ea9
[pixiv] fix exception when processing deleted sanity_level work (#6339)
https://github.com/mikf/gallery-dl/issues/6339#issuecomment-2427350524
2024-10-21 21:13:26 +02:00
Mike Fährmann
9a2bccd0d9
[bunkr] update to new site layout (#6344, #6352) 2024-10-21 14:17:28 +02:00
Mike Fährmann
66aa514c25
[scrolller] add initial support (#295, #3418, #5051) 2024-10-21 14:17:18 +02:00
Mike Fährmann
eacd4479a2
[patreon] support '/c/' before creator name in URLs (#6348)
and avoid '/CREATOR/posts' redirects
2024-10-21 14:11:19 +02:00
Mike Fährmann
cff3d37551
[civitai] update "My Reactions" handling (#6263)
- use "reactions" subcategory
- raise exception when no api-key or cookies are present
2024-10-20 14:56:56 +02:00
Mike Fährmann
10c56a561d
add '--print' and '--print-to-file' command-line options 2024-10-20 10:06:04 +02:00
Mike Fährmann
2e1dab3036
[pp] add 'error' event 2024-10-19 20:30:34 +02:00
Mike Fährmann
5d984f35aa
[pinterest] support 'story' pins (#6188, #6078, #4229) 2024-10-19 17:47:31 +02:00
Mike Fährmann
5477ed181d
[pinterest] move file extraction into separate method 2024-10-18 20:55:20 +02:00
Mike Fährmann
9751b716ef
merge #6328: [vk] add 'offset' option 2024-10-17 21:21:52 +02:00
Mike Fährmann
0e4e40c9d2
[vk] document 'offset', add '--range' support 2024-10-17 21:20:21 +02:00
Mike Fährmann
4667833195
[util] add 'std' object to global eval namespace (#6330)
allows accessing standard library modules (and other external modules)
in a more straightforward manner than '__import__(...)'

* std.os.getcwd()
* std["os"].getcwd()
2024-10-17 21:08:07 +02:00
Mike Fährmann
d3dcc44bd1
use child fallbacks only when a non-user error occurs (#6329) 2024-10-17 08:04:41 +02:00
Mike Fährmann
4b31601dc0
[civitai] fix "My Reactions" results (#6263)
requires 'api-key' or authenticated cookies to work
2024-10-16 23:18:00 +02:00
Mike Fährmann
69a75b1de2
[civitai] add extractors for global 'models' and 'images' (#6310) 2024-10-16 23:00:51 +02:00
Mike Fährmann
a37ac3c748
[civitai] add 'uuid' metadata field & use as archive format (#6326) 2024-10-16 19:56:09 +02:00
Mike Fährmann
a78f33d0fc
[civitai] fix 'model' file download URLs for tRPC API 2024-10-16 15:18:28 +02:00
Mike Fährmann
1824267447
[dl:ytdl] implement explicit HLS/DASH handling
add '_ytdl_manifest' to specify a manifest type to process
2024-10-16 15:16:21 +02:00
Sergey Kolesnik
67be377c77
Update vk.py: add "offset" option 2024-10-16 02:07:58 +03:00
Mike Fährmann
67547a659f
[cookies] convert Chromium expires_utc values to Unix timestamps 2024-10-15 08:31:46 +02:00
Mike Fährmann
36883e458e
use 'v[0] == "c"' instead of 'v.startswith("c")' 2024-10-15 08:24:06 +02:00
Mike Fährmann
d68bb78f44
use 'True if COND else False' for bool conversions
faster than 'bool(COND)', especially on older Pythons < 3.11
2024-10-14 20:57:47 +02:00
Mike Fährmann
c148e200a4
[bunkr] send proper Referer headers for file downloads (#6319) 2024-10-14 16:28:24 +02:00
Mike Fährmann
6506afb257
[bunkr] support 'bunkr.ps' URLs 2024-10-14 16:27:09 +02:00
Mike Fährmann
4358799833
[newgrounds] support page numbers in URLs (#6320)
https://USER.newgrounds.com/art/?page=5
https://USER.newgrounds.com/art/page/5
2024-10-14 14:25:09 +02:00
Mike Fährmann
d7e34e1dc3
[8chan] automatically determine TOS cookie name (#6318) 2024-10-14 14:25:09 +02:00
Mike Fährmann
93265db9b3
[8chan] support '/last/' thread URLs (#6318) 2024-10-14 14:25:09 +02:00
Mike Fährmann
c7f0d8945b
[pixiv] make retrieving ugoira metadata non-fatal (#6297)
update to 6ae3a5cdb0
catch all exceptions instead of just StopException
2024-10-14 14:25:09 +02:00
Mike Fährmann
3946fe5ac4
[cookies] return loaded cookies as list
don't set_cookie() them immediately into a CookieJar
also, give some more consistent names to chrome/chromium functions
2024-10-14 14:24:27 +02:00
Mike Fährmann
39b7d748f6
[urlgalleries] fix extraction
accept HTTP status 500 as valid pages
2024-10-11 20:16:32 +02:00
Mike Fährmann
1023b2d85d
[exception] implement constructing HttpError from response 2024-10-11 19:57:38 +02:00
Mike Fährmann
6d8d882dbf
[common] allow request() to accept all HTTP status codes
by passing Ellipsis/... as 'fatal' argument
2024-10-11 19:49:16 +02:00
Mike Fährmann
1df09292b5
consistent __exit__() argument names 2024-10-11 16:43:28 +02:00
Mike Fährmann
bca9a1a1e5
release version 1.27.6 2024-10-11 14:31:17 +02:00
Mike Fährmann
f8bb0fd6cb
[pixiv][danbooru] set '_ugoira_original' for ugoira results 2024-10-11 13:19:45 +02:00
Mike Fährmann
5dd17910d1
[deviantart] fix sta.sh folder extraction 2024-10-11 09:51:20 +02:00
Mike Fährmann
1f5f04e673
[pixiv] make '_request_ajax()' non-fatal 2024-10-11 07:55:02 +02:00
Mike Fährmann
bc954eb7e4
[civitai] update default archive format (#6302) 2024-10-10 20:29:07 +02:00
Mike Fährmann
0f7d032773
[shimmie2] use 'self.groups' to access matched URL values 2024-10-10 19:01:30 +02:00
Mike Fährmann
09d4c281b6
[shimmie2] remove 'loudbooru.com' 2024-10-10 18:32:42 +02:00