1
0
mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-25 12:12:34 +01:00
Commit Graph

1081 Commits

Author SHA1 Message Date
Mike Fährmann
ef8b1bc56e
[twitter] extract 'type' metadata (#6111) 2024-08-31 13:16:51 +02:00
Mike Fährmann
510ca36b35
[tests] fix bug when running tests in a certain order
test_ytdl -> test_downloader -> test_extractor
would cause a test failure in Python <3.6 related to youtube_dl imports
2024-08-31 09:42:30 +02:00
Mike Fährmann
17f5ba43a8
[pp:rename] add 'rename' post processor (#5846, #6044)
renames previously downloaded files to a different filename format
2024-08-30 18:21:36 +02:00
Mike Fährmann
e464c955b3
merge #6104: [generic] better directory names 2024-08-30 11:47:09 +02:00
Allen
a552fb5bcd
[generic] better directory formatting/md 2024-08-29 22:30:48 +02:00
Mike Fährmann
127aa45834
[ytdl] ignore SyntaxErrors when trying to import a module 2024-08-29 19:28:08 +02:00
Mike Fährmann
4b286e80fd
merge #6050: [wikimedia] add 'wiki' extractor 2024-08-25 09:38:24 +02:00
Mike Fährmann
65cae11b54
merge #6077: [flickr] fix bug introduced in e92a9ae3 2024-08-25 09:30:07 +02:00
Mike Fährmann
47e7be31a5
[flickr] add 'contexts' and 'exif' test 2024-08-25 09:28:22 +02:00
Mike Fährmann
4b94b7d477
[pp:metadata] add 'include' and 'exclude' options (#6058) 2024-08-19 21:58:57 +02:00
Mike Fährmann
80c423a444
[cyberdrop] fix extraction 2024-08-19 13:37:14 +02:00
Mike Fährmann
6af26a424a
[hitomi] extract 'extension_original' metadata (#6049) 2024-08-18 12:56:50 +02:00
ClosedPort22
33d2ddd9fb
[wikimedia] add 'wiki' extractor 2024-08-18 13:20:59 +08:00
Mike Fährmann
8ea75202ed
[bunkr] fix file downloads (#6037) 2024-08-17 19:25:19 +02:00
Mike Fährmann
ef4c1b4fc5
[sankaku] restore old 'tags' format (#6043)
lowercase + words separated by underscores
2024-08-17 19:25:19 +02:00
Mike Fährmann
78ae0ba9f7
[formatter] implement 'A' format specifier (#6036) 2024-08-16 20:34:23 +02:00
Mike Fährmann
01e8433889
fix tests & syntax for older Python versions 2024-08-15 18:16:55 +02:00
Mike Fährmann
0bbdeaaa50
[bunkr] support 'bunkr:' URL prefix (#6017)
and support 'bunkr.ax' by default
2024-08-14 19:00:59 +02:00
Mike Fährmann
f0de1685ae
[batoto] extract 'chapter_url' (#5562)
i.e. the chapter number found in the full chapter URL
2024-08-13 20:57:33 +02:00
Mike Fährmann
5208c0d28a
[util] extend CustomNone with an __index__ method (#6009)
- Make it compatible with functions expecting integer arguments
- Simplify and reuse some method definitions
2024-08-13 16:05:54 +02:00
Mike Fährmann
ad73789256
[util] extend CustomNone with arithmetic operators (#6007) 2024-08-13 09:41:12 +02:00
Mike Fährmann
6193c8eabc
merge #5997: [batoto] improve chapter info regex (#5988)
… and add 'chapter_string' metadata
2024-08-12 17:54:17 +02:00
Mike Fährmann
224821bd21
[batoto] add test, improve 'info' handling 2024-08-12 17:52:28 +02:00
Mike Fährmann
8cd625fe2b
merge #5983: [pixiv] use mobile API for series 2024-08-12 11:53:37 +02:00
Mike Fährmann
b33a53968d
[pixiv] incorporate into '_pagination' 2024-08-12 11:50:41 +02:00
Mike Fährmann
785e6f2911
[tumblr] fix 401 Unauthorized for likes when using api-key (#5994)
fixes regression introduced in 540eaa5a
2024-08-12 09:09:59 +02:00
Mike Fährmann
d1fe81dab0
merge #5970: [bunkr] support 'bunkr.ci' and 'bunkrrr.org' 2024-08-10 12:26:39 +02:00
Mike Fährmann
ad6ed72307
[bunkr] add tests for new domains 2024-08-10 12:25:43 +02:00
Mike Fährmann
8d1805e336
[hentaicosplays] support 'hentai-cosplay-xxx.com' (#5959) 2024-08-10 12:14:18 +02:00
Mike Fährmann
6e7da6310c
[behance] fix video extraction (#5965)
a lot slower than before since each video now requires an extra HTTP
request and 'sleep-request' is set to 2s-4s by default.

it now also requires ytdl.
2024-08-10 11:06:54 +02:00
Mike Fährmann
c07a074418
[furaffinity] remove end of system messages 2024-08-07 18:39:34 +02:00
Mike Fährmann
846512f6cd
[furaffinity] add 'submissions' extractor (#5954) 2024-08-07 16:34:32 +02:00
Mike Fährmann
b4733b7ea8
[fanbox] fix pagination (#5949, #5951, #5956) 2024-08-07 10:16:14 +02:00
Mike Fährmann
605f6fba2c
[pp:metadata] implement 'base-directory' option (#5262, #5728) 2024-08-07 10:15:06 +02:00
Mike Fährmann
575e5df539
[hotleak] fix AttributeError (#5950)
fixes regression introduced in 0432e057
2024-08-07 08:26:23 +02:00
Mike Fährmann
bb07313227
[deviantart] try to work around journal/status API changes (#5916)
The new /user/profile/posts endpoint is documented to
"return all journals & status updates for a given user in a single feed"
but it does not do this.

There are usually several or even all status updates missing.
2024-08-03 13:17:27 +02:00
Mike Fährmann
f87783ff26
[cien] add 'files' option (#2885) 2024-08-03 11:37:14 +02:00
Mike Fährmann
f68b0efc1b
[wallpapercave] support 'album' listings (#5925) 2024-08-02 16:13:09 +02:00
Mike Fährmann
a5be680596
[directlink] extend recognized file extensions (#5924)
bmp, svg, avif, heic, psd, pdf, m4v, mov, wav, mp3, zip, rar, 7z, swf
2024-08-02 16:13:09 +02:00
Mike Fährmann
9be7896f78
[koharu] fix 'count' for 'cbz' downloads (#5893) 2024-08-02 16:13:03 +02:00
Mike Fährmann
42388dc819
[tests] fix 'pattern' not being compiled before running a test
fixes regression introduced in 3fa74ca4
2024-08-01 12:35:01 +02:00
Mike Fährmann
c372242a06
[koharu] add 'favorite' extractor (#5893) 2024-08-01 12:33:18 +02:00
Mike Fährmann
2bf76461ce
[deviantart:following] use OAuth API endpoint (#2511) 2024-07-31 17:21:39 +02:00
Mike Fährmann
095f278d6f
[vsco] add 'include' option (#5911) 2024-07-31 12:32:04 +02:00
Mike Fährmann
ff58683b76
[koharu] send necessary headers for image downloads (#5893) 2024-07-30 19:39:50 +02:00
Mike Fährmann
399ba85841
[fallenangels] remove module 2024-07-30 17:33:16 +02:00
Mike Fährmann
84eefeebd6
[sankaku] match URLs with 'www' subdomain (#5907) 2024-07-30 17:05:22 +02:00
Mike Fährmann
279854cd9e
[agnph] implement 'tags' option (#5284) 2024-07-30 14:16:19 +02:00
Mike Fährmann
ca3e7a5e5a
[koharu] add 'date', simplify 'tags' (#5893) 2024-07-30 12:38:19 +02:00
Mike Fährmann
9c91935fe4
merge #5904: [dynastyscans] extract chapter 'tags' 2024-07-29 21:17:13 +02:00
enduser420
5a36da2968 [dynastyscans] extract chapter 'tags' 2024-07-29 16:50:47 +05:30
Mike Fährmann
8a6e208605
[zerochan] fix 'Invalid control character' errors (#5892) 2024-07-29 11:24:17 +02:00
Mike Fährmann
aa6d00613f
[cien] initial support (#2885, #4103, #5240) 2024-07-28 19:27:12 +02:00
Mike Fährmann
65d7cccaf9
merge #5899: [redgifs] support URLs with numeric IDs (#5898) 2024-07-28 12:35:57 +02:00
Mike Fährmann
4e245c94a8
[redgifs] add test for numeric ID 2024-07-28 12:33:28 +02:00
Mike Fährmann
c9aeedeafd
[koharu] add 'gallery' and 'search' extractors (#5893, #4707) 2024-07-28 12:22:18 +02:00
Mike Fährmann
226ead728e
[agnph] add 'tag' and 'post' extractors (#5284, #5890) 2024-07-27 12:17:47 +02:00
Mike Fährmann
d7a2c73274
[util] let a CustomNone instance be equal to itself 2024-07-26 20:56:01 +02:00
Mike Fährmann
b5e141ed6e
[sankakucomplex] update domain to 'news.sankakucomplex.com' 2024-07-26 20:39:55 +02:00
Mike Fährmann
f321272b7c
[ytdl] fix --cookies-from-browser option parsing 2024-07-25 17:45:25 +02:00
Mike Fährmann
5207a0c2e0
[zerochan] implement 'tags' option (#5874)
allow splitting tags into separate lists by category
2024-07-23 10:21:33 +02:00
Mike Fährmann
1aadc29c5b
[zerochan] fix 'source' extraction 2024-07-23 09:34:44 +02:00
Mike Fährmann
5f6d20c595
[tests] remove internal extractor check
revert 60a2fefedd.
2024-07-22 18:40:02 +02:00
Mike Fährmann
3eba1f7c29
[tests] load results from ${GDL_TEST_RESULTS} (#5262) 2024-07-22 18:35:50 +02:00
Mike Fährmann
db9833c28a
merge #5870: [aryion] add 'favorite' extractor (#4511) 2024-07-21 12:37:47 +02:00
Mike Fährmann
156a70bec0
[aryion] update favorite extractor
- add test case
- add docs/supportedsites entry
- add custom directory_fmt and archive_fmt
- remove constructor
- appease flake8
2024-07-21 12:34:06 +02:00
Mike Fährmann
727e53f513
[bunkr] support 'bunkr.fi' URLs (#5872) 2024-07-21 06:41:50 +02:00
Mike Fährmann
287a7d13cf
[sankaku] implement 'notes' extraction (#5865) 2024-07-18 20:44:49 +02:00
Mike Fährmann
026e0b97db
merge #5824: [furaffinity] add 'folders' and 'thumbnail' (#1284) 2024-07-18 01:41:15 +02:00
Mike Fährmann
60a2fefedd
[tests] restrict 'test_unique_pattern_matches' to internal extractors
https://github.com/mikf/gallery-dl/issues/5262#issuecomment-2231649983
2024-07-17 23:35:41 +02:00
Mike Fährmann
3fa74ca4d7
[tests] enable test results for external extractors (#5262)
https://github.com/mikf/gallery-dl/issues/5262#issuecomment-2231649983
2024-07-17 22:29:09 +02:00
Mike Fährmann
f7383a56f8
wrap filters/conditionals in a try-except block
allows accessing undefined fields without exception or locals().get(…)
but hides mistakes/typos/etc by evaluating to False without feedback

performance loss compared to the previous version without try-except
is negligible (~20ns for me)
2024-07-12 22:51:11 +02:00
Mike Fährmann
c83c812a1e
[instagram][twitter] rename 'profile' to 'info' (#5262, #3623) 2024-07-11 00:22:39 +02:00
Mike Fährmann
21831eba1e
[tests] completely ignore '#auth' for 'only_matching' tests 2024-07-11 00:20:10 +02:00
Mike Fährmann
1527ad79e2
[tests] fix syntax for Python < 3.6
no f-strings
2024-07-06 18:39:59 +02:00
Mike Fährmann
da9916c01f
[pp:metadata] implement format strings for 'directory' (#5728) 2024-07-06 03:08:59 +02:00
Mike Fährmann
8f3f061daf
[hentainexus] fix error for spread pages (#5827) 2024-07-05 21:36:29 +02:00
Nicholas Bishop
f43bccb5be [furaffinity] Add 'thumbnail' (#1284) and 'folders' properties
Retrieve 'thumbnail' and 'folders' properties for each post.
'thumbnail' (#1284):
 - Preview image used for search results, writing posts, music, etc.
 - Filename format: <post_id>@600-<directory_containing_full_image>.jpg
'Folders' (related to #1817):
 - A list of all gallery folders containing this post
 - Folder name format: [<folder_category> - ]<folder_name>
 - Only works on new layout; old layout does not show folders, so list will be empty

A test is included for each property.
2024-07-04 15:41:14 -04:00
Mike Fährmann
d10bfa9065
[vipergirls] improve 'thread' URL pattern
allow for query parameters and fragments at tne end of URLs
2024-06-29 17:38:57 +02:00
Mike Fährmann
44896b0296
[instagram] add 'profile' extractor (#5262)
https://github.com/mikf/gallery-dl/issues/5262#issuecomment-2188915210
2024-06-28 22:54:07 +02:00
Mike Fährmann
51fdfbe6fc
[erome] extract 'date' metadata (#5796) 2024-06-27 21:07:23 +02:00
Mike Fährmann
8f50c04af2
[formatter] implement 'X' format specifier (#5770) 2024-06-21 20:56:19 +02:00
enduser420
5adbfe526d [tcbscans] support other domains 2024-06-21 21:29:11 +05:30
Mike Fährmann
f58b0e6fc7
[twitter] ignore 'Unavailable' media (#5736)
… including geo-restricted content.

add 'unavailable' option to allow re-enabling them again
2024-06-21 00:15:10 +02:00
Mike Fährmann
8452d04a33
[fanbox] handle KeyError for no longer existing plans (#5759)
return the plan of the next higher tier instead
2024-06-21 00:13:25 +02:00
Mike Fährmann
ae3706286a
[speakerdeck] inherit from GalleryExtractor 2024-06-15 21:56:51 +02:00
Mike Fährmann
9c65db2a92
consistent 'with open(…) as fp:' syntax 2024-06-14 01:22:00 +02:00
Mike Fährmann
c6fc0281e8
[newgrounds] extend 'format' option (#5709)
- check more extensions for original formats (mp4, webm, m4v, mov, mkv)
- allow specifying which extensions and recoded formats to check
2024-06-12 20:46:45 +02:00
Mike Fährmann
86f0c3baaf
[szurubooru] support empty tag searches (#5711) 2024-06-11 20:25:06 +02:00
Mike Fährmann
2e11b6e756
[nijie] support downloading videos (#5707, #5617) 2024-06-08 22:55:28 +02:00
Mike Fährmann
f160859c5c
[hitomi] extract 'title_jpn' metadata (#5706) 2024-06-08 00:05:19 +02:00
Mike Fährmann
9abeab5ecf
[shimmie2] support 'vidya.pics' (#5632) 2024-06-06 15:08:56 +02:00
Mike Fährmann
7614bc458e
[util] extend CustomNone with comparison operators 2024-06-05 16:49:30 +02:00
Mike Fährmann
1ce5de0290
[formatter] implement 'C' format specifier (#5647)
to apply a conversion after ':' or
to apply multiple conversions

for example {tags:CSl} or {tags:J - /Cl}
to convert list to string and lowercase it
2024-06-05 16:49:29 +02:00
Mike Fährmann
9b99d2c886
[philomena] support downloading SVG files (#5643) 2024-06-05 16:48:51 +02:00
Mike Fährmann
8fce9ea6d5
[hentainexus] restore module (#5275)
revert 97641cd151
2024-06-05 16:48:25 +02:00
Mike Fährmann
4d11cd9ffb
[vichan] remove 'wikieat.club'
redirects to some non-vichan site
2024-06-02 18:16:24 +02:00
Mike Fährmann
31133b97fb
[nitter] remove instances 2024-06-02 18:15:53 +02:00
Mike Fährmann
ce228ee163
[photobucket] remove module
had been broken for years and the new site is payed access only
2024-06-02 01:40:31 +02:00
Mike Fährmann
009aa90c3f
[tests] update extractor results
at least some of them
2024-06-01 20:28:04 +02:00
Mike Fährmann
020050ea8b
merge #5641: [pixeldrain] add support for single file album download 2024-05-25 23:43:43 +02:00
Mike Fährmann
154a890399
[pixeldrain] integrate into 'album' extractor 2024-05-25 23:42:23 +02:00
HornyQT
24e70b956b [pixeldrain] add support for single file album download 2024-05-25 16:06:50 +02:00
Mike Fährmann
0761b22a7f
[hiperdex] update domain to 'hiperdex.top' (#5635) 2024-05-24 17:13:10 +02:00
Mike Fährmann
f651b3b6ab
merge #5601: [twitter] match '/video/' Tweet URLs 2024-05-17 22:49:12 +02:00
Mike Fährmann
7f1ed909d5
[imgur] match gallery/album/image URLs with title slugs (#5593) 2024-05-17 22:44:37 +02:00
Delphox
8ba73e2ec9
[twitter] match /video/ tweet urls 2024-05-17 16:50:51 -03:00
Mike Fährmann
2ee9ffeed6
merge #5568: [furaffinity] match 'xfuraffinity' URLs 2024-05-09 19:20:12 +02:00
Delphox
11109d5bad
[furaffinity] match xfuraffinity.com 2024-05-08 12:15:47 -03:00
Mike Fährmann
699592498b
[tests] use random port number for local HTTP server
… and explicitly bind to 127.0.0.1 instead of all interfaces
2024-05-02 22:54:15 +02:00
Mike Fährmann
bd8e4797e5
[vsco] add 'avatar' extractor (#5341) 2024-05-02 18:12:19 +02:00
Mike Fährmann
d0cead105b
[formatter] allow dots etc in '…' literals (#5539)
don't parse fields starting with '

this disables the ability to directly apply […] to '…' literals,
but that's not really useful anyway and can still be done with _lit
2024-05-02 17:43:24 +02:00
Mike Fährmann
8ed70b3256
[tests] mark tests with missing auth as 'only_matching'
… instead of skipping them completely
2024-05-01 16:00:07 +02:00
Mike Fährmann
3cf5366143
[mastodon] add support for card images 2024-05-01 16:00:07 +02:00
Mike Fährmann
9b1995dda3
[mastodon] add 'favorite', 'list', and 'hashtag' extractors (#5529) 2024-05-01 15:59:34 +02:00
Mike Fährmann
6c57958806
merge #5511: [twitter] [furaffinity] match fixvx.com and fxfuraffinity/fxraffinity.net URLs 2024-04-25 22:00:19 +02:00
Delphox
1886721d82
update tests 2024-04-25 13:28:30 -03:00
Mike Fährmann
cd241bea0a
[downloader:http] add MIME type and signature for .m4v files (#5505) 2024-04-25 01:01:35 +02:00
Mike Fährmann
068ccfe0b3
[tests] allow filtering extractor result tests by URL or comment
python test_results.py twitter:+/i/web/
python test_results.py twitter:~twitpic
2024-04-19 23:02:55 +02:00
Mike Fährmann
c9d3b5e5d9
[pixiv] change 'sanity_level' debug message to a warning (#5180) 2024-04-19 16:41:31 +02:00
Mike Fährmann
257e9fb435
[gelbooru] improve pagination logic for meta tags (#5478)
similar to 494acabd38
2024-04-15 23:14:48 +02:00
Mike Fährmann
e02d2ff45d
[tapas] add 'creator' extractor (#5306) 2024-04-11 23:41:50 +02:00
Mike Fährmann
35d4a706ae
[pixiv:novel] add 'covers' option (#5373) 2024-04-11 22:27:49 +02:00
Mike Fährmann
b57051719f
[wikimedia] support wiki.gg wikis 2024-04-09 19:24:01 +02:00
Mike Fährmann
40c1a8e471
[wikimedia] fix exception for files with empty 'metadata' 2024-04-09 19:12:15 +02:00
Mike Fährmann
0e730ba980
[pp:mtime] do not overwrite '_mtime' for None values (#5439) 2024-04-07 02:33:19 +02:00
Mike Fährmann
647a87d17c
[twitter] match '/photo/' Tweet URLs (#5443)
fixes regression introduced in 40c05535
2024-04-06 17:56:21 +02:00
Mike Fährmann
40bd145637
remove 'contextlib' imports 2024-04-06 16:59:09 +02:00
Mike Fährmann
9a8403917a
restore LD_LIBRARY_PATH for PyInstaller builds (#5421) 2024-04-06 16:58:33 +02:00
Mike Fährmann
095e5ded6f
[reddit] support comment embeds (#5366) 2024-04-01 23:35:42 +02:00
Mike Fährmann
64948f2c09
[foolfuuka] improve 'board' pattern & support pages (#5408) 2024-04-01 22:31:25 +02:00
Mike Fährmann
ef0c90414c
[wikimedia] suppress exception for entries without 'imageinfo' (#5384) 2024-03-26 15:33:26 +01:00
Mike Fährmann
9cce461627
[kemonoparty] add 'announcements' option (#5262)
https://github.com/mikf/gallery-dl/issues/5262#issuecomment-2015919188
2024-03-26 15:20:14 +01:00
Mike Fährmann
72ac2c750d
[kemonoparty:favorite] support 'sort' and 'order' query params (#5375) 2024-03-26 02:27:36 +01:00
Mike Fährmann
d1d017ab5d
merge #5372: [twitter] match Tweet URLs with query parameters (#5371)
fixes regression introduced in 40c05535
2024-03-25 22:01:46 +01:00
fireattack
423599ce95
[twitter] fix pattern for single tweet (#5371)
- Add optional slash
- Update tests to include some non-standard tweet URLs
2024-03-25 21:57:35 +01:00
Mike Fährmann
15a4bc2584
[kemonoparty] fix KeyError for empty files (#5368) 2024-03-24 02:21:38 +01:00
Mike Fährmann
31e7ca73b6
[gelbooru] add 'order-posts' option for favorites (#5220) 2024-03-23 13:30:09 +01:00
Mike Fährmann
55e8fdad29
[tests] use 'datetime.timezone.utc' instead of 'datetime.UTC'
'datetime.UTC' was added in Python 3.11
and is not defined in older versions.
2024-03-22 18:16:24 +01:00
Mike Fährmann
4b6f47e571
[pornhub:gif] extract 'viewkey' and 'timestamp' metadata (#4463)
https://github.com/mikf/gallery-dl/issues/4463#issuecomment-2014550302
2024-03-22 18:00:20 +01:00
Mike Fährmann
7a7dc442a0
[tests] update extractor results 2024-03-22 17:57:04 +01:00
Mike Fährmann
1d6260f456
[bunkr] remove 'description' metadata
album descriptions are no longer available on album pages
and the previous code erroneously returned just '0'
2024-03-22 02:14:41 +01:00
Mike Fährmann
32262a048b
[idolcomplex] fix metadata extraction
- replace legacy 'id' vales with alphanumeric ones, since the former are
  no longer available
- approximate 'vote_average', since the real value is no longer
  available
- fix 'vote_count'
2024-03-22 01:43:05 +01:00
Mike Fährmann
ddb2edfd32
[formatter] fix local DST datetime offsets for ':O'
'O' would get the *current* local UTC offset and apply it to all
'datetime' objects it gets applied to.
This would result in a wrong offset if the current offset includes
DST and the target 'datetime' does not or vice-versa.

'O' now determines the correct local UTC offset while respecting DST for
each individual 'datetime'.
2024-03-21 20:45:46 +01:00
Mike Fährmann
da6ba60331
[bluesky] add 'instance' metadata field (#4438) 2024-03-18 17:36:16 +01:00
Mike Fährmann
718c870430
[tests] show full path for nested values
'user.name' instead of just 'name' when testing for
"user": { … , "name": "…", … }
2024-03-18 17:36:16 +01:00
Mike Fährmann
26bc2d55f4
[hiperdex] update URL patterns & fix 'manga' metadata (#5340) 2024-03-18 17:36:16 +01:00
Mike Fährmann
8e694d85c4
[twitter] add 'birdwatch' metadata field (#5317)
should probably get a better name,
but this is what it's called internally by Twitter
2024-03-18 17:36:02 +01:00
Mike Fährmann
b8e7be225c
merge #5333: [imagefap] fix folder extractor 2024-03-15 23:46:43 +01:00
Herp
99c53f7fa8
Fix imagefap extrcator 2024-03-15 23:44:25 +01:00