1
0
mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-23 03:02:50 +01:00
Commit Graph

3915 Commits

Author SHA1 Message Date
Wiiplay123
6eb62f2140
Combine lh*(-**).googleusercontent.com URL regex into one line.
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2024-01-20 15:53:11 -06:00
Wiiplay123
a6fed628dd
[blogger] Fix lh*.googleusercontent.com forward slash bug, add support for lh*-**.googleusercontent.com
Some URLs use "lh(number)-(locale).googleusercontent.com" format, so I added support for those.

Also, "lh(number).googleusercontent.com" formats were broken because the regex was looking for a second forward slash.

Examples:
lh7.googleusercontent.com
lh7-us.googleusercontent.com
2024-01-20 15:07:52 -06:00
Mike Fährmann
6f8592eaff
[hbrowse] remove from modules list 2024-01-20 18:25:38 +01:00
Mike Fährmann
acc94ac187
[realbooru] fix extraction
revert ac97aca99c
2024-01-20 17:56:07 +01:00
Mike Fährmann
9599151118
[issuu] fix extraction 2024-01-20 16:44:48 +01:00
Mike Fährmann
9ca6117c67
[hbrowse] remove module
website gone
2024-01-20 02:53:44 +01:00
Mike Fährmann
375eefb886
[chevereto] remove 'pixl.li'
"Pixl is closing down"
"All images will be deleted January 1st."
2024-01-20 02:21:40 +01:00
Mike Fährmann
321861af7e
[erome] fix 'count' metadata 2024-01-20 00:26:41 +01:00
Mike Fährmann
b41d9bf616
[paheal] fix 'source' metadata 2024-01-19 22:24:39 +01:00
Mike Fährmann
b0a441f1e3
[nitter] remove 'nitter.lacontrevoie.fr'
"Fermeture de Nitter / Closing down Nitter"
2024-01-19 19:34:16 +01:00
Mike Fährmann
a1c1e80f67
[giantessbooru] update domain 2024-01-19 14:21:56 +01:00
Mike Fährmann
2007cb2f59
[tests] check extractor category values 2024-01-19 14:21:09 +01:00
Mike Fährmann
fc4e737f67
[wikimedia] include 'sha1' in default filenames 2024-01-19 03:08:43 +01:00
Mike Fährmann
44f2c15a04
[wikimedia] handle 'File:' paths 2024-01-19 03:05:45 +01:00
Mike Fährmann
93b4120e77
[gelbooru] support 'all' and empty tag (#5076) 2024-01-18 21:49:33 +01:00
Mike Fährmann
a416d4c3d5
[sankaku] support post URLs with alphanumeric IDs (#5073) 2024-01-18 16:23:14 +01:00
Mike Fährmann
ea553a1d55
[wikimedia] generalize (#1443)
- support mediawiki.org
- support mariowiki.com (#3660)

- combine code into a single extractor
  (use prefix as subcategory)
- handle non-wiki instances
- unescape titles
2024-01-18 15:36:16 +01:00
Mike Fährmann
89066844f4
add 'config_instance' method
to allow for a more streamlined access to BaseExtractor instance options
2024-01-18 03:20:36 +01:00
Mike Fährmann
c3c1635ef3
[wikimedia] update
- rewrite using BaseExtractor
- support most Wiki* domains
- update docs/supportedsites
- add tests
2024-01-17 00:08:06 +01:00
Ailothaen
221f54309c
[wikimedia] Improved archive identifiers 2024-01-16 02:32:32 +01:00
Ailothaen
e33056adcd
[wikimedia] Add Wikipedia/Wikimedia extractor 2024-01-16 02:32:25 +01:00
Mike Fährmann
3d68eda4ab
[kemonoparty] add 'revision_hash' metadata (#4706, #4727, #5013)
A SHA1 hexdigest of other relevant metadata fields like
title, content, file and attachment URLs.

This value does NOT reflect which revisions are listed on the website.
Neither does 'edited' or any other metadata field (combinations).
2024-01-16 00:38:10 +01:00
Mike Fährmann
799a8206ad
merge #5061: [webtoons] extract more metadata
- author_name
- comic_name
- episode_name
- username
2024-01-15 18:27:12 +01:00
Mike Fährmann
8ffa0cd3c8
[webtoons] small optimization
don't extract the entire 'author_area' and
avoid creating a second 'text.extract_from()' object
2024-01-15 18:24:47 +01:00
Mike Fährmann
59cf4b3884
merge #4444: [2ch] add 'thread' and 'board' extractors (#1009, #3540) 2024-01-15 17:50:34 +01:00
Mike Fährmann
90b382304a
[deviantart] fix KeyError: 'premium_folder_data' (#5063) 2024-01-15 17:30:03 +01:00
Mike Fährmann
4cedf378d5
[deviantart] fix AttributeError for URLs without username (#5065)
caused by 4f367145
2024-01-15 16:28:57 +01:00
Mike Fährmann
68196589c4
[2ch] update
- simplify extractor code
- more metadata
- add tests
2024-01-15 04:09:05 +01:00
hunter-gatherer8
6c4abc982e
[2ch] add 'thread' and 'board' extractors
- [2ch] add thread extractor
- [2ch] add board extractor
- [2ch] add new entry to supported sites
2024-01-15 03:51:03 +01:00
blankie
bb446b1598
[webtoons] extract more metadata 2024-01-14 19:26:49 +11:00
Mike Fährmann
355b909f46
merge #5041: [steamgriddb] add support (#5033) 2024-01-13 00:59:15 +01:00
Mike Fährmann
71e2c3e5a2
merge #5037: [hatenablog] add support (#5036) 2024-01-13 00:57:21 +01:00
blankie
9f53daabb8
[hatenablog] implement additional suggestion 2024-01-13 10:43:25 +11:00
blankie
293f1559df
[hatenablog] implement suggestions 2024-01-13 10:42:22 +11:00
blankie
65f42442f5
[steamgriddb] implement another suggestion 2024-01-13 10:12:15 +11:00
blankie
8995fd5f01
[steamgriddb] implement suggestions 2024-01-13 09:55:39 +11:00
Mike Fährmann
2dcfb012ea
[patreon] download 'm3u8' manifests with ytdl 2024-01-12 02:33:27 +01:00
Mike Fährmann
1c68b7df01
[patreon] fix KeyError (#5048) 2024-01-11 17:56:47 +01:00
Mike Fährmann
2191e29e14
[nijie] fix image URL for single image posts (#5049) 2024-01-11 05:07:38 +01:00
Mike Fährmann
bbf96753e2
[gelbooru] only log "Incomplete API response" for favorites (#5045) 2024-01-10 17:27:46 +01:00
Mike Fährmann
39904c9e4e
[deviantart:avatar] add 'formats' option (#4995) 2024-01-10 17:13:34 +01:00
Mike Fährmann
5c43098a1a
[twitter] revert to using 'media' timeline by default (#4953)
This reverts commit a94f944148.
2024-01-09 23:19:39 +01:00
Mike Fährmann
5f9a98cf0f
[deviantart:avatar] fix exception when 'comments' are enabled (#4995) 2024-01-09 20:04:46 +01:00
Mike Fährmann
887ade30a5
[batoto] support more mirror domains (#5042) 2024-01-09 18:02:49 +01:00
Mike Fährmann
0a382a5092
[batoto] improve 'manga_id' extraction (#5042) 2024-01-09 17:25:04 +01:00
blankie
100966b122
[steamgriddb] fix linting error 2024-01-09 17:18:31 +11:00
blankie
2ccb7d3bd3
[steamgriddb] add support 2024-01-09 17:12:56 +11:00
Mike Fährmann
ec958a26bc
[fuskator] make metadata extraction non-fatal (#5039)
- prevent KeyErrors
- prevent HTTP redirect
- return file URLs as list
2024-01-08 19:18:12 +01:00
blankie
2cfe788f93
[hatenablog] fix extractor naming errors 2024-01-09 01:42:57 +11:00
blankie
be6949c55d
[hatenablog] fix linting error 2024-01-09 01:36:52 +11:00
blankie
61f3b2f820
[hatenablog] add support 2024-01-09 01:29:47 +11:00
Mike Fährmann
657ed93a22
[batoto] improve v2 manga URL pattern
and add tests
2024-01-07 22:23:30 +01:00
Mike Fährmann
50eef1b5cc
merge #5029: [pixiv] update App API headers 2024-01-07 16:27:13 +01:00
Mike Fährmann
33f228756a
[mangadex] add 'list' extractor (#5025)
supports listing manga and chapters from list feed
2024-01-07 02:59:35 +01:00
Mike Fährmann
db8de13537
[vk] transform image URLs to non-blurred versions (#5017)
apply the same filter from before d85e66bc
2024-01-07 00:19:10 +01:00
Se AKi
d0d199414f modify useragent of pixiv 2024-01-06 23:15:15 +09:00
Mike Fährmann
cbfb7bfdf1
[gelbooru] display error for invalid API responses (#4903) 2024-01-06 14:28:35 +01:00
Mike Fährmann
c25bdbae91
[komikcast] fix 'manga' extractor (#5027) 2024-01-06 14:19:44 +01:00
Mike Fährmann
8e1a2b5446
[komikcast] update domain to 'komikcast.lol' (#5027) 2024-01-06 02:16:43 +01:00
Mike Fährmann
a441249ea2
merge #4979: [batoto] add 'chapter' and 'manga' extractors (#1434, #2111) 2024-01-06 01:53:26 +01:00
Mike Fährmann
b11c352d66
[bato] rename to 'batoto'
to use the same category name as the previous bato.to site
2024-01-06 01:49:34 +01:00
Mike Fährmann
3aa24c3744
[bato] simplify and update 2024-01-06 01:10:04 +01:00
Mike Fährmann
11150a7d72
[nudecollect] remove module 2024-01-05 21:32:04 +01:00
Mike Fährmann
c158927c38
merge #5016: [zzup] add 'gallery' extractor (#4517, #4604, #4659, #4863) 2024-01-05 21:25:46 +01:00
Mike Fährmann
e61f016465
[szurubooru] support 'snootbooru.com' (#5023) 2024-01-05 17:56:39 +01:00
Mike Fährmann
b4bcf40278
[weibo] fix AttributeError in 'user' extractor (#5022)
yet another bug caused by a383eca7
2024-01-05 17:18:33 +01:00
Mike Fährmann
0ab0a10d2d
[jpgfish] update domain 2024-01-05 02:27:20 +01:00
enduser420
0f30136109 [zzup] add 'gallery' extractor 2024-01-04 21:38:59 +05:30
Mike Fährmann
a86775f617
[gelbooru] fix 'favorite' extractor (#4903)
lots of +1/-1 and </<= mistakes
2024-01-04 15:17:49 +01:00
Mike Fährmann
7eaf648f2e
[fanbox] add 'metadata' option (#4921)
extracts 'plan' and extended 'user' metadata
2024-01-04 15:01:33 +01:00
Mike Fährmann
4f3671458e
[deviantart] add 'avatar' and 'background' extractors (#4995) 2024-01-03 00:07:55 +01:00
Mike Fährmann
9fa4f54c24
[twitter] raise error for invalid 'strategy' values (#4953) 2024-01-01 22:58:42 +01:00
Mike Fährmann
516c69297d
[manganelo] fix extraction & recognize '.to' TLDs (#5005) 2024-01-01 22:05:21 +01:00
Mike Fährmann
63f649cd92
[idolcomplex] fix extraction & update URL patterns (#5002) 2024-01-01 17:38:32 +01:00
Mike Fährmann
b6903a4c90
[nijie] add 'count' metadata field
https://github.com/mikf/gallery-dl/issues/146#issuecomment-1812849102
2023-12-30 22:25:59 +01:00
Mike Fährmann
b93b351db9
merge #4962: [poringa] add support (#4675) 2023-12-30 20:39:35 +01:00
Mike Fährmann
9f21c839ad
[poringa] improvements and fixes
- add 'num' and 'count' metadata fields
- prevent crash for "private" posts
- prevent crash when there's no 'main-info'
- update tests
2023-12-30 20:37:09 +01:00
Mike Fährmann
00d83d9588
[rule34us] add fallback for 'video-cdn1' videos (#4985) 2023-12-30 18:33:46 +01:00
Mike Fährmann
085411f3f1
[rule34] recognize URLs with 'www' subdomain (#4984) 2023-12-30 16:07:56 +01:00
Mike Fährmann
9f5051e4ed
merge #4981: [pinterest] add 'count' metadata field 2023-12-30 15:53:35 +01:00
bug-assassin
f6ce870885 Better variable names 2023-12-28 17:25:15 -05:00
bug-assassin
3553025584 Removed f-strings 2023-12-28 17:07:41 -05:00
Mike Fährmann
f36dafad06
improve 'include' handling (#4982)
- remove spaces when given as string
- warn about invalid vales
2023-12-28 19:07:04 +01:00
blankie
375f2db4c2
[pinterest] add count metadata field 2023-12-28 01:07:04 +11:00
Antonio
e348da7a06 [poringa] add support 2023-12-27 00:07:23 -06:00
bug-assassin
2c3f171d65 Fix python 3.5 linting issue 2023-12-26 23:55:47 -05:00
bug-assassin
06ff1d3a3c Replace text.extract with extr 2023-12-26 23:47:30 -05:00
bug-assassin
9c1ce28f68 [bato] Added mangatoto alias 2023-12-26 23:44:27 -05:00
bug-assassin
663b8d789a Fix linting 2023-12-26 23:41:37 -05:00
bug-assassin
74c225f94e [bato] add support 2023-12-26 22:33:33 -05:00
Mike Fährmann
f9544194c0
[paheal] restore 'extension' metadata (#4976) 2023-12-26 16:09:26 +01:00
Mike Fährmann
77d46e6f0c
[lynxchan] update 'bbw-chan' domain (#4970) 2023-12-25 15:29:05 +01:00
Mike Fährmann
766316e436
[imagechest] fix loading more than 10 images in a gallery (#4469) 2023-12-25 15:14:43 +01:00
Mike Fährmann
108c978073
merge #4919: [postmill] add support (#4917) 2023-12-23 15:23:56 +01:00
blankie
8a42ea736a
[postmill] implement suggestions 2023-12-23 13:28:36 +11:00
Mike Fährmann
c184454efb
[shimmie2] small optimizations
- unroll/remove loop
- avoid copy
2023-12-22 20:29:22 +01:00
Mike Fährmann
7cd0211cc9
[shimmie2] autodetect single or double quotes 2023-12-22 15:25:28 +01:00
Mike Fährmann
2a60645095
[deviantart] set 'is_original' for intermediary URLs to 'false' 2023-12-22 14:49:10 +01:00
Mike Fährmann
01bb75f6cb
merge #4945: {shimmie2[ support 'rule34hentai.net' (#861, #4789) 2023-12-22 00:10:26 +01:00
Mike Fährmann
79e4606893
[rule34hentai] cleanup
- fix using 'self._posts_rule34hentai'
- fix 'file_url' for posts
- update docs/supportedsites
- add tests
2023-12-22 00:01:36 +01:00