1
0
mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-23 19:22:32 +01:00
Commit Graph

3026 Commits

Author SHA1 Message Date
Mike Fährmann
220cfe244e
[deviantart] get original files for GIF previews (#1731) 2021-07-30 00:20:53 +02:00
Mike Fährmann
7a0da4f93f
[newgrounds] add 'format' option (closes #1729) 2021-07-29 19:11:20 +02:00
Mike Fährmann
223a4e79cd
[newgrounds] fix using 'category-tranfer' (#1274) 2021-07-29 15:54:04 +02:00
Mike Fährmann
4e95cef6d2
[nsfwalbum] retry backend requests (fixes #1733) 2021-07-29 15:29:37 +02:00
Mike Fährmann
6c11105587
[bbc] improve image dimensions (#1706)
download the 1920xN versions instead of 976x549
2021-07-29 03:22:15 +02:00
Mike Fährmann
57c1a86082
[bbc] support multi-page gallery listings (closes #1730) 2021-07-28 22:42:33 +02:00
Mike Fährmann
2792ed6e4b
implement 'util.format_value()' 2021-07-26 02:11:22 +02:00
Mike Fährmann
552032d4fb
release version 1.18.2 2021-07-23 17:39:19 +02:00
Mike Fährmann
486474800f
[kemonoparty] skip duplicated patreon files (closes #1689)
this behavior can be disabled with the 'patreon-skip-file' option
2021-07-23 16:38:57 +02:00
Mike Fährmann
da7297c0b9
[comicvine] add extractor (closes #1712) 2021-07-23 16:17:06 +02:00
Mike Fährmann
e4788fa663
[bbc] add 'gallery' and 'programme' extractors (closes #1706) 2021-07-22 20:37:05 +02:00
Mike Fährmann
13d4045a8a
add 'archive-prefix' option (#1711) 2021-07-20 20:21:33 +02:00
Mike Fährmann
c3b5c88b04
update extractor test results 2021-07-20 20:21:33 +02:00
Mike Fährmann
9e42cd58ea
replace ChainPredicate class with 'functools.partial' 2021-07-20 20:21:32 +02:00
Mike Fährmann
1b2f9050fb
rename all instances of 'kwds' to 'kwdict' 2021-07-20 20:21:19 +02:00
Mike Fährmann
3868ec02d1
[pururin] update domain and fix extraction 2021-07-18 04:11:59 +02:00
Mike Fährmann
b89a44090f
[naverwebtoon] fix comic extraction 2021-07-18 03:41:54 +02:00
Mike Fährmann
4048eabeab
allow 'extractor.modules' to be a string
with comma-separated module names
2021-07-18 00:34:04 +02:00
Mike Fährmann
c8e678a5b4
[instagram] fix extraction of '/explore/tags/' posts
(closes #1666)
2021-07-18 00:21:18 +02:00
Mike Fährmann
a6a51f207d
[moebooru] fix 'tags' ending with a '+' when logged in (#1702) 2021-07-16 15:42:03 +02:00
Mike Fährmann
f5b097165e
[ytdl] transfer YoutubeDL objects to downloader (#1680)
allows specifying downloader-specific options per subcategory
but overwrites all downloader.ytdl settings
2021-07-16 15:40:54 +02:00
Mike Fährmann
06e69ea79a
[ytdl] actually set options for YoutubeDL objects (#1680)
I somehow managed to remove the options parameter for
the YoutubeDL constructor in 9a849cdf without noticing ...
2021-07-16 15:05:55 +02:00
Mike Fährmann
fc19010808
[downloader:ytdl] fix 'outtmpl' setting for yt_dlp (#1680)
yt_dlp supports multiple outtmpl settings for different file types and
uses its 'outtmpl_dict' for that.
2021-07-16 15:05:16 +02:00
Mike Fährmann
dfe1f490e9
[mangadex] use custom User-Agent header (#1535) 2021-07-15 16:39:32 +02:00
Mike Fährmann
36a2aff363
[vk] improve metadata extraction and URL pattern (fixes #1691)
- always fetch all user metadata
- use 'user[name]' for directory names if available
2021-07-15 00:43:42 +02:00
Mike Fährmann
b9783403d9
add 'url-metadata' option (#1659, #1073) 2021-07-14 03:08:49 +02:00
Mike Fährmann
e622e004f0
[ytdl] improve module imports (#1680)
Apply 'extractor.ytdl.module' for every URL, not just the first.
2021-07-14 03:08:00 +02:00
Mike Fährmann
e95f99882f
extend 'parent-metadata' functionality (#1687, #1651, #1364) 2021-07-14 02:53:41 +02:00
Mike Fährmann
193401ce3b
[ytdl] "fix" cookie transfer between session and ytdl (#1680)
requests' CookieJar class is not quite compatible with the standard
http.cookiejar.CookieJar used by youtube_dl
2021-07-12 18:50:25 +02:00
Mike Fährmann
9a849cdf61
[ytdl] allow setting 'module' for subcategories (#1680) 2021-07-12 18:47:12 +02:00
Mike Fährmann
dff0da60f9
[ytdl] add 'generic' option (#1680) 2021-07-11 23:48:18 +02:00
Mike Fährmann
d3da96142a
[ytdl] support cookies + username&password (#1680) 2021-07-11 22:51:57 +02:00
Mike Fährmann
36ac2197db
[ytdl] add extractor for sites supported by youtube-dl
(#1680, #878)

Can be used by prefixing any URL with 'ytdl:',
or by setting 'extractor,ytdl.enabled' to 'true'.
2021-07-10 20:55:47 +02:00
Mike Fährmann
64240c8d42
[imagevenue] fix extraction
(closes #1677)
2021-07-09 20:13:18 +02:00
Mike Fährmann
d287d2eb88
[kemonoparty] parse 'o' query parameters (#1674) 2021-07-09 18:29:50 +02:00
Mike Fährmann
8b036778e3
[kemonoparty] add 'max-posts' option (#1674) 2021-07-09 18:19:02 +02:00
Mike Fährmann
5612ca31c2
[hitomi] fix image URLs (closes #1679) 2021-07-09 18:01:49 +02:00
Mike Fährmann
8ecca3af58
[pixiv] add extractor for 'pixivision' articles (#1672) 2021-07-07 16:27:16 +02:00
Mike Fährmann
312a28e78a
[mastodon] add 'replies' option (#1669) 2021-07-07 00:59:02 +02:00
Mike Fährmann
513c491cea
[mastodon] reset 'params' after first pagination iteration
otherwise query parameters in 'params' get specified twice the second
time around - once from the 'links["next"]' URL and once from 'params'
itself.
2021-07-07 00:07:18 +02:00
Mike Fährmann
a1f5b78039
[mastodon] add 'reblogs' option (#1669) 2021-07-06 23:27:32 +02:00
Mike Fährmann
317ecc8180
use HTML tables in docs/formatting.md 2021-07-05 23:29:03 +02:00
Mike Fährmann
5f1b13d1a5
release version 1.18.1 2021-07-04 22:37:19 +02:00
Mike Fährmann
21c2da454f
update extractor test results 2021-07-04 22:00:32 +02:00
Mike Fährmann
7f591c78cb
[mangafox] cleanup 2021-07-04 03:21:02 +02:00
FollieHiyuki
4763bc1e4e
Add MangaExtractor for mangafox (#1633) 2021-07-03 22:53:21 +02:00
Mike Fährmann
b519bf567c
[hiperdex] use domain from input URL 2021-07-02 23:23:42 +02:00
Mike Fährmann
93d356712c
[mastodon] implement 'text-posts' option (#1569)
similar to Twitter's 'text-tweets'
2021-07-02 22:12:41 +02:00
Mike Fährmann
414bdc95a3
[twitter] set 'retweet_id' for original retweets (#1481) 2021-07-02 21:50:37 +02:00
Mike Fährmann
5323c1c73a
[twitter] ensure guest tokens are returned as string (#1665) 2021-07-01 14:35:53 +02:00
Mike Fährmann
9ee45f3617
[kemonoparty] warn about missing DDoS-GUARD cookies 2021-06-28 23:34:58 +02:00
Mike Fährmann
344aab3fb7
[seisoparty] warn about missing DDoS-GUARD cookies 2021-06-28 23:33:21 +02:00
Mike Fährmann
035562bd11
[twitter] remove old-style URLs from image fallback lists 2021-06-28 16:25:24 +02:00
Mike Fährmann
daf821b8b6
[seisoparty] use user names instead of IDs by default (#1635) 2021-06-27 22:57:20 +02:00
Mike Fährmann
e4db1bad14
[seisoparty] also extract files hosted on 'cdn-2' servers (#1635) 2021-06-27 22:55:09 +02:00
Mike Fährmann
267bbf5996
[mangasee] add 'chapter' and 'manga' extractors 2021-06-27 02:03:03 +02:00
Mike Fährmann
fad4918208
[deviantart] use UUIDs in internal folder/collection URLs 2021-06-26 00:56:57 +02:00
Mike Fährmann
64986f9435
fix depth counter in UrlJob
regression from adf4d661

It would either stop at the first level (-g) or go infinitely deep (-G)
Going down to for example level 3 with -ggg didn't work.
2021-06-26 00:30:03 +02:00
Mike Fährmann
0179581340
add 'T' format string conversion (#1646)
to convert 'date'/datetime to timestamp
2021-06-25 22:35:45 +02:00
Mike Fährmann
f74cf52e2b
[seisoparty] add 'user' and 'post' extractors (#1635) 2021-06-25 18:40:11 +02:00
Mike Fährmann
759735fb02
[kemonoparty] fix 'username' extraction (fixes #1652)
The site's <title> content changed from

<title>NAME | Kemono</title>

to

<title>
    NAME | Kemono
</title>
2021-06-25 15:35:20 +02:00
Mike Fährmann
befe635022
cache parsed Formatter functions 2021-06-22 19:46:04 +02:00
Mike Fährmann
a416e54765
[directlink] manually encode Referer URLs (fixes #1647)
Trying to send a non-latin-1-encodable header raises an exception,
so we encode the Referer value ourselves with 'errors=ignore'.
2021-06-21 20:28:19 +02:00
Mike Fährmann
8bdeb2a6dd
[webtoons] match arbitrary language codes (closes #1643) 2021-06-21 19:25:28 +02:00
Mike Fährmann
79b7ee2712
use 'functools.partial' in '_build_cleanfunc' when possible
makes calls to the returned function a slight bit faster (~10%)
2021-06-20 23:34:41 +02:00
Mike Fährmann
ceaf7fd989
optimize 'base-directory' initialization and usage
apply 'clean_path()' only once
2021-06-20 21:35:43 +02:00
Mike Fährmann
2ca011dfa8
add 'kwdict' argument to PathFormat.build_filename() 2021-06-20 20:26:38 +02:00
Mike Fährmann
fd00d47116
implement conditional directories (#1394)
They work the same way as conditional filenames (84d2e640), e.g.

"directory": {
    "score >= 20": ["high score"],
    "score >= 5" : ["mid score"],
    ""           : ["{category}", "default"]
}
2021-06-20 20:09:35 +02:00
Mike Fährmann
ee1064a2b2
release version 1.18.0 2021-06-19 21:26:42 +02:00
Mike Fährmann
4adc44df69
[furaffinity] improve metadata extraction (fixes #1630)
Fetch 'title' and 'artist' metadata from a different location,
since for posts with an empty title the <title> element is
completely empty and does not contain the artist's name.
2021-06-19 03:29:00 +02:00
Mike Fährmann
e98fa01c44
[hitomi] update image URL code (fixes #1637) 2021-06-18 16:44:22 +02:00
Mike Fährmann
e9ab97396f
[kemonoparty] update default filenames and archive IDs (#1514)
Add an enumeration index so that attachments and regular files with the
same filename still get downloaded and not counted as duplicate files
(even though for patreon posts they usually are)

This invalidates all previously generated archive IDs.
To keep using old names and IDs, set
'filename' to "{id}_{title}_{filename}.{extension}" and
'archive-format' to "{service}_{user}_{id}_{filename}.{extension}".
2021-06-18 16:32:45 +02:00
Mike Fährmann
fb4b4725ba
[hiperdex] match 'hiperdex2.com' URLs
still doesn't properly work due to Cloudflare CAPTCHA and IUAM page
2021-06-18 00:50:11 +02:00
Mike Fährmann
95bc1139e0
[instagram] update query hashes 2021-06-17 22:38:38 +02:00
Mike Fährmann
23018a46f6
[instagram] fix login (fixes #1631) 2021-06-17 22:28:42 +02:00
Mike Fährmann
cac0110d8b
[redgifs] update API server address (fixes #1632)
napi.redgifs.com -> api.redgifs.com
2021-06-16 23:07:25 +02:00
Mike Fährmann
0d2961ae81
[500px] remove last query hash entry
forgot to include this in b56e2450
2021-06-16 23:00:45 +02:00
Mike Fährmann
7273cf8536
[pixiv] support fetching privately followed users (fixes #1628) 2021-06-16 19:56:09 +02:00
Mike Fährmann
e60962f7e5
[philomena] improve tag escapes handling (fixes #1629) 2021-06-16 18:47:08 +02:00
Mike Fährmann
d8908ca577
[unsplash] update collections URL pattern (fixes #1627) 2021-06-15 15:44:45 +02:00
Mike Fährmann
9ed13703cc
[sankaku] handle empty tags (fixes #1617) 2021-06-14 16:20:10 +02:00
Mike Fährmann
b56e245094
[500px] update GraphQL queries
500px changed its method from query hashes to sending the entire query
string for every request.
2021-06-14 16:13:08 +02:00
Mike Fährmann
e9993451c3
remove Python version checks 2021-06-12 00:20:59 +02:00
Mike Fährmann
40eb8025cc
require an argument for --clear-cache 2021-06-12 00:15:25 +02:00
Mike Fährmann
a751afdfb3
[twitter] change some defaults
- 'retweets' option: true -> false
- 'quoted' option  : true -> false

  i.e. disable downloading tweets from other user's timelines by default

- search directory:
    '["{category}", "Search", "{search}"]' ->
    '["{category}", "{user[name]}"]'

  i.e. change it to the same as other twitter extractors (#1308)
2021-06-11 21:26:11 +02:00
Mike Fährmann
4e4ca3c330
[deviantart] pin API version (#1611)
'/gallery/folders' in the newest version doesn't include subfolders.

It probably only needs the right query parameter to do so, but that
doesn't seem to be documented anywhere.
2021-06-11 00:19:44 +02:00
Mike Fährmann
d09bc5bd34
[subscribestar] improve attachment filenames (#1609) 2021-06-10 17:09:13 +02:00
Mike Fährmann
2986bf63bf
[mangafox] update URL pattern (fixes #1608)
also accept non-numeric volume labels, e.g. vTBD
2021-06-10 14:56:59 +02:00
Mike Fährmann
53dab5c289
[mangadex] revert chapter handling (#1535)
Spawn a new ChapterExtractor for each individual chapter
instead of handling them directly with a MangaExtractor.

Doing it that way broke too many features like
--chapter-filter, --chapter-range, --zip, etc.
2021-06-09 21:54:40 +02:00
Mike Fährmann
def0148582
restructure code in PathFormat constructor 2021-06-08 18:05:07 +02:00
Mike Fährmann
84d2e64024
combine conditional filenames into filename option (#1394) 2021-06-08 18:00:06 +02:00
Mike Fährmann
1197ee2c20
[mangadex] add extractor for a user's followed feed (#1535) 2021-06-08 02:45:36 +02:00
Mike Fährmann
07c8adbd8b
[mangadex] implement login with username & password (#1535) 2021-06-08 02:12:57 +02:00
Mike Fährmann
b93cbe6720
[postprocessor:zip] run 'finalize()' regardless of 'filter' (#1460) 2021-06-07 22:15:36 +02:00
Mike Fährmann
3e332eaf53
[mangadex] update to API v5 (#1535) 2021-06-07 01:29:42 +02:00
Mike Fährmann
ff8d0d7280
[postprocessor:metadata] ensure reliable tags order (fixes #1598) 2021-06-05 14:49:14 +02:00
Mike Fährmann
04f4f9badb
[oauth] prevent exceptions when reporting errors (#1603) 2021-06-05 14:28:26 +02:00
Mike Fährmann
dfe1e09df0
add '-T/--terminate' command-line option (#1399) 2021-06-05 14:14:48 +02:00
Mike Fährmann
6ecb710a7f
add '-P/--postprocessor' command-line option 2021-06-05 14:14:47 +02:00
Mike Fährmann
83fc4c1098
update post processor config capabilities
This change makes it possible to specify just the name of a post processor
in the "postprocessors" list instead of a dict with all of its options.
The options for it will then be taken from inside the "postprocessor"
block similar to "extractor", "downloader", or "output" blocks.

This makes it possible to for example override the default settings for
--write-metadata by specifying a custom "metadata" block, or to set a
custom post processor block ("cbz") and then use it by referencing just
its name in "postprocessors" lists.

{
    "postprocessor":
    {
        "metadata": {
            "name": "metadata",
            "event": "post",
            "filename": "{tweet_id|post_id|id}.json"
        },
        "cbz": {
            "name"       : "zip",
            "compression": "store",
            "extension"  : "cbz"
        }
    }
}
2021-06-05 14:11:16 +02:00