2017-06-18 22:16:26 +02:00
Configuration
#############
2020-05-11 00:56:41 +02:00
| Configuration files for *gallery-dl* use a JSON-based file format.
| For a (more or less) complete example with options set to their default values,
see `gallery-dl.conf <gallery-dl.conf> `__ .
| For a configuration file example with more involved settings and options,
see `gallery-dl-example.conf <gallery-dl-example.conf> `__ .
2020-09-26 13:33:46 +02:00
|
2020-05-10 23:47:20 +02:00
This file lists all available configuration options and their descriptions.
2017-06-18 22:16:26 +02:00
Contents
========
2018-03-16 11:49:49 +01:00
1) `Extractor Options`_
2) `Extractor-specific Options`_
2017-06-18 22:16:26 +02:00
3) `Downloader Options`_
2018-03-16 11:49:49 +01:00
4) `Output Options`_
2018-06-16 15:43:24 +02:00
5) `Postprocessor Options`_
6) `Miscellaneous Options`_
7) `API Tokens & IDs`_
2017-06-18 22:16:26 +02:00
Extractor Options
=================
2018-11-16 18:02:24 +01:00
2017-06-28 18:51:47 +02:00
Each extractor is identified by its `` category `` and `` subcategory `` .
The `` category `` is the lowercase site name without any spaces or special
characters, which is usually just the module name
2018-03-16 11:49:49 +01:00
(`` pixiv `` , `` danbooru `` , ...).
2017-06-28 18:51:47 +02:00
The `` subcategory `` is a lowercase word describing the general functionality
of that extractor (`` user `` , `` favorite `` , `` manga `` , ...).
2017-06-27 18:56:24 +02:00
2017-06-27 17:44:02 +02:00
Each one of the following options can be specified on multiple levels of the
configuration tree:
2020-09-26 13:33:46 +02:00
================== =======
2017-06-27 17:44:02 +02:00
Base level: `` extractor.<option-name> ``
Category level: `` extractor.<category>.<option-name> ``
Subcategory level: `` extractor.<category>.<subcategory>.<option-name> ``
2020-09-26 13:33:46 +02:00
================== =======
2017-06-27 17:44:02 +02:00
A value in a "deeper" level hereby overrides a value of the same name on a
lower level. Setting the `` extractor.pixiv.filename `` value, for example, lets
you specify a general filename pattern for all the different pixiv extractors.
Using the `` extractor.pixiv.user.filename `` value lets you override this
general pattern specifically for `` PixivUserExtractor `` instances.
2017-06-28 18:51:47 +02:00
The `` category `` and `` subcategory `` of all extractors are included in the
output of `` gallery-dl --list-extractors `` . For a specific URL these values
2017-09-09 18:48:28 +02:00
can also be determined by using the `` -K `` /`` --list-keywords `` command-line
option (see the example below).
2017-06-28 13:55:39 +02:00
2020-09-26 13:33:46 +02:00
2017-06-18 22:16:26 +02:00
extractor.*.filename
--------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
2023-01-06 13:07:33 +01:00
* `` object `` (`condition` -> `format string`_ )
2020-09-26 13:33:46 +02:00
Example
2023-01-06 13:07:33 +01:00
.. code :: json
2021-06-08 18:00:06 +02:00
"{manga}_c{chapter}_{page:>03}.{extension}"
2023-01-06 13:07:33 +01:00
.. code :: json
2021-06-08 18:00:06 +02:00
{
"extension == 'mp4'": "{id}_video.{extension}",
"'nature' in title" : "{id}_{title}.{extension}",
"" : "{id}_default.{extension}"
}
2020-09-26 13:33:46 +02:00
Description
2021-06-08 18:00:06 +02:00
A `format string`_ to build filenames for downloaded files with.
If this is an `` object `` , it must contain Python expressions mapping to the
filename format strings to use.
These expressions are evaluated in the order as specified in Python 3.6+
and in an undetermined order in Python 3.4 and 3.5.
2020-09-26 13:33:46 +02:00
The available replacement keys depend on the extractor used. A list
of keys for a specific one can be acquired by calling *gallery-dl*
with the `` -K `` /`` --list-keywords `` command-line option.
For example:
.. code ::
$ gallery-dl -K http://seiga.nicovideo.jp/seiga/im5977527
Keywords for directory names:
-----------------------------
category
seiga
subcategory
image
Keywords for filenames:
-----------------------
category
seiga
extension
None
image-id
5977527
subcategory
image
Note: Even if the value of the `` extension `` key is missing or
`` None `` , it will be filled in later when the file download is
starting. This key is therefore always available to provide
a valid filename extension.
2017-06-18 22:16:26 +02:00
extractor.*.directory
---------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` list `` of `` strings ``
2023-01-06 13:07:33 +01:00
* `` object `` (`condition` -> `format strings`_ )
2020-09-26 13:33:46 +02:00
Example
2023-01-06 13:07:33 +01:00
.. code :: json
2021-06-20 19:44:12 +02:00
["{category}", "{manga}", "c{chapter} - {title}"]
2023-01-06 13:07:33 +01:00
.. code :: json
2021-06-20 19:44:12 +02:00
{
"'nature' in content": ["Nature Pictures"],
"retweet_id != 0" : ["{category}", "{user[name]}", "Retweets"],
"" : ["{category}", "{user[name]}"]
}
2020-09-26 13:33:46 +02:00
Description
2021-06-20 19:44:12 +02:00
A list of `format strings`_ to build target directory paths with.
If this is an `` object `` , it must contain Python expressions mapping to the
list of format strings to use.
2017-06-27 17:44:02 +02:00
2020-09-26 13:33:46 +02:00
Each individual string in such a list represents a single path
segment, which will be joined together and appended to the
base-directory_ to form the complete target directory path.
2017-06-18 22:16:26 +02:00
2018-03-16 11:49:49 +01:00
extractor.*.base-directory
--------------------------
2020-09-26 13:33:46 +02:00
Type
|Path|_
Default
`` "./gallery-dl/" ``
Description
Directory path used as base for all download destinations.
2018-03-16 11:49:49 +01:00
2020-01-29 18:32:37 +01:00
extractor.*.parent-directory
----------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Use an extractor's current target directory as
`base-directory <extractor.*.base-directory_> `__
for any spawned child extractors.
2020-01-29 18:32:37 +01:00
2021-03-11 01:10:34 +01:00
extractor.*.parent-metadata
---------------------------
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2021-03-11 01:10:34 +01:00
Default
`` false ``
Description
2021-07-13 02:04:59 +02:00
If `` true `` , overwrite any metadata provided by a child extractor
with its parent's.
| If this is a `` string `` , add a parent's metadata to its children's
to a field named after said string.
| For example with `` "parent-metadata": "_p_" `` :
.. code :: json
{
"id": "child-id",
"_p_": {"id": "parent-id"}
}
2021-03-11 01:10:34 +01:00
2021-05-12 23:37:01 +02:00
extractor.*.parent-skip
-----------------------
Type
`` bool ``
Default
`` false ``
Description
Share number of skipped downloads between parent and child extractors.
2019-08-16 21:13:49 +02:00
extractor.*.path-restrict
-------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
2023-01-06 13:07:33 +01:00
* `` object `` (`character` -> `replacement character(s)` )
2020-09-26 13:33:46 +02:00
Default
`` "auto" ``
Example
* `` "/!? (){}" ``
2023-08-01 17:45:04 +02:00
* `` {" ": "_", "/": "-", "|": "-", ":": "_-_", "*": "_+_"} ``
2020-09-26 13:33:46 +02:00
Description
| A string of characters to be replaced with the value of
`path-replace <extractor.*.path-replace_> `__
| or an object mapping invalid/unwanted characters to their replacements
| for generated path segment names.
Special values:
* `` "auto" `` : Use characters from `` "unix" `` or `` "windows" ``
depending on the local operating system
* `` "unix" `` : `` "/" ``
* `` "windows" `` : `` "\\\\|/<>:\"?*" ``
2023-08-01 17:45:04 +02:00
* `` "ascii" `` : `` "^0-9A-Za-z_." `` (only ASCII digits, letters, underscores, and dots)
* `` "ascii+" `` : `` "^0-9@-[\\]-{ #-)+-.;=!}~" `` (all ASCII characters except the ones not allowed by Windows)
2020-09-26 13:33:46 +02:00
2023-08-11 17:35:32 +02:00
Implementation Detail: For `` strings `` with length >= 2, this option uses a
`Regular Expression Character Set <https://www.regular-expressions.info/charclass.html> `__ ,
meaning that:
* using a caret `` ^ `` as first character inverts the set
* character ranges are supported (`` 0-9a-z `` )
* `` ] `` , `` - `` , and `` \ `` need to be escaped as
`` \\] `` , `` \\- `` , and `` \\\\ `` respectively
to use them as literal characters
2019-08-16 21:13:49 +02:00
2020-05-24 17:35:25 +02:00
extractor.*.path-replace
------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "_" ``
Description
The replacement character(s) for
`path-restrict <extractor.*.path-restrict_> `__
2020-05-24 17:35:25 +02:00
2019-08-16 21:13:49 +02:00
extractor.*.path-remove
-----------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "\u0000-\u001f\u007f" `` (ASCII control characters)
Description
Set of characters to remove from generated path names.
2019-08-16 21:13:49 +02:00
2020-09-26 13:33:46 +02:00
Note: In a string with 2 or more characters, `` []^-\ `` need to be
escaped with backslashes, e.g. `` "\\[\\]" ``
2019-07-23 17:36:07 +02:00
2021-08-24 23:23:12 +02:00
extractor.*.path-strip
----------------------
Type
`` string ``
Default
`` "auto" ``
Description
Set of characters to remove from the end of generated path segment names
using `str.rstrip() <https://docs.python.org/3/library/stdtypes.html#str.rstrip> `_
Special values:
* `` "auto" `` : Use characters from `` "unix" `` or `` "windows" ``
depending on the local operating system
* `` "unix" `` : `` "" ``
* `` "windows" `` : `` ". " ``
2022-10-07 18:23:06 +02:00
extractor.*.path-extended
-------------------------
Type
`` bool ``
Default
`` true ``
Description
On Windows, use `extended-length paths <https://learn.microsoft.com/en-us/windows/win32/fileio/maximum-file-path-limitation> `__
prefixed with `` \\?\ `` to work around the 260 characters path length limit.
2020-10-31 22:42:42 +01:00
extractor.*.extension-map
-------------------------
Type
2023-01-06 13:07:33 +01:00
`` object `` (`extension` -> `replacement` )
2020-10-31 22:42:42 +01:00
Default
.. code :: json
2020-11-02 15:32:29 +01:00
2020-10-31 22:42:42 +01:00
{
"jpeg": "jpg",
"jpe" : "jpg",
"jfif": "jpg",
"jif" : "jpg",
"jfi" : "jpg"
}
Description
2020-11-14 22:40:31 +01:00
A JSON `` object `` mapping filename extensions to their replacements.
2020-10-31 22:42:42 +01:00
2017-06-18 22:16:26 +02:00
extractor.*.skip
----------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
Controls the behavior when downloading files that have been
downloaded before, i.e. a file with the same filename already
exists or its ID is in a `download archive <extractor.*.archive_> `__ .
2019-08-08 18:34:31 +02:00
2020-09-26 13:33:46 +02:00
* `` true `` : Skip downloads
* `` false `` : Overwrite already existing files
2018-10-13 17:21:55 +02:00
2021-05-12 02:22:28 +02:00
* `` "abort" `` : Stop the current extractor run
* `` "abort:N" `` : Skip downloads and stop the current extractor run
2020-09-26 13:33:46 +02:00
after `` N `` consecutive skips
2018-10-13 17:21:55 +02:00
2021-05-12 02:22:28 +02:00
* `` "terminate" `` : Stop the current extractor run, including parent extractors
* `` "terminate:N" `` : Skip downloads and stop the current extractor run,
including parent extractors, after `` N `` consecutive skips
2020-09-26 13:33:46 +02:00
* `` "exit" `` : Exit the program altogether
* `` "exit:N" `` : Skip downloads and exit the program
after `` N `` consecutive skips
2019-08-08 18:34:31 +02:00
2020-09-26 13:33:46 +02:00
* `` "enumerate" `` : Add an enumeration index to the beginning of the
filename extension (`` file.1.ext `` , `` file.2.ext `` , etc.)
2017-06-18 22:16:26 +02:00
2017-12-04 17:06:17 +01:00
extractor.*.sleep
2017-12-18 00:12:08 +01:00
-----------------
2020-09-26 13:33:46 +02:00
Type
2021-09-14 17:40:05 +02:00
|Duration|_
2020-09-26 13:33:46 +02:00
Default
`` 0 ``
Description
Number of seconds to sleep before each download.
2020-09-12 21:04:47 +02:00
extractor.*.sleep-extractor
---------------------------
2020-09-26 13:33:46 +02:00
Type
2021-09-14 17:40:05 +02:00
|Duration|_
2020-09-26 13:33:46 +02:00
Default
`` 0 ``
Description
Number of seconds to sleep before handling an input URL,
i.e. before starting a new extractor.
2017-12-04 17:06:17 +01:00
2020-09-19 22:07:41 +02:00
extractor.*.sleep-request
-------------------------
2020-09-26 13:33:46 +02:00
Type
2021-09-14 17:40:05 +02:00
|Duration|_
2020-09-26 13:33:46 +02:00
Default
`` 0 ``
Description
Minimal time interval in seconds between each HTTP request
during data extraction.
2020-09-19 22:07:41 +02:00
2017-10-12 23:37:28 +02:00
extractor.*.username & .password
--------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` null ``
Description
The username and password to use when attempting to log in to
another site.
Specifying a username and password is required for
* `` nijie ``
and optional for
2023-01-08 16:55:28 +01:00
* `` aibooru `` (*)
2020-09-26 13:33:46 +02:00
* `` aryion ``
2023-01-08 16:55:28 +01:00
* `` atfbooru `` (*)
2021-06-02 23:02:23 +02:00
* `` danbooru `` (*)
* `` e621 `` (*)
2023-02-03 19:11:54 +01:00
* `` e926 `` (*)
2020-09-26 13:33:46 +02:00
* `` exhentai ``
* `` idolcomplex ``
2021-03-03 03:05:25 +01:00
* `` imgbb ``
2020-09-26 13:33:46 +02:00
* `` inkbunny ``
2021-09-09 01:02:59 +02:00
* `` kemonoparty ``
2021-06-08 02:06:19 +02:00
* `` mangadex ``
2021-03-03 03:05:25 +01:00
* `` mangoxo ``
2021-05-19 02:57:36 +02:00
* `` pillowfort ``
2020-12-17 16:12:59 +01:00
* `` sankaku ``
2021-10-08 22:44:31 +02:00
* `` seisoparty ``
2020-09-26 13:33:46 +02:00
* `` subscribestar ``
2021-03-29 23:06:47 +02:00
* `` tapas ``
2020-09-26 13:33:46 +02:00
* `` tsumino ``
* `` twitter ``
2023-06-13 21:05:09 +02:00
* `` vipergirls ``
2022-07-29 12:49:04 +02:00
* `` zerochan ``
2020-09-26 13:33:46 +02:00
2020-10-15 00:51:53 +02:00
These values can also be specified via the
`` -u/--username `` and `` -p/--password `` command-line options or
by using a |.netrc|_ file. (see Authentication_)
2020-09-26 13:33:46 +02:00
2023-01-08 16:55:28 +01:00
(*) The password value for these sites should be
2020-10-19 21:57:26 +02:00
the API key found in your user profile, not the actual account password.
2017-06-18 22:16:26 +02:00
2018-11-16 18:02:24 +01:00
extractor.*.netrc
-----------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Enable the use of |.netrc|_ authentication data.
2018-11-16 18:02:24 +01:00
2017-07-21 18:32:56 +02:00
extractor.*.cookies
-------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* |Path|_
2023-01-06 13:07:33 +01:00
* `` object `` (`name` -> `value` )
2022-12-20 17:30:46 +01:00
* `` list ``
2020-09-26 13:33:46 +02:00
Description
2022-05-07 23:03:48 +02:00
Source to read additional cookies from. This can be
2020-02-13 23:44:02 +01:00
2022-05-07 23:03:48 +02:00
* The |Path|_ to a Mozilla/Netscape format cookies.txt file
2020-02-13 23:44:02 +01:00
2022-05-07 23:03:48 +02:00
.. code :: json
"~/.local/share/cookies-instagram-com.txt"
* An `` object `` specifying cookies as name-value pairs
2020-02-13 23:44:02 +01:00
2020-10-19 21:57:26 +02:00
.. code :: json
2017-07-21 18:32:56 +02:00
2020-09-26 13:33:46 +02:00
{
"cookie-name": "cookie-value",
"sessionid" : "14313336321%3AsabDFvuASDnlpb%3A31",
"isAdult" : "1"
}
2017-07-21 18:32:56 +02:00
2023-07-24 14:27:37 +02:00
* A `` list `` with up to 5 entries specifying a browser profile.
2022-05-07 23:03:48 +02:00
* The first entry is the browser name
2022-06-01 18:31:39 +02:00
* The optional second entry is a profile name or an absolute path to a profile directory
2022-05-07 23:03:48 +02:00
* The optional third entry is the keyring to retrieve passwords for decrypting cookies from
2022-12-09 19:43:55 +01:00
* The optional fourth entry is a (Firefox) container name (`` "none" `` for only cookies with no container)
2023-05-05 21:32:18 +02:00
* The optional fifth entry is the domain to extract cookies for. Prefix it with a dot `` . `` to include cookies for subdomains. Has no effect when also specifying a container.
2022-05-07 23:03:48 +02:00
.. code :: json
["firefox"]
2022-12-09 19:43:55 +01:00
["firefox", null, null, "Personal"]
2023-05-05 21:32:18 +02:00
["chromium", "Private", "kwallet", null, ".twitter.com"]
2022-05-07 23:03:48 +02:00
2017-07-21 18:32:56 +02:00
2019-10-18 21:31:33 +02:00
extractor.*.cookies-update
--------------------------
2020-09-26 13:33:46 +02:00
Type
2023-05-04 15:10:47 +02:00
* `` bool ``
* |Path|_
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
2023-05-04 15:10:47 +02:00
Export session cookies in cookies.txt format.
* If this is a |Path|_, write cookies to the given file path.
* If this is `` true `` and `extractor.*.cookies`_ specifies the |Path|_
of a valid cookies.txt file, update its contents.
2019-10-18 21:31:33 +02:00
2018-02-19 18:24:56 +01:00
extractor.*.proxy
-----------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
2023-01-06 13:07:33 +01:00
* `` object `` (`scheme` -> `proxy` )
2023-01-16 14:49:56 +01:00
Example
.. code :: json
"http://10.10.1.10:3128"
.. code :: json
{
"http" : "http://10.10.1.10:3128",
"https": "http://10.10.1.10:1080",
"http://10.20.1.128": "http://10.10.1.10:5323"
}
2020-09-26 13:33:46 +02:00
Description
Proxy (or proxies) to be used for remote connections.
2018-02-19 18:24:56 +01:00
2020-09-26 13:33:46 +02:00
* If this is a `` string `` , it is the proxy URL for all
outgoing requests.
* If this is an `` object `` , it is a scheme-to-proxy mapping to
specify different proxy URLs for each scheme.
It is also possible to set a proxy for a specific host by using
`` scheme://host `` as key.
See `Requests' proxy documentation`_ for more details.
2018-02-19 18:24:56 +01:00
2022-12-20 17:30:46 +01:00
Note: If a proxy URLs does not include a scheme,
`` http:// `` is assumed.
2018-02-19 18:24:56 +01:00
2022-01-20 23:16:00 +01:00
extractor.*.source-address
--------------------------
Type
* `` string ``
* `` list `` with 1 `` string `` and 1 `` integer `` as elements
Example
* `` "192.168.178.20" ``
* `` ["192.168.178.20", 8080] ``
Description
Client-side IP address to bind to.
| Can be either a simple `` string `` with just the local IP address
| or a `` list `` with IP and explicit port number as elements.
2022-03-10 23:32:16 +01:00
2017-11-15 13:54:40 +01:00
extractor.*.user-agent
----------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
2023-09-02 22:11:57 +02:00
`` "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:109.0) Gecko/20100101 Firefox/115.0" ``
2020-09-26 13:33:46 +02:00
Description
User-Agent header value to be used for HTTP requests.
2017-11-15 13:54:40 +01:00
2022-11-13 19:17:39 +01:00
Setting this value to `` "browser" `` will try to automatically detect
and use the User-Agent used by the system's default browser.
2022-12-20 17:30:46 +01:00
Note: This option has no effect on
`pixiv` , `e621` , and `mangadex`
extractors, as these need specific values to function correctly.
2018-02-08 23:10:58 +01:00
2021-02-25 23:39:34 +01:00
extractor.*.browser
-------------------
Type
`` string ``
2021-02-27 16:26:42 +01:00
Default
2022-11-13 19:17:39 +01:00
* `` "firefox" `` for `` patreon `` , `` mangapark `` , and `` mangasee ``
* `` null `` everywhere else
2021-02-25 23:39:34 +01:00
Example
* `` "chrome:macos" ``
Description
Try to emulate a real browser (`` firefox `` or `` chrome `` )
by using their default HTTP headers and TLS ciphers for HTTP requests.
Optionally, the operating system used in the `` User-Agent `` header can be
specified after a `` : `` (`` windows `` , `` linux `` , or `` macos `` ).
Note: `` requests `` and `` urllib3 `` only support HTTP/1.1, while a real
browser would use HTTP/2.
2023-09-18 23:50:25 +02:00
extractor.*.referer
-------------------
Type
* `` bool ``
* `` string ``
Default
`` true ``
Description
Send `Referer <https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/Referer> `__
headers with all outgoing HTTP requests.
If this is a `` string `` , send it as Referer
instead of the extractor's `` root `` domain.
2023-01-16 14:49:56 +01:00
extractor.*.headers
-------------------
Type
`` object `` (`name` -> `value` )
Default
.. code :: json
{
"User-Agent" : "<extractor.*.user-agent>",
"Accept" : "*/* ",
"Accept-Language": "en-US,en;q=0.5",
2023-09-18 23:50:25 +02:00
"Accept-Encoding": "gzip, deflate",
"Referer" : "<extractor.*.referer>"
2023-01-16 14:49:56 +01:00
}
Description
Additional `HTTP headers <https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers> `__
to be sent with each HTTP request,
To disable sending a header, set its value to `` null `` .
extractor.*.ciphers
-------------------
Type
`` list `` of `` strings ``
Example
.. code :: json
["ECDHE-ECDSA-AES128-GCM-SHA256",
"ECDHE-RSA-AES128-GCM-SHA256",
"ECDHE-ECDSA-CHACHA20-POLY1305",
"ECDHE-RSA-CHACHA20-POLY1305"]
Description
List of TLS/SSL cipher suites in
`OpenSSL cipher list format <https://www.openssl.org/docs/manmaster/man1/openssl-ciphers.html> `__
to be passed to
`ssl.SSLContext.set_ciphers() <https://docs.python.org/3/library/ssl.html#ssl.SSLContext.set_ciphers> `__
2018-02-08 23:10:58 +01:00
extractor.*.keywords
--------------------
2020-09-26 13:33:46 +02:00
Type
2023-01-06 13:07:33 +01:00
`` object `` (`name` -> `value` )
2020-09-26 13:33:46 +02:00
Example
`` {"type": "Pixel Art", "type_id": 123} ``
Description
2023-01-06 13:07:33 +01:00
Additional name-value pairs to be added to each metadata dictionary.
2017-11-15 13:54:40 +01:00
2018-02-21 23:18:21 +01:00
extractor.*.keywords-default
----------------------------
2020-09-26 13:33:46 +02:00
Type
any
Default
`` "None" ``
Description
Default value used for missing or undefined keyword names in
2021-06-29 19:24:18 +02:00
`format strings`_ .
2018-02-21 23:18:21 +01:00
2021-08-16 01:47:59 +02:00
extractor.*.url-metadata
------------------------
Type
`` string ``
Default
`` null ``
Description
Insert a file's download URL into its metadata dictionary as the given name.
2021-09-28 22:59:37 +02:00
For example, setting this option to `` "gdl_file_url" `` will cause a new
metadata field with name `` gdl_file_url `` to appear, which contains the
current file's download URL.
This can then be used in `filenames <extractor.*.filename_> `_ ,
with a `` metadata `` post processor, etc.
2021-08-16 01:47:59 +02:00
2022-07-30 12:31:45 +02:00
extractor.*.path-metadata
-------------------------
Type
`` string ``
Default
`` null ``
Description
2023-01-07 15:21:40 +01:00
Insert a reference to the current
`PathFormat <https://github.com/mikf/gallery-dl/blob/v1.24.2/gallery_dl/path.py#L27> `__
2022-07-30 12:31:45 +02:00
data structure into metadata dictionaries as the given name.
For example, setting this option to `` "gdl_path" `` would make it possible
2022-12-26 07:46:41 +01:00
to access the current file's filename as `` "{gdl_path.filename}" `` .
2022-07-30 12:31:45 +02:00
2022-11-05 17:37:43 +01:00
extractor.*.http-metadata
-------------------------
Type
`` string ``
Default
`` null ``
Description
Insert an `` object `` containing a file's HTTP headers and
`` filename `` , `` extension `` , and `` date `` parsed from them
into metadata dictionaries as the given name.
For example, setting this option to `` "gdl_http" `` would make it possible
2022-11-07 15:37:22 +01:00
to access the current file's `` Last-Modified `` header as `` "{gdl_http[Last-Modified]}" ``
and its parsed form as `` "{gdl_http[date]}" `` .
2022-11-05 17:37:43 +01:00
2022-11-27 16:09:42 +01:00
extractor.*.version-metadata
----------------------------
Type
`` string ``
Default
`` null ``
Description
Insert an `` object `` containing gallery-dl's version info into
metadata dictionaries as the given name.
The content of the object is as follows:
.. code :: json
{
"version" : "string",
"is_executable" : "bool",
"current_git_head": "string or null"
}
2019-01-19 20:28:19 +01:00
extractor.*.category-transfer
-----------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
Extractor-specific
Description
Transfer an extractor's (sub)category values to all child
extractors spawned by it, to let them inherit their parent's
config options.
2019-01-19 20:28:19 +01:00
2020-09-10 22:54:10 +02:00
extractor.*.blacklist & .whitelist
----------------------------------
2020-09-26 13:33:46 +02:00
Type
`` list `` of `` strings ``
Default
`` ["oauth", "recursive", "test"] `` + current extractor category
2022-01-06 23:36:57 +01:00
Example
2023-09-04 18:27:11 +02:00
`` ["imgur", "redgifs:user", "*:image"] ``
2020-09-26 13:33:46 +02:00
Description
2022-01-06 23:36:57 +01:00
A list of extractor identifiers to ignore (or allow)
2020-09-26 13:33:46 +02:00
when spawning child extractors for unknown URLs,
e.g. from `` reddit `` or `` plurk `` .
2020-09-10 22:54:10 +02:00
2022-01-06 23:36:57 +01:00
Each identifier can be
* A category or basecategory name (`` "imgur" `` , `` "mastodon" `` )
2023-09-04 18:27:11 +02:00
* | A (base)category-subcategory pair, where both names are separated by a colon (`` "redgifs:user" `` ).
2022-01-06 23:36:57 +01:00
| Both names can be a `*` or left empty, matching all possible names (`` "*:image" `` , `` ":user" `` ).
2020-09-26 13:33:46 +02:00
Note: Any `` blacklist `` setting will automatically include
`` "oauth" `` , `` "recursive" `` , and `` "test" `` .
2020-09-10 22:54:10 +02:00
2018-02-01 22:00:44 +01:00
extractor.*.archive
-------------------
2020-09-26 13:33:46 +02:00
Type
|Path|_
Default
`` null ``
Example
`` "$HOME/.archives/{category}.sqlite3" ``
Description
File to store IDs of downloaded files in. Downloads of files
already recorded in this archive file will be
`skipped <extractor.*.skip_> `__ .
The resulting archive file is not a plain text file but an SQLite3
database, as either lookup operations are significantly faster or
memory requirements are significantly lower when the
amount of stored IDs gets reasonably large.
2022-03-20 21:16:46 +01:00
Note: Archive files that do not already exist get generated automatically.
Note: Archive paths support regular `format string`_ replacements,
2020-09-26 13:33:46 +02:00
but be aware that using external inputs for building local paths
may pose a security risk.
2018-02-01 22:00:44 +01:00
2018-02-24 21:21:59 +01:00
extractor.*.archive-format
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Example
`` "{id}_{offset}" ``
Description
2022-04-28 20:26:12 +02:00
An alternative `format string`_ to build archive IDs with.
2018-02-24 21:21:59 +01:00
2021-07-20 19:59:42 +02:00
extractor.*.archive-prefix
--------------------------
Type
`` string ``
Default
`` "{category}" ``
Description
Prefix for archive IDs.
2023-02-05 16:05:13 +01:00
extractor.*.archive-pragma
--------------------------
Type
`` list `` of `` strings ``
Example
`` ["journal_mode=WAL", "synchronous=NORMAL"] ``
Description
A list of SQLite `` PRAGMA `` statements to run during archive initialization.
See `<https://www.sqlite.org/pragma.html>`__
for available `` PRAGMA `` statements and further details.
2018-06-16 15:43:24 +02:00
extractor.*.postprocessors
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` list `` of |Postprocessor Configuration|_ objects
Example
2020-10-19 21:57:26 +02:00
.. code :: json
2018-06-16 15:43:24 +02:00
2020-09-26 13:33:46 +02:00
[
2020-10-19 21:57:26 +02:00
{
"name": "zip" ,
"compression": "store"
},
{
"name": "exec",
"command": ["/home/foobar/script", "{category}", "{image_id}"]
}
2020-09-26 13:33:46 +02:00
]
2018-06-16 15:43:24 +02:00
2020-09-26 13:33:46 +02:00
Description
2020-10-19 21:57:26 +02:00
A list of `post processors <Postprocessor Configuration_> `__
2020-09-26 13:33:46 +02:00
to be applied to each downloaded file in the specified order.
2020-10-19 21:57:26 +02:00
| Unlike other options, a |postprocessors|_ setting at a deeper level
does not override any |postprocessors|_ setting at a lower level.
| Instead, all post processors from all applicable |postprocessors|_
settings get combined into a single list.
For example
* an `` mtime `` post processor at `` extractor.postprocessors `` ,
* a `` zip `` post processor at `` extractor.pixiv.postprocessors `` ,
* and using `` --exec ``
will run all three post processors - `` mtime `` , `` zip `` , `` exec `` -
for each downloaded `` pixiv `` file.
2018-06-16 15:43:24 +02:00
2023-01-26 14:59:24 +01:00
extractor.*.postprocessor-options
---------------------------------
Type
`` object `` (`name` -> `value` )
Example
.. code :: json
{
"archive": null,
"keep-files": true
}
Description
Additional `Postprocessor Options`_ that get added to each individual
`post processor object <Postprocessor Configuration_> `__
before initializing it and evaluating filters.
2018-10-08 23:08:11 +02:00
extractor.*.retries
-------------------
2020-09-26 13:33:46 +02:00
Type
`` integer ``
Default
`` 4 ``
Description
Maximum number of times a failed HTTP request is retried before
2020-10-19 21:57:26 +02:00
giving up, or `` -1 `` for infinite retries.
2018-10-08 23:08:11 +02:00
2023-01-14 17:16:18 +01:00
extractor.*.retry-codes
-----------------------
Type
`` list `` of `` integers ``
Example
`` [404, 429, 430] ``
Description
Additional `HTTP response status codes <https://developer.mozilla.org/en-US/docs/Web/HTTP/Status> `__
to retry an HTTP request on.
`` 2xx `` codes (success responses) and
`` 3xx `` codes (redirection messages)
will never be retried and always count as success,
regardless of this option.
`` 5xx `` codes (server error responses) will always be retried,
regardless of this option.
2018-10-08 23:08:11 +02:00
extractor.*.timeout
-------------------
2020-09-26 13:33:46 +02:00
Type
`` float ``
Default
`` 30.0 ``
Description
Amount of time (in seconds) to wait for a successful connection
and response from a remote server.
2018-10-08 23:08:11 +02:00
2020-09-26 13:33:46 +02:00
This value gets internally used as the |timeout|_ parameter for the
|requests.request()|_ method.
2018-10-08 23:08:11 +02:00
extractor.*.verify
------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
Controls whether to verify SSL/TLS certificates for HTTPS requests.
2018-10-08 23:08:11 +02:00
2020-09-26 13:33:46 +02:00
If this is a `` string `` , it must be the path to a CA bundle to use
instead of the default certificates.
2018-10-08 23:08:11 +02:00
2020-09-26 13:33:46 +02:00
This value gets internally used as the |verify|_ parameter for the
|requests.request()|_ method.
2018-10-08 23:08:11 +02:00
2019-07-13 21:49:26 +02:00
extractor.*.download
--------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Controls whether to download media files.
2019-07-13 21:49:26 +02:00
2020-09-26 13:33:46 +02:00
Setting this to `` false `` won't download any files, but all other
functions (`postprocessors`_ , `download archive`_ , etc.)
will be executed as normal.
2019-07-13 21:49:26 +02:00
2021-08-16 01:47:59 +02:00
extractor.*.fallback
--------------------
Type
`` bool ``
Default
`` true ``
Description
Use fallback download URLs when a download fails.
2018-10-08 23:08:11 +02:00
extractor.*.image-range
-----------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-27 18:21:12 +01:00
* `` string ``
* `` list `` of `` strings ``
Examples
2020-09-26 13:33:46 +02:00
* `` "10-20" ``
* `` "-5, 10, 30-50, 100-" ``
2022-12-27 18:21:12 +01:00
* `` "10:21, 30:51:2, :5, 100:" ``
* `` ["-5", "10", "30-50", "100-"] ``
2020-09-26 13:33:46 +02:00
Description
2022-12-27 18:21:12 +01:00
Index range(s) selecting which files to download.
These can be specified as
* index: `` 3 `` (file number 3)
* range: `` 2-4 `` (files 2, 3, and 4)
* `slice <https://docs.python.org/3/library/functions.html#slice> `__ : `` 3:8:2 `` (files 3, 5, and 7)
| Arguments for range and slice notation are optional
and will default to begin (`` 1 `` ) or end (`` sys.maxsize `` ) if omitted.
| For example `` 5- `` , `` 5: `` , and `` 5:: `` all mean "Start at file number 5".
2018-10-08 23:08:11 +02:00
2022-12-27 18:21:12 +01:00
Note: The index of the first file is `` 1 `` .
2018-10-08 23:08:11 +02:00
extractor.*.chapter-range
-------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Description
Like `image-range <extractor.*.image-range_> `__ ,
2022-12-20 17:30:46 +01:00
but applies to delegated URLs like manga chapters, etc.
2018-10-08 23:08:11 +02:00
extractor.*.image-filter
------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-21 20:36:46 +01:00
* `` string ``
* `` list `` of `` strings ``
Examples
2020-09-26 13:33:46 +02:00
* `` "re.search(r'foo(bar)+', description)" ``
2022-12-21 20:36:46 +01:00
* `` ["width >= 1200", "width/height > 1.2"] ``
2020-09-26 13:33:46 +02:00
Description
Python expression controlling which files to download.
2022-12-21 20:36:46 +01:00
A file only gets downloaded when *all* of the given expressions evaluate to `` True `` .
Available values are the filename-specific ones listed by `` -K `` or `` -j `` .
2018-10-08 23:08:11 +02:00
extractor.*.chapter-filter
--------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-21 20:36:46 +01:00
* `` string ``
* `` list `` of `` strings ``
Examples
2020-09-26 13:33:46 +02:00
* `` "lang == 'en'" ``
2022-12-21 20:36:46 +01:00
* `` ["language == 'French'", "10 <= chapter < 20"] ``
2020-09-26 13:33:46 +02:00
Description
Like `image-filter <extractor.*.image-filter_> `__ ,
2022-12-20 17:30:46 +01:00
but applies to delegated URLs like manga chapters, etc.
2018-10-08 23:08:11 +02:00
2018-03-16 11:49:49 +01:00
2019-06-29 22:48:59 +02:00
extractor.*.image-unique
------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Ignore image URLs that have been encountered before during the
current extractor run.
2019-06-29 22:48:59 +02:00
extractor.*.chapter-unique
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Like `image-unique <extractor.*.image-unique_> `__ ,
2022-12-20 17:30:46 +01:00
but applies to delegated URLs like manga chapters, etc.
2019-06-29 22:48:59 +02:00
2019-07-16 23:08:27 +02:00
extractor.*.date-format
2021-06-29 19:24:18 +02:00
-----------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "%Y-%m-%dT%H:%M:%S" ``
Description
Format string used to parse `` string `` values of
`date-min` and `date-max` .
2019-07-16 23:08:27 +02:00
2020-09-26 13:33:46 +02:00
See |strptime|_ for a list of formatting directives.
2019-07-16 23:08:27 +02:00
2022-12-20 17:30:46 +01:00
Note: Despite its name, this option does **not** control how
`` {date} `` metadata fields are formatted.
To use a different formatting for those values other than the default
`` %Y-%m-%d %H:%M:%S `` , put |strptime|_ formatting directives
after a colon `` : `` , for example `` {date:%Y%m%d} `` .
2019-07-16 23:08:27 +02:00
2018-11-16 18:02:24 +01:00
2017-06-18 22:16:26 +02:00
Extractor-specific Options
==========================
2018-11-16 18:02:24 +01:00
2018-03-14 14:03:53 +01:00
extractor.artstation.external
-----------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Try to follow external URLs of embedded players.
2018-03-14 14:03:53 +01:00
2022-11-23 22:00:18 +01:00
extractor.artstation.max-posts
------------------------------
Type
`` integer ``
Default
`` null ``
Description
Limit the number of posts/projects to download.
2022-11-23 21:45:20 +01:00
extractor.artstation.search.pro-first
-------------------------------------
Type
`` bool ``
Default
`` true ``
Description
Enable the "Show Studio and Pro member artwork first" checkbox
when retrieving search results.
2020-06-26 22:05:02 +02:00
extractor.aryion.recursive
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Controls the post extraction strategy.
2020-06-26 22:05:02 +02:00
2020-09-26 13:33:46 +02:00
* `` true `` : Start on users' main gallery pages and recursively
descend into subfolders
* `` false `` : Get posts from "Latest Updates" pages
2020-06-26 22:05:02 +02:00
2021-07-30 01:09:32 +02:00
extractor.bbc.width
-------------------
Type
2022-12-20 17:30:46 +01:00
`` integer ``
2021-07-30 01:09:32 +02:00
Default
`` 1920 ``
Description
Specifies the requested image width.
This value must be divisble by 16 and gets rounded down otherwise.
The maximum possible value appears to be `` 1920 `` .
2020-01-23 22:42:56 +01:00
extractor.blogger.videos
------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Download embedded videos hosted on https://www.blogger.com/
2020-01-23 22:42:56 +01:00
2022-05-10 12:17:59 +02:00
extractor.cyberdrop.domain
--------------------------
Type
`` string ``
Default
2022-12-11 17:34:34 +01:00
`` null ``
2022-05-10 12:17:59 +02:00
Example
`` "cyberdrop.to" ``
Description
Specifies the domain used by `` cyberdrop `` regardless of input URL.
Setting this option to `` "auto" ``
uses the same domain as a given input URL.
2021-08-08 19:18:41 +02:00
extractor.danbooru.external
---------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
2021-08-08 19:18:41 +02:00
For unavailable or restricted posts,
follow the `` source `` and download from there if possible.
2019-08-31 21:46:49 +02:00
2021-06-20 22:41:41 +02:00
2023-02-04 21:20:38 +01:00
extractor.danbooru.ugoira
-------------------------
Type
`` bool ``
Default
`` false ``
Description
Controls the download target for Ugoira posts.
* `` true `` : Original ZIP archives
* `` false `` : Converted video files
extractor.[Danbooru].metadata
-----------------------------
2021-04-13 23:41:30 +02:00
Type
2023-01-13 16:20:01 +01:00
* `` bool ``
* `` string ``
* `` list `` of `` strings ``
2021-04-13 23:41:30 +02:00
Default
`` false ``
2023-01-13 16:20:01 +01:00
Example
* `` replacements,comments,ai_tags ``
* `` ["replacements", "comments", "ai_tags"] ``
2021-04-13 23:41:30 +02:00
Description
2022-12-24 13:27:16 +01:00
Extract additional metadata
(notes, artist commentary, parent, children, uploader)
2021-04-13 23:41:30 +02:00
2023-01-13 16:20:01 +01:00
It is possible to specify a custom list of metadata includes.
See `available_includes <https://github.com/danbooru/danbooru/blob/2cf7baaf6c5003c1a174a8f2d53db010cf05dca7/app/models/post.rb#L1842-L1849> `__
for possible field names. `` aibooru `` also supports `` ai_metadata `` .
2023-04-02 20:11:52 +02:00
Note: This requires 1 additional HTTP request per 200-post batch.
2019-08-31 21:46:49 +02:00
2021-06-20 22:41:41 +02:00
2023-07-26 14:01:16 +02:00
extractor.[Danbooru].threshold
2023-02-04 21:20:38 +01:00
------------------------------
2022-12-16 09:15:36 +01:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` integer ``
2022-12-16 09:15:36 +01:00
Default
2022-12-17 14:06:47 +01:00
`` "auto" ``
2022-12-16 09:15:36 +01:00
Description
2022-12-17 14:06:47 +01:00
Stop paginating over API results if the length of a batch of returned
posts is less than the specified number. Defaults to the per-page limit
2023-02-04 21:20:38 +01:00
of the current instance, which is 200.
2022-12-17 14:06:47 +01:00
Note: Changing this setting is normally not necessary. When the value is
greater than the per-page limit, gallery-dl will stop after the first
batch. The value cannot be less than 1.
2022-12-16 09:15:36 +01:00
2021-01-07 18:05:32 +01:00
extractor.derpibooru.api-key
----------------------------
Type
`` string ``
Default
`` null ``
Description
Your `Derpibooru API Key <https://derpibooru.org/registrations/edit> `__ ,
to use your account's browsing settings and filters.
extractor.derpibooru.filter
---------------------------
Type
`` integer ``
Default
`` 56027 `` (`Everything <https://derpibooru.org/filters/56027> `_ filter)
Description
The content filter ID to use.
Setting an explicit filter ID overrides any default filters and can be used
to access 18+ content without `API Key <extractor.derpibooru.api-key_> `_ .
See `Filters <https://derpibooru.org/filters> `_ for details.
2021-09-07 21:16:49 +02:00
extractor.deviantart.auto-watch
-------------------------------
Type
`` bool ``
Default
`` false ``
Description
Automatically watch users when encountering "Watchers-Only Deviations"
(requires a `refresh-token <extractor.deviantart.refresh-token_> `_ ).
extractor.deviantart.auto-unwatch
---------------------------------
Type
`` bool ``
Default
`` false ``
Description
After watching a user through `auto-watch <extractor.deviantart.auto-watch_> `_ ,
unwatch that user at the end of the current extractor run.
2021-08-29 20:56:34 +02:00
extractor.deviantart.comments
-----------------------------
Type
`` bool ``
Default
`` false ``
Description
Extract `` comments `` metadata.
2019-06-10 21:05:25 +02:00
extractor.deviantart.extra
--------------------------
2020-09-26 13:33:46 +02:00
Type
2021-03-19 16:24:23 +01:00
`` bool ``
2020-09-26 13:33:46 +02:00
Default
2021-03-19 16:24:23 +01:00
`` false ``
2020-09-26 13:33:46 +02:00
Description
2021-03-19 16:24:23 +01:00
Download extra Sta.sh resources from
description texts and journals.
2021-03-06 21:31:28 +01:00
2021-03-19 16:24:23 +01:00
Note: Enabling this option also enables deviantart.metadata_.
2019-06-10 21:05:25 +02:00
2017-07-12 17:05:31 +02:00
extractor.deviantart.flat
-------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Select the directory structure created by the Gallery- and
Favorite-Extractors.
2017-07-12 17:05:31 +02:00
2020-09-26 13:33:46 +02:00
* `` true `` : Use a flat directory structure.
* `` false `` : Collect a list of all gallery-folders or
favorites-collections and transfer any further work to other
extractors (`` folder `` or `` collection `` ), which will then
create individual subdirectories for each of them.
Note: Going through all gallery folders will not be able to
fetch deviations which aren't in any folder.
2017-07-12 17:05:31 +02:00
2019-05-29 23:50:05 +02:00
extractor.deviantart.folders
----------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Provide a `` folders `` metadata field that contains the names of all
folders a deviation is present in.
2019-05-29 23:50:05 +02:00
2020-09-26 13:33:46 +02:00
Note: Gathering this information requires a lot of API calls.
Use with caution.
2019-05-29 23:50:05 +02:00
2022-10-06 22:47:14 +02:00
extractor.deviantart.group
--------------------------
Type
`` bool ``
Default
`` true ``
Description
Check whether the profile name in a given URL
belongs to a group or a regular user.
2019-11-06 23:57:12 +01:00
extractor.deviantart.include
----------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
`` "gallery" ``
Example
2022-12-20 17:30:46 +01:00
* `` "favorite,journal,scraps" ``
* `` ["favorite", "journal", "scraps"] ``
2020-09-26 13:33:46 +02:00
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
2019-11-06 23:57:12 +01:00
2020-09-26 13:33:46 +02:00
Possible values are
2023-01-17 14:16:25 +01:00
`` "gallery" `` , `` "scraps" `` , `` "journal" `` , `` "favorite" `` , `` "status" `` .
2019-11-06 23:57:12 +01:00
2022-12-20 17:30:46 +01:00
It is possible to use `` "all" `` instead of listing all values separately.
2019-11-06 23:57:12 +01:00
2018-07-16 18:14:41 +02:00
extractor.deviantart.journals
-----------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "html" ``
Description
2023-01-17 14:16:25 +01:00
Selects the output format for textual content. This includes journals,
literature and status updates.
2018-07-16 18:14:41 +02:00
2020-09-26 13:33:46 +02:00
* `` "html" `` : HTML with (roughly) the same layout as on DeviantArt.
* `` "text" `` : Plain text with image references and HTML tags removed.
2023-01-17 14:16:25 +01:00
* `` "none" `` : Don't download textual content.
2018-07-16 18:14:41 +02:00
2023-09-22 16:57:28 +02:00
extractor.deviantart.jwt
------------------------
Type
`` bool ``
Default
2023-09-24 14:45:34 +02:00
`` true ``
2023-09-22 16:57:28 +02:00
Description
Update `JSON Web Tokens <https://jwt.io/> `__ (the `` token `` URL parameter)
of otherwise non-downloadable, low-resolution images
to be able to download them in full resolution.
2017-10-09 23:20:17 +02:00
extractor.deviantart.mature
---------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Enable mature content.
2017-10-07 13:07:34 +02:00
2020-09-26 13:33:46 +02:00
This option simply sets the |mature_content|_ parameter for API
calls to either `` "true" `` or `` "false" `` and does not do any other
form of content filtering.
2017-10-07 13:07:34 +02:00
2019-03-21 14:46:47 +01:00
extractor.deviantart.metadata
-----------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Request extended metadata for deviation objects to additionally provide
`` description `` , `` tags `` , `` license `` and `` is_watching `` fields.
2019-03-21 14:46:47 +01:00
2017-10-09 23:20:17 +02:00
extractor.deviantart.original
-----------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
Download original files if available.
2017-06-18 22:16:26 +02:00
2020-09-26 13:33:46 +02:00
Setting this option to `` "images" `` only downloads original
files if they are images and falls back to preview versions for
everything else (archives, etc.).
2017-06-18 22:16:26 +02:00
2022-04-18 18:08:01 +02:00
extractor.deviantart.pagination
-------------------------------
Type
`` string ``
Default
`` "api" ``
Description
Controls when to stop paginating over API results.
* `` "api" `` : Trust the API and stop when `` has_more `` is `` false `` .
* `` "manual" `` : Disregard `` has_more `` and only stop when a batch of results is empty.
2023-04-08 22:52:13 +02:00
extractor.deviantart.public
---------------------------
Type
`` bool ``
Default
`` true ``
Description
Use a public access token for API requests.
Disable this option to *force* using a private token for all requests
when a `refresh token <extractor.deviantart.refresh-token_> `__ is provided.
2023-09-24 17:36:05 +02:00
extractor.deviantart.quality
----------------------------
Type
`` integer ``
Default
`` 100 ``
Description
JPEG quality level of newer images for which
an original file download is not available.
Note: Only has an effect when `deviantart.jwt <extractor.deviantart.jwt_> `__ is disabled.
2018-07-25 12:52:36 +02:00
extractor.deviantart.refresh-token
----------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` null ``
Description
The `` refresh-token `` value you get from
`linking your DeviantArt account to gallery-dl <OAuth_> `__ .
2018-07-25 12:52:36 +02:00
2020-09-26 13:33:46 +02:00
Using a `` refresh-token `` allows you to access private or otherwise
not publicly available deviations.
2019-10-13 23:01:57 +02:00
2020-09-26 13:33:46 +02:00
Note: The `` refresh-token `` becomes invalid
`after 3 months <https://www.deviantart.com/developers/authentication#refresh> `__
or whenever your `cache file <cache.file_> `__ is deleted or cleared.
2018-07-25 12:52:36 +02:00
2018-07-14 11:52:21 +02:00
extractor.deviantart.wait-min
-----------------------------
2020-09-26 13:33:46 +02:00
Type
`` integer ``
Default
`` 0 ``
Description
Minimum wait time in seconds before API requests.
2018-07-14 11:52:21 +02:00
2023-02-04 21:20:38 +01:00
extractor.[E621].metadata
-------------------------
Type
* `` bool ``
* `` string ``
* `` list `` of `` strings ``
Default
`` false ``
Example
* `` notes,pools ``
* `` ["notes", "pools" ``
Description
Extract additional metadata (notes, pool metadata) if available.
Note: This requires 0-2 additional HTTP requests per post.
extractor.[E621].threshold
--------------------------
Type
* `` string ``
* `` integer ``
Default
`` "auto" ``
Description
Stop paginating over API results if the length of a batch of returned
posts is less than the specified number. Defaults to the per-page limit
of the current instance, which is 320.
Note: Changing this setting is normally not necessary. When the value is
greater than the per-page limit, gallery-dl will stop after the first
batch. The value cannot be less than 1.
2020-07-17 19:25:19 +02:00
extractor.exhentai.domain
-------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "auto" ``
Description
* `` "auto" `` : Use `` e-hentai.org `` or `` exhentai.org ``
depending on the input URL
* `` "e-hentai.org" `` : Use `` e-hentai.org `` for all URLs
* `` "exhentai.org" `` : Use `` exhentai.org `` for all URLs
2020-07-17 19:25:19 +02:00
2023-08-23 12:54:45 +02:00
extractor.exhentai.fav
----------------------
Type
`` string ``
Example
`` "4" ``
Description
After downloading a gallery,
add it to your account's favorites as the given category number.
Note: Set this to `"favdel"` to remove galleries from your favorites.
Note: This will remove any Favorite Notes when applied
to already favorited galleries.
2021-12-16 22:29:04 +01:00
extractor.exhentai.limits
-------------------------
Type
`` integer ``
Default
`` null ``
Description
Sets a custom image download limit and
stops extraction when it gets exceeded.
2021-02-22 22:59:51 +01:00
extractor.exhentai.metadata
---------------------------
Type
2021-02-26 17:53:27 +01:00
`` bool ``
2021-02-22 22:59:51 +01:00
Default
2021-02-26 17:53:27 +01:00
`` false ``
2021-02-22 22:59:51 +01:00
Description
2021-02-26 17:53:27 +01:00
Load extended gallery metadata from the
`API <https://ehwiki.org/wiki/API#Gallery_Metadata> `_ .
2021-02-22 22:59:51 +01:00
2021-02-26 17:53:27 +01:00
Adds `` archiver_key `` , `` posted `` , and `` torrents `` .
Makes `` date `` and `` filesize `` more precise.
2021-02-22 22:59:51 +01:00
2017-06-18 22:16:26 +02:00
extractor.exhentai.original
---------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Download full-sized original images if available.
2017-06-18 22:16:26 +02:00
2021-12-16 22:29:04 +01:00
extractor.exhentai.source
-------------------------
Type
`` string ``
Default
`` "gallery" ``
Description
Selects an alternative source to download files from.
* `` "hitomi" `` : Download the corresponding gallery from `` hitomi.la ``
2021-04-25 19:39:13 +02:00
extractor.fanbox.embeds
-----------------------
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2021-04-25 19:39:13 +02:00
Default
`` true ``
Description
Control behavior on embedded content from external sites.
* `` true `` : Extract embed URLs and download them if supported
(videos are not downloaded).
* `` "ytdl" `` : Like `` true `` , but let `youtube-dl`_ handle video
extraction and download for YouTube, Vimeo and SoundCloud embeds.
* `` false `` : Ignore embeds.
2017-10-12 23:37:28 +02:00
extractor.flickr.access-token & .access-token-secret
----------------------------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` null ``
Description
The `` access_token `` and `` access_token_secret `` values you get
from `linking your Flickr account to gallery-dl <OAuth_> `__ .
2017-06-18 22:16:26 +02:00
2023-07-01 19:19:39 +02:00
extractor.flickr.exif
---------------------
Type
`` bool ``
Default
`` false ``
Description
Fetch `exif` and `camera` metadata for each photo.
Note: This requires 1 additional API call per photo.
2023-06-26 16:49:48 +02:00
extractor.flickr.metadata
-------------------------
Type
* `` bool ``
* `` string ``
* `` list `` of `` strings ``
Default
`` false ``
Example
* `` license,last_update,machine_tags ``
* `` ["license", "last_update", "machine_tags"] ``
Description
Extract additional metadata
(license, date_taken, original_format, last_update, geo, machine_tags, o_dims)
It is possible to specify a custom list of metadata includes.
See `the extras parameter <https://www.flickr.com/services/api/flickr.people.getPhotos.html> `__
in `Flickr API docs <https://www.flickr.com/services/api/> `__
for possible field names.
2019-05-14 18:12:02 +02:00
extractor.flickr.videos
-----------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Extract and download videos.
2017-06-18 22:16:26 +02:00
2017-06-20 16:20:28 +02:00
extractor.flickr.size-max
--------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` integer ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` null ``
Description
Sets the maximum allowed size for downloaded images.
2017-06-20 16:20:28 +02:00
2020-09-26 13:33:46 +02:00
* If this is an `` integer `` , it specifies the maximum image dimension
(width and height) in pixels.
* If this is a `` string `` , it should be one of Flickr's format specifiers
(`` "Original" `` , `` "Large" `` , ... or `` "o" `` , `` "k" `` , `` "h" `` ,
`` "l" `` , ...) to use as an upper limit.
2017-06-20 16:20:28 +02:00
2017-12-21 21:42:40 +01:00
2021-01-19 19:09:29 +01:00
extractor.furaffinity.descriptions
----------------------------------
Type
`` string ``
Default
`` "text" ``
Description
Controls the format of `` description `` metadata fields.
* `` "text" `` : Plain text with HTML tags removed
* `` "html" `` : Raw HTML content
2021-08-08 18:53:02 +02:00
extractor.furaffinity.external
------------------------------
Type
`` bool ``
Default
`` false ``
Description
Follow external URLs linked in descriptions.
2020-02-12 21:39:43 +01:00
extractor.furaffinity.include
-----------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
`` "gallery" ``
Example
2022-12-20 17:30:46 +01:00
* `` "scraps,favorite" ``
* `` ["scraps", "favorite"] ``
2020-09-26 13:33:46 +02:00
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
2020-02-12 21:39:43 +01:00
2020-09-26 13:33:46 +02:00
Possible values are
`` "gallery" `` , `` "scraps" `` , `` "favorite" `` .
2020-02-12 21:39:43 +01:00
2022-12-20 17:30:46 +01:00
It is possible to use `` "all" `` instead of listing all values separately.
2020-02-12 21:39:43 +01:00
2022-02-11 00:00:01 +01:00
extractor.furaffinity.layout
----------------------------
Type
`` string ``
Default
`` "auto" ``
Description
Selects which site layout to expect when parsing posts.
* `` "auto" `` : Automatically differentiate between `` "old" `` and `` "new" ``
* `` "old" `` : Expect the *old* site layout
* `` "new" `` : Expect the *new* site layout
2022-07-18 18:46:31 +02:00
extractor.gelbooru.api-key & .user-id
-------------------------------------
Type
`` string ``
Default
`` null ``
Description
Values from the API Access Credentials section found at the bottom of your
`Account Options <https://gelbooru.com/index.php?page=account&s=options> `__
page.
2021-12-29 22:45:07 +01:00
extractor.generic.enabled
-------------------------
Type
`` bool ``
Default
`` false ``
Description
Match **all** URLs not otherwise supported by gallery-dl,
even ones without a `` generic: `` prefix.
2022-03-29 17:31:57 +02:00
extractor.gofile.api-token
--------------------------
Type
`` string ``
Default
`` null ``
Description
API token value found at the bottom of your `profile page <https://gofile.io/myProfile> `__ .
If not set, a temporary guest token will be used.
2022-06-01 12:59:52 +02:00
extractor.gofile.website-token
------------------------------
Type
`` string ``
Description
API token value used during API requests.
2023-05-20 16:58:21 +02:00
An invalid or not up-to-date value
will result in `` 401 Unauthorized `` errors.
2022-06-01 12:59:52 +02:00
2023-05-20 16:58:21 +02:00
Keeping this option unset will use an extra HTTP request
to attempt to fetch the current value used by gofile.
2022-06-01 12:59:52 +02:00
2022-03-29 17:31:57 +02:00
extractor.gofile.recursive
--------------------------
Type
`` bool ``
Default
`` false ``
Description
Recursively download files from subfolders.
2020-09-24 00:48:08 +02:00
extractor.hentaifoundry.include
-------------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
2020-10-11 18:44:46 +02:00
`` "pictures" ``
2020-09-26 13:33:46 +02:00
Example
2022-12-20 17:30:46 +01:00
* `` "scraps,stories" ``
* `` ["scraps", "stories"] ``
2020-09-26 13:33:46 +02:00
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
2020-09-24 00:48:08 +02:00
2020-09-26 13:33:46 +02:00
Possible values are
2020-10-11 18:44:46 +02:00
`` "pictures" `` , `` "scraps" `` , `` "stories" `` , `` "favorite" `` .
2020-09-24 00:48:08 +02:00
2022-12-20 17:30:46 +01:00
It is possible to use `` "all" `` instead of listing all values separately.
2020-09-24 00:48:08 +02:00
2022-02-03 22:51:10 +01:00
extractor.hitomi.format
-----------------------
Type
`` string ``
Default
`` "webp" ``
Description
Selects which image format to download.
Available formats are `` "webp" `` and `` "avif" `` .
`` "original" `` will try to download the original `` jpg `` or `` png `` versions,
but is most likely going to fail with `` 403 Forbidden `` errors.
2023-05-20 16:21:11 +02:00
extractor.imagechest.access-token
---------------------------------
Type
`` string ``
Description
Your personal Image Chest access token.
These tokens allow using the API instead of having to scrape HTML pages,
providing more detailed metadata.
(`` date `` , `` description `` , etc)
See https://imgchest.com/docs/api/1.0/general/authorization
for instructions on how to generate such a token.
2023-04-21 14:22:26 +02:00
extractor.imgur.client-id
-------------------------
Type
`` string ``
Description
Custom Client ID value for API requests.
2017-06-18 22:16:26 +02:00
extractor.imgur.mp4
-------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
Controls whether to choose the GIF or MP4 version of an animation.
2017-06-18 22:16:26 +02:00
2020-09-26 13:33:46 +02:00
* `` true `` : Follow Imgur's advice and choose MP4 if the
`` prefer_video `` flag in an image's metadata is set.
* `` false `` : Always choose GIF.
* `` "always" `` : Always choose MP4.
2017-06-20 16:20:28 +02:00
2020-07-24 17:50:32 +02:00
extractor.inkbunny.orderby
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "create_datetime" ``
Description
Value of the `` orderby `` parameter for submission searches.
2020-07-24 17:50:32 +02:00
2020-09-26 13:33:46 +02:00
(See `API#Search <https://wiki.inkbunny.net/wiki/API#Search> `__
for details)
2020-07-24 17:50:32 +02:00
2022-09-26 22:05:29 +02:00
extractor.instagram.api
-----------------------
Type
`` string ``
Default
2022-11-17 17:15:38 +01:00
`` "rest" ``
2022-09-26 22:05:29 +02:00
Description
Selects which API endpoints to use.
2022-11-17 17:15:38 +01:00
* `` "rest" `` : REST API - higher-resolution media
* `` "graphql" `` : GraphQL API - lower-resolution media
2022-09-26 22:05:29 +02:00
2020-12-20 23:20:32 +01:00
extractor.instagram.include
---------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
2020-12-20 23:20:32 +01:00
`` "posts" ``
Example
2022-12-20 17:30:46 +01:00
* `` "stories,highlights,posts" ``
* `` ["stories", "highlights", "posts"] ``
2020-09-26 13:33:46 +02:00
Description
2020-12-20 23:20:32 +01:00
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
2022-10-19 10:58:42 +02:00
`` "posts" `` ,
`` "reels" `` ,
`` "tagged" `` ,
`` "stories" `` ,
`` "highlights" `` ,
`` "avatar" `` .
2020-12-20 23:20:32 +01:00
2022-12-20 17:30:46 +01:00
It is possible to use `` "all" `` instead of listing all values separately.
2019-09-21 23:38:20 +02:00
2023-05-27 15:51:13 +02:00
extractor.instagram.metadata
----------------------------
Type
`` bool ``
Default
`` false ``
Description
Provide extended `` user `` metadata even when referring to a user by ID,
e.g. `` instagram.com/id:12345678 `` .
Note: This metadata is always available when referring to a user by name,
e.g. `` instagram.com/USERNAME `` .
2023-05-18 22:34:33 +02:00
extractor.instagram.order-files
-------------------------------
Type
`` string ``
Default
`` "asc" ``
Description
Controls the order in which files of each post are returned.
* `` "asc" `` : Same order as displayed in a post
* `` "desc" `` : Reverse order as displayed in a post
* `` "reverse" `` : Same as `` "desc" ``
Note: This option does *not* affect `` {num} `` .
To enumerate files in reverse order, use `` count - num + 1 `` .
2023-05-18 22:50:04 +02:00
extractor.instagram.order-posts
-------------------------------
Type
`` string ``
Default
`` "asc" ``
Description
Controls the order in which posts are returned.
* `` "asc" `` : Same order as displayed
* `` "desc" `` : Reverse order as displayed
* `` "id" `` or `` "id_asc" `` : Ascending order by ID
* `` "id_desc" `` : Descending order by ID
* `` "reverse" `` : Same as `` "desc" ``
Note: This option only affects `` highlights `` .
2022-03-19 15:22:13 +01:00
extractor.instagram.previews
----------------------------
Type
`` bool ``
Default
`` false ``
Description
Download video previews.
2019-12-19 17:15:41 +01:00
extractor.instagram.videos
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Download video files.
2019-12-19 17:15:41 +01:00
2022-06-20 19:47:53 +02:00
extractor.itaku.videos
----------------------
Type
`` bool ``
Default
`` true ``
Description
Download video files.
2021-11-03 22:52:15 +01:00
extractor.kemonoparty.comments
2021-11-12 18:29:40 +01:00
------------------------------
2021-11-03 22:52:15 +01:00
Type
`` bool ``
Default
`` false ``
Description
Extract `` comments `` metadata.
2022-12-20 17:30:46 +01:00
Note: This requires 1 additional HTTP request per post.
2021-11-03 22:52:15 +01:00
2022-03-24 11:58:38 +01:00
extractor.kemonoparty.duplicates
--------------------------------
Type
`` bool ``
Default
`` false ``
Description
Controls how to handle duplicate files in a post.
* `` true `` : Download duplicates
* `` false `` : Ignore duplicates
2021-11-20 23:36:16 +01:00
extractor.kemonoparty.dms
-------------------------
Type
`` bool ``
Default
`` false ``
Description
Extract a user's direct messages as `` dms `` metadata.
2022-08-18 18:01:42 +02:00
extractor.kemonoparty.favorites
2022-08-29 23:00:06 +02:00
-------------------------------
2022-08-18 18:01:42 +02:00
Type
`` string ``
Default
`` artist ``
Description
Determines the type of favorites to be downloaded.
Available types are `` artist `` , and `` post `` .
2021-11-17 19:59:24 +01:00
extractor.kemonoparty.files
---------------------------
Type
`` list `` of `` strings ``
Default
2021-11-29 02:15:44 +01:00
`` ["attachments", "file", "inline"] ``
2021-11-17 19:59:24 +01:00
Description
Determines the type and order of files to be downloaded.
2021-11-19 01:50:48 +01:00
Available types are `` file `` , `` attachments `` , and `` inline `` .
2021-11-17 19:59:24 +01:00
2021-07-09 18:19:02 +02:00
extractor.kemonoparty.max-posts
-------------------------------
Type
`` integer ``
Default
`` null ``
Description
Limit the number of posts to download.
2021-05-14 19:54:16 +02:00
extractor.kemonoparty.metadata
------------------------------
Type
`` bool ``
Default
`` false ``
Description
Extract `` username `` metadata
2020-07-12 23:06:42 +02:00
extractor.khinsider.format
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "mp3" ``
Description
The name of the preferred file format to download.
2020-07-12 23:06:42 +02:00
2020-09-26 13:33:46 +02:00
Use `` "all" `` to download all available formats,
or a (comma-separated) list to select multiple formats.
2020-07-12 23:06:42 +02:00
2020-09-26 13:33:46 +02:00
If the selected format is not available,
the first in the list gets chosen (usually `mp3` ).
2020-07-12 23:06:42 +02:00
2022-05-10 12:17:59 +02:00
extractor.lolisafe.domain
-------------------------
Type
`` string ``
Default
2022-12-11 17:34:34 +01:00
`` null ``
2022-05-10 12:17:59 +02:00
Description
Specifies the domain used by a `` lolisafe `` extractor
regardless of input URL.
Setting this option to `` "auto" ``
uses the same domain as a given input URL.
2021-08-12 15:12:42 +02:00
extractor.luscious.gif
2021-08-12 16:05:26 +02:00
----------------------
2021-08-12 15:12:42 +02:00
Type
`` bool ``
Default
`` false ``
Description
Format in which to download animated images.
2021-08-12 16:05:26 +02:00
Use `` true `` to download animated images as gifs and `` false ``
2021-08-12 15:12:42 +02:00
to download as mp4 videos.
2021-02-28 01:31:50 +01:00
extractor.mangadex.api-server
-----------------------------
Type
`` string ``
Default
`` "https://api.mangadex.org" ``
Description
The server to use for API requests.
2021-10-05 19:46:48 +02:00
extractor.mangadex.api-parameters
---------------------------------
Type
2023-01-06 13:07:33 +01:00
`` object `` (`name` -> `value` )
2021-10-05 19:46:48 +02:00
Example
`` {"order[updatedAt]": "desc"} ``
Description
Additional query parameters to send when fetching manga chapters.
2023-05-24 12:32:13 +02:00
(See `/manga/{id}/feed <https://api.mangadex.org/docs/swagger.html#/Manga/get-manga-id-feed> `__
and `/user/follows/manga/feed <https://api.mangadex.org/docs/swagger.html#/Feed/get-user-follows-manga-feed> `__ )
2021-10-05 19:46:48 +02:00
2021-06-20 22:41:41 +02:00
extractor.mangadex.lang
-----------------------
Type
2023-07-26 17:14:12 +02:00
* `` string ``
* `` list `` of `` strings ``
2021-06-20 22:41:41 +02:00
Example
2023-07-26 17:14:12 +02:00
* `` "en" ``
* `` "fr,it" ``
* `` ["fr", "it"] ``
2021-06-20 22:41:41 +02:00
Description
2023-07-26 17:14:12 +02:00
`ISO 639-1 <https://en.wikipedia.org/wiki/ISO_639-1> `__ language codes
2021-06-20 22:41:41 +02:00
to filter chapters by.
2021-10-05 19:46:48 +02:00
extractor.mangadex.ratings
--------------------------
Type
`` list `` of `` strings ``
Default
`` ["safe", "suggestive", "erotica", "pornographic"] ``
Description
List of acceptable content ratings for returned chapters.
2023-07-02 15:07:22 +02:00
extractor.mangapark.source
--------------------------
Type
* `` string ``
* `` integer ``
Example
* `` "koala:en" ``
* `` 15150116 ``
Description
Select chapter source and language for a manga.
| The general syntax is `` "<source name>:<ISO 639-1 language code>" `` .
| Both are optional, meaning `` "koala" `` , `` "koala:" `` , `` ":en" `` ,
or even just `` ":" `` are possible as well.
Specifying the numeric `` ID `` of a source is also supported.
2023-01-06 06:20:41 +01:00
extractor.[mastodon].access-token
---------------------------------
Type
`` string ``
Default
`` null ``
Description
The `` access-token `` value you get from `linking your account to
gallery-dl <OAuth_>`__.
Note: gallery-dl comes with built-in tokens for `` mastodon.social `` ,
`` pawoo `` and `` baraag `` . For other instances, you need to obtain an
`` access-token `` in order to use usernames in place of numerical
2023-01-07 23:12:36 +01:00
user IDs.
2023-01-06 06:20:41 +01:00
extractor.[mastodon].reblogs
----------------------------
2021-07-06 23:13:58 +02:00
Type
`` bool ``
Default
`` false ``
Description
2021-07-07 00:56:42 +02:00
Fetch media from reblogged posts.
2023-01-06 06:20:41 +01:00
extractor.[mastodon].replies
----------------------------
2021-07-07 00:56:42 +02:00
Type
`` bool ``
Default
`` true ``
Description
Fetch media from replies to other posts.
2021-07-06 23:13:58 +02:00
2023-01-06 06:20:41 +01:00
extractor.[mastodon].text-posts
-------------------------------
2021-07-02 22:12:41 +02:00
Type
`` bool ``
Default
`` false ``
Description
Also emit metadata for text-only posts without media content.
2023-05-23 22:15:20 +02:00
extractor.[misskey].access-token
--------------------------------
Type
`` string ``
Description
Your access token, necessary to fetch favorited notes.
2023-03-02 15:26:19 +01:00
extractor.[misskey].renotes
2023-05-23 22:15:20 +02:00
---------------------------
2023-03-02 15:26:19 +01:00
Type
`` bool ``
Default
`` false ``
Description
Fetch media from renoted notes.
extractor.[misskey].replies
2023-05-23 22:15:20 +02:00
---------------------------
2023-03-02 15:26:19 +01:00
Type
`` bool ``
Default
`` true ``
Description
Fetch media from replies to other notes.
2023-10-12 21:32:41 +02:00
extractor.[moebooru].pool.metadata
----------------------------------
Type
`` bool ``
Default
`` false ``
Description
Extract extended `` pool `` metadata.
Note: Not supported by all `` moebooru `` instances.
2021-01-19 17:43:55 +01:00
extractor.newgrounds.flash
--------------------------
Type
`` bool ``
Default
`` true ``
Description
Download original Adobe Flash animations instead of pre-rendered videos.
2021-07-29 19:11:20 +02:00
extractor.newgrounds.format
---------------------------
Type
`` string ``
Default
`` "original" ``
Example
`` "720p" ``
Description
Selects the preferred format for video downloads.
If the selected format is not available,
the next smaller one gets chosen.
2019-11-22 23:20:21 +01:00
extractor.newgrounds.include
----------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
`` "art" ``
Example
2022-12-20 17:30:46 +01:00
* `` "movies,audio" ``
* `` ["movies", "audio"] ``
2020-09-26 13:33:46 +02:00
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
2019-11-22 23:20:21 +01:00
2020-09-26 13:33:46 +02:00
Possible values are
2022-09-24 12:34:37 +02:00
`` "art" `` , `` "audio" `` , `` "games" `` , `` "movies" `` .
2019-11-22 23:20:21 +01:00
2022-12-20 17:30:46 +01:00
It is possible to use `` "all" `` instead of listing all values separately.
2019-11-22 23:20:21 +01:00
2020-09-24 23:26:33 +02:00
extractor.nijie.include
2022-05-01 17:45:38 +02:00
-----------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
`` "illustration,doujin" ``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
2020-09-24 23:26:33 +02:00
2020-09-26 13:33:46 +02:00
Possible values are
2022-05-01 17:45:38 +02:00
`` "illustration" `` , `` "doujin" `` , `` "favorite" `` , `` "nuita" `` .
2020-09-24 23:26:33 +02:00
2022-12-20 17:30:46 +01:00
It is possible to use `` "all" `` instead of listing all values separately.
2020-09-24 23:26:33 +02:00
2022-11-26 11:23:03 +01:00
extractor.nitter.quoted
-----------------------
Type
`` bool ``
Default
`` false ``
Description
Fetch media from quoted Tweets.
2022-11-25 19:53:28 +01:00
extractor.nitter.retweets
-------------------------
Type
`` bool ``
Default
`` false ``
Description
Fetch media from Retweets.
2022-11-24 22:56:01 +01:00
extractor.nitter.videos
-----------------------
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2022-11-24 22:56:01 +01:00
Default
`` true ``
Description
Control video download behavior.
* `` true `` : Download videos
* `` "ytdl" `` : Download videos using `youtube-dl`_
* `` false `` : Skip video Tweets
2017-06-20 16:20:28 +02:00
extractor.oauth.browser
-----------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Controls how a user is directed to an OAuth authorization page.
2017-06-20 16:20:28 +02:00
2020-09-26 13:33:46 +02:00
* `` true `` : Use Python's |webbrowser.open()|_ method to automatically
open the URL in the user's default browser.
* `` false `` : Ask the user to copy & paste an URL from the terminal.
2017-06-18 22:16:26 +02:00
2020-05-25 22:19:58 +02:00
extractor.oauth.cache
---------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Store tokens received during OAuth authorizations
in `cache <cache.file_> `__ .
2020-05-25 22:19:58 +02:00
2022-08-14 17:08:01 +02:00
extractor.oauth.host
--------------------
Type
`` string ``
Default
`` "localhost" ``
Description
Host name / IP address to bind to during OAuth authorization.
2020-02-09 13:45:44 +01:00
extractor.oauth.port
--------------------
2020-09-26 13:33:46 +02:00
Type
`` integer ``
Default
`` 6414 ``
Description
Port number to listen on during OAuth authorization.
2020-02-09 13:45:44 +01:00
2022-12-20 17:30:46 +01:00
Note: All redirects will go to port `` 6414 `` , regardless
2020-09-26 13:33:46 +02:00
of the port specified here. You'll have to manually adjust the
port number in your browser's address bar when using a different
port than the default.
2020-02-09 13:45:44 +01:00
2022-06-04 16:05:49 +02:00
extractor.paheal.metadata
-------------------------
Type
`` bool ``
Default
`` false ``
Description
Extract additional metadata (`` source `` , `` uploader `` )
Note: This requires 1 additional HTTP request per post.
2021-10-17 04:14:58 +02:00
extractor.patreon.files
-----------------------
Type
`` list `` of `` strings ``
Default
2022-03-06 17:07:13 +01:00
`` ["images", "image_large", "attachments", "postfile", "content"] ``
2021-10-17 04:14:58 +02:00
Description
Determines the type and order of files to be downloaded.
Available types are
2022-03-06 17:07:13 +01:00
`` postfile `` , `` images `` , `` image_large `` , `` attachments `` , and `` content `` .
2021-10-17 04:14:58 +02:00
2019-01-21 19:55:05 +01:00
extractor.photobucket.subalbums
-------------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Download subalbums.
2019-01-21 19:55:05 +01:00
2021-05-17 01:38:00 +02:00
extractor.pillowfort.external
-----------------------------
Type
`` bool ``
Default
`` false ``
Description
Follow links to external sites, e.g. Twitter,
2021-05-17 02:57:02 +02:00
extractor.pillowfort.inline
---------------------------
Type
`` bool ``
Default
`` true ``
Description
Extract inline images.
2021-01-25 00:38:19 +01:00
extractor.pillowfort.reblogs
----------------------------
Type
`` bool ``
Default
`` false ``
Description
Extract media from reblogged posts.
2023-01-04 16:40:03 +01:00
extractor.pinterest.domain
--------------------------
Type
`` string ``
Default
`` "auto" ``
Description
2023-01-06 13:07:33 +01:00
Specifies the domain used by `` pinterest `` extractors.
2023-01-04 16:40:03 +01:00
Setting this option to `` "auto" ``
uses the same domain as a given input URL.
2020-06-16 14:41:05 +02:00
extractor.pinterest.sections
----------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Include pins from board sections.
2020-06-16 14:41:05 +02:00
2020-12-21 16:09:06 +01:00
extractor.pinterest.videos
--------------------------
Type
`` bool ``
Default
`` true ``
Description
Download from video pins.
2022-05-01 21:12:23 +02:00
extractor.pixiv.include
-----------------------
2020-09-26 13:33:46 +02:00
Type
2022-05-01 21:12:23 +02:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
2022-05-01 21:12:23 +02:00
`` "artworks" ``
Example
* `` "avatar,background,artworks" ``
* `` ["avatar", "background", "artworks"] ``
2020-09-26 13:33:46 +02:00
Description
2022-05-01 21:12:23 +02:00
A (comma-separated) list of subcategories to include
when processing a user profile.
2020-03-09 21:17:16 +01:00
2022-05-01 21:12:23 +02:00
Possible values are
2023-06-19 15:01:53 +02:00
`` "artworks" `` ,
`` "avatar" `` ,
`` "background" `` ,
`` "favorite" `` ,
`` "novel-user" `` ,
`` "novel-bookmark" `` .
2020-03-09 21:17:16 +01:00
2022-05-01 21:12:23 +02:00
It is possible to use `` "all" `` instead of listing all values separately.
2022-04-21 13:53:02 +02:00
2023-01-07 23:12:36 +01:00
extractor.pixiv.refresh-token
-----------------------------
Type
`` string ``
Description
The `` refresh-token `` value you get
from running `` gallery-dl oauth:pixiv `` (see OAuth_) or
by using a third-party tool like
`gppt <https://github.com/eggplants/get-pixivpy-token> `__ .
2023-05-23 12:14:06 +02:00
extractor.pixiv.embeds
----------------------
Type
`` bool ``
Default
`` false ``
Description
Download images embedded in novels.
2023-06-01 13:07:20 +02:00
extractor.pixiv.novel.full-series
---------------------------------
Type
`` bool ``
Default
`` false ``
Description
When downloading a novel being part of a series,
download all novels of that series.
2022-10-16 15:32:31 +02:00
extractor.pixiv.metadata
------------------------
2021-05-14 20:30:28 +02:00
Type
`` bool ``
Default
`` false ``
Description
Fetch extended `` user `` metadata.
2023-01-07 23:12:36 +01:00
extractor.pixiv.metadata-bookmark
---------------------------------
Type
`` bool ``
Default
`` false ``
Description
For works bookmarked by
`your own account <extractor.pixiv.refresh-token_> `__ ,
fetch bookmark tags as `` tags_bookmark `` metadata.
Note: This requires 1 additional API call per bookmarked post.
2021-01-17 16:37:07 +01:00
extractor.pixiv.work.related
----------------------------
Type
`` bool ``
Default
`` false ``
Description
Also download related artworks.
2021-04-27 23:51:37 +02:00
extractor.pixiv.tags
--------------------
2021-03-05 17:18:51 +01:00
Type
2021-04-27 23:51:37 +02:00
`` string ``
2021-03-05 17:18:51 +01:00
Default
2021-04-27 23:51:37 +02:00
`` "japanese" ``
2021-03-05 17:18:51 +01:00
Description
2021-05-07 20:41:54 +02:00
Controls the `` tags `` metadata field.
2021-04-27 23:51:37 +02:00
2021-05-07 20:41:54 +02:00
* `"japanese"` : List of Japanese tags
2021-04-27 23:51:37 +02:00
* `"translated"` : List of translated tags
2021-05-07 20:41:54 +02:00
* `"original"` : Unmodified list with both Japanese and translated tags
2021-03-05 17:18:51 +01:00
2017-06-18 22:16:26 +02:00
extractor.pixiv.ugoira
----------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Download Pixiv's Ugoira animations or ignore them.
2017-06-18 22:16:26 +02:00
2020-09-26 13:33:46 +02:00
These animations come as a `` .zip `` file containing all
animation frames in JPEG format.
2020-02-14 01:22:16 +01:00
2020-09-26 13:33:46 +02:00
Use an `ugoira` post processor to convert them
to watchable videos. (Example__)
2017-06-18 22:16:26 +02:00
2020-06-12 18:25:17 +02:00
.. __: https://github.com/mikf/gallery-dl/blob/v1.12.3/docs/gallery-dl-example.conf#L9-L14
2020-02-14 01:22:16 +01:00
2017-06-18 22:16:26 +02:00
2021-05-24 17:49:46 +02:00
extractor.pixiv.max-posts
-------------------------
Type
`` integer ``
Default
`` 0 ``
Description
When downloading galleries, this sets the maximum number of posts to get.
A value of `` 0 `` means no limit.
2019-04-14 21:52:23 +02:00
extractor.plurk.comments
------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Also search Plurk comments for URLs.
2019-04-14 21:52:23 +02:00
2021-08-12 15:12:42 +02:00
extractor.reactor.gif
2021-08-12 16:05:26 +02:00
---------------------
2021-08-12 15:12:42 +02:00
Type
`` bool ``
Default
`` false ``
Description
Format in which to download animated images.
2021-08-12 16:05:26 +02:00
Use `` true `` to download animated images as gifs and `` false ``
2021-08-12 15:12:42 +02:00
to download as mp4 videos.
2019-05-27 22:24:48 +02:00
extractor.readcomiconline.captcha
---------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "stop" ``
Description
Controls how to handle redirects to CAPTCHA pages.
2019-05-27 22:24:48 +02:00
2020-09-26 13:33:46 +02:00
* `` "stop `` : Stop the current extractor run.
* `` "wait `` : Ask the user to solve the CAPTCHA and wait.
2019-05-27 22:24:48 +02:00
2022-04-15 18:10:37 +02:00
extractor.readcomiconline.quality
---------------------------------
Type
`` string ``
Default
`` "auto" ``
Description
Sets the `` quality `` query parameter of issue pages. (`` "lq" `` or `` "hq" `` )
`` "auto" `` uses the quality parameter of the input URL
or `` "hq" `` if not present.
2017-06-18 22:16:26 +02:00
extractor.reddit.comments
-------------------------
2020-09-26 13:33:46 +02:00
Type
`` integer ``
Default
`` 0 ``
Description
The value of the `` limit `` parameter when loading
a submission and its comments.
This number (roughly) specifies the total amount of comments
being retrieved with the first API call.
2017-06-18 22:16:26 +02:00
2020-09-26 13:33:46 +02:00
Reddit's internal default and maximum values for this parameter
appear to be 200 and 500 respectively.
2017-06-18 22:16:26 +02:00
2020-09-26 13:33:46 +02:00
The value `` 0 `` ignores all comments and significantly reduces the
time required when scanning a subreddit.
2017-06-18 22:16:26 +02:00
2017-06-23 16:14:51 +02:00
extractor.reddit.morecomments
2017-06-24 12:17:26 +02:00
-----------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Retrieve additional comments by resolving the `` more `` comment
stubs in the base comment tree.
2017-06-23 16:14:51 +02:00
2023-01-07 23:12:36 +01:00
Note: This requires 1 additional API call for every 100 extra comments.
2017-06-23 16:14:51 +02:00
2017-10-12 23:37:28 +02:00
extractor.reddit.date-min & .date-max
-------------------------------------
2020-09-26 13:33:46 +02:00
Type
|Date|_
Default
`` 0 `` and `` 253402210800 `` (timestamp of |datetime.max|_)
Description
Ignore all submissions posted before/after this date.
2017-07-04 19:34:34 +02:00
2017-10-12 23:37:28 +02:00
extractor.reddit.id-min & .id-max
---------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Example
`` "6kmzv2" ``
Description
Ignore all submissions posted before/after the submission with this ID.
2017-06-18 22:16:26 +02:00
2023-09-22 18:10:44 +02:00
extractor.reddit.previews
-------------------------
Type
`` bool ``
Default
`` true ``
Description
For failed downloads from external URLs / child extractors,
download Reddit's preview image/video if available.
2017-06-18 22:16:26 +02:00
extractor.reddit.recursion
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` integer ``
Default
`` 0 ``
Description
Reddit extractors can recursively visit other submissions
linked to in the initial set of submissions.
This value sets the maximum recursion depth.
2017-06-18 22:16:26 +02:00
2020-09-26 13:33:46 +02:00
Special values:
2021-03-20 01:31:12 +01:00
2020-09-26 13:33:46 +02:00
* `` 0 `` : Recursion is disabled
* `` -1 `` : Infinite recursion (don't do this)
2017-06-18 22:16:26 +02:00
extractor.reddit.refresh-token
------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` null ``
Description
The `` refresh-token `` value you get from
`linking your Reddit account to gallery-dl <OAuth_> `__ .
2017-06-23 16:14:51 +02:00
2020-09-26 13:33:46 +02:00
Using a `` refresh-token `` allows you to access private or otherwise
not publicly available subreddits, given that your account is
authorized to do so,
but requests to the reddit API are going to be rate limited
at 600 requests every 10 minutes/600 seconds.
2017-06-18 22:16:26 +02:00
2020-01-31 23:45:02 +01:00
extractor.reddit.videos
-----------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
Control video download behavior.
2020-01-31 23:45:02 +01:00
2020-09-26 13:33:46 +02:00
* `` true `` : Download videos and use `youtube-dl`_ to handle
HLS and DASH manifests
* `` "ytdl" `` : Download videos and let `youtube-dl`_ handle all of
video extraction and download
2023-03-06 12:18:25 +01:00
* `` "dash" `` : Extract DASH manifest URLs and use `youtube-dl`_
to download and merge them. (*)
2020-09-26 13:33:46 +02:00
* `` false `` : Ignore videos
2020-01-31 23:45:02 +01:00
2023-03-06 12:18:25 +01:00
(*)
This saves 1 HTTP request per video
and might potentially be able to download otherwise deleted videos,
but it will not always get the best video quality available.
2020-01-31 23:45:02 +01:00
2020-06-12 18:25:17 +02:00
extractor.redgifs.format
------------------------
2020-09-26 13:33:46 +02:00
Type
2021-10-22 22:47:29 +02:00
* `` string ``
2022-12-20 17:30:46 +01:00
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
2021-11-04 21:31:20 +01:00
`` ["hd", "sd", "gif"] ``
2020-09-26 13:33:46 +02:00
Description
2021-10-22 22:47:29 +02:00
List of names of the preferred animation format, which can be
2023-04-21 23:12:01 +02:00
`` "hd" `` ,
`` "sd" `` ,
`` "gif" `` ,
`` "thumbnail" `` ,
`` "vthumbnail" `` , or
`` "poster" `` .
2020-06-12 18:25:17 +02:00
2021-10-22 22:47:29 +02:00
If a selected format is not available, the next one in the list will be
tried until an available format is found.
If the format is given as `` string `` , it will be extended with
2021-11-04 21:31:20 +01:00
`` ["hd", "sd", "gif"] `` . Use a list with one element to
2021-10-22 22:47:29 +02:00
restrict it to only one possible format.
2020-06-12 18:25:17 +02:00
2022-09-30 19:55:48 +02:00
extractor.sankaku.refresh
-------------------------
Type
`` bool ``
Default
`` false ``
Description
Refresh download URLs before they expire.
2020-10-30 00:53:11 +01:00
extractor.sankakucomplex.embeds
-------------------------------
Type
`` bool ``
Default
`` false ``
Description
Download video embeds from external sites.
extractor.sankakucomplex.videos
-------------------------------
Type
`` bool ``
Default
`` true ``
Description
Download videos.
2022-07-29 16:32:00 +02:00
extractor.skeb.article
----------------------
Type
`` bool ``
Default
`` false ``
Description
Download article images.
2022-02-28 22:42:15 +01:00
extractor.skeb.sent-requests
2022-03-08 21:11:06 +01:00
----------------------------
2022-02-28 22:42:15 +01:00
Type
`` bool ``
Default
`` false ``
Description
Download sent requests.
2021-11-23 21:16:42 +01:00
extractor.skeb.thumbnails
-------------------------
Type
`` bool ``
Default
`` false ``
Description
Download thumbnails.
2022-09-26 15:17:28 +02:00
extractor.skeb.search.filters
-----------------------------
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2022-09-26 15:17:28 +02:00
Default
`` ["genre:art", "genre:voice", "genre:novel", "genre:video", "genre:music", "genre:correction"] ``
Example
`` "genre:music OR genre:voice" ``
Description
Filters used during searches.
2019-03-10 15:20:35 +01:00
extractor.smugmug.videos
------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Download video files.
2019-03-10 15:20:35 +01:00
2023-03-01 18:20:37 +01:00
extractor.[szurubooru].username & .token
----------------------------------------
Type
`` string ``
Description
Username and login token of your account to access private resources.
To generate a token, visit `` /user/USERNAME/list-tokens ``
and click `` Create Token `` .
2018-12-26 14:29:30 +01:00
extractor.tumblr.avatar
-----------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Download blog avatars.
2018-12-26 14:29:30 +01:00
2019-07-16 23:08:27 +02:00
extractor.tumblr.date-min & .date-max
-------------------------------------
2020-09-26 13:33:46 +02:00
Type
|Date|_
Default
`` 0 `` and `` null ``
Description
Ignore all posts published before/after this date.
2019-07-16 23:08:27 +02:00
2017-11-22 23:09:08 +01:00
extractor.tumblr.external
-------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Follow external URLs (e.g. from "Link" posts) and try to extract
images from them.
2017-11-22 23:09:08 +01:00
extractor.tumblr.inline
-----------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Search posts for inline images and videos.
2017-11-22 23:09:08 +01:00
2022-10-11 10:54:23 +02:00
extractor.tumblr.offset
-----------------------
Type
`` integer ``
Default
`` 0 ``
Description
Custom `` offset `` starting value when paginating over blog posts.
Allows skipping over posts without having to waste API calls.
2022-08-10 20:01:46 +02:00
extractor.tumblr.original
-------------------------
Type
`` bool ``
Default
`` true ``
Description
2022-08-31 10:53:50 +02:00
Download full-resolution `` photo `` and `` inline `` images.
2022-08-10 20:01:46 +02:00
For each photo with "maximum" resolution
2022-08-31 10:53:50 +02:00
(width equal to 2048 or height equal to 3072)
or each inline image,
2022-08-10 20:01:46 +02:00
use an extra HTTP request to find the URL to its full-resolution version.
2022-09-16 22:34:07 +02:00
extractor.tumblr.ratelimit
--------------------------
Type
`` string ``
Default
`` "abort" ``
Description
Selects how to handle exceeding the daily API rate limit.
* `` "abort" `` : Raise an error and stop extraction
* `` "wait" `` : Wait until rate limit reset
2018-01-05 13:00:25 +01:00
extractor.tumblr.reblogs
------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
* `` true `` : Extract media from reblogged posts
* `` false `` : Skip reblogged posts
* `` "same-blog" `` : Skip reblogged posts unless the original post
is from the same blog
2018-01-05 13:00:25 +01:00
2017-11-22 23:09:08 +01:00
extractor.tumblr.posts
----------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Default
`` "all" ``
Example
2022-12-20 17:30:46 +01:00
* `` "video,audio,link" ``
* `` ["video", "audio", "link"] ``
2020-09-26 13:33:46 +02:00
Description
A (comma-separated) list of post types to extract images, etc. from.
2017-11-22 23:09:08 +01:00
2020-09-26 13:33:46 +02:00
Possible types are `` text `` , `` quote `` , `` link `` , `` answer `` ,
`` video `` , `` audio `` , `` photo `` , `` chat `` .
2017-11-22 23:09:08 +01:00
2022-12-20 17:30:46 +01:00
It is possible to use `` "all" `` instead of listing all types separately.
2018-06-29 19:38:53 +02:00
2022-10-26 13:53:45 +02:00
extractor.tumblr.fallback-delay
-------------------------------
Type
`` float ``
Default
`` 120.0 ``
Description
Number of seconds to wait between retries
for fetching full-resolution images.
extractor.tumblr.fallback-retries
---------------------------------
Type
`` integer ``
Default
`` 2 ``
Description
Number of retries for fetching full-resolution images.
2022-02-18 00:40:22 +01:00
extractor.twibooru.api-key
--------------------------
Type
`` string ``
Default
`` null ``
Description
Your `Twibooru API Key <https://twibooru.org/users/edit> `__ ,
to use your account's browsing settings and filters.
extractor.twibooru.filter
-------------------------
Type
`` integer ``
Default
`` 2 `` (`Everything <https://twibooru.org/filters/2> `__ filter)
Description
The content filter ID to use.
Setting an explicit filter ID overrides any default filters and can be used
to access 18+ content without `API Key <extractor.twibooru.api-key_> `__ .
See `Filters <https://twibooru.org/filters> `__ for details.
2020-10-22 21:33:53 +02:00
extractor.twitter.cards
-----------------------
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-10-22 21:33:53 +02:00
Default
2022-05-21 15:39:25 +02:00
`` false ``
2020-10-22 21:33:53 +02:00
Description
2022-01-15 22:02:57 +01:00
Controls how to handle `Twitter Cards <https://developer.twitter.com/en/docs/twitter-for-websites/cards/overview/abouts-cards> `__ .
* `` false `` : Ignore cards
* `` true `` : Download image content from supported cards
* `` "ytdl" `` : Additionally download video content from unsupported cards using `youtube-dl`_
2020-10-22 21:33:53 +02:00
2022-08-31 10:05:26 +02:00
extractor.twitter.cards-blacklist
---------------------------------
Type
`` list `` of `` strings ``
Example
2022-09-17 17:39:34 +02:00
`` ["summary", "youtube.com", "player:twitch.tv"] ``
2022-08-31 10:05:26 +02:00
Description
2022-09-17 17:39:34 +02:00
List of card types to ignore.
Possible values are
* card names
* card domains
* `` <card name>:<card domain> ``
2022-08-31 10:05:26 +02:00
2021-02-26 13:50:46 +01:00
extractor.twitter.conversations
-------------------------------
Type
2023-06-24 20:49:00 +02:00
* `` bool ``
* `` string ``
2021-02-26 13:50:46 +01:00
Default
`` false ``
Description
2022-08-29 23:00:06 +02:00
For input URLs pointing to a single Tweet,
e.g. `https://twitter.com/i/web/status/<TweetID>` ,
fetch media from all Tweets and replies in this `conversation
2023-06-24 20:49:00 +02:00
<https://help.twitter.com/en/using-twitter/twitter-conversations>`__.
If this option is equal to `` "accessible" `` ,
only download from conversation Tweets
if the given initial Tweet is accessible.
2021-02-26 13:50:46 +01:00
2022-06-13 18:36:39 +02:00
extractor.twitter.csrf
----------------------
Type
`` string ``
Default
`` "cookies" ``
Description
Controls how to handle Cross Site Request Forgery (CSRF) tokens.
* `` "auto" `` : Always auto-generate a token.
* `` "cookies" `` : Use token given by the `` ct0 `` cookie if present.
2022-08-29 23:00:06 +02:00
extractor.twitter.expand
------------------------
Type
`` bool ``
Default
`` false ``
Description
For each Tweet, return *all* Tweets from that initial Tweet's
conversation or thread, i.e. *expand* all Twitter threads.
Going through a timeline with this option enabled is essentially the same
as running `` gallery-dl https://twitter.com/i/web/status/<TweetID> ``
with enabled `conversations <extractor.twitter.conversations_> `__ option
for each Tweet in said timeline.
Note: This requires at least 1 additional API call per initial Tweet.
2022-12-30 06:20:55 +01:00
Age-restricted replies cannot be expanded when using the
2022-12-30 05:39:11 +01:00
`syndication <extractor.twitter.syndication_> `__ API.
2022-08-29 23:00:06 +02:00
2023-07-18 16:42:55 +02:00
extractor.twitter.include
-------------------------
Type
* `` string ``
* `` list `` of `` strings ``
Default
`` "timeline" ``
Example
* `` "avatar,background,media" ``
* `` ["avatar", "background", "media"] ``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
`` "avatar" `` ,
`` "background" `` ,
`` "timeline" `` ,
`` "tweets" `` ,
`` "media" `` ,
`` "replies" `` ,
`` "likes" `` .
It is possible to use `` "all" `` instead of listing all values separately.
2023-02-02 22:01:36 +01:00
extractor.twitter.transform
---------------------------
Type
`` bool ``
Default
`` true ``
Description
Transform Tweet and User metadata into a simpler, uniform format.
2023-07-18 17:19:32 +02:00
extractor.twitter.tweet-endpoint
--------------------------------
Type
`` string ``
Default
`` "auto" ``
Description
Selects the API endpoint used to retrieve single Tweets.
* `` "restid" `` : `` /TweetResultByRestId `` - accessible to guest users
* `` "detail" `` : `` /TweetDetail `` - more stable
* `` "auto" `` : `` "detail" `` when logged in, `` "restid" `` otherwise
2021-10-05 18:58:10 +02:00
extractor.twitter.size
----------------------
Type
`` list `` of `` strings ``
Default
2021-12-15 23:17:07 +01:00
`` ["orig", "4096x4096", "large", "medium", "small"] ``
2021-10-05 18:58:10 +02:00
Description
The image version to download.
Any entries after the first one will be used for potential
`fallback <extractor.*.fallback_> `_ URLs.
Known available sizes are
`` 4096x4096 `` , `` orig `` , `` large `` , `` medium `` , and `` small `` .
2022-03-31 20:31:58 +02:00
extractor.twitter.syndication
-----------------------------
Type
2023-01-02 14:03:01 +01:00
* `` bool ``
* `` string ``
2022-03-31 20:31:58 +02:00
Default
`` false ``
Description
2022-12-30 13:39:36 +01:00
Controls how to retrieve age-restricted content when not logged in.
* `` false `` : Skip age-restricted Tweets.
* `` true `` : Download using Twitter's syndication API.
* `` "extended" `` : Try to fetch Tweet metadata using the normal API
2023-01-02 14:03:01 +01:00
in addition to the syndication API. This requires additional HTTP
requests in some cases (e.g. when `retweets <extractor.twitter.retweets_> `_
are enabled).
2022-03-31 20:31:58 +02:00
2022-12-23 15:57:45 +01:00
Note: This does not apply to search results (including
`timeline strategies <extractor.twitter.timeline.strategy_> `__ ).
To retrieve such content from search results, you must log in and
disable "Hide sensitive content" in your `search settings
<https://twitter.com/settings/search>`__.
2022-03-31 20:31:58 +02:00
2021-08-16 01:31:39 +02:00
extractor.twitter.logout
------------------------
Type
`` bool ``
Default
`` false ``
Description
Logout and retry as guest when access to another user's Tweets is blocked.
2021-10-29 22:10:58 +02:00
extractor.twitter.pinned
------------------------
Type
`` bool ``
Default
`` false ``
Description
Fetch media from pinned Tweets.
2020-06-24 21:13:16 +02:00
extractor.twitter.quoted
------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
2021-06-11 21:19:04 +02:00
`` false ``
2020-09-26 13:33:46 +02:00
Description
Fetch media from quoted Tweets.
2020-06-24 21:13:16 +02:00
2023-01-06 13:32:08 +01:00
If this option is enabled, gallery-dl will try to fetch
a quoted (original) Tweet when it sees the Tweet which quotes it.
2022-12-25 08:59:26 +01:00
2020-06-24 21:13:16 +02:00
2023-07-04 18:17:32 +02:00
extractor.twitter.ratelimit
---------------------------
Type
`` string ``
Default
`` "wait" ``
Description
Selects how to handle exceeding the API rate limit.
* `` "abort" `` : Raise an error and stop extraction
* `` "wait" `` : Wait until rate limit reset
2020-04-29 23:11:24 +02:00
extractor.twitter.replies
-------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Fetch media from replies to other Tweets.
2020-04-29 23:11:24 +02:00
2021-08-10 22:02:19 +02:00
If this value is `` "self" `` , only consider replies where
reply and original Tweet are from the same user.
2022-12-23 15:57:45 +01:00
Note: Twitter will automatically expand conversations if you
use the `` /with_replies `` timeline while logged in. For example,
media from Tweets which the user replied to will also be downloaded.
It is possible to exclude unwanted Tweets using `image-filter
<extractor.*.image-filter_>`__.
2020-04-29 23:11:24 +02:00
2018-08-17 20:04:11 +02:00
extractor.twitter.retweets
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
2021-06-11 21:19:04 +02:00
`` false ``
2020-09-26 13:33:46 +02:00
Description
Fetch media from Retweets.
2018-08-17 20:04:11 +02:00
2020-09-28 23:03:35 +02:00
If this value is `` "original" `` , metadata for these files
will be taken from the original Tweets, not the Retweets.
2018-08-17 20:04:11 +02:00
2022-07-03 14:29:15 +02:00
extractor.twitter.timeline.strategy
-----------------------------------
Type
`` string ``
Default
`` "auto" ``
Description
2023-07-24 14:27:37 +02:00
Controls the strategy / tweet source used for timeline URLs
(`` https://twitter.com/USER/timeline `` ).
2022-07-03 14:29:15 +02:00
* `` "tweets" `` : `/tweets <https://twitter.com/USER/tweets> `__ timeline + search
* `` "media" `` : `/media <https://twitter.com/USER/media> `__ timeline + search
* `` "with_replies" `` : `/with_replies <https://twitter.com/USER/with_replies> `__ timeline + search
* `` "auto" `` : `` "tweets" `` or `` "media" `` , depending on `retweets <extractor.twitter.retweets_> `__ and `text-tweets <extractor.twitter.text-tweets_> `__ settings
2021-05-22 21:07:21 +02:00
extractor.twitter.text-tweets
-----------------------------
2021-05-22 17:01:49 +02:00
Type
`` bool ``
Default
`` false ``
Description
2021-05-22 21:07:21 +02:00
Also emit metadata for text-only Tweets without media content.
2021-05-22 17:01:49 +02:00
This only has an effect with a `` metadata `` (or `` exec `` ) post processor
with `"event": "post" <metadata.event_> `_
and appropriate `filename <metadata.filename_> `_ .
2020-01-18 21:26:46 +01:00
extractor.twitter.twitpic
-------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Extract `TwitPic <https://twitpic.com/> `__ embeds.
2020-01-18 21:26:46 +01:00
2022-07-03 16:07:07 +02:00
extractor.twitter.unique
------------------------
Type
`` bool ``
Default
`` true ``
Description
Ignore previously seen Tweets.
2021-03-15 22:55:24 +01:00
extractor.twitter.users
-----------------------
Type
`` string ``
Default
2023-09-15 23:04:30 +02:00
`` "user" ``
2021-03-20 01:31:12 +01:00
Example
`` "https://twitter.com/search?q=from:{legacy[screen_name]}" ``
2021-03-15 22:55:24 +01:00
Description
2021-03-20 01:31:12 +01:00
| Format string for user URLs generated from
`` following `` and `` list-members `` queries,
| whose replacement field values come from Twitter `` user `` objects
(`Example <https://gist.githubusercontent.com/mikf/99d2719b3845023326c7a4b6fb88dd04/raw/275b4f0541a2c7dc0a86d3998f7d253e8f10a588/github.json> `_ )
Special values:
2023-09-15 23:04:30 +02:00
* `` "user" `` : `` https://twitter.com/i/user/{rest_id} ``
* `` "timeline" `` : `` https://twitter.com/id:{rest_id}/timeline ``
2022-05-23 18:23:21 +02:00
* `` "tweets" `` : `` https://twitter.com/id:{rest_id}/tweets ``
2021-03-20 01:31:12 +01:00
* `` "media" `` : `` https://twitter.com/id:{rest_id}/media ``
Note: To allow gallery-dl to follow custom URL formats, set the blacklist__
for `` twitter `` to a non-default value, e.g. an empty string `` "" `` .
2021-03-15 22:55:24 +01:00
2021-03-20 01:31:12 +01:00
.. __: `extractor.*.blacklist & .whitelist`_
2021-03-15 22:55:24 +01:00
2018-09-30 18:41:39 +02:00
extractor.twitter.videos
------------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
Control video download behavior.
2019-11-01 22:06:07 +01:00
2020-09-26 13:33:46 +02:00
* `` true `` : Download videos
* `` "ytdl" `` : Download videos using `youtube-dl`_
* `` false `` : Skip video Tweets
2018-09-30 18:41:39 +02:00
2021-01-21 22:41:49 +01:00
extractor.unsplash.format
-------------------------
Type
`` string ``
Default
`` "raw" ``
Description
Name of the image format to download.
Available formats are
2021-03-03 03:05:25 +01:00
`` "raw" `` , `` "full" `` , `` "regular" `` , `` "small" `` , and `` "thumb" `` .
2021-01-21 22:41:49 +01:00
2019-12-19 17:20:51 +01:00
extractor.vsco.videos
---------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Download video files.
2019-12-19 17:20:51 +01:00
2019-05-30 23:11:36 +02:00
extractor.wallhaven.api-key
---------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` null ``
Description
2020-10-19 21:57:26 +02:00
Your `Wallhaven API Key <https://wallhaven.cc/settings/account> `__ ,
to use your account's browsing settings and default filters when searching.
2019-05-30 23:11:36 +02:00
2020-09-26 13:33:46 +02:00
See https://wallhaven.cc/help/api for more information.
2019-05-30 23:11:36 +02:00
2022-11-15 17:35:52 +01:00
extractor.wallhaven.include
---------------------------
Type
* `` string ``
* `` list `` of `` strings ``
Default
`` "uploads" ``
Example
* `` "uploads,collections" ``
* `` ["uploads", "collections"] ``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
`` "uploads" `` , `` "collections" `` .
It is possible to use `` "all" `` instead of listing all values separately.
2022-08-08 21:46:36 +02:00
extractor.wallhaven.metadata
----------------------------
Type
`` bool ``
Default
`` false ``
Description
Extract additional metadata (tags, uploader)
2022-12-20 17:30:46 +01:00
Note: This requires 1 additional HTTP request per post.
2022-08-08 21:46:36 +02:00
2020-10-15 15:12:09 +02:00
extractor.weasyl.api-key
2020-10-15 15:17:19 +02:00
------------------------
2020-10-15 15:12:09 +02:00
Type
`` string ``
Default
`` null ``
Description
2020-10-19 21:57:26 +02:00
Your `Weasyl API Key <https://www.weasyl.com/control/apikeys> `__ ,
to use your account's browsing settings and filters.
2020-10-15 15:12:09 +02:00
2022-05-20 22:32:35 +02:00
extractor.weasyl.metadata
-------------------------
Type
`` bool ``
Default
`` false ``
Description
| Fetch extra submission metadata during gallery downloads.
| (`` comments `` , `` description `` , `` favorites `` , `` folder_name `` ,
`` tags `` , `` views `` )
Note: This requires 1 additional HTTP request per submission.
2022-06-03 16:36:22 +02:00
extractor.weibo.include
-----------------------
Type
* `` string ``
* `` list `` of `` strings ``
Default
`` "feed" ``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
2022-12-20 17:30:46 +01:00
`` "home" `` ,
`` "feed" `` ,
`` "videos" `` ,
`` "newvideo" `` ,
`` "article" `` ,
`` "album" `` .
2022-06-03 16:36:22 +02:00
It is possible to use `` "all" `` instead of listing all values separately.
2022-05-31 15:14:37 +02:00
extractor.weibo.livephoto
-------------------------
Type
`` bool ``
Default
`` true ``
Description
Download `` livephoto `` files.
2020-04-29 23:27:29 +02:00
extractor.weibo.retweets
------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
2021-05-27 23:09:42 +02:00
Fetch media from retweeted posts.
If this value is `` "original" `` , metadata for these files
will be taken from the original posts, not the retweeted posts.
2020-04-29 23:27:29 +02:00
extractor.weibo.videos
----------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Download video files.
2020-04-29 23:27:29 +02:00
2021-07-10 20:47:33 +02:00
extractor.ytdl.enabled
----------------------
Type
`` bool ``
Default
`` false ``
Description
Match **all** URLs, even ones without a `` ytdl: `` prefix.
extractor.ytdl.format
---------------------
Type
`` string ``
Default
youtube-dl's default, currently `` "bestvideo+bestaudio/best" ``
Description
Video `format selection
<https://github.com/ytdl-org/youtube-dl#format-selection>`__
directly passed to youtube-dl.
2021-07-11 23:01:57 +02:00
extractor.ytdl.generic
----------------------
Type
`` bool ``
Default
`` true ``
Description
Controls the use of youtube-dl's generic extractor.
Set this option to `` "force" `` for the same effect as youtube-dl's
`` --force-generic-extractor `` .
2021-07-10 20:47:33 +02:00
extractor.ytdl.logging
----------------------
Type
`` bool ``
Default
`` true ``
Description
Route youtube-dl's output through gallery-dl's logging system.
Otherwise youtube-dl will write its output directly to stdout/stderr.
Note: Set `` quiet `` and `` no_warnings `` in
`extractor.ytdl.raw-options`_ to `` true `` to suppress all output.
extractor.ytdl.module
---------------------
Type
`` string ``
Default
2021-11-29 04:36:43 +01:00
`` null ``
2021-07-10 20:47:33 +02:00
Description
Name of the youtube-dl Python module to import.
2021-11-29 04:36:43 +01:00
Setting this to `` null `` will try to import `` "yt_dlp" ``
followed by `` "youtube_dl" `` as fallback.
2021-07-10 20:47:33 +02:00
extractor.ytdl.raw-options
--------------------------
Type
2023-01-06 13:07:33 +01:00
`` object `` (`name` -> `value` )
2021-07-10 20:47:33 +02:00
Example
.. code :: json
{
"quiet": true,
"writesubtitles": true,
"merge_output_format": "mkv"
}
Description
Additional options passed directly to the `` YoutubeDL `` constructor.
All available options can be found in `youtube-dl's docstrings
<https://github.com/ytdl-org/youtube-dl/blob/master/youtube_dl/YoutubeDL.py#L138-L318>`__.
2021-11-12 18:29:40 +01:00
extractor.ytdl.cmdline-args
---------------------------
Type
* `` string ``
* `` list `` of `` strings ``
Example
* `` "--quiet --write-sub --merge-output-format mkv" ``
* `` ["--quiet", "--write-sub", "--merge-output-format", "mkv"] ``
Description
Additional options specified as youtube-dl command-line arguments.
extractor.ytdl.config-file
--------------------------
Type
|Path|_
Example
`` "~/.config/youtube-dl/config" ``
Description
Location of a youtube-dl configuration file to load options from.
2022-09-01 21:44:22 +02:00
extractor.zerochan.metadata
---------------------------
Type
`` bool ``
Default
`` false ``
Description
Extract additional metadata (date, md5, tags, ...)
2022-12-20 17:30:46 +01:00
Note: This requires 1-2 additional HTTP requests per post.
2022-09-01 21:44:22 +02:00
2018-07-13 16:20:14 +02:00
extractor.[booru].tags
2018-06-29 19:38:53 +02:00
----------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Categorize tags by their respective types
and provide them as `` tags_<type> `` metadata fields.
2018-06-29 19:38:53 +02:00
2022-12-20 17:30:46 +01:00
Note: This requires 1 additional HTTP request per post.
2017-11-22 23:09:08 +01:00
2021-07-10 20:47:33 +02:00
2021-04-13 23:40:24 +02:00
extractor.[booru].notes
2021-05-19 02:57:36 +02:00
-----------------------
2021-04-13 23:40:24 +02:00
Type
`` bool ``
Default
`` false ``
Description
Extract overlay notes (position and text).
2022-12-20 17:30:46 +01:00
Note: This requires 1 additional HTTP request per post.
2017-11-22 23:09:08 +01:00
2021-07-10 20:47:33 +02:00
2023-01-13 16:32:32 +01:00
extractor.[booru].url
---------------------
Type
`` string ``
Default
`` "file_url" ``
Example
`` "preview_url" ``
Description
Alternate field name to retrieve download URLs from.
2019-01-07 18:22:33 +01:00
extractor.[manga-extractor].chapter-reverse
-------------------------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Reverse the order of chapter URLs extracted from manga pages.
2019-01-07 18:22:33 +01:00
2020-09-26 13:33:46 +02:00
* `` true `` : Start with the latest chapter
* `` false `` : Start with the first chapter
2019-01-07 18:22:33 +01:00
2021-09-18 02:15:42 +02:00
extractor.[manga-extractor].page-reverse
----------------------------------------
Type
`` bool ``
Default
`` false ``
Description
Download manga chapter pages in reverse order.
2017-10-14 23:01:33 +02:00
2018-03-16 11:49:49 +01:00
Downloader Options
==================
2018-11-16 18:02:24 +01:00
downloader.*.enabled
--------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Enable/Disable this downloader module.
2018-11-16 18:02:24 +01:00
2020-09-01 22:05:17 +02:00
downloader.*.filesize-min & .filesize-max
-----------------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` null ``
Example
`` "32000" `` , `` "500k" `` , `` "2.5M" ``
Description
Minimum/Maximum allowed file size in bytes.
Any file smaller/larger than this limit will not be downloaded.
2020-09-01 22:05:17 +02:00
2020-09-26 13:33:46 +02:00
Possible values are valid integer or floating-point numbers
2022-11-02 15:34:54 +01:00
optionally followed by one of `` k `` , `` m `` . `` g `` , `` t `` , or `` p `` .
2020-09-26 13:33:46 +02:00
These suffixes are case-insensitive.
2020-09-01 22:05:17 +02:00
2019-06-20 17:19:44 +02:00
downloader.*.mtime
------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Use |Last-Modified|_ HTTP response headers
to set file modification times.
2019-06-20 17:19:44 +02:00
2018-11-16 18:02:24 +01:00
downloader.*.part
-----------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Controls the use of `` .part `` files during file downloads.
2018-03-16 11:49:49 +01:00
2020-09-26 13:33:46 +02:00
* `` true `` : Write downloaded data into `` .part `` files and rename
them upon download completion. This mode additionally supports
resuming incomplete downloads.
* `` false `` : Do not use `` .part `` files and write data directly
into the actual output files.
2018-03-16 11:49:49 +01:00
2018-11-16 18:02:24 +01:00
downloader.*.part-directory
---------------------------
2020-09-26 13:33:46 +02:00
Type
|Path|_
Default
`` null ``
Description
Alternate location for `` .part `` files.
2018-03-16 11:49:49 +01:00
2020-09-26 13:33:46 +02:00
Missing directories will be created as needed.
If this value is `` null `` , `` .part `` files are going to be stored
alongside the actual output files.
2018-03-16 11:49:49 +01:00
2021-09-28 22:37:11 +02:00
downloader.*.progress
---------------------
Type
`` float ``
Default
`` 3.0 ``
Description
Number of seconds until a download progress indicator
for the current download is displayed.
Set this option to `` null `` to disable this indicator.
2018-11-16 18:02:24 +01:00
downloader.*.rate
-----------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` null ``
Example
`` "32000" `` , `` "500k" `` , `` "2.5M" ``
Description
Maximum download rate in bytes per second.
2018-03-16 11:49:49 +01:00
2020-09-26 13:33:46 +02:00
Possible values are valid integer or floating-point numbers
2022-11-02 15:34:54 +01:00
optionally followed by one of `` k `` , `` m `` . `` g `` , `` t `` , or `` p `` .
2020-09-26 13:33:46 +02:00
These suffixes are case-insensitive.
2018-03-16 11:49:49 +01:00
2018-11-16 18:02:24 +01:00
downloader.*.retries
--------------------
2020-09-26 13:33:46 +02:00
Type
`` integer ``
Default
`extractor.*.retries`_
Description
Maximum number of retries during file downloads,
or `` -1 `` for infinite retries.
2018-03-16 11:49:49 +01:00
2018-11-16 18:02:24 +01:00
downloader.*.timeout
--------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
`` float ``
2020-09-26 13:33:46 +02:00
Default
`extractor.*.timeout`_
Description
Connection timeout during file downloads.
2018-03-16 11:49:49 +01:00
2018-11-16 18:02:24 +01:00
downloader.*.verify
-------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`extractor.*.verify`_
Description
Certificate validation during file downloads.
2018-03-16 11:49:49 +01:00
2022-03-10 23:32:16 +01:00
downloader.*.proxy
------------------
Type
2022-12-20 17:30:46 +01:00
* `` string ``
2023-01-06 13:07:33 +01:00
* `` object `` (`scheme` -> `proxy` )
2022-03-10 23:32:16 +01:00
Default
`extractor.*.proxy`_
Description
2022-12-20 17:30:46 +01:00
Proxy server used for file downloads.
2023-01-06 13:07:33 +01:00
Disable the use of a proxy for file downloads
by explicitly setting this option to `` null `` .
2022-03-10 23:32:16 +01:00
2019-08-07 22:52:29 +02:00
downloader.http.adjust-extensions
---------------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
2022-10-09 20:42:46 +02:00
Check file headers of downloaded files
2020-09-26 13:33:46 +02:00
and adjust their filename extensions if they do not match.
2019-08-07 22:52:29 +02:00
2022-10-09 20:42:46 +02:00
For example, this will change the filename extension (`` {extension} `` )
of a file called `` example.png `` from `` png `` to `` jpg `` when said file
contains JPEG/JFIF data.
2019-08-07 22:52:29 +02:00
2023-03-09 13:55:28 +01:00
downloader.http.consume-content
2023-03-11 14:36:37 +01:00
-------------------------------
2023-03-09 13:55:28 +01:00
Type
`` bool ``
Default
`` false ``
Description
Controls the behavior when an HTTP response is considered
unsuccessful
If the value is `` true `` , consume the response body. This
avoids closing the connection and therefore improves connection
reuse.
If the value is `` false `` , immediately close the connection
without reading the response. This can be useful if the server
is known to send large bodies for error responses.
2022-11-02 15:34:54 +01:00
downloader.http.chunk-size
--------------------------
Type
2022-12-20 17:30:46 +01:00
* `` integer ``
* `` string ``
2022-11-02 15:34:54 +01:00
Default
`` 32768 ``
Example
`` "50k" `` , `` "0.8M" ``
Description
Number of bytes per downloaded chunk.
Possible values are integer numbers
optionally followed by one of `` k `` , `` m `` . `` g `` , `` t `` , or `` p `` .
These suffixes are case-insensitive.
2021-02-21 19:13:39 +01:00
downloader.http.headers
-----------------------
Type
2023-01-06 13:07:33 +01:00
`` object `` (`name` -> `value` )
2021-02-21 19:13:39 +01:00
Example
`` {"Accept": "image/webp,*/*", "Referer": "https://example.org/"} ``
Description
Additional HTTP headers to send when downloading files,
2022-12-01 10:52:08 +01:00
downloader.http.retry-codes
---------------------------
Type
`` list `` of `` integers ``
Default
2023-01-14 17:16:18 +01:00
`extractor.*.retry-codes`_
2022-12-01 10:52:08 +01:00
Description
Additional `HTTP response status codes <https://developer.mozilla.org/en-US/docs/Web/HTTP/Status> `__
to retry a download on.
Codes `` 200 `` , `` 206 `` , and `` 416 `` (when resuming a `partial <downloader.*.part_> `__
download) will never be retried and always count as success,
regardless of this option.
2023-01-14 17:16:18 +01:00
`` 5xx `` codes (server error responses) will always be retried,
2022-12-01 10:52:08 +01:00
regardless of this option.
2023-01-11 15:37:40 +01:00
downloader.http.validate
------------------------
Type
`` bool ``
Default
`` true ``
Description
Check for invalid responses.
Fail a download when a file does not pass
instead of downloading a potentially broken file.
2018-11-16 18:02:24 +01:00
downloader.ytdl.format
----------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
youtube-dl's default, currently `` "bestvideo+bestaudio/best" ``
Description
Video `format selection
<https://github.com/ytdl-org/youtube-dl#format-selection>`__
directly passed to youtube-dl.
2018-11-13 18:06:36 +01:00
2019-07-24 21:19:11 +02:00
downloader.ytdl.forward-cookies
-------------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Forward cookies to youtube-dl.
2019-07-24 21:19:11 +02:00
2018-10-19 22:10:59 +02:00
downloader.ytdl.logging
-----------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Route youtube-dl's output through gallery-dl's logging system.
Otherwise youtube-dl will write its output directly to stdout/stderr.
2018-10-19 22:10:59 +02:00
2020-09-26 13:33:46 +02:00
Note: Set `` quiet `` and `` no_warnings `` in
`downloader.ytdl.raw-options`_ to `` true `` to suppress all output.
2018-10-19 22:10:59 +02:00
2021-03-01 03:10:42 +01:00
downloader.ytdl.module
----------------------
Type
`` string ``
Default
2021-11-29 04:36:43 +01:00
`` null ``
2021-03-01 03:10:42 +01:00
Description
Name of the youtube-dl Python module to import.
2021-11-29 04:36:43 +01:00
Setting this to `` null `` will first try to import `` "yt_dlp" ``
and use `` "youtube_dl" `` as fallback.
2021-03-01 03:10:42 +01:00
2019-08-24 22:39:37 +02:00
downloader.ytdl.outtmpl
-----------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` null ``
Description
The `Output Template <https://github.com/ytdl-org/youtube-dl#output-template> `__
used to generate filenames for files downloaded with youtube-dl.
2019-08-24 22:39:37 +02:00
2020-09-26 13:33:46 +02:00
Special values:
2019-08-24 22:39:37 +02:00
2020-09-26 13:33:46 +02:00
* `` null `` : generate filenames with `extractor.*.filename`_
* `` "default" `` : use youtube-dl's default, currently `` "%(title)s-%(id)s.%(ext)s" ``
2019-08-24 22:39:37 +02:00
2020-09-26 13:33:46 +02:00
Note: An output template other than `` null `` might
cause unexpected results in combination with other options
(e.g. `` "skip": "enumerate" `` )
2019-08-24 22:39:37 +02:00
2018-10-19 22:10:59 +02:00
downloader.ytdl.raw-options
---------------------------
2020-09-26 13:33:46 +02:00
Type
2023-01-06 13:07:33 +01:00
`` object `` (`name` -> `value` )
2020-09-26 13:33:46 +02:00
Example
2020-10-19 21:57:26 +02:00
.. code :: json
2018-10-19 22:10:59 +02:00
2020-09-26 13:33:46 +02:00
{
"quiet": true,
"writesubtitles": true,
"merge_output_format": "mkv"
}
2018-10-19 22:10:59 +02:00
2020-09-26 13:33:46 +02:00
Description
Additional options passed directly to the `` YoutubeDL `` constructor.
All available options can be found in `youtube-dl's docstrings
<https://github.com/ytdl-org/youtube-dl/blob/master/youtube_dl/YoutubeDL.py#L138-L318>`__.
2018-10-19 22:10:59 +02:00
2021-11-12 18:29:40 +01:00
downloader.ytdl.cmdline-args
----------------------------
Type
* `` string ``
* `` list `` of `` strings ``
Example
* `` "--quiet --write-sub --merge-output-format mkv" ``
* `` ["--quiet", "--write-sub", "--merge-output-format", "mkv"] ``
Description
Additional options specified as youtube-dl command-line arguments.
downloader.ytdl.config-file
---------------------------
Type
|Path|_
Example
`` "~/.config/youtube-dl/config" ``
Description
Location of a youtube-dl configuration file to load options from.
2018-03-16 11:49:49 +01:00
Output Options
==============
2018-11-16 18:02:24 +01:00
2018-03-16 11:49:49 +01:00
output.mode
-----------
2020-09-26 13:33:46 +02:00
Type
2022-12-30 17:14:42 +01:00
* `` string ``
2023-01-06 13:07:33 +01:00
* `` object `` (`key` -> `format string` )
2020-09-26 13:33:46 +02:00
Default
`` "auto" ``
Description
Controls the output string format and status indicators.
2018-03-16 11:49:49 +01:00
2020-09-26 13:33:46 +02:00
* `` "null" `` : No output
* `` "pipe" `` : Suitable for piping to other processes or files
* `` "terminal" `` : Suitable for the standard Windows console
* `` "color" `` : Suitable for terminals that understand ANSI escape codes and colors
2022-12-30 17:14:42 +01:00
* `` "auto" `` : `` "terminal" `` on Windows with `output.ansi`_ disabled,
`` "color" `` otherwise.
| It is possible to use custom output format strings
by setting this option to an `` object `` and specifying
| `` start `` , `` success `` , `` skip `` , `` progress `` , and `` progress-total `` .
For example, the following will replicate the same output as |mode: color|:
.. code :: json
{
"start" : "{}",
"success": "\r\u001b[1;32m{}\u001b[0m\n",
"skip" : "\u001b[2m{}\u001b[0m\n",
"progress" : "\r{0:>7}B {1:>7}B/s ",
"progress-total": "\r{3:>3}% {0:>7}B {1:>7}B/s "
}
`` start `` , `` success `` , and `` skip `` are used to output the current
filename, where `` {} `` or `` {0} `` is replaced with said filename.
If a given format string contains printable characters other than that,
their number needs to be specified as `` [<number>, <format string>] ``
to get the correct results for `output.shorten`_ . For example
.. code :: json
"start" : [12, "Downloading {}"]
| `` progress `` and `` progress-total `` are used when displaying the
`download progress indicator <downloader.*.progress_> `__ ,
| `` progress `` when the total number of bytes to download is unknown,
`` progress-total `` otherwise.
For these format strings
* `` {0} `` is number of bytes downloaded
* `` {1} `` is number of downloaded bytes per second
* `` {2} `` is total number of bytes
* `` {3} `` is percent of bytes downloaded to total bytes
2018-03-16 11:49:49 +01:00
2023-02-26 14:56:19 +01:00
output.stdout & .stdin & .stderr
--------------------------------
Type
* `` string ``
* `` object ``
Example
.. code :: json
"utf-8"
.. code :: json
{
"encoding": "utf-8",
"errors": "replace",
"line_buffering": true
}
Description
`Reconfigure <https://docs.python.org/3/library/io.html#io.TextIOWrapper.reconfigure> `__
a `standard stream <https://docs.python.org/3/library/sys.html#sys.stdin> `__ .
Possible options are
* `` encoding ``
* `` errors ``
* `` newline ``
* `` line_buffering ``
* `` write_through ``
When this option is specified as a simple `` string `` ,
it is interpreted as `` {"encoding": "<string-value>", "errors": "replace"} ``
Note: `` errors `` always defaults to `` "replace" ``
2018-03-16 11:49:49 +01:00
output.shorten
--------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Controls whether the output strings should be shortened to fit
on one console line.
2018-03-16 11:49:49 +01:00
2021-09-13 21:29:38 +02:00
Set this option to `` "eaw" `` to also work with east-asian characters
with a display width greater than 1.
2018-03-16 11:49:49 +01:00
2022-05-02 12:41:14 +02:00
output.colors
-------------
Type
2023-01-06 13:07:33 +01:00
`` object `` (`key` -> `ANSI color` )
2022-05-02 12:41:14 +02:00
Default
`` {"success": "1;32", "skip": "2"} ``
Description
Controls the `ANSI colors <https://gist.github.com/fnky/458719343aabd01cfb17a3a4f7296797#colors--graphics-mode> `__
used with |mode: color|__ for successfully downloaded or skipped files.
.. __: `output.mode`_
2022-05-29 19:15:25 +02:00
output.ansi
-----------
Type
`` bool ``
Default
`` false ``
Description
| On Windows, enable ANSI escape sequences and colored output
| by setting the `` ENABLE_VIRTUAL_TERMINAL_PROCESSING `` flag for stdout and stderr.
2021-05-04 18:07:08 +02:00
output.skip
-----------
Type
`` bool ``
Default
`` true ``
Description
Show skipped file downloads.
2022-03-24 23:05:36 +01:00
output.fallback
---------------
Type
`` bool ``
Default
`` true ``
Description
Include fallback URLs in the output of `` -g/--get-urls `` .
output.private
--------------
Type
`` bool ``
Default
`` false ``
Description
Include private fields,
i.e. fields whose name starts with an underscore,
in the output of `` -K/--list-keywords `` and `` -j/--dump-json `` .
2018-03-16 11:49:49 +01:00
output.progress
---------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
`` true ``
Description
Controls the progress indicator when *gallery-dl* is run with
multiple URLs as arguments.
* `` true `` : Show the default progress indicator
(`` "[{current}/{total}] {url}" `` )
* `` false `` : Do not show any progress indicator
* Any `` string `` : Show the progress indicator using this
as a custom `format string`_ . Possible replacement keys are
`` current `` , `` total `` and `` url `` .
2018-03-16 11:49:49 +01:00
2018-05-27 16:48:54 +02:00
output.log
----------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* |Logging Configuration|_
2020-09-26 13:33:46 +02:00
Default
`` "[{name}][{levelname}] {message}" ``
Description
2022-12-20 17:30:46 +01:00
Configuration for logging output to stderr.
2018-05-27 16:48:54 +02:00
2020-09-26 13:33:46 +02:00
If this is a simple `` string `` , it specifies
the format string for logging messages.
2018-05-27 16:48:54 +02:00
2018-03-16 11:49:49 +01:00
output.logfile
--------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* |Path|_
* |Logging Configuration|_
2020-09-26 13:33:46 +02:00
Description
File to write logging output to.
2018-03-16 11:49:49 +01:00
output.unsupportedfile
----------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* |Path|_
* |Logging Configuration|_
2020-09-26 13:33:46 +02:00
Description
File to write external URLs unsupported by *gallery-dl* to.
2018-05-27 17:08:22 +02:00
2020-09-26 13:33:46 +02:00
The default format string here is `` "{message}" `` .
2018-03-16 11:49:49 +01:00
2018-10-08 23:08:11 +02:00
output.num-to-str
-----------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Convert numeric values (`` integer `` or `` float `` ) to `` string ``
before outputting them as JSON.
2018-10-08 23:08:11 +02:00
2018-03-16 11:49:49 +01:00
2018-06-16 15:43:24 +02:00
Postprocessor Options
=====================
2020-10-19 21:57:26 +02:00
This section lists all options available inside
`Postprocessor Configuration`_ objects.
2021-05-22 17:01:49 +02:00
Each option is titled as `` <name>.<option> `` , meaning a post processor
2020-10-19 21:57:26 +02:00
of type `` <name> `` will look for an `` <option> `` field inside its "body".
For example an `` exec `` post processor will recognize
an `async <exec.async_> `__ , `command <exec.command_> `__ ,
2020-11-25 12:12:41 +01:00
and `event <exec.event_> `__ field:
2020-10-19 21:57:26 +02:00
.. code :: json
{
"name" : "exec",
"async" : false,
"command": "...",
2020-11-25 12:12:41 +01:00
"event" : "after"
2020-10-19 21:57:26 +02:00
}
2018-06-16 15:43:24 +02:00
classify.mapping
----------------
2020-09-26 13:33:46 +02:00
Type
2023-01-06 13:07:33 +01:00
`` object `` (`directory` -> `extensions` )
2020-09-26 13:33:46 +02:00
Default
2020-10-19 21:57:26 +02:00
.. code :: json
2018-06-16 15:43:24 +02:00
2020-09-26 13:33:46 +02:00
{
2020-10-19 21:57:26 +02:00
"Pictures": ["jpg", "jpeg", "png", "gif", "bmp", "svg", "webp"],
"Video" : ["flv", "ogv", "avi", "mp4", "mpg", "mpeg", "3gp", "mkv", "webm", "vob", "wmv"],
"Music" : ["mp3", "aac", "flac", "ogg", "wma", "m4a", "wav"],
"Archives": ["zip", "rar", "7z", "tar", "gz", "bz2"]
2020-09-26 13:33:46 +02:00
}
2018-06-16 15:43:24 +02:00
2020-09-26 13:33:46 +02:00
Description
A mapping from directory names to filename extensions that should
be stored in them.
2018-06-16 15:43:24 +02:00
2020-09-26 13:33:46 +02:00
Files with an extension not listed will be ignored and stored
in their default location.
2018-06-16 15:43:24 +02:00
2020-01-18 21:08:25 +01:00
compare.action
--------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "replace" ``
Description
2021-10-05 03:03:37 +02:00
The action to take when files do **not** compare as equal.
2020-09-26 13:33:46 +02:00
* `` "replace" `` : Replace/Overwrite the old version with the new one
2021-10-01 19:30:51 +02:00
2021-10-05 03:03:37 +02:00
* `` "enumerate" `` : Add an enumeration index to the filename of the new
version like `skip = "enumerate" <extractor.*.skip_> `__
compare.equal
-------------
Type
`` string ``
Default
`` "null" ``
Description
The action to take when files do compare as equal.
2021-10-01 19:30:51 +02:00
2021-10-05 03:03:37 +02:00
* `` "abort:N" `` : Stop the current extractor run
2021-10-01 19:30:51 +02:00
after `` N `` consecutive files compared as equal.
2021-10-05 03:03:37 +02:00
* `` "terminate:N" `` : Stop the current extractor run,
including parent extractors,
2021-10-01 19:30:51 +02:00
after `` N `` consecutive files compared as equal.
2021-10-05 03:03:37 +02:00
* `` "exit:N" `` : Exit the program
after `` N `` consecutive files compared as equal.
2020-01-18 21:08:25 +01:00
compare.shallow
---------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Only compare file sizes. Do not read and compare their content.
2020-01-18 21:08:25 +01:00
2018-06-16 15:43:24 +02:00
2023-02-01 13:57:16 +01:00
exec.archive
------------
Type
|Path|_
Description
File to store IDs of executed commands in,
similar to `extractor.*.archive`_ .
2023-02-05 16:05:13 +01:00
`` archive-format `` , `` archive-prefix `` , and `` archive-pragma `` options,
akin to
`extractor.*.archive-format`_ ,
`extractor.*.archive-prefix`_ , and
`extractor.*.archive-pragma`_ , are supported as well.
2023-02-01 13:57:16 +01:00
2018-06-16 15:43:24 +02:00
exec.async
----------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Controls whether to wait for a subprocess to finish
or to let it run asynchronously.
2018-06-16 15:43:24 +02:00
exec.command
------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Example
* `` "convert {} {}.png && rm {}" ``
* `` ["echo", "{user[account]}", "{id}"] ``
Description
The command to run.
* If this is a `` string `` , it will be executed using the system's
shell, e.g. `` /bin/sh `` . Any `` {} `` will be replaced
with the full path of a file or target directory, depending on
2020-11-25 12:12:41 +01:00
`exec.event`_
2020-09-26 13:33:46 +02:00
* If this is a `` list `` , the first element specifies the program
name and any further elements its arguments.
Each element of this list is treated as a `format string`_ using
the files' metadata as well as `` {_path} `` , `` {_directory} `` ,
and `` {_filename} `` .
2019-11-03 21:45:45 +01:00
2020-11-25 12:12:41 +01:00
exec.event
2019-11-03 21:45:45 +01:00
----------
2020-09-26 13:33:46 +02:00
Type
2020-11-25 12:12:41 +01:00
`` string ``
2020-09-26 13:33:46 +02:00
Default
2020-11-25 12:12:41 +01:00
`` "after" ``
2020-09-26 13:33:46 +02:00
Description
2020-11-25 12:12:41 +01:00
The event for which `exec.command`_ is run.
See `metadata.event`_ for a list of available events.
2019-01-17 21:18:12 +01:00
metadata.mode
-------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "json" ``
Description
2022-07-19 12:24:26 +02:00
Selects how to process metadata.
2020-09-26 13:33:46 +02:00
2023-01-06 13:07:33 +01:00
* `` "json" `` : write metadata using |json.dump()|_
2022-11-09 22:17:08 +01:00
* `` "jsonl" `` : write metadata in `JSON Lines
<https://jsonlines.org/>`__ format
2022-07-19 12:24:26 +02:00
* `` "tags" `` : write `` tags `` separated by newlines
* `` "custom" `` : write the result of applying `metadata.content-format`_
2020-09-26 13:33:46 +02:00
to a file's metadata dictionary
2022-07-19 12:24:26 +02:00
* `` "modify" `` : add or modify metadata entries
* `` "delete" `` : remove metadata entries
2020-09-26 13:33:46 +02:00
2019-01-17 21:18:12 +01:00
2020-11-25 12:12:41 +01:00
metadata.filename
-----------------
Type
`` string ``
Default
`` null ``
Example
`` "{id}.data.json" ``
Description
A `format string`_ to build the filenames for metadata files with.
(see `extractor.filename <extractor.*.filename_> `__ )
2022-05-30 21:15:16 +02:00
Using `` "-" `` as filename will write all output to `` stdout `` .
2020-11-25 12:12:41 +01:00
If this option is set, `metadata.extension`_ and
`metadata.extension-format`_ will be ignored.
2020-01-02 20:58:10 +01:00
metadata.directory
------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "." ``
Example
`` "metadata" ``
Description
Directory where metadata files are stored in relative to the
current target location for file downloads.
2020-01-02 20:58:10 +01:00
2019-01-17 21:18:12 +01:00
metadata.extension
------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "json" `` or `` "txt" ``
Description
Filename extension for metadata files that will be appended to the
original file names.
2019-11-29 23:12:22 +01:00
metadata.extension-format
-------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Example
* `` "{extension}.json" ``
* `` "json" ``
Description
Custom format string to build filename extensions for metadata
files with, which will replace the original filename extensions.
Note: `metadata.extension`_ is ignored if this option is set.
2019-11-29 23:12:22 +01:00
2019-01-17 21:18:12 +01:00
2020-11-25 12:12:41 +01:00
metadata.event
--------------
Type
`` string ``
Default
`` "file" ``
Description
The event for which metadata gets written to a file.
The available events are:
`` init ``
2021-05-22 17:01:49 +02:00
After post processor initialization
2020-11-25 12:12:41 +01:00
and before the first file download
`` finalize ``
On extractor shutdown, e.g. after all files were downloaded
2023-08-10 19:46:37 +02:00
`` finalize-success ``
On extractor shutdown when no error occurred
`` finalize-error ``
On extractor shutdown when at least one error occurred
2020-11-25 12:12:41 +01:00
`` prepare ``
Before a file download
2023-08-10 21:28:48 +02:00
`` prepare-after ``
Before a file download,
but after building and checking file paths
2020-11-25 12:12:41 +01:00
`` file ``
When completing a file download,
but before it gets moved to its target location
`` after ``
After a file got moved to its target location
`` skip ``
When skipping a file download
`` post ``
When starting to download all files of a `post` ,
e.g. a Tweet on Twitter or a post on Patreon.
2022-10-31 14:35:48 +01:00
`` post-after ``
After downloading all files of a `post`
2020-11-25 12:12:41 +01:00
2022-07-19 00:57:29 +02:00
metadata.fields
---------------
Type
2022-07-19 12:24:26 +02:00
* `` list `` of `` strings ``
* `` object `` (`field name` -> `format string`_ )
2022-07-19 00:57:29 +02:00
Example
2023-01-06 13:07:33 +01:00
.. code :: json
2022-07-19 12:24:26 +02:00
["blocked", "watching", "status[creator][name]"]
2023-01-06 13:07:33 +01:00
.. code :: json
2022-07-19 12:24:26 +02:00
{
"blocked" : "*** ",
"watching" : "\fE 'yes' if watching else 'no'",
"status[username]": "{status[creator][name]!l}"
}
2022-07-19 00:57:29 +02:00
Description
2022-07-19 12:24:26 +02:00
* `` "mode": "delete" `` :
A list of metadata field names to remove.
* `` "mode": "modify" `` :
An object with metadata field names mapping to a `format string`_
whose result is assigned to said field name.
2022-07-19 00:57:29 +02:00
2019-11-30 17:27:49 +01:00
metadata.content-format
-----------------------
2020-09-26 13:33:46 +02:00
Type
2022-12-20 17:30:46 +01:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Example
2020-10-27 20:09:58 +01:00
* `` "tags:\n\n{tags:J\n}\n" ``
* `` ["tags:", "", "{tags:J\n}"] ``
2020-09-26 13:33:46 +02:00
Description
Custom format string to build the content of metadata files with.
2019-01-17 21:18:12 +01:00
2020-09-26 13:33:46 +02:00
Note: Only applies for `` "mode": "custom" `` .
2019-07-14 22:37:28 +02:00
2023-02-07 18:28:14 +01:00
metadata.ascii
--------------
Type
`` bool ``
Default
`` false ``
Description
Escape all non-ASCII characters.
See the `` ensure_ascii `` argument of |json.dump()|_ for further details.
Note: Only applies for `` "mode": "json" `` and `` "jsonl" `` .
2023-01-06 13:07:33 +01:00
metadata.indent
---------------
Type
* `` integer ``
* `` string ``
Default
`` 4 ``
Description
Indentation level of JSON output.
See the `` indent `` argument of |json.dump()|_ for further details.
Note: Only applies for `` "mode": "json" `` .
2023-02-07 18:28:14 +01:00
metadata.separators
-------------------
Type
`` list `` with two `` string `` elements
Default
`` [", ", ": "] ``
Description
`` <item separator> `` - `` <key separator> `` pair
to separate JSON keys and values with.
See the `` separators `` argument of |json.dump()|_ for further details.
Note: Only applies for `` "mode": "json" `` and `` "jsonl" `` .
metadata.sort
-------------
Type
`` bool ``
Default
`` false ``
Description
Sort output by `key` .
See the `` sort_keys `` argument of |json.dump()|_ for further details.
Note: Only applies for `` "mode": "json" `` and `` "jsonl" `` .
2022-11-07 15:37:22 +01:00
metadata.open
-------------
Type
`` string ``
Defsult
`` "w" ``
Description
The `` mode `` in which metadata files get opened.
For example,
use `` "a" `` to append to a file's content
or `` "w" `` to truncate it.
2023-01-06 13:07:33 +01:00
See the `` mode `` argument of |open()|_ for further details.
2022-11-20 15:27:36 +01:00
2022-11-07 15:37:22 +01:00
metadata.encoding
-----------------
Type
`` string ``
Defsult
`` "utf-8" ``
Description
Name of the encoding used to encode a file's content.
2023-01-06 13:07:33 +01:00
See the `` encoding `` argument of |open()|_ for further details.
metadata.private
----------------
Type
`` bool ``
Default
`` false ``
Description
Include private fields,
i.e. fields whose name starts with an underscore.
2022-11-07 15:37:22 +01:00
2023-03-17 23:16:52 +01:00
metadata.skip
-------------
Type
`` bool ``
Default
`` false ``
Description
Do not overwrite already existing files.
2022-03-20 21:16:46 +01:00
metadata.archive
----------------
Type
|Path|_
Description
File to store IDs of generated metadata files in,
similar to `extractor.*.archive`_ .
2023-02-05 16:05:13 +01:00
`` archive-format `` , `` archive-prefix `` , and `` archive-pragma `` options,
akin to
`extractor.*.archive-format`_ ,
`extractor.*.archive-prefix`_ , and
`extractor.*.archive-pragma`_ , are supported as well.
2022-03-20 21:16:46 +01:00
2022-02-22 23:02:13 +01:00
metadata.mtime
--------------
Type
`` bool ``
Default
`` false ``
Description
2022-03-21 10:02:29 +01:00
Set modification times of generated metadata files
2022-02-22 23:02:13 +01:00
according to the accompanying downloaded file.
2022-03-08 21:11:06 +01:00
Enabling this option will only have an effect
*if* there is actual `` mtime `` metadata available, that is
* after a file download (`` "event": "file" `` (default), `` "event": "after" `` )
* when running *after* an `` mtime `` post processes for the same `event <metadata.event_> `__
For example, a `` metadata `` post processor for `` "event": "post" `` will
*not* be able to set its file's modification time unless an `` mtime ``
post processor with `` "event": "post" `` runs *before* it.
2022-02-22 23:02:13 +01:00
2022-02-22 23:27:40 +01:00
mtime.event
-----------
Type
`` string ``
Default
`` "file" ``
Description
See `metadata.event`_
2019-07-14 22:37:28 +02:00
mtime.key
---------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "date" ``
Description
Name of the metadata field whose value should be used.
2019-07-14 22:37:28 +02:00
2023-07-24 14:27:37 +02:00
This value must be either a UNIX timestamp or a
2020-09-26 13:33:46 +02:00
|datetime|_ object.
2019-07-14 22:37:28 +02:00
2022-07-08 20:56:01 +02:00
Note: This option gets ignored if `mtime.value`_ is set.
mtime.value
-----------
Type
`` string ``
Default
`` null ``
Example
* `` "{status[date]}" ``
* `` "{content[0:6]:R22/2022/D%Y%m%d/}" ``
Description
A `format string`_ whose value should be used.
2023-07-24 14:27:37 +02:00
The resulting value must be either a UNIX timestamp or a
2022-07-08 20:56:01 +02:00
|datetime|_ object.
2019-07-14 22:37:28 +02:00
2023-07-24 12:26:40 +02:00
python.archive
--------------
Type
|Path|_
Description
File to store IDs of called Python functions in,
similar to `extractor.*.archive`_ .
`` archive-format `` , `` archive-prefix `` , and `` archive-pragma `` options,
akin to
`extractor.*.archive-format`_ ,
`extractor.*.archive-prefix`_ , and
`extractor.*.archive-pragma`_ , are supported as well.
python.event
------------
Type
`` string ``
Default
`` "file" ``
Description
The event for which `python.function`_ gets called.
See `metadata.event`_ for a list of available events.
python.function
---------------
Type
`` string ``
Example
* `` "my_module:generate_text" ``
* `` "~/.local/share/gdl-utils.py:resize" ``
Description
The Python function to call.
This function gets specified as `` <module>:<function name> ``
and gets called with the current metadata dict as argument.
`` module `` is either an importable Python module name
or the |Path|_ to a `.py` file,
2018-06-16 15:43:24 +02:00
ugoira.extension
----------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "webm" ``
Description
Filename extension for the resulting video files.
2018-06-16 15:43:24 +02:00
ugoira.ffmpeg-args
------------------
2020-09-26 13:33:46 +02:00
Type
`` list `` of `` strings ``
Default
`` null ``
Example
`` ["-c:v", "libvpx-vp9", "-an", "-b:v", "2M"] ``
Description
Additional FFmpeg command-line arguments.
2018-06-16 15:43:24 +02:00
2021-05-26 02:04:21 +02:00
ugoira.ffmpeg-demuxer
---------------------
Type
`` string ``
Default
2022-03-26 21:10:59 +01:00
`` auto ``
2021-05-26 02:04:21 +02:00
Description
2022-03-25 23:20:32 +01:00
FFmpeg demuxer to read and process input files with. Possible values are
2022-04-12 21:47:47 +02:00
* "`concat <https://ffmpeg.org/ffmpeg-formats.html#concat-1> `_ " (inaccurate frame timecodes for non-uniform frame delays)
* "`image2 <https://ffmpeg.org/ffmpeg-formats.html#image2-1> `_ " (accurate timecodes, requires nanosecond file timestamps, i.e. no Windows or macOS)
2022-03-25 23:20:32 +01:00
* "mkvmerge" (accurate timecodes, only WebM or MKV, requires `mkvmerge <ugoira.mkvmerge-location_> `__ )
2021-05-26 02:04:21 +02:00
2022-04-12 21:47:47 +02:00
`"auto"` will select `mkvmerge` if available and fall back to `concat` otherwise.
2022-03-26 21:10:59 +01:00
2021-05-26 02:04:21 +02:00
2018-06-16 15:43:24 +02:00
ugoira.ffmpeg-location
----------------------
2020-09-26 13:33:46 +02:00
Type
|Path|_
Default
`` "ffmpeg" ``
Description
Location of the `` ffmpeg `` (or `` avconv `` ) executable to use.
2018-06-16 15:43:24 +02:00
2022-03-25 23:20:32 +01:00
ugoira.mkvmerge-location
------------------------
Type
|Path|_
Default
`` "mkvmerge" ``
Description
Location of the `` mkvmerge `` executable for use with the
`mkvmerge demuxer <ugoira.ffmpeg-demuxer_> `__ .
2018-08-29 15:58:01 +02:00
ugoira.ffmpeg-output
--------------------
2020-09-26 13:33:46 +02:00
Type
2023-08-21 18:19:53 +02:00
* `` bool ``
* `` string ``
2020-09-26 13:33:46 +02:00
Default
2023-08-21 18:19:53 +02:00
`` "error" ``
2020-09-26 13:33:46 +02:00
Description
2023-08-21 18:19:53 +02:00
Controls FFmpeg output.
* `` true `` : Enable FFmpeg output
* `` false `` : Disable all FFmpeg output
* any `` string `` : Pass `` -hide_banner `` and `` -loglevel ``
with this value as argument to FFmpeg
2020-09-26 13:33:46 +02:00
2018-08-29 15:58:01 +02:00
2018-06-20 18:48:10 +02:00
ugoira.ffmpeg-twopass
---------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Enable Two-Pass encoding.
2018-06-20 18:48:10 +02:00
2018-07-20 22:06:48 +02:00
ugoira.framerate
----------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "auto" ``
Description
Controls the frame rate argument (`` -r `` ) for FFmpeg
* `` "auto" `` : Automatically assign a fitting frame rate
based on delays between frames.
2023-08-21 19:43:47 +02:00
* `` "uniform" `` : Like `` auto `` , but assign an explicit frame rate
only to Ugoira with uniform frame delays.
2020-09-26 13:33:46 +02:00
* any other `` string `` : Use this value as argument for `` -r `` .
* `` null `` or an empty `` string `` : Don't set an explicit frame rate.
2018-07-20 22:06:48 +02:00
2018-06-18 17:25:52 +02:00
ugoira.keep-files
-----------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Keep ZIP archives after conversion.
2018-06-18 17:25:52 +02:00
2018-09-21 19:52:45 +02:00
ugoira.libx264-prevent-odd
--------------------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` true ``
Description
Prevent `` "width/height not divisible by 2" `` errors
when using `` libx264 `` or `` libx265 `` encoders
by applying a simple cropping filter. See this `Stack Overflow
thread <https://stackoverflow.com/questions/20847674>`__
for more information.
This option, when `` libx264/5 `` is used, automatically
adds `` ["-vf", "crop=iw-mod(iw\\,2):ih-mod(ih\\,2)"] ``
to the list of FFmpeg command-line arguments
to reduce an odd width/height by 1 pixel and make them even.
2018-06-16 15:43:24 +02:00
2022-03-21 10:02:29 +01:00
ugoira.mtime
------------
Type
`` bool ``
Default
2022-06-29 22:35:34 +02:00
`` true ``
2022-03-21 10:02:29 +01:00
Description
Set modification times of generated ugoira aniomations.
2021-05-26 02:26:26 +02:00
ugoira.repeat-last-frame
------------------------
Type
`` bool ``
Default
`` true ``
Description
Allow repeating the last frame when necessary
to prevent it from only being displayed for a very short amount of time.
2018-06-16 15:43:24 +02:00
zip.compression
---------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "store" ``
Description
Compression method to use when writing the archive.
2018-06-16 15:43:24 +02:00
2020-09-26 13:33:46 +02:00
Possible values are `` "store" `` , `` "zip" `` , `` "bzip2" `` , `` "lzma" `` .
2018-06-16 15:43:24 +02:00
zip.extension
-------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "zip" ``
Description
Filename extension for the created ZIP archive.
2018-06-16 15:43:24 +02:00
2022-09-09 11:41:27 +02:00
zip.files
---------
Type
`` list `` of |Path|
Example
`` ["info.json"] ``
Description
List of extra files to be added to a ZIP archive.
Note: Relative paths are relative to the current
`download directory <extractor.*.directory_> `__ .
2018-06-16 15:43:24 +02:00
zip.keep-files
--------------
2020-09-26 13:33:46 +02:00
Type
`` bool ``
Default
`` false ``
Description
Keep the actual files after writing them to a ZIP archive.
2018-06-16 15:43:24 +02:00
2019-07-28 18:13:18 +02:00
zip.mode
--------
2020-09-26 13:33:46 +02:00
Type
`` string ``
Default
`` "default" ``
Description
* `` "default" `` : Write the central directory file header
once after everything is done or an exception is raised.
2019-07-28 18:13:18 +02:00
2020-09-26 13:33:46 +02:00
* `` "safe" `` : Update the central directory file header
each time a file is stored in a ZIP archive.
2019-07-28 18:13:18 +02:00
2020-09-26 13:33:46 +02:00
This greatly reduces the chance a ZIP archive gets corrupted in
case the Python interpreter gets shut down unexpectedly
(power outage, SIGKILL) but is also a lot slower.
2019-07-28 18:13:18 +02:00
2018-06-16 15:43:24 +02:00
2018-03-16 11:49:49 +01:00
Miscellaneous Options
=====================
2020-10-25 03:05:10 +01:00
extractor.modules
-----------------
Type
`` list `` of `` strings ``
Default
The `` modules `` list in
`extractor/__init__.py <../gallery_dl/extractor/__init__.py#L12> `__
Example
`` ["reddit", "danbooru", "mangadex"] ``
Description
2023-01-30 23:32:36 +01:00
List of internal modules to load when searching for a suitable
2020-10-25 03:05:10 +01:00
extractor class. Useful to reduce startup time and memory usage.
2023-01-30 23:32:36 +01:00
extractor.module-sources
------------------------
Type
`` list `` of |Path|_ instances
Example
`` ["~/.config/gallery-dl/modules", null] ``
Description
List of directories to load external extractor modules from.
Any file in a specified directory with a `` .py `` filename extension
gets `imported <https://docs.python.org/3/reference/import.html> `__
and searched for potential extractors,
i.e. classes with a `` pattern `` attribute.
Note: `` null `` references internal extractors defined in
`extractor/__init__.py <../gallery_dl/extractor/__init__.py#L12> `__
or by `extractor.modules`_ .
2023-02-28 18:18:55 +01:00
globals
-------
Type
* |Path|_
* `` string ``
Example
* `` "~/.local/share/gdl-globals.py" ``
* `` "gdl-globals" ``
Description
2023-03-16 18:37:00 +01:00
| Path to or name of an
`importable <https://docs.python.org/3/reference/import.html> `__
Python module,
| whose namespace,
in addition to the `` GLOBALS `` dict in `util.py <../gallery_dl/util.py> `__ ,
gets used as |globals parameter|__ for compiled Python expressions.
2023-02-28 18:18:55 +01:00
.. |globals parameter| replace :: `` globals `` parameter
.. __: https://docs.python.org/3/library/functions.html#eval
2018-03-16 11:49:49 +01:00
cache.file
----------
2020-09-26 13:33:46 +02:00
Type
|Path|_
Default
* (`` %APPDATA% `` or `` "~" `` ) + `` "/gallery-dl/cache.sqlite3" `` on Windows
* (`` $XDG_CACHE_HOME `` or `` "~/.cache" `` ) + `` "/gallery-dl/cache.sqlite3" `` on all other platforms
Description
Path of the SQLite3 database used to cache login sessions,
cookies and API tokens across `gallery-dl` invocations.
2018-03-16 11:49:49 +01:00
2020-09-26 13:33:46 +02:00
Set this option to `` null `` or an invalid path to disable
this cache.
2018-03-16 11:49:49 +01:00
2022-07-10 13:30:45 +02:00
format-separator
----------------
Type
`` string ``
Default
`` "/" ``
Description
Character(s) used as argument separator in format string
`format specifiers <formatting.md#format-specifiers> `__ .
For example, setting this option to `` "#" `` would allow a replacement
operation to be `` Rold#new# `` instead of the default `` Rold/new/ ``
2022-02-13 22:39:26 +01:00
signals-ignore
--------------
Type
`` list `` of `` strings ``
Example
`` ["SIGTTOU", "SIGTTIN", "SIGTERM"] ``
Description
The list of signal names to ignore, i.e. set
`SIG_IGN <https://docs.python.org/3/library/signal.html#signal.SIG_IGN> `_
as signal handler for.
2023-08-21 21:18:40 +02:00
subconfigs
----------
Type
`` list `` of |Path|_
Example
`` ["~/cfg-twitter.json", "~/cfg-reddit.json"] ``
Description
Additional configuration files to load.
2022-07-18 22:20:30 +02:00
warnings
--------
Type
`` string ``
Default
`` "default" ``
Description
The `Warnings Filter action <https://docs.python.org/3/library/warnings.html#the-warnings-filter> `__
used for (urllib3) warnings.
2018-03-16 11:49:49 +01:00
2017-09-09 17:31:42 +02:00
API Tokens & IDs
================
2017-10-12 23:37:28 +02:00
All configuration keys listed in this section have fully functional default
values embedded into *gallery-dl* itself, but if things unexpectedly break
or you want to use your own personal client credentials, you can follow these
instructions to get an alternative set of API tokens and IDs.
2018-05-26 11:26:50 +02:00
2017-09-09 17:31:42 +02:00
extractor.deviantart.client-id & .client-secret
-----------------------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
How To
* login and visit DeviantArt's
`Applications & Keys <https://www.deviantart.com/developers/apps> `__
section
* click "Register Application"
* scroll to "OAuth2 Redirect URI Whitelist (Required)"
and enter "https://mikf.github.io/gallery-dl/oauth-redirect.html"
* scroll to the bottom and agree to the API License Agreement.
Submission Policy, and Terms of Service.
* click "Save"
* copy `` client_id `` and `` client_secret `` of your new
application and put them in your configuration file
as `` "client-id" `` and `` "client-secret" ``
2020-10-19 21:57:26 +02:00
* clear your `cache <cache.file_> `__ to delete any remaining
2021-09-21 21:58:17 +02:00
`` access-token `` entries. (`` gallery-dl --clear-cache deviantart `` )
2020-10-19 21:57:26 +02:00
* get a new `refresh-token <extractor.deviantart.refresh-token_> `__ for the
new `` client-id `` (`` gallery-dl oauth:deviantart `` )
2017-09-09 17:31:42 +02:00
extractor.flickr.api-key & .api-secret
--------------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
How To
* login and `Create an App <https://www.flickr.com/services/apps/create/apply/> `__
in Flickr's `App Garden <https://www.flickr.com/services/> `__
* click "APPLY FOR A NON-COMMERCIAL KEY"
* fill out the form with a random name and description
and click "SUBMIT"
2022-12-20 17:30:46 +01:00
* copy `` Key `` and `` Secret `` and put them in your configuration file
as `` "api-key" `` and `` "api-secret" ``
2017-09-09 17:31:42 +02:00
2017-10-10 17:29:46 +02:00
extractor.reddit.client-id & .user-agent
----------------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
How To
* login and visit the `apps <https://www.reddit.com/prefs/apps/> `__
section of your account's preferences
* click the "are you a developer? create an app..." button
* fill out the form, choose "installed app", preferably set
"http://localhost:6414/" as "redirect uri" and finally click
"create app"
* copy the client id (third line, under your application's name and
"installed app") and put it in your configuration file
2022-12-20 17:30:46 +01:00
as `` "client-id" ``
2020-09-26 13:33:46 +02:00
* use "`` Python:<application name>:v1.0 (by /u/<username>) `` " as
2022-12-20 17:30:46 +01:00
`` user-agent `` and replace `` <application name> `` and `` <username> ``
2020-09-26 13:33:46 +02:00
accordingly (see Reddit's
`API access rules <https://github.com/reddit/reddit/wiki/API> `__ )
2023-07-13 15:32:21 +02:00
* clear your `cache <cache.file_> `__ to delete any remaining
`` access-token `` entries. (`` gallery-dl --clear-cache reddit `` )
* get a `refresh-token <extractor.reddit.refresh-token_> `__ for the
new `` client-id `` (`` gallery-dl oauth:reddit `` )
2017-09-09 17:31:42 +02:00
2018-05-26 11:26:50 +02:00
extractor.smugmug.api-key & .api-secret
---------------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
How To
* login and `Apply for an API Key <https://api.smugmug.com/api/developer/apply> `__
* use a random name and description,
set "Type" to "Application", "Platform" to "All",
and "Use" to "Non-Commercial"
* fill out the two checkboxes at the bottom and click "Apply"
* copy `` API Key `` and `` API Secret ``
and put them in your configuration file
2022-12-20 17:30:46 +01:00
as `` "api-key" `` and `` "api-secret" ``
2018-05-26 11:26:50 +02:00
extractor.tumblr.api-key & .api-secret
--------------------------------------
2020-09-26 13:33:46 +02:00
Type
`` string ``
How To
* login and visit Tumblr's
`Applications <https://www.tumblr.com/oauth/apps> `__ section
* click "Register application"
* fill out the form: use a random name and description, set
https://example.org/ as "Application Website" and "Default
callback URL"
* solve Google's "I'm not a robot" challenge and click "Register"
* click "Show secret key" (below "OAuth Consumer Key")
* copy your `` OAuth Consumer Key `` and `` Secret Key ``
and put them in your configuration file
2022-12-20 17:30:46 +01:00
as `` "api-key" `` and `` "api-secret" ``
2017-11-22 23:09:08 +01:00
2018-03-16 11:49:49 +01:00
2018-05-27 16:48:54 +02:00
Custom Types
============
2019-07-16 23:08:27 +02:00
Date
----
2020-09-26 13:33:46 +02:00
Type
2021-09-14 17:40:05 +02:00
* `` string ``
* `` integer ``
2020-09-26 13:33:46 +02:00
Example
* `` "2019-01-01T00:00:00" ``
* `` "2019" `` with `` "%Y" `` as `date-format`_
* `` 1546297200 ``
Description
A |Date|_ value represents a specific point in time.
2019-07-16 23:08:27 +02:00
2020-09-26 13:33:46 +02:00
* If given as `` string `` , it is parsed according to `date-format`_ .
* If given as `` integer `` , it is interpreted as UTC timestamp.
2019-07-16 23:08:27 +02:00
2021-09-14 17:40:05 +02:00
Duration
--------
Type
* `` float ``
* `` list `` with 2 `` floats ``
2021-12-18 23:04:33 +01:00
* `` string ``
2021-09-14 17:40:05 +02:00
Example
* `` 2.85 ``
* `` [1.5, 3.0] ``
2021-12-18 23:04:33 +01:00
* `` "2.85" `` , `` "1.5-3.0" ``
2021-09-14 17:40:05 +02:00
Description
A |Duration|_ represents a span of time in seconds.
* If given as a single `` float `` , it will be used as that exact value.
* If given as a `` list `` with 2 floating-point numbers `` a `` & `` b `` ,
2023-01-06 13:07:33 +01:00
it will be randomly chosen with uniform distribution such that `` a <= N <= b `` .
2021-09-14 17:40:05 +02:00
(see `random.uniform() <https://docs.python.org/3/library/random.html#random.uniform> `_ )
2021-12-18 23:04:33 +01:00
* If given as a `` string `` , it can either represent a single `` float ``
value (`` "2.85" `` ) or a range (`` "1.5-3.0" `` ).
2021-09-14 17:40:05 +02:00
2018-05-27 16:48:54 +02:00
Path
----
2020-09-26 13:33:46 +02:00
Type
2021-09-14 17:40:05 +02:00
* `` string ``
* `` list `` of `` strings ``
2020-09-26 13:33:46 +02:00
Example
* `` "file.ext" ``
* `` "~/path/to/file.ext" ``
* `` "$HOME/path/to/file.ext" ``
* `` ["$HOME", "path", "to", "file.ext"] ``
Description
A |Path|_ is a `` string `` representing the location of a file
or directory.
Simple `tilde expansion <https://docs.python.org/3/library/os.path.html#os.path.expanduser> `__
and `environment variable expansion <https://docs.python.org/3/library/os.path.html#os.path.expandvars> `__
is supported.
In Windows environments, backslashes (`` "\" `` ) can, in addition to
forward slashes (`` "/" `` ), be used as path separators.
Because backslashes are JSON's escape character,
they themselves have to be escaped.
The path `` C:\path\to\file.ext `` has therefore to be written as
`` "C:\\path\\to\\file.ext" `` if you want to use backslashes.
2018-05-27 16:48:54 +02:00
Logging Configuration
---------------------
2020-09-26 13:33:46 +02:00
Type
`` object ``
Example
2020-10-19 21:57:26 +02:00
.. code :: json
2020-09-26 13:33:46 +02:00
{
2020-10-19 21:57:26 +02:00
"format" : "{asctime} {name}: {message}",
2020-09-26 13:33:46 +02:00
"format-date": "%H:%M:%S",
2020-10-19 21:57:26 +02:00
"path" : "~/log.txt",
"encoding" : "ascii"
2020-09-26 13:33:46 +02:00
}
2020-10-19 21:57:26 +02:00
.. code :: json
2020-09-26 13:33:46 +02:00
{
2020-10-19 21:57:26 +02:00
"level" : "debug",
2020-09-26 13:33:46 +02:00
"format": {
"debug" : "debug: {message}",
"info" : "[{name}] {message}",
"warning": "Warning: {message}",
"error" : "ERROR: {message}"
}
}
Description
Extended logging output configuration.
* format
* General format string for logging messages
or a dictionary with format strings for each loglevel.
In addition to the default
`LogRecord attributes <https://docs.python.org/3/library/logging.html#logrecord-attributes> `__ ,
it is also possible to access the current
2023-01-07 15:21:40 +01:00
`extractor <https://github.com/mikf/gallery-dl/blob/v1.24.2/gallery_dl/extractor/common.py#L26> `__ ,
`job <https://github.com/mikf/gallery-dl/blob/v1.24.2/gallery_dl/job.py#L21> `__ ,
`path <https://github.com/mikf/gallery-dl/blob/v1.24.2/gallery_dl/path.py#L27> `__ ,
2020-09-26 13:33:46 +02:00
and `keywords` objects and their attributes, for example
`` "{extractor.url}" `` , `` "{path.filename}" `` , `` "{keywords.title}" ``
* Default: `` "[{name}][{levelname}] {message}" ``
* format-date
* Format string for `` {asctime} `` fields in logging messages
(see `strftime() directives <https://docs.python.org/3/library/time.html#time.strftime> `__ )
* Default: `` "%Y-%m-%d %H:%M:%S" ``
* level
* Minimum logging message level
(one of `` "debug" `` , `` "info" `` , `` "warning" `` , `` "error" `` , `` "exception" `` )
* Default: `` "info" ``
* path
* |Path|_ to the output file
* mode
* Mode in which the file is opened;
use `` "w" `` to truncate or `` "a" `` to append
2022-11-07 15:37:22 +01:00
(see |open()|_)
2020-09-26 13:33:46 +02:00
* Default: `` "w" ``
* encoding
* File encoding
* Default: `` "utf-8" ``
2020-10-19 21:57:26 +02:00
Note: path, mode, and encoding are only applied when configuring
2020-09-26 13:33:46 +02:00
logging output to a file.
2018-05-27 16:48:54 +02:00
2018-06-16 15:43:24 +02:00
Postprocessor Configuration
---------------------------
2020-09-26 13:33:46 +02:00
Type
`` object ``
Example
2020-10-19 21:57:26 +02:00
.. code :: json
2020-09-26 13:33:46 +02:00
{ "name": "mtime" }
2020-10-19 21:57:26 +02:00
.. code :: json
2020-09-26 13:33:46 +02:00
{
2020-10-19 21:57:26 +02:00
"name" : "zip",
2020-09-26 13:33:46 +02:00
"compression": "store",
2020-10-19 21:57:26 +02:00
"extension" : "cbz",
2021-06-04 18:08:08 +02:00
"filter" : "extension not in ('zip', 'rar')",
2020-10-19 21:57:26 +02:00
"whitelist" : ["mangadex", "exhentai", "nhentai"]
2020-09-26 13:33:46 +02:00
}
Description
An `` object `` containing a `` "name" `` attribute specifying the
post-processor type, as well as any of its `options <Postprocessor Options_> `__ .
2021-06-04 18:08:08 +02:00
It is possible to set a `` "filter" `` expression similar to
`image-filter <extractor.*.image-filter_> `_ to only run a post-processor
2021-06-08 02:06:19 +02:00
conditionally.
2021-06-04 18:08:08 +02:00
2020-09-26 13:33:46 +02:00
It is also possible set a `` "whitelist" `` or `` "blacklist" `` to
only enable or disable a post-processor for the specified
extractor categories.
The available post-processor types are
`` classify ``
Categorize files by filename extension
`` compare ``
| Compare versions of the same file and replace/enumerate them on mismatch
| (requires `downloader.*.part`_ = `` true `` and `extractor.*.skip`_ = `` false `` )
`` exec ``
Execute external commands
`` metadata ``
Write metadata to separate files
`` mtime ``
Set file modification time according to its metadata
2023-07-24 12:26:40 +02:00
`` python ``
Call Python functions
2020-09-26 13:33:46 +02:00
`` ugoira ``
Convert Pixiv Ugoira to WebM using `FFmpeg <https://www.ffmpeg.org/> `__
`` zip ``
Store files in a ZIP archive
2018-06-16 15:43:24 +02:00
2018-05-27 16:48:54 +02:00
2017-06-24 12:17:26 +02:00
.. |.netrc| replace :: `` .netrc ``
2017-08-31 15:21:08 +02:00
.. |requests.request()| replace :: `` requests.request() ``
.. |timeout| replace :: `` timeout ``
.. |verify| replace :: `` verify ``
2017-06-23 16:14:51 +02:00
.. |mature_content| replace :: `` mature_content ``
.. |webbrowser.open()| replace :: `` webbrowser.open() ``
2019-07-14 22:37:28 +02:00
.. |datetime| replace :: `` datetime ``
2017-07-04 19:34:34 +02:00
.. |datetime.max| replace :: `` datetime.max ``
2019-07-16 23:08:27 +02:00
.. |Date| replace :: `` Date ``
2021-09-14 17:40:05 +02:00
.. |Duration| replace :: `` Duration ``
2018-05-27 16:48:54 +02:00
.. |Path| replace :: `` Path ``
2019-06-20 17:19:44 +02:00
.. |Last-Modified| replace :: `` Last-Modified ``
2018-05-27 16:48:54 +02:00
.. |Logging Configuration| replace :: `` Logging Configuration ``
2018-06-16 15:43:24 +02:00
.. |Postprocessor Configuration| replace :: `` Postprocessor Configuration ``
2017-07-04 19:34:34 +02:00
.. |strptime| replace :: strftime() and strptime() Behavior
2020-10-19 21:57:26 +02:00
.. |postprocessors| replace :: `` postprocessors ``
2022-05-02 12:41:14 +02:00
.. |mode: color| replace :: `` "mode": "color" ``
2022-11-07 15:37:22 +01:00
.. |open()| replace :: the built-in `` open() `` function
2023-01-06 13:07:33 +01:00
.. |json.dump()| replace :: `` json.dump() ``
2017-07-04 19:34:34 +02:00
2018-03-16 11:49:49 +01:00
.. _base-directory: `extractor.*.base-directory`_
2019-07-16 23:08:27 +02:00
.. _date-format: `extractor.*.date-format`_
2021-03-19 16:24:23 +01:00
.. _deviantart.metadata: `extractor.deviantart.metadata`_
2020-10-19 21:57:26 +02:00
.. _postprocessors: `extractor.*.postprocessors`_
.. _download archive: `extractor.*.archive`_
2017-06-23 16:14:51 +02:00
2019-10-11 18:19:39 +02:00
.. _.netrc: https://stackoverflow.com/tags/.netrc/info
.. _Last-Modified: https://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html#sec14.29
.. _datetime: https://docs.python.org/3/library/datetime.html#datetime-objects
.. _datetime.max: https://docs.python.org/3/library/datetime.html#datetime.datetime.max
.. _strptime: https://docs.python.org/3/library/datetime.html#strftime-strptime-behavior
.. _webbrowser.open(): https://docs.python.org/3/library/webbrowser.html
2022-11-07 15:37:22 +01:00
.. _open(): https://docs.python.org/3/library/functions.html#open
2023-01-06 13:07:33 +01:00
.. _json.dump(): https://docs.python.org/3/library/json.html#json.dump
2019-10-11 18:19:39 +02:00
.. _mature_content: https://www.deviantart.com/developers/http/v1/20160316/object/deviation
.. _Authentication: https://github.com/mikf/gallery-dl#authentication
.. _OAuth: https://github.com/mikf/gallery-dl#oauth
2022-12-20 17:30:46 +01:00
.. _format string: formatting.md
.. _format strings: formatting.md
2019-10-11 18:19:39 +02:00
.. _youtube-dl: https://github.com/ytdl-org/youtube-dl
.. _requests.request(): https://requests.readthedocs.io/en/master/api/#requests.request
.. _timeout: https://requests.readthedocs.io/en/master/user/advanced/#timeouts
.. _verify: https://requests.readthedocs.io/en/master/user/advanced/#ssl-cert-verification
.. _`Requests' proxy documentation`: https://requests.readthedocs.io/en/master/user/advanced/#proxies