1
0
mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-23 03:02:50 +01:00
gallery-dl/docs/configuration.rst

6823 lines
160 KiB
ReStructuredText
Raw Normal View History

Configuration
#############
| Configuration files for *gallery-dl* use a JSON-based file format.
| For a (more or less) complete example with options set to their default values,
see `gallery-dl.conf <gallery-dl.conf>`__.
| For a configuration file example with more involved settings and options,
see `gallery-dl-example.conf <gallery-dl-example.conf>`__.
|
This file lists all available configuration options and their descriptions.
Contents
========
1) `Extractor Options`_
2) `Extractor-specific Options`_
3) `Downloader Options`_
4) `Output Options`_
5) `Postprocessor Options`_
6) `Miscellaneous Options`_
7) `API Tokens & IDs`_
Extractor Options
=================
Each extractor is identified by its ``category`` and ``subcategory``.
The ``category`` is the lowercase site name without any spaces or special
characters, which is usually just the module name
(``pixiv``, ``danbooru``, ...).
The ``subcategory`` is a lowercase word describing the general functionality
of that extractor (``user``, ``favorite``, ``manga``, ...).
2017-06-27 18:56:24 +02:00
2017-06-27 17:44:02 +02:00
Each one of the following options can be specified on multiple levels of the
configuration tree:
================== =======
2017-06-27 17:44:02 +02:00
Base level: ``extractor.<option-name>``
Category level: ``extractor.<category>.<option-name>``
Subcategory level: ``extractor.<category>.<subcategory>.<option-name>``
================== =======
2017-06-27 17:44:02 +02:00
A value in a "deeper" level hereby overrides a value of the same name on a
lower level. Setting the ``extractor.pixiv.filename`` value, for example, lets
you specify a general filename pattern for all the different pixiv extractors.
Using the ``extractor.pixiv.user.filename`` value lets you override this
general pattern specifically for ``PixivUserExtractor`` instances.
The ``category`` and ``subcategory`` of all extractors are included in the
output of ``gallery-dl --list-extractors``. For a specific URL these values
can also be determined by using the ``-K``/``--list-keywords`` command-line
option (see the example below).
extractor.*.filename
--------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``object`` (`condition` -> `format string`_)
Example
.. code:: json
"{manga}_c{chapter}_{page:>03}.{extension}"
.. code:: json
{
"extension == 'mp4'": "{id}_video.{extension}",
"'nature' in title" : "{id}_{title}.{extension}",
"" : "{id}_default.{extension}"
}
Description
A `format string`_ to build filenames for downloaded files with.
If this is an ``object``, it must contain Python expressions mapping to the
filename format strings to use.
These expressions are evaluated in the order as specified in Python 3.6+
and in an undetermined order in Python 3.4 and 3.5.
The available replacement keys depend on the extractor used. A list
of keys for a specific one can be acquired by calling *gallery-dl*
with the ``-K``/``--list-keywords`` command-line option.
For example:
.. code::
$ gallery-dl -K http://seiga.nicovideo.jp/seiga/im5977527
Keywords for directory names:
-----------------------------
category
seiga
subcategory
image
Keywords for filenames:
-----------------------
category
seiga
extension
None
image-id
5977527
subcategory
image
Note: Even if the value of the ``extension`` key is missing or
``None``, it will be filled in later when the file download is
starting. This key is therefore always available to provide
a valid filename extension.
extractor.*.directory
---------------------
Type
2022-12-20 17:30:46 +01:00
* ``list`` of ``strings``
* ``object`` (`condition` -> `format strings`_)
Example
.. code:: json
["{category}", "{manga}", "c{chapter} - {title}"]
.. code:: json
{
"'nature' in content": ["Nature Pictures"],
"retweet_id != 0" : ["{category}", "{user[name]}", "Retweets"],
"" : ["{category}", "{user[name]}"]
}
Description
A list of `format strings`_ to build target directory paths with.
If this is an ``object``, it must contain Python expressions mapping to the
list of format strings to use.
2017-06-27 17:44:02 +02:00
Each individual string in such a list represents a single path
segment, which will be joined together and appended to the
base-directory_ to form the complete target directory path.
extractor.*.base-directory
--------------------------
Type
|Path|_
Default
``"./gallery-dl/"``
Description
Directory path used as base for all download destinations.
extractor.*.parent-directory
----------------------------
Type
``bool``
Default
``false``
Description
Use an extractor's current target directory as
base-directory_ for any spawned child extractors.
extractor.*.parent-metadata
---------------------------
extractor.*.metadata-parent
---------------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``false``
Description
If ``true``, overwrite any metadata provided by a child extractor
with its parent's.
| If this is a ``string``, add a parent's metadata to its children's
to a field named after said string.
| For example with ``"parent-metadata": "_p_"``:
.. code:: json
{
"id": "child-id",
"_p_": {"id": "parent-id"}
}
2021-05-12 23:37:01 +02:00
extractor.*.parent-skip
-----------------------
Type
``bool``
Default
``false``
Description
Share number of skipped downloads between parent and child extractors.
extractor.*.path-restrict
-------------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``object`` (`character` -> `replacement character(s)`)
Default
``"auto"``
Example
* ``"/!? (){}"``
* ``{" ": "_", "/": "-", "|": "-", ":": "_-_", "*": "_+_"}``
Description
| A string of characters to be replaced with the value of
`path-replace <extractor.*.path-replace_>`__
| or an object mapping invalid/unwanted characters to their replacements
| for generated path segment names.
Special values:
* ``"auto"``: Use characters from ``"unix"`` or ``"windows"``
depending on the local operating system
* ``"unix"``: ``"/"``
* ``"windows"``: ``"\\\\|/<>:\"?*"``
* ``"ascii"``: ``"^0-9A-Za-z_."`` (only ASCII digits, letters, underscores, and dots)
* ``"ascii+"``: ``"^0-9@-[\\]-{ #-)+-.;=!}~"`` (all ASCII characters except the ones not allowed by Windows)
Implementation Detail: For ``strings`` with length >= 2, this option uses a
`Regular Expression Character Set <https://www.regular-expressions.info/charclass.html>`__,
meaning that:
* using a caret ``^`` as first character inverts the set
* character ranges are supported (``0-9a-z``)
* ``]``, ``-``, and ``\`` need to be escaped as
``\\]``, ``\\-``, and ``\\\\`` respectively
to use them as literal characters
extractor.*.path-replace
------------------------
Type
``string``
Default
``"_"``
Description
The replacement character(s) for
`path-restrict <extractor.*.path-restrict_>`__
extractor.*.path-remove
-----------------------
Type
``string``
Default
``"\u0000-\u001f\u007f"`` (ASCII control characters)
Description
Set of characters to remove from generated path names.
Note: In a string with 2 or more characters, ``[]^-\`` need to be
escaped with backslashes, e.g. ``"\\[\\]"``
2019-07-23 17:36:07 +02:00
2021-08-24 23:23:12 +02:00
extractor.*.path-strip
----------------------
Type
``string``
Default
``"auto"``
Description
Set of characters to remove from the end of generated path segment names
using `str.rstrip() <https://docs.python.org/3/library/stdtypes.html#str.rstrip>`_
Special values:
* ``"auto"``: Use characters from ``"unix"`` or ``"windows"``
depending on the local operating system
* ``"unix"``: ``""``
* ``"windows"``: ``". "``
2022-10-07 18:23:06 +02:00
extractor.*.path-extended
-------------------------
Type
``bool``
Default
``true``
Description
On Windows, use `extended-length paths <https://learn.microsoft.com/en-us/windows/win32/fileio/maximum-file-path-limitation>`__
prefixed with ``\\?\`` to work around the 260 characters path length limit.
extractor.*.extension-map
-------------------------
Type
``object`` (`extension` -> `replacement`)
Default
.. code:: json
{
"jpeg": "jpg",
"jpe" : "jpg",
"jfif": "jpg",
"jif" : "jpg",
"jfi" : "jpg"
}
Description
A JSON ``object`` mapping filename extensions to their replacements.
extractor.*.skip
----------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Controls the behavior when downloading files that have been
downloaded before, i.e. a file with the same filename already
exists or its ID is in a `download archive <extractor.*.archive_>`__.
* ``true``: Skip downloads
* ``false``: Overwrite already existing files
* ``"abort"``: Stop the current extractor run
* ``"abort:N"``: Skip downloads and stop the current extractor run
after ``N`` consecutive skips
* ``"terminate"``: Stop the current extractor run, including parent extractors
* ``"terminate:N"``: Skip downloads and stop the current extractor run,
including parent extractors, after ``N`` consecutive skips
* ``"exit"``: Exit the program altogether
* ``"exit:N"``: Skip downloads and exit the program
after ``N`` consecutive skips
* ``"enumerate"``: Add an enumeration index to the beginning of the
filename extension (``file.1.ext``, ``file.2.ext``, etc.)
2024-05-10 22:56:51 +02:00
extractor.*.skip-filter
-----------------------
Type
``string``
Description
Python expression controlling which skipped files to count towards
``"abort"`` / ``"terminate"`` / ``"exit"``.
extractor.*.sleep
-----------------
Type
|Duration|_
Default
``0``
Description
Number of seconds to sleep before each download.
extractor.*.sleep-extractor
---------------------------
Type
|Duration|_
Default
``0``
Description
Number of seconds to sleep before handling an input URL,
i.e. before starting a new extractor.
extractor.*.sleep-429
---------------------
Type
|Duration|_
Default
``60``
Description
Number of seconds to sleep when receiving a `429 Too Many Requests`
response before `retrying <extractor.*.retries_>`__ the request.
2020-09-19 22:07:41 +02:00
extractor.*.sleep-request
-------------------------
Type
|Duration|_
Default
* ``"0.5-1.5"``
``[Danbooru]``, ``[E621]``, ``[foolfuuka]:search``, ``itaku``,
``koharu``,
``newgrounds``, ``[philomena]``, ``pixiv:novel``, ``plurk``,
``poipiku`` , ``pornpics``, ``soundgasm``, ``urlgalleries``,
``vk``, ``zerochan``
* ``"1.0-2.0"``
``flickr``, ``weibo``, ``[wikimedia]``
* ``"2.0-4.0"``
``behance``, ``imagefap``, ``[Nijie]``
* ``"3.0-6.0"``
``exhentai``, ``idolcomplex``, ``[reactor]``, ``readcomiconline``
* ``"6.0-6.1"``
``twibooru``
* ``"6.0-12.0"``
``instagram``
* ``0``
otherwise
Description
Minimal time interval in seconds between each HTTP request
during data extraction.
2020-09-19 22:07:41 +02:00
extractor.*.username & .password
--------------------------------
Type
``string``
Default
``null``
Description
The username and password to use when attempting to log in to
another site.
Specifying username and password is required for
* ``nijie``
* ``horne``
and optional for
* ``aibooru`` (*)
* ``aryion``
* ``atfbooru`` (*)
* ``bluesky``
* ``booruvar`` (*)
* ``coomerparty``
* ``danbooru`` (*)
* ``deviantart``
* ``e621`` (*)
* ``e6ai`` (*)
* ``e926`` (*)
* ``exhentai``
* ``idolcomplex``
2021-03-03 03:05:25 +01:00
* ``imgbb``
* ``inkbunny``
* ``kemonoparty``
* ``koharu``
* ``mangadex``
2021-03-03 03:05:25 +01:00
* ``mangoxo``
* ``pillowfort``
* ``sankaku``
* ``subscribestar``
* ``tapas``
* ``tsumino``
* ``twitter``
* ``vipergirls``
* ``zerochan``
These values can also be specified via the
``-u/--username`` and ``-p/--password`` command-line options or
by using a |.netrc|_ file. (see Authentication_)
(*) The password value for these sites should be
the API key found in your user profile, not the actual account password.
Note: Leave the ``password`` value empty or undefined
to be prompted for a passeword when performing a login
(see `getpass() <https://docs.python.org/3/library/getpass.html#getpass.getpass>`__).
extractor.*.input
-----------------
Type
``bool``
Default
``true`` if `stdin` is attached to a terminal ,
``false`` otherwise
Description
Allow prompting the user for interactive input.
extractor.*.netrc
-----------------
Type
``bool``
Default
``false``
Description
Enable the use of |.netrc|_ authentication data.
2017-07-21 18:32:56 +02:00
extractor.*.cookies
-------------------
Type
2022-12-20 17:30:46 +01:00
* |Path|_
* ``object`` (`name` -> `value`)
2022-12-20 17:30:46 +01:00
* ``list``
Description
Source to read additional cookies from. This can be
2020-02-13 23:44:02 +01:00
* The |Path|_ to a Mozilla/Netscape format cookies.txt file
2020-02-13 23:44:02 +01:00
.. code:: json
"~/.local/share/cookies-instagram-com.txt"
* An ``object`` specifying cookies as name-value pairs
2020-02-13 23:44:02 +01:00
.. code:: json
2017-07-21 18:32:56 +02:00
{
"cookie-name": "cookie-value",
"sessionid" : "14313336321%3AsabDFvuASDnlpb%3A31",
"isAdult" : "1"
}
2017-07-21 18:32:56 +02:00
2023-07-24 14:27:37 +02:00
* A ``list`` with up to 5 entries specifying a browser profile.
* The first entry is the browser name
* The optional second entry is a profile name or an absolute path to a profile directory
* The optional third entry is the keyring to retrieve passwords for decrypting cookies from
* The optional fourth entry is a (Firefox) container name (``"none"`` for only cookies with no container)
* The optional fifth entry is the domain to extract cookies for. Prefix it with a dot ``.`` to include cookies for subdomains. Has no effect when also specifying a container.
.. code:: json
["firefox"]
["firefox", null, null, "Personal"]
["chromium", "Private", "kwallet", null, ".twitter.com"]
2017-07-21 18:32:56 +02:00
extractor.*.cookies-update
--------------------------
Type
* ``bool``
* |Path|_
Default
``true``
Description
Export session cookies in cookies.txt format.
* If this is a |Path|_, write cookies to the given file path.
* If this is ``true`` and `extractor.*.cookies`_ specifies the |Path|_
of a valid cookies.txt file, update its contents.
extractor.*.proxy
-----------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``object`` (`scheme` -> `proxy`)
Example
.. code:: json
"http://10.10.1.10:3128"
.. code:: json
{
"http" : "http://10.10.1.10:3128",
"https": "http://10.10.1.10:1080",
"http://10.20.1.128": "http://10.10.1.10:5323"
}
Description
Proxy (or proxies) to be used for remote connections.
* If this is a ``string``, it is the proxy URL for all
outgoing requests.
* If this is an ``object``, it is a scheme-to-proxy mapping to
specify different proxy URLs for each scheme.
It is also possible to set a proxy for a specific host by using
``scheme://host`` as key.
See `Requests' proxy documentation`_ for more details.
2022-12-20 17:30:46 +01:00
Note: If a proxy URLs does not include a scheme,
``http://`` is assumed.
extractor.*.source-address
--------------------------
Type
* ``string``
* ``list`` with 1 ``string`` and 1 ``integer`` as elements
Example
* ``"192.168.178.20"``
* ``["192.168.178.20", 8080]``
Description
Client-side IP address to bind to.
| Can be either a simple ``string`` with just the local IP address
| or a ``list`` with IP and explicit port number as elements.
extractor.*.user-agent
----------------------
Type
``string``
Default
``"Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:128.0) Gecko/20100101 Firefox/128.0"``
Description
User-Agent header value to be used for HTTP requests.
Setting this value to ``"browser"`` will try to automatically detect
and use the User-Agent used by the system's default browser.
2022-12-20 17:30:46 +01:00
Note: This option has no effect on
`pixiv`, `e621`, and `mangadex`
extractors, as these need specific values to function correctly.
extractor.*.browser
-------------------
Type
``string``
Default
* ``"firefox"``: ``artstation``, ``mangasee``, ``patreon``, ``pixiv:series``, ``twitter``
* ``null``: otherwise
Example
* ``"chrome:macos"``
Description
Try to emulate a real browser (``firefox`` or ``chrome``)
by using their default HTTP headers and TLS ciphers for HTTP requests.
Optionally, the operating system used in the ``User-Agent`` header can be
specified after a ``:`` (``windows``, ``linux``, or ``macos``).
Note: ``requests`` and ``urllib3`` only support HTTP/1.1, while a real
browser would use HTTP/2.
2023-09-18 23:50:25 +02:00
extractor.*.referer
-------------------
Type
* ``bool``
* ``string``
Default
``true``
Description
Send `Referer <https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/Referer>`__
headers with all outgoing HTTP requests.
If this is a ``string``, send it as Referer
instead of the extractor's ``root`` domain.
extractor.*.headers
-------------------
Type
``object`` (`name` -> `value`)
Default
.. code:: json
{
"User-Agent" : "<extractor.*.user-agent>",
"Accept" : "*/*",
"Accept-Language": "en-US,en;q=0.5",
2023-09-18 23:50:25 +02:00
"Accept-Encoding": "gzip, deflate",
"Referer" : "<extractor.*.referer>"
}
Description
Additional `HTTP headers <https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers>`__
to be sent with each HTTP request,
To disable sending a header, set its value to ``null``.
extractor.*.ciphers
-------------------
Type
``list`` of ``strings``
Example
.. code:: json
["ECDHE-ECDSA-AES128-GCM-SHA256",
"ECDHE-RSA-AES128-GCM-SHA256",
"ECDHE-ECDSA-CHACHA20-POLY1305",
"ECDHE-RSA-CHACHA20-POLY1305"]
Description
List of TLS/SSL cipher suites in
`OpenSSL cipher list format <https://www.openssl.org/docs/manmaster/man1/openssl-ciphers.html#CIPHER-LIST-FORMAT>`__
to be passed to
`ssl.SSLContext.set_ciphers() <https://docs.python.org/3/library/ssl.html#ssl.SSLContext.set_ciphers>`__
extractor.*.tls12
-----------------
Type
``bool``
Default
* ``false``: ``artstation``, ``patreon``, ``pixiv:series``
* ``true``: otherwise
Description
Allow selecting TLS 1.2 cipher suites.
Can be disabled to alter TLS fingerprints
and potentially bypass Cloudflare blocks.
extractor.*.keywords
--------------------
Type
``object`` (`name` -> `value`)
Example
``{"type": "Pixel Art", "type_id": 123}``
Description
Additional name-value pairs to be added to each metadata dictionary.
extractor.*.keywords-eval
-------------------------
Type
``bool``
Default
``false``
Description
Evaluate each `keywords <extractor.*.keywords_>`__ ``string`` value
as a `format string`_.
extractor.*.keywords-default
----------------------------
Type
any
Default
``"None"``
Description
Default value used for missing or undefined keyword names in
`format strings`_.
extractor.*.metadata-url
------------------------
extractor.*.url-metadata
------------------------
Type
``string``
Description
Insert a file's download URL into its metadata dictionary as the given name.
For example, setting this option to ``"gdl_file_url"`` will cause a new
metadata field with name ``gdl_file_url`` to appear, which contains the
current file's download URL.
This can then be used in `filenames <extractor.*.filename_>`_,
with a ``metadata`` post processor, etc.
extractor.*.metadata-path
-------------------------
extractor.*.path-metadata
-------------------------
Type
``string``
Description
2023-01-07 15:21:40 +01:00
Insert a reference to the current
2024-06-15 01:57:34 +02:00
`PathFormat <https://github.com/mikf/gallery-dl/blob/v1.27.0/gallery_dl/path.py#L27>`__
data structure into metadata dictionaries as the given name.
For example, setting this option to ``"gdl_path"`` would make it possible
2022-12-26 07:46:41 +01:00
to access the current file's filename as ``"{gdl_path.filename}"``.
extractor.*.metadata-extractor
------------------------------
extractor.*.extractor-metadata
------------------------------
Type
``string``
Description
Insert a reference to the current
2024-06-15 01:57:34 +02:00
`Extractor <https://github.com/mikf/gallery-dl/blob/v1.27.0/gallery_dl/extractor/common.py#L28>`__
object into metadata dictionaries as the given name.
extractor.*.metadata-http
-------------------------
extractor.*.http-metadata
-------------------------
Type
``string``
Description
Insert an ``object`` containing a file's HTTP headers and
``filename``, ``extension``, and ``date`` parsed from them
into metadata dictionaries as the given name.
For example, setting this option to ``"gdl_http"`` would make it possible
to access the current file's ``Last-Modified`` header as ``"{gdl_http[Last-Modified]}"``
and its parsed form as ``"{gdl_http[date]}"``.
extractor.*.metadata-version
----------------------------
extractor.*.version-metadata
----------------------------
Type
``string``
Description
Insert an ``object`` containing gallery-dl's version info into
metadata dictionaries as the given name.
The content of the object is as follows:
.. code:: json
{
"version" : "string",
"is_executable" : "bool",
"current_git_head": "string or null"
}
extractor.*.category-transfer
-----------------------------
Type
``bool``
Default
Extractor-specific
Description
Transfer an extractor's (sub)category values to all child
extractors spawned by it, to let them inherit their parent's
config options.
extractor.*.blacklist & .whitelist
----------------------------------
Type
``list`` of ``strings``
Default
``["oauth", "recursive", "test"]`` + current extractor category
Example
2023-09-04 18:27:11 +02:00
``["imgur", "redgifs:user", "*:image"]``
Description
A list of extractor identifiers to ignore (or allow)
when spawning child extractors for unknown URLs,
e.g. from ``reddit`` or ``plurk``.
Each identifier can be
* A category or basecategory name (``"imgur"``, ``"mastodon"``)
2023-09-04 18:27:11 +02:00
* | A (base)category-subcategory pair, where both names are separated by a colon (``"redgifs:user"``).
| Both names can be a `*` or left empty, matching all possible names (``"*:image"``, ``":user"``).
Note: Any ``blacklist`` setting will automatically include
``"oauth"``, ``"recursive"``, and ``"test"``.
extractor.*.archive
-------------------
Type
|Path|_
Default
``null``
Example
``"$HOME/.archives/{category}.sqlite3"``
Description
File to store IDs of downloaded files in. Downloads of files
already recorded in this archive file will be
`skipped <extractor.*.skip_>`__.
The resulting archive file is not a plain text file but an SQLite3
database, as either lookup operations are significantly faster or
memory requirements are significantly lower when the
amount of stored IDs gets reasonably large.
Note: Archive files that do not already exist get generated automatically.
Note: Archive paths support regular `format string`_ replacements,
but be aware that using external inputs for building local paths
may pose a security risk.
extractor.*.archive-event
-------------------------
Type
+ ``string``
+ ``list`` of ``strings``
Default
``"file"``
Example
* ``"file,skip"``
* ``["file", "skip"]``
Description
`Event(s) <metadata.event_>`__
for which IDs get written to an
`archive <extractor.*.archive_>`__.
Available events are:
``file``, ``skip``
2018-02-24 21:21:59 +01:00
extractor.*.archive-format
--------------------------
Type
``string``
Example
``"{id}_{offset}"``
Description
An alternative `format string`_ to build archive IDs with.
2018-02-24 21:21:59 +01:00
extractor.*.archive-mode
------------------------
Type
``string``
Default
``"file"``
Description
Controls when to write `archive IDs <extractor.*.archive-format_>`__
to the archive database.
* ``"file"``: Write IDs immediately
after completing or skipping a file download.
* ``"memory"``: Keep IDs in memory
and only write them after successful job completion.
2021-07-20 19:59:42 +02:00
extractor.*.archive-prefix
--------------------------
Type
``string``
Default
``"{category}"``
Description
Prefix for archive IDs.
2023-02-05 16:05:13 +01:00
extractor.*.archive-pragma
--------------------------
Type
``list`` of ``strings``
Example
``["journal_mode=WAL", "synchronous=NORMAL"]``
Description
A list of SQLite ``PRAGMA`` statements to run during archive initialization.
2024-06-15 01:57:34 +02:00
See `<https://www.sqlite.org/pragma.html#toc>`__
2023-02-05 16:05:13 +01:00
for available ``PRAGMA`` statements and further details.
extractor.*.actions
-------------------
Type
* ``object`` (`pattern` -> `action(s)`)
* ``list`` of ``lists`` with `pattern` -> `action(s)` pairs as elements
Example
.. code:: json
{
"info:Logging in as .+" : "level = debug",
"warning:(?i)unable to .+": "exit 127",
"error" : [
"status |= 1",
"exec notify.sh 'gdl error'",
"abort"
]
}
.. code:: json
[
["info:Logging in as .+" , "level = debug"],
["warning:(?i)unable to .+", "exit 127" ],
["error" , [
"status |= 1",
"exec notify.sh 'gdl error'",
"abort"
]]
]
Description
Perform an ``action`` when logging a message matched by ``pattern``.
``pattern`` is parsed as severity level (``debug``, ``info``, ``warning``, ``error``, or integer value)
followed by an optional `Python Regular Expression <https://docs.python.org/3/library/re.html#regular-expression-syntax>`__
separated by a colon ``:``.
Using ``*`` as `level` or leaving it empty
matches logging messages of all levels
(e.g. ``*:<re>`` or ``:<re>``).
``action`` is parsed as action type
followed by (optional) arguments.
It is possible to specify more than one ``action`` per ``pattern``
by providing them as a ``list``: ``["<action1>", "<action2>", …]``
Supported Action Types:
``status``:
| Modify job exit status.
| Expected syntax is ``<operator> <value>`` (e.g. ``= 100``).
Supported operators are
``=`` (assignment),
``&`` (bitwise AND),
``|`` (bitwise OR),
``^`` (bitwise XOR).
``level``:
| Modify severity level of the current logging message.
| Can be one of ``debug``, ``info``, ``warning``, ``error`` or an integer value.
``print``:
Write argument to stdout.
``exec``:
2024-06-29 23:02:48 +02:00
Run a shell command.
``abort``:
Stop the current extractor run.
``terminate``:
Stop the current extractor run, including parent extractors.
``restart``:
Restart the current extractor run.
``wait``:
| Sleep for a given Duration_ or
| wait until Enter is pressed when no argument was given.
``exit``:
Exit the program with the given argument as exit status.
extractor.*.postprocessors
--------------------------
Type
``list`` of |Postprocessor Configuration|_ objects
Example
.. code:: json
[
{
"name": "zip" ,
"compression": "store"
},
{
"name": "exec",
"command": ["/home/foobar/script", "{category}", "{image_id}"]
}
]
Description
A list of `post processors <Postprocessor Configuration_>`__
to be applied to each downloaded file in the specified order.
| Unlike other options, a |postprocessors|_ setting at a deeper level
does not override any |postprocessors|_ setting at a lower level.
| Instead, all post processors from all applicable |postprocessors|_
settings get combined into a single list.
For example
* an ``mtime`` post processor at ``extractor.postprocessors``,
* a ``zip`` post processor at ``extractor.pixiv.postprocessors``,
* and using ``--exec``
will run all three post processors - ``mtime``, ``zip``, ``exec`` -
for each downloaded ``pixiv`` file.
extractor.*.postprocessor-options
---------------------------------
Type
``object`` (`name` -> `value`)
Example
.. code:: json
{
"archive": null,
"keep-files": true
}
Description
Additional `Postprocessor Options`_ that get added to each individual
`post processor object <Postprocessor Configuration_>`__
before initializing it and evaluating filters.
2018-10-08 23:08:11 +02:00
extractor.*.retries
-------------------
Type
``integer``
Default
``4``
Description
Maximum number of times a failed HTTP request is retried before
giving up, or ``-1`` for infinite retries.
2018-10-08 23:08:11 +02:00
extractor.*.retry-codes
-----------------------
Type
``list`` of ``integers``
Example
``[404, 429, 430]``
Description
Additional `HTTP response status codes <https://developer.mozilla.org/en-US/docs/Web/HTTP/Status>`__
to retry an HTTP request on.
``2xx`` codes (success responses) and
``3xx`` codes (redirection messages)
will never be retried and always count as success,
regardless of this option.
``5xx`` codes (server error responses) will always be retried,
regardless of this option.
2018-10-08 23:08:11 +02:00
extractor.*.timeout
-------------------
Type
``float``
Default
``30.0``
Description
Amount of time (in seconds) to wait for a successful connection
and response from a remote server.
2018-10-08 23:08:11 +02:00
This value gets internally used as the |timeout|_ parameter for the
|requests.request()|_ method.
2018-10-08 23:08:11 +02:00
extractor.*.verify
------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Controls whether to verify SSL/TLS certificates for HTTPS requests.
2018-10-08 23:08:11 +02:00
If this is a ``string``, it must be the path to a CA bundle to use
instead of the default certificates.
2018-10-08 23:08:11 +02:00
This value gets internally used as the |verify|_ parameter for the
|requests.request()|_ method.
2018-10-08 23:08:11 +02:00
2019-07-13 21:49:26 +02:00
extractor.*.download
--------------------
Type
``bool``
Default
``true``
Description
Controls whether to download media files.
2019-07-13 21:49:26 +02:00
Setting this to ``false`` won't download any files, but all other
functions (`postprocessors`_, `download archive`_, etc.)
will be executed as normal.
2019-07-13 21:49:26 +02:00
extractor.*.fallback
--------------------
Type
``bool``
Default
``true``
Description
Use fallback download URLs when a download fails.
2018-10-08 23:08:11 +02:00
extractor.*.image-range
-----------------------
Type
* ``string``
* ``list`` of ``strings``
Examples
* ``"10-20"``
* ``"-5, 10, 30-50, 100-"``
* ``"10:21, 30:51:2, :5, 100:"``
* ``["-5", "10", "30-50", "100-"]``
Description
Index range(s) selecting which files to download.
These can be specified as
* index: ``3`` (file number 3)
* range: ``2-4`` (files 2, 3, and 4)
* `slice <https://docs.python.org/3/library/functions.html#slice>`__: ``3:8:2`` (files 3, 5, and 7)
| Arguments for range and slice notation are optional
and will default to begin (``1``) or end (``sys.maxsize``) if omitted.
| For example ``5-``, ``5:``, and ``5::`` all mean "Start at file number 5".
2018-10-08 23:08:11 +02:00
Note: The index of the first file is ``1``.
2018-10-08 23:08:11 +02:00
extractor.*.chapter-range
-------------------------
Type
``string``
Description
Like `image-range <extractor.*.image-range_>`__,
2022-12-20 17:30:46 +01:00
but applies to delegated URLs like manga chapters, etc.
2018-10-08 23:08:11 +02:00
extractor.*.image-filter
------------------------
Type
* ``string``
* ``list`` of ``strings``
Examples
* ``"re.search(r'foo(bar)+', description)"``
* ``["width >= 1200", "width/height > 1.2"]``
Description
Python expression controlling which files to download.
A file only gets downloaded when *all* of the given expressions evaluate to ``True``.
Available values are the filename-specific ones listed by ``-K`` or ``-j``.
2018-10-08 23:08:11 +02:00
extractor.*.chapter-filter
--------------------------
Type
* ``string``
* ``list`` of ``strings``
Examples
* ``"lang == 'en'"``
* ``["language == 'French'", "10 <= chapter < 20"]``
Description
Like `image-filter <extractor.*.image-filter_>`__,
2022-12-20 17:30:46 +01:00
but applies to delegated URLs like manga chapters, etc.
2018-10-08 23:08:11 +02:00
extractor.*.image-unique
------------------------
Type
``bool``
Default
``false``
Description
Ignore image URLs that have been encountered before during the
current extractor run.
extractor.*.chapter-unique
--------------------------
Type
``bool``
Default
``false``
Description
Like `image-unique <extractor.*.image-unique_>`__,
2022-12-20 17:30:46 +01:00
but applies to delegated URLs like manga chapters, etc.
extractor.*.date-format
-----------------------
Type
``string``
Default
``"%Y-%m-%dT%H:%M:%S"``
Description
Format string used to parse ``string`` values of
`date-min` and `date-max`.
See |strptime|_ for a list of formatting directives.
2022-12-20 17:30:46 +01:00
Note: Despite its name, this option does **not** control how
``{date}`` metadata fields are formatted.
To use a different formatting for those values other than the default
``%Y-%m-%d %H:%M:%S``, put |strptime|_ formatting directives
after a colon ``:``, for example ``{date:%Y%m%d}``.
2023-12-31 01:24:12 +01:00
extractor.*.write-pages
-----------------------
Type
* ``bool``
* ``string``
Default
``false``
Description
During data extraction,
write received HTTP request data
to enumerated files in the current working directory.
Special values:
* ``"all"``: Include HTTP request and response headers. Hide ``Authorization``, ``Cookie``, and ``Set-Cookie`` values.
* ``"ALL"``: Include all HTTP request and response headers.
Extractor-specific Options
==========================
extractor.artstation.external
-----------------------------
Type
``bool``
Default
``false``
Description
Try to follow external URLs of embedded players.
extractor.artstation.max-posts
------------------------------
Type
``integer``
Default
``null``
Description
Limit the number of posts/projects to download.
extractor.artstation.previews
-----------------------------
Type
``bool``
Default
``false``
Description
Download video previews.
extractor.artstation.videos
---------------------------
Type
``bool``
Default
``true``
Description
Download video clips.
extractor.artstation.search.pro-first
-------------------------------------
Type
``bool``
Default
``true``
Description
Enable the "Show Studio and Pro member artwork first" checkbox
when retrieving search results.
extractor.aryion.recursive
--------------------------
Type
``bool``
Default
``true``
Description
Controls the post extraction strategy.
* ``true``: Start on users' main gallery pages and recursively
descend into subfolders
* ``false``: Get posts from "Latest Updates" pages
2021-07-30 01:09:32 +02:00
extractor.bbc.width
-------------------
Type
2022-12-20 17:30:46 +01:00
``integer``
2021-07-30 01:09:32 +02:00
Default
``1920``
Description
Specifies the requested image width.
This value must be divisble by 16 and gets rounded down otherwise.
The maximum possible value appears to be ``1920``.
2023-11-17 22:47:57 +01:00
extractor.behance.modules
-------------------------
Type
``list`` of ``strings``
Default
``["image", "video", "mediacollection", "embed"]``
Description
Selects which gallery modules to download from.
Supported module types are
``image``, ``video``, ``mediacollection``, ``embed``, ``text``.
2024-08-07 08:10:01 +02:00
extractor.[blogger].api-key
---------------------------
Type
``string``
Description
Custom Blogger API key.
https://developers.google.com/blogger/docs/3.0/using#APIKey
extractor.[blogger].videos
--------------------------
Type
``bool``
Default
``true``
Description
Download embedded videos hosted on https://www.blogger.com/
extractor.bluesky.include
-------------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"media"``
Example
* ``"avatar,background,posts"``
* ``["avatar", "background", "posts"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"avatar"``,
``"background"``,
``"posts"``,
``"replies"``,
``"media"``,
``"likes"``,
It is possible to use ``"all"`` instead of listing all values separately.
extractor.bluesky.metadata
--------------------------
Type
* ``bool``
* ``string``
* ``list`` of ``strings``
Default
``false``
Example
* ``"facets,user"``
* ``["facets", "user"]``
Description
Extract additional metadata.
* ``facets``: ``hashtags``, ``mentions``, and ``uris``
* ``user``: detailed ``user`` metadata for the user referenced in the input URL
(See `app.bsky.actor.getProfile <https://www.docs.bsky.app/docs/api/app-bsky-actor-get-profile>`__).
extractor.bluesky.post.depth
----------------------------
Type
``integer``
Default
``0``
Description
Sets the maximum depth of returned reply posts.
(See `depth` parameter of `app.bsky.feed.getPostThread <https://www.docs.bsky.app/docs/api/app-bsky-feed-get-post-thread>`__)
extractor.bluesky.reposts
-------------------------
Type
``bool``
Default
``false``
Description
Process reposts.
extractor.bunkr.tlds
--------------------
Type
``bool``
Default
``false``
Description
Controls which ``bunkr`` TLDs to accept.
* ``true``: Match URLs with *all* possible TLDs (e.g. ``bunkr.xyz`` or ``bunkrrr.duck``)
* ``false``: Match only URLs with known TLDs
2024-08-03 11:37:14 +02:00
extractor.cien.files
--------------------
Type
``list`` of ``strings``
Default
``["image", "video", "download", "gallery"]``
Description
Determines the type and order of files to be downloaded.
Available types are
``image``,
``video``,
``download``,
``gallery``.
extractor.cyberdrop.domain
--------------------------
Type
``string``
Default
``null``
Example
``"cyberdrop.to"``
Description
Specifies the domain used by ``cyberdrop`` regardless of input URL.
Setting this option to ``"auto"``
uses the same domain as a given input URL.
extractor.danbooru.external
---------------------------
Type
``bool``
Default
``false``
Description
For unavailable or restricted posts,
follow the ``source`` and download from there if possible.
extractor.danbooru.ugoira
-------------------------
Type
``bool``
Default
``false``
Description
Controls the download target for Ugoira posts.
* ``true``: Original ZIP archives
* ``false``: Converted video files
extractor.[Danbooru].metadata
-----------------------------
Type
* ``bool``
* ``string``
* ``list`` of ``strings``
Default
``false``
Example
* ``replacements,comments,ai_tags``
* ``["replacements", "comments", "ai_tags"]``
Description
Extract additional metadata
(notes, artist commentary, parent, children, uploader)
It is possible to specify a custom list of metadata includes.
See `available_includes <https://github.com/danbooru/danbooru/blob/2cf7baaf6c5003c1a174a8f2d53db010cf05dca7/app/models/post.rb#L1842-L1849>`__
for possible field names. ``aibooru`` also supports ``ai_metadata``.
Note: This requires 1 additional HTTP request per 200-post batch.
2023-07-26 14:01:16 +02:00
extractor.[Danbooru].threshold
------------------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``integer``
Default
``"auto"``
Description
Stop paginating over API results if the length of a batch of returned
posts is less than the specified number. Defaults to the per-page limit
of the current instance, which is 200.
Note: Changing this setting is normally not necessary. When the value is
greater than the per-page limit, gallery-dl will stop after the first
batch. The value cannot be less than 1.
extractor.derpibooru.api-key
----------------------------
Type
``string``
Default
``null``
Description
Your `Derpibooru API Key <https://derpibooru.org/registrations/edit>`__,
to use your account's browsing settings and filters.
extractor.derpibooru.filter
---------------------------
Type
``integer``
Default
``56027`` (`Everything <https://derpibooru.org/filters/56027>`_ filter)
Description
The content filter ID to use.
Setting an explicit filter ID overrides any default filters and can be used
to access 18+ content without `API Key <extractor.derpibooru.api-key_>`_.
See `Filters <https://derpibooru.org/filters>`_ for details.
extractor.derpibooru.svg
------------------------
Type
``bool``
Default
``true``
Description
Download SVG versions of images when available.
Try to download the ``view_url`` version of these posts
when this option is disabled.
extractor.deviantart.auto-watch
-------------------------------
Type
``bool``
Default
``false``
Description
Automatically watch users when encountering "Watchers-Only Deviations"
(requires a `refresh-token <extractor.deviantart.refresh-token_>`_).
extractor.deviantart.auto-unwatch
---------------------------------
Type
``bool``
Default
``false``
Description
After watching a user through `auto-watch <extractor.deviantart.auto-watch_>`_,
unwatch that user at the end of the current extractor run.
extractor.deviantart.comments
-----------------------------
Type
``bool``
Default
``false``
Description
Extract ``comments`` metadata.
extractor.deviantart.comments-avatars
-------------------------------------
Type
``bool``
Default
``false``
Description
Download the avatar of each commenting user.
Note: Enabling this option also enables deviantart.comments_.
extractor.deviantart.extra
--------------------------
Type
``bool``
Default
``false``
Description
Download extra Sta.sh resources from
description texts and journals.
Note: Enabling this option also enables deviantart.metadata_.
extractor.deviantart.flat
-------------------------
Type
``bool``
Default
``true``
Description
Select the directory structure created by the Gallery- and
Favorite-Extractors.
* ``true``: Use a flat directory structure.
* ``false``: Collect a list of all gallery-folders or
favorites-collections and transfer any further work to other
extractors (``folder`` or ``collection``), which will then
create individual subdirectories for each of them.
Note: Going through all gallery folders will not be able to
fetch deviations which aren't in any folder.
extractor.deviantart.folders
----------------------------
Type
``bool``
Default
``false``
Description
Provide a ``folders`` metadata field that contains the names of all
folders a deviation is present in.
Note: Gathering this information requires a lot of API calls.
Use with caution.
extractor.deviantart.group
--------------------------
Type
* ``bool``
* ``string``
Default
``true``
Description
Check whether the profile name in a given URL
belongs to a group or a regular user.
When disabled, assume every given profile name
belongs to a regular user.
Special values:
* ``"skip"``: Skip groups
extractor.deviantart.include
----------------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Default
``"gallery"``
Example
2022-12-20 17:30:46 +01:00
* ``"favorite,journal,scraps"``
* ``["favorite", "journal", "scraps"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"avatar"``,
``"background"``,
``"gallery"``,
``"scraps"``,
``"journal"``,
``"favorite"``,
``"status"``.
2022-12-20 17:30:46 +01:00
It is possible to use ``"all"`` instead of listing all values separately.
extractor.deviantart.intermediary
---------------------------------
Type
``bool``
Default
``true``
Description
For older non-downloadable images,
download a higher-quality ``/intermediary/`` version.
2018-07-16 18:14:41 +02:00
extractor.deviantart.journals
-----------------------------
Type
``string``
Default
``"html"``
Description
Selects the output format for textual content. This includes journals,
literature and status updates.
2018-07-16 18:14:41 +02:00
* ``"html"``: HTML with (roughly) the same layout as on DeviantArt.
* ``"text"``: Plain text with image references and HTML tags removed.
* ``"none"``: Don't download textual content.
2018-07-16 18:14:41 +02:00
extractor.deviantart.jwt
------------------------
Type
``bool``
Default
2023-10-12 21:37:10 +02:00
``false``
Description
Update `JSON Web Tokens <https://jwt.io/>`__ (the ``token`` URL parameter)
of otherwise non-downloadable, low-resolution images
to be able to download them in full resolution.
2023-10-12 21:37:10 +02:00
Note: No longer functional as of 2023-10-11
extractor.deviantart.mature
---------------------------
Type
``bool``
Default
``true``
Description
Enable mature content.
This option simply sets the |mature_content|_ parameter for API
calls to either ``"true"`` or ``"false"`` and does not do any other
form of content filtering.
extractor.deviantart.metadata
-----------------------------
Type
* ``bool``
* ``string``
* ``list`` of ``strings``
Default
``false``
Example
* ``"stats,submission"``
* ``["camera", "stats", "submission"]``
Description
Extract additional metadata for deviation objects.
Provides
``description``, ``tags``, ``license``, and ``is_watching``
fields when enabled.
It is possible to request extended metadata by specifying a list of
* ``camera`` : EXIF information (if available)
* ``stats`` : deviation statistics
* ``submission`` : submission information
* ``collection`` : favourited folder information (requires a `refresh token <extractor.deviantart.refresh-token_>`__)
* ``gallery`` : gallery folder information (requires a `refresh token <extractor.deviantart.refresh-token_>`__)
Set this option to ``"all"`` to request all extended metadata categories.
See `/deviation/metadata <https://www.deviantart.com/developers/http/v1/20210526/deviation_metadata/7824fc14d6fba6acbacca1cf38c24158>`__
for official documentation.
extractor.deviantart.original
-----------------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Download original files if available.
Setting this option to ``"images"`` only downloads original
files if they are images and falls back to preview versions for
everything else (archives, etc.).
extractor.deviantart.pagination
-------------------------------
Type
``string``
Default
``"api"``
Description
Controls when to stop paginating over API results.
* ``"api"``: Trust the API and stop when ``has_more`` is ``false``.
* ``"manual"``: Disregard ``has_more`` and only stop when a batch of results is empty.
2023-04-08 22:52:13 +02:00
extractor.deviantart.public
---------------------------
Type
``bool``
Default
``true``
Description
Use a public access token for API requests.
Disable this option to *force* using a private token for all requests
when a `refresh token <extractor.deviantart.refresh-token_>`__ is provided.
extractor.deviantart.quality
----------------------------
Type
* ``integer``
* ``string``
Default
``100``
Description
JPEG quality level of images for which
an original file download is not available.
Set this to ``"png"`` to download a PNG version of these images instead.
extractor.deviantart.refresh-token
----------------------------------
Type
``string``
Default
``null``
Description
The ``refresh-token`` value you get from
`linking your DeviantArt account to gallery-dl <OAuth_>`__.
Using a ``refresh-token`` allows you to access private or otherwise
not publicly available deviations.
Note: The ``refresh-token`` becomes invalid
`after 3 months <https://www.deviantart.com/developers/authentication#refresh>`__
or whenever your `cache file <cache.file_>`__ is deleted or cleared.
2018-07-14 11:52:21 +02:00
extractor.deviantart.wait-min
-----------------------------
Type
``integer``
Default
``0``
Description
Minimum wait time in seconds before API requests.
2018-07-14 11:52:21 +02:00
extractor.deviantart.avatar.formats
-----------------------------------
Type
``list`` of ``strings``
Example
``["original.jpg", "big.jpg", "big.gif", ".png"]``
Description
Avatar URL formats to return.
| Each format is parsed as ``SIZE.EXT``.
| Leave ``SIZE`` empty to download the regular, small avatar format.
extractor.[E621].metadata
-------------------------
Type
* ``bool``
* ``string``
* ``list`` of ``strings``
Default
``false``
Example
* ``"notes,pools"``
* ``["notes", "pools"]``
Description
Extract additional metadata (notes, pool metadata) if available.
Note: This requires 0-2 additional HTTP requests per post.
extractor.[E621].threshold
--------------------------
Type
* ``string``
* ``integer``
Default
``"auto"``
Description
Stop paginating over API results if the length of a batch of returned
posts is less than the specified number. Defaults to the per-page limit
of the current instance, which is 320.
Note: Changing this setting is normally not necessary. When the value is
greater than the per-page limit, gallery-dl will stop after the first
batch. The value cannot be less than 1.
2020-07-17 19:25:19 +02:00
extractor.exhentai.domain
-------------------------
Type
``string``
Default
``"auto"``
Description
* ``"auto"``: Use ``e-hentai.org`` or ``exhentai.org``
depending on the input URL
* ``"e-hentai.org"``: Use ``e-hentai.org`` for all URLs
* ``"exhentai.org"``: Use ``exhentai.org`` for all URLs
2020-07-17 19:25:19 +02:00
extractor.exhentai.fallback-retries
-----------------------------------
Type
``integer``
Default
``2``
Description
Number of times a failed image gets retried
or ``-1`` for infinite retries.
extractor.exhentai.fav
----------------------
Type
``string``
Example
``"4"``
Description
After downloading a gallery,
add it to your account's favorites as the given category number.
Note: Set this to `"favdel"` to remove galleries from your favorites.
Note: This will remove any Favorite Notes when applied
to already favorited galleries.
2023-11-22 20:15:00 +01:00
extractor.exhentai.gp
---------------------
Type
``string``
Default
2023-11-22 20:15:00 +01:00
``"resized"``
Description
Selects how to handle "you do not have enough GP" errors.
* `"resized"`: Continue downloading `non-original <extractor.exhentai.original_>`__ images.
* `"stop"`: Stop the current extractor run.
* `"wait"`: Wait for user input before retrying the current image.
extractor.exhentai.limits
-------------------------
Type
``integer``
Default
``null``
Description
Sets a custom image download limit and
stops extraction when it gets exceeded.
extractor.exhentai.metadata
---------------------------
Type
``bool``
Default
``false``
Description
Load extended gallery metadata from the
`API <https://ehwiki.org/wiki/API#Gallery_Metadata>`_.
Adds ``archiver_key``, ``posted``, and ``torrents``.
Makes ``date`` and ``filesize`` more precise.
extractor.exhentai.original
---------------------------
Type
``bool``
Default
``true``
Description
Download full-sized original images if available.
extractor.exhentai.source
-------------------------
Type
``string``
Default
``"gallery"``
Description
Selects an alternative source to download files from.
* ``"hitomi"``: Download the corresponding gallery from ``hitomi.la``
extractor.fanbox.embeds
-----------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Control behavior on embedded content from external sites.
* ``true``: Extract embed URLs and download them if supported
(videos are not downloaded).
2024-06-15 21:45:06 +02:00
* ``"ytdl"``: Like ``true``, but let |ytdl| handle video
extraction and download for YouTube, Vimeo, and SoundCloud embeds.
* ``false``: Ignore embeds.
extractor.fanbox.metadata
-------------------------
Type
* ``bool``
* ``string``
* ``list`` of ``strings``
Default
``false``
Example
* ``user,plan``
* ``["user", "plan"]``
Description
Extract ``plan`` and extended ``user`` metadata.
extractor.flickr.access-token & .access-token-secret
----------------------------------------------------
Type
``string``
Default
``null``
Description
The ``access_token`` and ``access_token_secret`` values you get
from `linking your Flickr account to gallery-dl <OAuth_>`__.
2024-03-18 00:01:27 +01:00
extractor.flickr.contexts
-------------------------
Type
``bool``
Default
``false``
Description
For each photo, return the albums and pools it belongs to
as ``set`` and ``pool`` metadata.
Note: This requires 1 additional API call per photo.
See `flickr.photos.getAllContexts <https://www.flickr.com/services/api/flickr.photos.getAllContexts.html>`__ for details.
2023-07-01 19:19:39 +02:00
extractor.flickr.exif
---------------------
Type
``bool``
Default
``false``
Description
2024-03-18 00:01:27 +01:00
For each photo, return its EXIF/TIFF/GPS tags
as ``exif`` and ``camera`` metadata.
2023-07-01 19:19:39 +02:00
Note: This requires 1 additional API call per photo.
2024-03-18 00:01:27 +01:00
See `flickr.photos.getExif <https://www.flickr.com/services/api/flickr.photos.getExif.html>`__ for details.
2023-07-01 19:19:39 +02:00
2023-06-26 16:49:48 +02:00
extractor.flickr.metadata
-------------------------
Type
* ``bool``
* ``string``
* ``list`` of ``strings``
Default
``false``
Example
* ``license,last_update,machine_tags``
* ``["license", "last_update", "machine_tags"]``
Description
Extract additional metadata
(license, date_taken, original_format, last_update, geo, machine_tags, o_dims)
It is possible to specify a custom list of metadata includes.
See `the extras parameter <https://www.flickr.com/services/api/flickr.people.getPhotos.html>`__
2024-03-18 00:01:27 +01:00
in `Flickr's API docs <https://www.flickr.com/services/api/>`__
2023-06-26 16:49:48 +02:00
for possible field names.
extractor.flickr.videos
-----------------------
Type
``bool``
Default
``true``
Description
Extract and download videos.
2017-06-20 16:20:28 +02:00
extractor.flickr.size-max
--------------------------
Type
2022-12-20 17:30:46 +01:00
* ``integer``
* ``string``
Default
``null``
Description
Sets the maximum allowed size for downloaded images.
2017-06-20 16:20:28 +02:00
* If this is an ``integer``, it specifies the maximum image dimension
(width and height) in pixels.
* If this is a ``string``, it should be one of Flickr's format specifiers
(``"Original"``, ``"Large"``, ... or ``"o"``, ``"k"``, ``"h"``,
``"l"``, ...) to use as an upper limit.
2017-06-20 16:20:28 +02:00
extractor.furaffinity.descriptions
----------------------------------
Type
``string``
Default
``"text"``
Description
Controls the format of ``description`` metadata fields.
* ``"text"``: Plain text with HTML tags removed
* ``"html"``: Raw HTML content
extractor.furaffinity.external
------------------------------
Type
``bool``
Default
``false``
Description
Follow external URLs linked in descriptions.
extractor.furaffinity.include
-----------------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Default
``"gallery"``
Example
2022-12-20 17:30:46 +01:00
* ``"scraps,favorite"``
* ``["scraps", "favorite"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"gallery"``, ``"scraps"``, ``"favorite"``.
2022-12-20 17:30:46 +01:00
It is possible to use ``"all"`` instead of listing all values separately.
extractor.furaffinity.layout
----------------------------
Type
``string``
Default
``"auto"``
Description
Selects which site layout to expect when parsing posts.
* ``"auto"``: Automatically differentiate between ``"old"`` and ``"new"``
* ``"old"``: Expect the *old* site layout
* ``"new"``: Expect the *new* site layout
extractor.gelbooru.api-key & .user-id
-------------------------------------
Type
``string``
Default
``null``
Description
Values from the API Access Credentials section found at the bottom of your
`Account Options <https://gelbooru.com/index.php?page=account&s=options>`__
page.
extractor.gelbooru.favorite.order-posts
---------------------------------------
Type
``string``
Default
``"desc"``
Description
Controls the order in which favorited posts are returned.
* ``"asc"``: Ascending favorite date order (oldest first)
* ``"desc"``: Descending favorite date order (newest first)
* ``"reverse"``: Same as ``"asc"``
extractor.generic.enabled
-------------------------
Type
``bool``
Default
``false``
Description
Match **all** URLs not otherwise supported by gallery-dl,
even ones without a ``generic:`` prefix.
extractor.gofile.api-token
--------------------------
Type
``string``
Default
``null``
Description
API token value found at the bottom of your `profile page <https://gofile.io/myProfile>`__.
If not set, a temporary guest token will be used.
extractor.gofile.website-token
------------------------------
Type
``string``
Description
API token value used during API requests.
An invalid or not up-to-date value
will result in ``401 Unauthorized`` errors.
Keeping this option unset will use an extra HTTP request
to attempt to fetch the current value used by gofile.
extractor.gofile.recursive
--------------------------
Type
``bool``
Default
``false``
Description
Recursively download files from subfolders.
2020-09-24 00:48:08 +02:00
extractor.hentaifoundry.include
-------------------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Default
2020-10-11 18:44:46 +02:00
``"pictures"``
Example
2022-12-20 17:30:46 +01:00
* ``"scraps,stories"``
* ``["scraps", "stories"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
2020-09-24 00:48:08 +02:00
Possible values are
2020-10-11 18:44:46 +02:00
``"pictures"``, ``"scraps"``, ``"stories"``, ``"favorite"``.
2020-09-24 00:48:08 +02:00
2022-12-20 17:30:46 +01:00
It is possible to use ``"all"`` instead of listing all values separately.
2020-09-24 00:48:08 +02:00
extractor.hitomi.format
-----------------------
Type
``string``
Default
``"webp"``
Description
Selects which image format to download.
Available formats are ``"webp"`` and ``"avif"``.
``"original"`` will try to download the original ``jpg`` or ``png`` versions,
but is most likely going to fail with ``403 Forbidden`` errors.
extractor.imagechest.access-token
---------------------------------
Type
``string``
Description
Your personal Image Chest access token.
These tokens allow using the API instead of having to scrape HTML pages,
providing more detailed metadata.
(``date``, ``description``, etc)
See https://imgchest.com/docs/api/1.0/general/authorization
for instructions on how to generate such a token.
extractor.imgur.client-id
-------------------------
Type
``string``
Description
Custom Client ID value for API requests.
extractor.imgur.mp4
-------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Controls whether to choose the GIF or MP4 version of an animation.
* ``true``: Follow Imgur's advice and choose MP4 if the
``prefer_video`` flag in an image's metadata is set.
* ``false``: Always choose GIF.
* ``"always"``: Always choose MP4.
2017-06-20 16:20:28 +02:00
2020-07-24 17:50:32 +02:00
extractor.inkbunny.orderby
--------------------------
Type
``string``
Default
``"create_datetime"``
Description
Value of the ``orderby`` parameter for submission searches.
2020-07-24 17:50:32 +02:00
(See `API#Search <https://wiki.inkbunny.net/wiki/API#Search>`__
for details)
2020-07-24 17:50:32 +02:00
2022-09-26 22:05:29 +02:00
extractor.instagram.api
-----------------------
Type
``string``
Default
``"rest"``
2022-09-26 22:05:29 +02:00
Description
Selects which API endpoints to use.
* ``"rest"``: REST API - higher-resolution media
* ``"graphql"``: GraphQL API - lower-resolution media
2022-09-26 22:05:29 +02:00
extractor.instagram.cursor
--------------------------
Type
* ``bool``
* ``string``
Default
``true``
Example
``"3414259811154179155_25025320"``
Description
Controls from which position to start the extraction process from.
* ``true``: Start from the beginning.
Log the most recent ``cursor`` value when interrupted before reaching the end.
* ``false``: Start from the beginning.
* any ``string``: Start from the position defined by this value.
extractor.instagram.include
---------------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Default
``"posts"``
Example
2022-12-20 17:30:46 +01:00
* ``"stories,highlights,posts"``
* ``["stories", "highlights", "posts"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"posts"``,
``"reels"``,
``"tagged"``,
``"stories"``,
``"highlights"``,
``"avatar"``.
2022-12-20 17:30:46 +01:00
It is possible to use ``"all"`` instead of listing all values separately.
extractor.instagram.max-posts
-----------------------------
Type
``integer``
Default
``null``
Description
Limit the number of posts to download.
extractor.instagram.metadata
----------------------------
Type
``bool``
Default
``false``
Description
Provide extended ``user`` metadata even when referring to a user by ID,
e.g. ``instagram.com/id:12345678``.
Note: This metadata is always available when referring to a user by name,
e.g. ``instagram.com/USERNAME``.
extractor.instagram.order-files
-------------------------------
Type
``string``
Default
``"asc"``
Description
Controls the order in which files of each post are returned.
* ``"asc"``: Same order as displayed in a post
* ``"desc"``: Reverse order as displayed in a post
* ``"reverse"``: Same as ``"desc"``
Note: This option does *not* affect ``{num}``.
To enumerate files in reverse order, use ``count - num + 1``.
extractor.instagram.order-posts
-------------------------------
Type
``string``
Default
``"asc"``
Description
Controls the order in which posts are returned.
* ``"asc"``: Same order as displayed
* ``"desc"``: Reverse order as displayed
* ``"id"`` or ``"id_asc"``: Ascending order by ID
* ``"id_desc"``: Descending order by ID
* ``"reverse"``: Same as ``"desc"``
Note: This option only affects ``highlights``.
extractor.instagram.previews
----------------------------
Type
``bool``
Default
``false``
Description
Download video previews.
extractor.instagram.videos
--------------------------
Type
``bool``
Default
``true``
Description
Download video files.
2022-06-20 19:47:53 +02:00
extractor.itaku.videos
----------------------
Type
``bool``
Default
``true``
Description
Download video files.
extractor.kemonoparty.comments
------------------------------
Type
``bool``
Default
``false``
Description
Extract ``comments`` metadata.
2022-12-20 17:30:46 +01:00
Note: This requires 1 additional HTTP request per post.
extractor.kemonoparty.duplicates
--------------------------------
Type
``bool``
Default
``false``
Description
Controls how to handle duplicate files in a post.
* ``true``: Download duplicates
* ``false``: Ignore duplicates
2021-11-20 23:36:16 +01:00
extractor.kemonoparty.dms
-------------------------
Type
``bool``
Default
``false``
Description
Extract a user's direct messages as ``dms`` metadata.
extractor.kemonoparty.announcements
-----------------------------------
Type
``bool``
Default
``false``
Description
Extract a user's announcements as ``announcements`` metadata.
extractor.kemonoparty.favorites
-------------------------------
Type
``string``
Default
``artist``
Description
Determines the type of favorites to be downloaded.
Available types are ``artist``, and ``post``.
extractor.kemonoparty.files
---------------------------
Type
``list`` of ``strings``
Default
``["attachments", "file", "inline"]``
Description
Determines the type and order of files to be downloaded.
Available types are ``file``, ``attachments``, and ``inline``.
extractor.kemonoparty.max-posts
-------------------------------
Type
``integer``
Default
``null``
Description
Limit the number of posts to download.
extractor.kemonoparty.metadata
------------------------------
Type
``bool``
Default
``false``
Description
Extract ``username`` metadata.
extractor.kemonoparty.revisions
-------------------------------
Type
* ``bool``
* ``string``
Default
``false``
Description
Extract post revisions.
Set this to ``"unique"`` to filter out duplicate revisions.
Note: This requires 1 additional HTTP request per post.
extractor.kemonoparty.order-revisions
-------------------------------------
Type
``string``
Default
``"desc"``
Description
Controls the order in which
`revisions <extractor.kemonoparty.revisions_>`__
are returned.
* ``"asc"``: Ascending order (oldest first)
* ``"desc"``: Descending order (newest first)
* ``"reverse"``: Same as ``"asc"``
extractor.khinsider.format
--------------------------
Type
``string``
Default
``"mp3"``
Description
The name of the preferred file format to download.
Use ``"all"`` to download all available formats,
or a (comma-separated) list to select multiple formats.
If the selected format is not available,
the first in the list gets chosen (usually `mp3`).
extractor.koharu.cbz
--------------------
Type
``bool``
Default
``true``
Description
Download each gallery as a single ``.cbz`` file.
Disabling this option causes a gallery
to be downloaded as individual image files.
extractor.koharu.format
-----------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``["0", "1600", "1280", "980", "780"]``
Description
Name(s) of the image format to download.
When more than one format is given, the first available one is selected.
| Possible formats are
| ``"780"``, ``"980"``, ``"1280"``, ``"1600"``, ``"0"`` (original)
extractor.lolisafe.domain
-------------------------
Type
``string``
Default
``null``
Description
Specifies the domain used by a ``lolisafe`` extractor
regardless of input URL.
Setting this option to ``"auto"``
uses the same domain as a given input URL.
extractor.luscious.gif
2021-08-12 16:05:26 +02:00
----------------------
Type
``bool``
Default
``false``
Description
Format in which to download animated images.
2021-08-12 16:05:26 +02:00
Use ``true`` to download animated images as gifs and ``false``
to download as mp4 videos.
extractor.mangadex.api-server
-----------------------------
Type
``string``
Default
``"https://api.mangadex.org"``
Description
The server to use for API requests.
extractor.mangadex.api-parameters
---------------------------------
Type
``object`` (`name` -> `value`)
Example
``{"order[updatedAt]": "desc"}``
Description
Additional query parameters to send when fetching manga chapters.
2023-05-24 12:32:13 +02:00
(See `/manga/{id}/feed <https://api.mangadex.org/docs/swagger.html#/Manga/get-manga-id-feed>`__
and `/user/follows/manga/feed <https://api.mangadex.org/docs/swagger.html#/Feed/get-user-follows-manga-feed>`__)
extractor.mangadex.lang
-----------------------
Type
* ``string``
* ``list`` of ``strings``
Example
* ``"en"``
* ``"fr,it"``
* ``["fr", "it"]``
Description
`ISO 639-1 <https://en.wikipedia.org/wiki/ISO_639-1>`__ language codes
to filter chapters by.
extractor.mangadex.ratings
--------------------------
Type
``list`` of ``strings``
Default
``["safe", "suggestive", "erotica", "pornographic"]``
Description
List of acceptable content ratings for returned chapters.
extractor.mangapark.source
--------------------------
Type
* ``string``
* ``integer``
Example
* ``"koala:en"``
* ``15150116``
Description
Select chapter source and language for a manga.
| The general syntax is ``"<source name>:<ISO 639-1 language code>"``.
| Both are optional, meaning ``"koala"``, ``"koala:"``, ``":en"``,
or even just ``":"`` are possible as well.
Specifying the numeric ``ID`` of a source is also supported.
extractor.[mastodon].access-token
---------------------------------
Type
``string``
Default
``null``
Description
The ``access-token`` value you get from `linking your account to
gallery-dl <OAuth_>`__.
Note: gallery-dl comes with built-in tokens for ``mastodon.social``,
``pawoo`` and ``baraag``. For other instances, you need to obtain an
``access-token`` in order to use usernames in place of numerical
user IDs.
2024-04-30 20:07:23 +02:00
extractor.[mastodon].cards
--------------------------
Type
``bool``
Default
``false``
Description
Fetch media from cards.
extractor.[mastodon].reblogs
----------------------------
Type
``bool``
Default
``false``
Description
Fetch media from reblogged posts.
extractor.[mastodon].replies
----------------------------
Type
``bool``
Default
``true``
Description
Fetch media from replies to other posts.
extractor.[mastodon].text-posts
-------------------------------
Type
``bool``
Default
``false``
Description
Also emit metadata for text-only posts without media content.
extractor.[misskey].access-token
--------------------------------
Type
``string``
Description
Your access token, necessary to fetch favorited notes.
extractor.[misskey].renotes
---------------------------
Type
``bool``
Default
``false``
Description
Fetch media from renoted notes.
extractor.[misskey].replies
---------------------------
Type
``bool``
Default
``true``
Description
Fetch media from replies to other notes.
extractor.[moebooru].pool.metadata
----------------------------------
Type
``bool``
Default
``false``
Description
Extract extended ``pool`` metadata.
Note: Not supported by all ``moebooru`` instances.
extractor.newgrounds.flash
--------------------------
Type
``bool``
Default
``true``
Description
Download original Adobe Flash animations instead of pre-rendered videos.
extractor.newgrounds.format
---------------------------
Type
* ``string``
* ``list`` of ``string``
Default
``"original"``
Example
* ``"720p"``
* ``["mp4", "mov", "1080p", "720p"]``
Description
Selects the preferred format for video downloads.
If the selected format is not available,
the next smaller one gets chosen.
If this is a ``list``, try each given
filename extension in original resolution or recoded format
until an available format is found.
extractor.newgrounds.include
----------------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Default
``"art"``
Example
2022-12-20 17:30:46 +01:00
* ``"movies,audio"``
* ``["movies", "audio"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"art"``, ``"audio"``, ``"games"``, ``"movies"``.
2022-12-20 17:30:46 +01:00
It is possible to use ``"all"`` instead of listing all values separately.
extractor.nijie.include
-----------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Default
``"illustration,doujin"``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"illustration"``, ``"doujin"``, ``"favorite"``, ``"nuita"``.
2022-12-20 17:30:46 +01:00
It is possible to use ``"all"`` instead of listing all values separately.
extractor.nitter.quoted
-----------------------
Type
``bool``
Default
``false``
Description
Fetch media from quoted Tweets.
2022-11-25 19:53:28 +01:00
extractor.nitter.retweets
-------------------------
Type
``bool``
Default
``false``
Description
Fetch media from Retweets.
extractor.nitter.videos
-----------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Control video download behavior.
* ``true``: Download videos
2024-06-15 21:45:06 +02:00
* ``"ytdl"``: Download videos using |ytdl|
* ``false``: Skip video Tweets
2017-06-20 16:20:28 +02:00
extractor.oauth.browser
-----------------------
Type
``bool``
Default
``true``
Description
Controls how a user is directed to an OAuth authorization page.
2017-06-20 16:20:28 +02:00
* ``true``: Use Python's |webbrowser.open()|_ method to automatically
open the URL in the user's default browser.
* ``false``: Ask the user to copy & paste an URL from the terminal.
extractor.oauth.cache
---------------------
Type
``bool``
Default
``true``
Description
Store tokens received during OAuth authorizations
in `cache <cache.file_>`__.
2022-08-14 17:08:01 +02:00
extractor.oauth.host
--------------------
Type
``string``
Default
``"localhost"``
Description
Host name / IP address to bind to during OAuth authorization.
2020-02-09 13:45:44 +01:00
extractor.oauth.port
--------------------
Type
``integer``
Default
``6414``
Description
Port number to listen on during OAuth authorization.
2020-02-09 13:45:44 +01:00
2022-12-20 17:30:46 +01:00
Note: All redirects will go to port ``6414``, regardless
of the port specified here. You'll have to manually adjust the
port number in your browser's address bar when using a different
port than the default.
2020-02-09 13:45:44 +01:00
2022-06-04 16:05:49 +02:00
extractor.paheal.metadata
-------------------------
Type
``bool``
Default
``false``
Description
Extract additional metadata (``source``, ``uploader``)
Note: This requires 1 additional HTTP request per post.
extractor.patreon.files
-----------------------
Type
``list`` of ``strings``
Default
``["images", "image_large", "attachments", "postfile", "content"]``
Description
Determines the type and order of files to be downloaded.
Available types are
``postfile``, ``images``, ``image_large``, ``attachments``, and ``content``.
extractor.pillowfort.external
-----------------------------
Type
``bool``
Default
``false``
Description
Follow links to external sites, e.g. Twitter,
extractor.pillowfort.inline
---------------------------
Type
``bool``
Default
``true``
Description
Extract inline images.
extractor.pillowfort.reblogs
----------------------------
Type
``bool``
Default
``false``
Description
Extract media from reblogged posts.
extractor.pinterest.domain
--------------------------
Type
``string``
Default
``"auto"``
Description
Specifies the domain used by ``pinterest`` extractors.
Setting this option to ``"auto"``
uses the same domain as a given input URL.
extractor.pinterest.sections
----------------------------
Type
``bool``
Default
``true``
Description
Include pins from board sections.
extractor.pinterest.videos
--------------------------
Type
``bool``
Default
``true``
Description
Download from video pins.
extractor.pixeldrain.api-key
----------------------------
Type
``string``
Description
Your account's `API key <https://pixeldrain.com/user/api_keys>`__
extractor.pixiv.include
-----------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"artworks"``
Example
* ``"avatar,background,artworks"``
* ``["avatar", "background", "artworks"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"artworks"``,
``"avatar"``,
``"background"``,
``"favorite"``,
``"novel-user"``,
``"novel-bookmark"``.
It is possible to use ``"all"`` instead of listing all values separately.
extractor.pixiv.refresh-token
-----------------------------
Type
``string``
Description
The ``refresh-token`` value you get
from running ``gallery-dl oauth:pixiv`` (see OAuth_) or
by using a third-party tool like
`gppt <https://github.com/eggplants/get-pixivpy-token>`__.
extractor.pixiv.novel.covers
----------------------------
Type
``bool``
Default
``false``
Description
Download cover images.
extractor.pixiv.novel.embeds
----------------------------
2023-05-23 12:14:06 +02:00
Type
``bool``
Default
``false``
Description
Download embedded images.
2023-05-23 12:14:06 +02:00
extractor.pixiv.novel.full-series
---------------------------------
Type
``bool``
Default
``false``
Description
When downloading a novel being part of a series,
download all novels of that series.
extractor.pixiv.metadata
------------------------
2021-05-14 20:30:28 +02:00
Type
``bool``
Default
``false``
Description
Fetch extended ``user`` metadata.
extractor.pixiv.metadata-bookmark
---------------------------------
Type
``bool``
Default
``false``
Description
For works bookmarked by
`your own account <extractor.pixiv.refresh-token_>`__,
fetch bookmark tags as ``tags_bookmark`` metadata.
Note: This requires 1 additional API call per bookmarked post.
2021-01-17 16:37:07 +01:00
extractor.pixiv.work.related
----------------------------
Type
``bool``
Default
``false``
Description
Also download related artworks.
extractor.pixiv.tags
--------------------
Type
``string``
Default
``"japanese"``
Description
Controls the ``tags`` metadata field.
* `"japanese"`: List of Japanese tags
* `"translated"`: List of translated tags
* `"original"`: Unmodified list with both Japanese and translated tags
extractor.pixiv.ugoira
----------------------
Type
``bool``
Default
``true``
Description
Download Pixiv's Ugoira animations or ignore them.
These animations come as a ``.zip`` file containing all
animation frames in JPEG format.
Use an `ugoira` post processor to convert them
to watchable videos. (Example__)
.. __: https://github.com/mikf/gallery-dl/blob/v1.12.3/docs/gallery-dl-example.conf#L9-L14
extractor.pixiv.max-posts
-------------------------
Type
``integer``
Default
``0``
Description
When downloading galleries, this sets the maximum number of posts to get.
A value of ``0`` means no limit.
extractor.plurk.comments
------------------------
Type
``bool``
Default
``false``
Description
Also search Plurk comments for URLs.
2023-12-12 10:54:34 +01:00
extractor.[postmill].save-link-post-body
2023-12-23 03:28:36 +01:00
----------------------------------------
2023-12-12 10:54:34 +01:00
Type
``bool``
Default
``false``
Description
Whether or not to save the body for link/image posts.
extractor.reactor.gif
2021-08-12 16:05:26 +02:00
---------------------
Type
``bool``
Default
``false``
Description
Format in which to download animated images.
2021-08-12 16:05:26 +02:00
Use ``true`` to download animated images as gifs and ``false``
to download as mp4 videos.
extractor.readcomiconline.captcha
---------------------------------
Type
``string``
Default
``"stop"``
Description
Controls how to handle redirects to CAPTCHA pages.
* ``"stop``: Stop the current extractor run.
* ``"wait``: Ask the user to solve the CAPTCHA and wait.
extractor.readcomiconline.quality
---------------------------------
Type
``string``
Default
``"auto"``
Description
Sets the ``quality`` query parameter of issue pages. (``"lq"`` or ``"hq"``)
``"auto"`` uses the quality parameter of the input URL
or ``"hq"`` if not present.
extractor.reddit.comments
-------------------------
Type
``integer``
Default
``0``
Description
The value of the ``limit`` parameter when loading
a submission and its comments.
This number (roughly) specifies the total amount of comments
being retrieved with the first API call.
Reddit's internal default and maximum values for this parameter
appear to be 200 and 500 respectively.
The value ``0`` ignores all comments and significantly reduces the
time required when scanning a subreddit.
extractor.reddit.morecomments
-----------------------------
Type
``bool``
Default
``false``
Description
Retrieve additional comments by resolving the ``more`` comment
stubs in the base comment tree.
Note: This requires 1 additional API call for every 100 extra comments.
extractor.reddit.date-min & .date-max
-------------------------------------
Type
|Date|_
Default
``0`` and ``253402210800`` (timestamp of |datetime.max|_)
Description
Ignore all submissions posted before/after this date.
2017-07-04 19:34:34 +02:00
extractor.reddit.id-min & .id-max
---------------------------------
Type
``string``
Example
``"6kmzv2"``
Description
Ignore all submissions posted before/after the submission with this ID.
extractor.reddit.previews
-------------------------
Type
``bool``
Default
``true``
Description
For failed downloads from external URLs / child extractors,
download Reddit's preview image/video if available.
extractor.reddit.recursion
--------------------------
Type
``integer``
Default
``0``
Description
Reddit extractors can recursively visit other submissions
linked to in the initial set of submissions.
This value sets the maximum recursion depth.
Special values:
* ``0``: Recursion is disabled
* ``-1``: Infinite recursion (don't do this)
extractor.reddit.refresh-token
------------------------------
Type
``string``
Default
``null``
Description
The ``refresh-token`` value you get from
`linking your Reddit account to gallery-dl <OAuth_>`__.
Using a ``refresh-token`` allows you to access private or otherwise
not publicly available subreddits, given that your account is
authorized to do so,
but requests to the reddit API are going to be rate limited
at 600 requests every 10 minutes/600 seconds.
2020-01-31 23:45:02 +01:00
extractor.reddit.videos
-----------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Control video download behavior.
2020-01-31 23:45:02 +01:00
2024-06-15 21:45:06 +02:00
* ``true``: Download videos and use |ytdl| to handle
HLS and DASH manifests
2024-06-15 21:45:06 +02:00
* ``"ytdl"``: Download videos and let |ytdl| handle all of
video extraction and download
2024-06-15 21:45:06 +02:00
* ``"dash"``: Extract DASH manifest URLs and use |ytdl|
to download and merge them. (*)
* ``false``: Ignore videos
2020-01-31 23:45:02 +01:00
(*)
This saves 1 HTTP request per video
and might potentially be able to download otherwise deleted videos,
but it will not always get the best video quality available.
2020-01-31 23:45:02 +01:00
extractor.redgifs.format
------------------------
Type
* ``string``
2022-12-20 17:30:46 +01:00
* ``list`` of ``strings``
Default
2021-11-04 21:31:20 +01:00
``["hd", "sd", "gif"]``
Description
List of names of the preferred animation format, which can be
``"hd"``,
``"sd"``,
``"gif"``,
``"thumbnail"``,
``"vthumbnail"``, or
``"poster"``.
If a selected format is not available, the next one in the list will be
tried until an available format is found.
If the format is given as ``string``, it will be extended with
2021-11-04 21:31:20 +01:00
``["hd", "sd", "gif"]``. Use a list with one element to
restrict it to only one possible format.
extractor.sankaku.id-format
---------------------------
Type
``string``
Default
``"numeric"``
Description
Format of ``id`` metadata fields.
* ``"alphanumeric"`` or ``"alnum"``: 11-character alphanumeric IDs (``y0abGlDOr2o``)
* ``"numeric"`` or ``"legacy"``: numeric IDs (``360451``)
extractor.sankaku.refresh
-------------------------
Type
``bool``
Default
``false``
Description
Refresh download URLs before they expire.
extractor.sankakucomplex.embeds
-------------------------------
Type
``bool``
Default
``false``
Description
Download video embeds from external sites.
extractor.sankakucomplex.videos
-------------------------------
Type
``bool``
Default
``true``
Description
Download videos.
extractor.skeb.article
----------------------
Type
``bool``
Default
``false``
Description
Download article images.
extractor.skeb.sent-requests
----------------------------
Type
``bool``
Default
``false``
Description
Download sent requests.
extractor.skeb.thumbnails
-------------------------
Type
``bool``
Default
``false``
Description
Download thumbnails.
extractor.skeb.search.filters
-----------------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Default
``["genre:art", "genre:voice", "genre:novel", "genre:video", "genre:music", "genre:correction"]``
Example
``"genre:music OR genre:voice"``
Description
Filters used during searches.
extractor.smugmug.videos
------------------------
Type
``bool``
Default
``true``
Description
Download video files.
2024-01-09 07:12:56 +01:00
extractor.steamgriddb.animated
------------------------------
Type
``bool``
Default
``true``
Description
Include animated assets when downloading from a list of assets.
extractor.steamgriddb.epilepsy
------------------------------
Type
``bool``
Default
``true``
Description
Include assets tagged with epilepsy when downloading from a list of assets.
extractor.steamgriddb.dimensions
--------------------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"all"``
Examples
* ``"1024x512,512x512"``
* ``["460x215", "920x430"]``
Description
Only include assets that are in the specified dimensions. ``all`` can be
used to specify all dimensions. Valid values are:
* Grids: ``460x215``, ``920x430``, ``600x900``, ``342x482``, ``660x930``,
``512x512``, ``1024x1024``
* Heroes: ``1920x620``, ``3840x1240``, ``1600x650``
* Logos: N/A (will be ignored)
* Icons: ``8x8``, ``10x10``, ``14x14``, ``16x16``, ``20x20``, ``24x24``,
``28x28``, ``32x32``, ``35x35``, ``40x40``, ``48x48``, ``54x54``,
``56x56``, ``57x57``, ``60x60``, ``64x64``, ``72x72``, ``76x76``,
``80x80``, ``90x90``, ``96x96``, ``100x100``, ``114x114``, ``120x120``,
``128x128``, ``144x144``, ``150x150``, ``152x152``, ``160x160``,
``180x180``, ``192x192``, ``194x194``, ``256x256``, ``310x310``,
``512x512``, ``768x768``, ``1024x1024``
extractor.steamgriddb.file-types
--------------------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"all"``
Examples
2024-01-12 23:55:39 +01:00
* ``"png,jpeg"``
2024-01-09 07:12:56 +01:00
* ``["jpeg", "webp"]``
Description
Only include assets that are in the specified file types. ``all`` can be
used to specify all file types. Valid values are:
2024-01-09 07:12:56 +01:00
* Grids: ``png``, ``jpeg``, ``jpg``, ``webp``
* Heroes: ``png``, ``jpeg``, ``jpg``, ``webp``
* Logos: ``png``, ``webp``
* Icons: ``png``, ``ico``
extractor.steamgriddb.download-fake-png
---------------------------------------
Type
``bool``
Default
``true``
Description
Download fake PNGs alongside the real file.
extractor.steamgriddb.humor
---------------------------
Type
``bool``
Default
``true``
Description
Include assets tagged with humor when downloading from a list of assets.
extractor.steamgriddb.languages
-------------------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"all"``
Examples
* ``"en,km"``
* ``["fr", "it"]``
Description
Only include assets that are in the specified languages. ``all`` can be
used to specify all languages. Valid values are `ISO 639-1 <https://en.wikipedia.org/wiki/ISO_639-1>`__
2024-01-09 07:12:56 +01:00
language codes.
extractor.steamgriddb.nsfw
--------------------------
Type
``bool``
Default
``true``
Description
Include assets tagged with adult content when downloading from a list of assets.
extractor.steamgriddb.sort
--------------------------
Type
``string``
Default
``score_desc``
Description
Set the chosen sorting method when downloading from a list of assets. Can be one of:
* ``score_desc`` (Highest Score (Beta))
* ``score_asc`` (Lowest Score (Beta))
* ``score_old_desc`` (Highest Score (Old))
* ``score_old_asc`` (Lowest Score (Old))
* ``age_desc`` (Newest First)
* ``age_asc`` (Oldest First)
extractor.steamgriddb.static
----------------------------
Type
``bool``
Default
``true``
Description
Include static assets when downloading from a list of assets.
extractor.steamgriddb.styles
----------------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``all``
Examples
* ``white,black``
* ``["no_logo", "white_logo"]``
Description
Only include assets that are in the specified styles. ``all`` can be used
to specify all styles. Valid values are:
* Grids: ``alternate``, ``blurred``, ``no_logo``, ``material``, ``white_logo``
* Heroes: ``alternate``, ``blurred``, ``material``
* Logos: ``official``, ``white``, ``black``, ``custom``
* Icons: ``official``, ``custom``
extractor.steamgriddb.untagged
------------------------------
Type
``bool``
Default
``true``
Description
Include untagged assets when downloading from a list of assets.
extractor.[szurubooru].username & .token
----------------------------------------
Type
``string``
Description
Username and login token of your account to access private resources.
To generate a token, visit ``/user/USERNAME/list-tokens``
and click ``Create Token``.
extractor.tumblr.avatar
-----------------------
Type
``bool``
Default
``false``
Description
Download blog avatars.
extractor.tumblr.date-min & .date-max
-------------------------------------
Type
|Date|_
Default
``0`` and ``null``
Description
Ignore all posts published before/after this date.
extractor.tumblr.external
-------------------------
Type
``bool``
Default
``false``
Description
Follow external URLs (e.g. from "Link" posts) and try to extract
images from them.
extractor.tumblr.inline
-----------------------
Type
``bool``
Default
``true``
Description
Search posts for inline images and videos.
2022-10-11 10:54:23 +02:00
extractor.tumblr.offset
-----------------------
Type
``integer``
Default
``0``
Description
Custom ``offset`` starting value when paginating over blog posts.
Allows skipping over posts without having to waste API calls.
extractor.tumblr.original
-------------------------
Type
``bool``
Default
``true``
Description
Download full-resolution ``photo`` and ``inline`` images.
For each photo with "maximum" resolution
(width equal to 2048 or height equal to 3072)
or each inline image,
use an extra HTTP request to find the URL to its full-resolution version.
extractor.tumblr.pagination
---------------------------
Type
``string``
Default
``"offset"``
Description
Controls how to paginate over blog posts.
* ``"api"``: ``next`` parameter provided by the API
(potentially misses posts due to a
`bug <https://github.com/tumblr/docs/issues/76>`__
in Tumblr's API)
* ``"before"``: timestamp of last post
* ``"offset"``: post offset number
extractor.tumblr.ratelimit
--------------------------
Type
``string``
Default
``"abort"``
Description
Selects how to handle exceeding the daily API rate limit.
* ``"abort"``: Raise an error and stop extraction
* ``"wait"``: Wait until rate limit reset
extractor.tumblr.reblogs
------------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
* ``true``: Extract media from reblogged posts
* ``false``: Skip reblogged posts
* ``"same-blog"``: Skip reblogged posts unless the original post
is from the same blog
extractor.tumblr.posts
----------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Default
``"all"``
Example
2022-12-20 17:30:46 +01:00
* ``"video,audio,link"``
* ``["video", "audio", "link"]``
Description
A (comma-separated) list of post types to extract images, etc. from.
Possible types are ``text``, ``quote``, ``link``, ``answer``,
``video``, ``audio``, ``photo``, ``chat``.
2022-12-20 17:30:46 +01:00
It is possible to use ``"all"`` instead of listing all types separately.
extractor.tumblr.fallback-delay
-------------------------------
Type
``float``
Default
``120.0``
Description
Number of seconds to wait between retries
for fetching full-resolution images.
extractor.tumblr.fallback-retries
---------------------------------
Type
``integer``
Default
``2``
Description
Number of retries for fetching full-resolution images
or ``-1`` for infinite retries.
extractor.twibooru.api-key
--------------------------
Type
``string``
Default
``null``
Description
Your `Twibooru API Key <https://twibooru.org/users/edit>`__,
to use your account's browsing settings and filters.
extractor.twibooru.filter
-------------------------
Type
``integer``
Default
``2`` (`Everything <https://twibooru.org/filters/2>`__ filter)
Description
The content filter ID to use.
Setting an explicit filter ID overrides any default filters and can be used
to access 18+ content without `API Key <extractor.twibooru.api-key_>`__.
See `Filters <https://twibooru.org/filters>`__ for details.
extractor.twibooru.svg
----------------------
Type
``bool``
Default
``true``
Description
Download SVG versions of images when available.
Try to download the ``view_url`` version of these posts
when this option is disabled.
extractor.twitter.ads
---------------------
Type
``bool``
Default
``false``
Description
Fetch media from promoted Tweets.
extractor.twitter.cards
-----------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
2022-05-21 15:39:25 +02:00
``false``
Description
Controls how to handle `Twitter Cards <https://developer.twitter.com/en/docs/twitter-for-websites/cards/overview/abouts-cards>`__.
* ``false``: Ignore cards
* ``true``: Download image content from supported cards
2024-06-15 21:45:06 +02:00
* ``"ytdl"``: Additionally download video content from unsupported cards
using |ytdl|
extractor.twitter.cards-blacklist
---------------------------------
Type
``list`` of ``strings``
Example
``["summary", "youtube.com", "player:twitch.tv"]``
Description
List of card types to ignore.
Possible values are
* card names
* card domains
* ``<card name>:<card domain>``
extractor.twitter.conversations
-------------------------------
Type
* ``bool``
* ``string``
Default
``false``
Description
For input URLs pointing to a single Tweet,
e.g. `https://twitter.com/i/web/status/<TweetID>`,
fetch media from all Tweets and replies in this `conversation
<https://help.twitter.com/en/using-twitter/twitter-conversations>`__.
If this option is equal to ``"accessible"``,
only download from conversation Tweets
if the given initial Tweet is accessible.
extractor.twitter.csrf
----------------------
Type
``string``
Default
``"cookies"``
Description
Controls how to handle Cross Site Request Forgery (CSRF) tokens.
* ``"auto"``: Always auto-generate a token.
* ``"cookies"``: Use token given by the ``ct0`` cookie if present.
extractor.twitter.cursor
------------------------
Type
* ``bool``
* ``string``
Default
``true``
Example
``"1/DAABCgABGVKi5lE___oKAAIYbfYNcxrQLggAAwAAAAIAAA"``
Description
Controls from which position to start the extraction process from.
* ``true``: Start from the beginning.
Log the most recent ``cursor`` value when interrupted before reaching the end.
* ``false``: Start from the beginning.
* any ``string``: Start from the position defined by this value.
Note: A ``cursor`` value from one timeline cannot be used with another.
extractor.twitter.expand
------------------------
Type
``bool``
Default
``false``
Description
For each Tweet, return *all* Tweets from that initial Tweet's
conversation or thread, i.e. *expand* all Twitter threads.
Going through a timeline with this option enabled is essentially the same
as running ``gallery-dl https://twitter.com/i/web/status/<TweetID>``
with enabled `conversations <extractor.twitter.conversations_>`__ option
for each Tweet in said timeline.
Note: This requires at least 1 additional API call per initial Tweet.
extractor.twitter.unavailable
-----------------------------
Type
``bool``
Default
``false``
Description
Try to download media marked as ``Unavailable``,
e.g. ``Geoblocked`` videos.
extractor.twitter.include
-------------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"timeline"``
Example
* ``"avatar,background,media"``
* ``["avatar", "background", "media"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"info"``,
``"avatar"``,
``"background"``,
``"timeline"``,
``"tweets"``,
``"media"``,
``"replies"``,
``"likes"``.
It is possible to use ``"all"`` instead of listing all values separately.
2023-02-02 22:01:36 +01:00
extractor.twitter.transform
---------------------------
Type
``bool``
Default
``true``
Description
Transform Tweet and User metadata into a simpler, uniform format.
extractor.twitter.tweet-endpoint
--------------------------------
Type
``string``
Default
``"auto"``
Description
Selects the API endpoint used to retrieve single Tweets.
* ``"restid"``: ``/TweetResultByRestId`` - accessible to guest users
* ``"detail"``: ``/TweetDetail`` - more stable
* ``"auto"``: ``"detail"`` when logged in, ``"restid"`` otherwise
2021-10-05 18:58:10 +02:00
extractor.twitter.size
----------------------
Type
``list`` of ``strings``
Default
``["orig", "4096x4096", "large", "medium", "small"]``
2021-10-05 18:58:10 +02:00
Description
The image version to download.
Any entries after the first one will be used for potential
`fallback <extractor.*.fallback_>`_ URLs.
Known available sizes are
``4096x4096``, ``orig``, ``large``, ``medium``, and ``small``.
extractor.twitter.logout
------------------------
Type
``bool``
Default
``false``
Description
Logout and retry as guest when access to another user's Tweets is blocked.
2021-10-29 22:10:58 +02:00
extractor.twitter.pinned
------------------------
Type
``bool``
Default
``false``
Description
Fetch media from pinned Tweets.
extractor.twitter.quoted
------------------------
Type
``bool``
Default
``false``
Description
Fetch media from quoted Tweets.
2023-01-06 13:32:08 +01:00
If this option is enabled, gallery-dl will try to fetch
a quoted (original) Tweet when it sees the Tweet which quotes it.
extractor.twitter.ratelimit
---------------------------
Type
``string``
Default
``"wait"``
Description
Selects how to handle exceeding the API rate limit.
* ``"abort"``: Raise an error and stop extraction
* ``"wait"``: Wait until rate limit reset
* ``"wait:N"``: Wait for ``N`` seconds
extractor.twitter.relogin
-------------------------
Type
``bool``
Default
``true``
Description
When receiving a "Could not authenticate you" error while logged in with
`username & passeword <extractor.*.username & .password_>`__,
refresh the current login session and
try to continue from where it left off.
extractor.twitter.locked
------------------------
Type
``string``
Default
``"abort"``
Description
Selects how to handle "account is temporarily locked" errors.
* ``"abort"``: Raise an error and stop extraction
* ``"wait"``: Wait until the account is unlocked and retry
extractor.twitter.replies
-------------------------
Type
``bool``
Default
``true``
Description
Fetch media from replies to other Tweets.
If this value is ``"self"``, only consider replies where
reply and original Tweet are from the same user.
Note: Twitter will automatically expand conversations if you
use the ``/with_replies`` timeline while logged in. For example,
media from Tweets which the user replied to will also be downloaded.
It is possible to exclude unwanted Tweets using `image-filter
<extractor.*.image-filter_>`__.
extractor.twitter.retweets
--------------------------
Type
``bool``
Default
``false``
Description
Fetch media from Retweets.
If this value is ``"original"``, metadata for these files
will be taken from the original Tweets, not the Retweets.
extractor.twitter.timeline.strategy
-----------------------------------
Type
``string``
Default
``"auto"``
Description
2023-07-24 14:27:37 +02:00
Controls the strategy / tweet source used for timeline URLs
(``https://twitter.com/USER/timeline``).
* ``"tweets"``: `/tweets <https://twitter.com/USER/tweets>`__ timeline + search
* ``"media"``: `/media <https://twitter.com/USER/media>`__ timeline + search
* ``"with_replies"``: `/with_replies <https://twitter.com/USER/with_replies>`__ timeline + search
* ``"auto"``: ``"tweets"`` or ``"media"``, depending on `retweets <extractor.twitter.retweets_>`__ and `text-tweets <extractor.twitter.text-tweets_>`__ settings
extractor.twitter.text-tweets
-----------------------------
Type
``bool``
Default
``false``
Description
Also emit metadata for text-only Tweets without media content.
This only has an effect with a ``metadata`` (or ``exec``) post processor
with `"event": "post" <metadata.event_>`_
and appropriate `filename <metadata.filename_>`_.
extractor.twitter.twitpic
-------------------------
Type
``bool``
Default
``false``
Description
Extract `TwitPic <https://twitpic.com/>`__ embeds.
extractor.twitter.unique
------------------------
Type
``bool``
Default
``true``
Description
Ignore previously seen Tweets.
extractor.twitter.username-alt
------------------------------
Type
``string``
Description
Alternate Identifier (username, email, phone number)
when `logging in <extractor.*.username & .password_>`__.
When not specified and asked for by Twitter,
this identifier will need to entered in an interactive prompt.
extractor.twitter.users
-----------------------
Type
``string``
Default
``"user"``
Example
``"https://twitter.com/search?q=from:{legacy[screen_name]}"``
Description
| Format string for user URLs generated from
``following`` and ``list-members`` queries,
| whose replacement field values come from Twitter ``user`` objects
(`Example <https://gist.githubusercontent.com/mikf/99d2719b3845023326c7a4b6fb88dd04/raw/275b4f0541a2c7dc0a86d3998f7d253e8f10a588/github.json>`_)
Special values:
* ``"user"``: ``https://twitter.com/i/user/{rest_id}``
* ``"timeline"``: ``https://twitter.com/id:{rest_id}/timeline``
* ``"tweets"``: ``https://twitter.com/id:{rest_id}/tweets``
* ``"media"``: ``https://twitter.com/id:{rest_id}/media``
Note: To allow gallery-dl to follow custom URL formats, set the blacklist__
for ``twitter`` to a non-default value, e.g. an empty string ``""``.
.. __: `extractor.*.blacklist & .whitelist`_
extractor.twitter.videos
------------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Control video download behavior.
* ``true``: Download videos
2024-06-15 21:45:06 +02:00
* ``"ytdl"``: Download videos using |ytdl|
* ``false``: Skip video Tweets
2021-01-21 22:41:49 +01:00
extractor.unsplash.format
-------------------------
Type
``string``
Default
``"raw"``
Description
Name of the image format to download.
Available formats are
2021-03-03 03:05:25 +01:00
``"raw"``, ``"full"``, ``"regular"``, ``"small"``, and ``"thumb"``.
2021-01-21 22:41:49 +01:00
extractor.vipergirls.domain
---------------------------
Type
``string``
Default
``"vipergirls.to"``
Description
Specifies the domain used by ``vipergirls`` extractors.
For example ``"viper.click"`` if the main domain is blocked or to bypass Cloudflare,
2024-03-18 16:35:09 +01:00
extractor.vipergirls.like
-------------------------
Type
``bool``
Default
``false``
Description
Automatically `like` posts after downloading their images.
Note: Requires `login <extractor.*.username & .password_>`__
or `cookies <extractor.*.cookies_>`__
2024-07-31 12:32:04 +02:00
extractor.vsco.include
----------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"gallery"``
Example
* ``"avatar,collection"``
* ``["avatar", "collection"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"avatar"``,
``"gallery"``,
``"spaces"``,
``"collection"``,
It is possible to use ``"all"`` instead of listing all values separately.
2019-12-19 17:20:51 +01:00
extractor.vsco.videos
---------------------
Type
``bool``
Default
``true``
Description
Download video files.
2019-12-19 17:20:51 +01:00
extractor.wallhaven.api-key
---------------------------
Type
``string``
Default
``null``
Description
Your `Wallhaven API Key <https://wallhaven.cc/settings/account>`__,
to use your account's browsing settings and default filters when searching.
See https://wallhaven.cc/help/api for more information.
extractor.wallhaven.include
---------------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"uploads"``
Example
* ``"uploads,collections"``
* ``["uploads", "collections"]``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
``"uploads"``, ``"collections"``.
It is possible to use ``"all"`` instead of listing all values separately.
extractor.wallhaven.metadata
----------------------------
Type
``bool``
Default
``false``
Description
Extract additional metadata (tags, uploader)
2022-12-20 17:30:46 +01:00
Note: This requires 1 additional HTTP request per post.
extractor.weasyl.api-key
2020-10-15 15:17:19 +02:00
------------------------
Type
``string``
Default
``null``
Description
Your `Weasyl API Key <https://www.weasyl.com/control/apikeys>`__,
to use your account's browsing settings and filters.
extractor.weasyl.metadata
-------------------------
Type
``bool``
Default
``false``
Description
| Fetch extra submission metadata during gallery downloads.
| (``comments``, ``description``, ``favorites``, ``folder_name``,
``tags``, ``views``)
Note: This requires 1 additional HTTP request per submission.
2024-02-10 18:17:07 +01:00
extractor.weibo.gifs
--------------------
Type
* ``bool``
* ``string``
Default
``true``
Description
Download ``gif`` files.
Set this to ``"video"`` to download GIFs as video files.
extractor.weibo.include
-----------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"feed"``
Description
A (comma-separated) list of subcategories to include
when processing a user profile.
Possible values are
2022-12-20 17:30:46 +01:00
``"home"``,
``"feed"``,
``"videos"``,
``"newvideo"``,
``"article"``,
``"album"``.
It is possible to use ``"all"`` instead of listing all values separately.
extractor.weibo.livephoto
-------------------------
Type
``bool``
Default
``true``
Description
Download ``livephoto`` files.
2020-04-29 23:27:29 +02:00
extractor.weibo.retweets
------------------------
Type
``bool``
Default
``false``
Description
Fetch media from retweeted posts.
If this value is ``"original"``, metadata for these files
will be taken from the original posts, not the retweeted posts.
2020-04-29 23:27:29 +02:00
extractor.weibo.videos
----------------------
Type
``bool``
Default
``true``
Description
Download video files.
2020-04-29 23:27:29 +02:00
2024-08-18 07:32:26 +02:00
extractor.wikimedia.limit
-------------------------
Type
``integer``
Default
``50``
2024-08-18 07:32:26 +02:00
Description
Number of results to return in a single API query.
The value must be between 10 and 500.
2024-06-15 21:45:06 +02:00
extractor.ytdl.cmdline-args
---------------------------
Type
* ``string``
* ``list`` of ``strings``
Example
* ``"--quiet --write-sub --merge-output-format mkv"``
* ``["--quiet", "--write-sub", "--merge-output-format", "mkv"]``
Description
Additional ``ytdl`` options specified as command-line arguments.
See
`yt-dlp options <https://github.com/yt-dlp/yt-dlp#usage-and-options>`__
/
`youtube-dl options <https://github.com/ytdl-org/youtube-dl#options>`__
extractor.ytdl.config-file
--------------------------
Type
|Path|_
Example
``"~/.config/yt-dlp/config"``
Description
Location of a |ytdl| configuration file to load options from.
extractor.ytdl.enabled
----------------------
Type
``bool``
Default
``false``
Description
2024-06-15 21:45:06 +02:00
Process URLs otherwise unsupported by gallery-dl with |ytdl|.
extractor.ytdl.format
---------------------
Type
``string``
Default
2024-06-15 21:45:06 +02:00
| Default of the ``ytdl`` `module <extractor.ytdl.module_>`__ used.
| (``"bestvideo*+bestaudio/best"`` for ``yt_dlp``,
``"bestvideo+bestaudio/best"`` for ``youtube_dl``)
Description
2024-06-15 21:45:06 +02:00
``ytdl`` format selection string.
See
`yt-dlp format selection <https://github.com/yt-dlp/yt-dlp#format-selection>`__
/
`youtube-dl format selection <https://github.com/ytdl-org/youtube-dl#format-selection>`__
2021-07-11 23:01:57 +02:00
extractor.ytdl.generic
----------------------
Type
2024-06-15 21:45:06 +02:00
* ``bool``
* ``string``
2021-07-11 23:01:57 +02:00
Default
``true``
Description
2024-06-15 21:45:06 +02:00
Enables the use of |ytdl's| ``generic`` extractor.
2021-07-11 23:01:57 +02:00
2024-06-15 21:45:06 +02:00
Set this option to ``"force"`` for the same effect as
2021-07-11 23:01:57 +02:00
``--force-generic-extractor``.
extractor.ytdl.logging
----------------------
Type
``bool``
Default
``true``
Description
2024-06-15 21:45:06 +02:00
Route |ytdl's| output through gallery-dl's logging system.
Otherwise it will be written directly to stdout/stderr.
Note: Set ``quiet`` and ``no_warnings`` in
`extractor.ytdl.raw-options`_ to ``true`` to suppress all output.
extractor.ytdl.module
---------------------
Type
``string``
Default
2021-11-29 04:36:43 +01:00
``null``
Description
2024-06-15 21:45:06 +02:00
Name of the ``ytdl`` Python module to import.
2021-11-29 04:36:43 +01:00
Setting this to ``null`` will try to import ``"yt_dlp"``
followed by ``"youtube_dl"`` as fallback.
extractor.ytdl.raw-options
--------------------------
Type
``object`` (`name` -> `value`)
Example
.. code:: json
{
"quiet": true,
"writesubtitles": true,
"merge_output_format": "mkv"
}
Description
Additional options passed directly to the ``YoutubeDL`` constructor.
2024-06-15 21:45:06 +02:00
Available options can be found in
`yt-dlp's docstrings <https://github.com/yt-dlp/yt-dlp/blob/2024.05.27/yt_dlp/YoutubeDL.py#L200>`__
/
`youtube-dl's docstrings <https://github.com/ytdl-org/youtube-dl/blob/0153b387e57e0bb8e580f1869f85596d2767fb0d/youtube_dl/YoutubeDL.py#L157>`__
extractor.zerochan.metadata
---------------------------
Type
``bool``
Default
``false``
Description
Extract additional metadata (date, md5, tags, ...)
2022-12-20 17:30:46 +01:00
Note: This requires 1-2 additional HTTP requests per post.
extractor.zerochan.pagination
-----------------------------
Type
``string``
Default
``"api"``
Description
Controls how to paginate over tag search results.
* ``"api"``: Use the `JSON API <https://www.zerochan.net/api>`__
(no ``extension`` metadata)
* ``"html"``: Parse HTML pages
(limited to 100 pages * 24 posts)
extractor.zerochan.redirects
----------------------------
Type
``bool``
Default
``false``
Description
Automatically follow tag redirects.
2018-07-13 16:20:14 +02:00
extractor.[booru].tags
----------------------
Type
``bool``
Default
``false``
Description
Categorize tags by their respective types
and provide them as ``tags_<type>`` metadata fields.
2022-12-20 17:30:46 +01:00
Note: This requires 1 additional HTTP request per post.
extractor.[booru].notes
-----------------------
Type
``bool``
Default
``false``
Description
Extract overlay notes (position and text).
2022-12-20 17:30:46 +01:00
Note: This requires 1 additional HTTP request per post.
2023-01-13 16:32:32 +01:00
extractor.[booru].url
---------------------
Type
* ``string``
* ``list`` of ``strings``
2023-01-13 16:32:32 +01:00
Default
``"file_url"``
Example
* ``"preview_url"``
* ``["sample_url", "preview_url", "file_url"}``
2023-01-13 16:32:32 +01:00
Description
Alternate field name to retrieve download URLs from.
When multiple names are given, download the first available one.
2023-01-13 16:32:32 +01:00
extractor.[manga-extractor].chapter-reverse
-------------------------------------------
Type
``bool``
Default
``false``
Description
Reverse the order of chapter URLs extracted from manga pages.
* ``true``: Start with the latest chapter
* ``false``: Start with the first chapter
extractor.[manga-extractor].page-reverse
----------------------------------------
Type
``bool``
Default
``false``
Description
Download manga chapter pages in reverse order.
Downloader Options
==================
downloader.*.enabled
--------------------
Type
``bool``
Default
``true``
Description
Enable/Disable this downloader module.
downloader.*.filesize-min & .filesize-max
-----------------------------------------
Type
``string``
Default
``null``
Example
``"32000"``, ``"500k"``, ``"2.5M"``
Description
Minimum/Maximum allowed file size in bytes.
Any file smaller/larger than this limit will not be downloaded.
Possible values are valid integer or floating-point numbers
optionally followed by one of ``k``, ``m``. ``g``, ``t``, or ``p``.
These suffixes are case-insensitive.
2019-06-20 17:19:44 +02:00
downloader.*.mtime
------------------
Type
``bool``
Default
``true``
Description
Use |Last-Modified|_ HTTP response headers
to set file modification times.
2019-06-20 17:19:44 +02:00
downloader.*.part
-----------------
Type
``bool``
Default
``true``
Description
Controls the use of ``.part`` files during file downloads.
* ``true``: Write downloaded data into ``.part`` files and rename
them upon download completion. This mode additionally supports
resuming incomplete downloads.
* ``false``: Do not use ``.part`` files and write data directly
into the actual output files.
downloader.*.part-directory
---------------------------
Type
|Path|_
Default
``null``
Description
Alternate location for ``.part`` files.
Missing directories will be created as needed.
If this value is ``null``, ``.part`` files are going to be stored
alongside the actual output files.
downloader.*.progress
---------------------
Type
``float``
Default
``3.0``
Description
Number of seconds until a download progress indicator
for the current download is displayed.
Set this option to ``null`` to disable this indicator.
downloader.*.rate
-----------------
Type
``string``
Default
``null``
Example
``"32000"``, ``"500k"``, ``"2.5M"``
Description
Maximum download rate in bytes per second.
Possible values are valid integer or floating-point numbers
optionally followed by one of ``k``, ``m``. ``g``, ``t``, or ``p``.
These suffixes are case-insensitive.
downloader.*.retries
--------------------
Type
``integer``
Default
`extractor.*.retries`_
Description
Maximum number of retries during file downloads,
or ``-1`` for infinite retries.
downloader.*.timeout
--------------------
Type
2022-12-20 17:30:46 +01:00
``float``
Default
`extractor.*.timeout`_
Description
Connection timeout during file downloads.
downloader.*.verify
-------------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
`extractor.*.verify`_
Description
Certificate validation during file downloads.
downloader.*.proxy
------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``object`` (`scheme` -> `proxy`)
Default
`extractor.*.proxy`_
Description
2022-12-20 17:30:46 +01:00
Proxy server used for file downloads.
Disable the use of a proxy for file downloads
by explicitly setting this option to ``null``.
downloader.http.adjust-extensions
---------------------------------
Type
``bool``
Default
``true``
Description
Check file headers of downloaded files
and adjust their filename extensions if they do not match.
For example, this will change the filename extension (``{extension}``)
of a file called ``example.png`` from ``png`` to ``jpg`` when said file
contains JPEG/JFIF data.
downloader.http.consume-content
-------------------------------
Type
``bool``
Default
``false``
Description
Controls the behavior when an HTTP response is considered
unsuccessful
If the value is ``true``, consume the response body. This
avoids closing the connection and therefore improves connection
reuse.
If the value is ``false``, immediately close the connection
without reading the response. This can be useful if the server
is known to send large bodies for error responses.
downloader.http.chunk-size
--------------------------
Type
2022-12-20 17:30:46 +01:00
* ``integer``
* ``string``
Default
``32768``
Example
``"50k"``, ``"0.8M"``
Description
Number of bytes per downloaded chunk.
Possible values are integer numbers
optionally followed by one of ``k``, ``m``. ``g``, ``t``, or ``p``.
These suffixes are case-insensitive.
downloader.http.headers
-----------------------
Type
``object`` (`name` -> `value`)
Example
``{"Accept": "image/webp,*/*", "Referer": "https://example.org/"}``
Description
Additional HTTP headers to send when downloading files,
downloader.http.retry-codes
---------------------------
Type
``list`` of ``integers``
Default
`extractor.*.retry-codes`_
Description
Additional `HTTP response status codes <https://developer.mozilla.org/en-US/docs/Web/HTTP/Status>`__
to retry a download on.
Codes ``200``, ``206``, and ``416`` (when resuming a `partial <downloader.*.part_>`__
download) will never be retried and always count as success,
regardless of this option.
``5xx`` codes (server error responses) will always be retried,
regardless of this option.
downloader.http.validate
------------------------
Type
``bool``
Default
``true``
Description
Check for invalid responses.
Fail a download when a file does not pass
instead of downloading a potentially broken file.
2024-06-15 21:45:06 +02:00
downloader.ytdl.cmdline-args
----------------------------
Type
* ``string``
* ``list`` of ``strings``
Example
* ``"--quiet --write-sub --merge-output-format mkv"``
* ``["--quiet", "--write-sub", "--merge-output-format", "mkv"]``
Description
Additional ``ytdl`` options specified as command-line arguments.
See
`yt-dlp options <https://github.com/yt-dlp/yt-dlp#usage-and-options>`__
/
`youtube-dl options <https://github.com/ytdl-org/youtube-dl#options>`__
downloader.ytdl.config-file
---------------------------
Type
|Path|_
Example
``"~/.config/yt-dlp/config"``
Description
Location of a |ytdl| configuration file to load options from.
downloader.ytdl.format
----------------------
Type
``string``
Default
2024-06-15 21:45:06 +02:00
| Default of the ``ytdl`` `module <downloader.ytdl.module_>`__ used.
| (``"bestvideo*+bestaudio/best"`` for ``yt_dlp``,
``"bestvideo+bestaudio/best"`` for ``youtube_dl``)
Description
2024-06-15 21:45:06 +02:00
``ytdl`` format selection string.
See
`yt-dlp format selection <https://github.com/yt-dlp/yt-dlp#format-selection>`__
/
`youtube-dl format selection <https://github.com/ytdl-org/youtube-dl#format-selection>`__
downloader.ytdl.forward-cookies
-------------------------------
Type
``bool``
Default
``false``
Description
2024-06-15 21:45:06 +02:00
Forward gallery-dl's cookies to |ytdl|.
downloader.ytdl.logging
-----------------------
Type
``bool``
Default
``true``
Description
2024-06-15 21:45:06 +02:00
Route |ytdl's| output through gallery-dl's logging system.
Otherwise it will be written directly to stdout/stderr.
Note: Set ``quiet`` and ``no_warnings`` in
`downloader.ytdl.raw-options`_ to ``true`` to suppress all output.
downloader.ytdl.module
----------------------
Type
``string``
Default
2021-11-29 04:36:43 +01:00
``null``
Description
2024-06-15 21:45:06 +02:00
Name of the ``ytdl`` Python module to import.
2024-06-15 21:45:06 +02:00
Setting this to ``null`` will try to import ``"yt_dlp"``
followed by ``"youtube_dl"`` as fallback.
2021-11-29 04:36:43 +01:00
downloader.ytdl.outtmpl
-----------------------
Type
``string``
Default
``null``
Description
2024-06-15 21:45:06 +02:00
The `Output Template`
used to generate filenames for files downloaded with ``ytdl``.
See
`yt-dlp output template <https://github.com/yt-dlp/yt-dlp#output-template>`__
/
`youtube-dl output template <https://github.com/ytdl-org/youtube-dl#output-template>`__.
Special values:
* ``null``: generate filenames with `extractor.*.filename`_
2024-06-15 21:45:06 +02:00
* ``"default"``: use |ytdl's| default, currently
``"%(title)s [%(id)s].%(ext)s"`` for yt-dlp_ /
``"%(title)s-%(id)s.%(ext)s"`` for youtube-dl_
Note: An output template other than ``null`` might
2024-06-15 21:45:06 +02:00
cause unexpected results in combination with certain options
(e.g. ``"skip": "enumerate"``)
downloader.ytdl.raw-options
---------------------------
Type
``object`` (`name` -> `value`)
Example
.. code:: json
{
"quiet": true,
"writesubtitles": true,
"merge_output_format": "mkv"
}
Description
Additional options passed directly to the ``YoutubeDL`` constructor.
2024-06-15 21:45:06 +02:00
Available options can be found in
`yt-dlp's docstrings <https://github.com/yt-dlp/yt-dlp/blob/2024.05.27/yt_dlp/YoutubeDL.py#L200>`__
/
`youtube-dl's docstrings <https://github.com/ytdl-org/youtube-dl/blob/0153b387e57e0bb8e580f1869f85596d2767fb0d/youtube_dl/YoutubeDL.py#L157>`__
Output Options
==============
output.mode
-----------
Type
2022-12-30 17:14:42 +01:00
* ``string``
* ``object`` (`key` -> `format string`)
Default
``"auto"``
Description
Controls the output string format and status indicators.
* ``"null"``: No output
* ``"pipe"``: Suitable for piping to other processes or files
* ``"terminal"``: Suitable for the standard Windows console
* ``"color"``: Suitable for terminals that understand ANSI escape codes and colors
2022-12-30 17:14:42 +01:00
* ``"auto"``: ``"terminal"`` on Windows with `output.ansi`_ disabled,
``"color"`` otherwise.
| It is possible to use custom output format strings
by setting this option to an ``object`` and specifying
| ``start``, ``success``, ``skip``, ``progress``, and ``progress-total``.
For example, the following will replicate the same output as |mode: color|:
.. code:: json
{
"start" : "{}",
"success": "\r\u001b[1;32m{}\u001b[0m\n",
"skip" : "\u001b[2m{}\u001b[0m\n",
"progress" : "\r{0:>7}B {1:>7}B/s ",
"progress-total": "\r{3:>3}% {0:>7}B {1:>7}B/s "
}
``start``, ``success``, and ``skip`` are used to output the current
filename, where ``{}`` or ``{0}`` is replaced with said filename.
If a given format string contains printable characters other than that,
their number needs to be specified as ``[<number>, <format string>]``
to get the correct results for `output.shorten`_. For example
.. code:: json
"start" : [12, "Downloading {}"]
| ``progress`` and ``progress-total`` are used when displaying the
`download progress indicator <downloader.*.progress_>`__,
| ``progress`` when the total number of bytes to download is unknown,
``progress-total`` otherwise.
For these format strings
* ``{0}`` is number of bytes downloaded
* ``{1}`` is number of downloaded bytes per second
* ``{2}`` is total number of bytes
* ``{3}`` is percent of bytes downloaded to total bytes
output.stdout & .stdin & .stderr
--------------------------------
Type
* ``string``
* ``object``
Example
.. code:: json
"utf-8"
.. code:: json
{
"encoding": "utf-8",
"errors": "replace",
"line_buffering": true
}
Description
`Reconfigure <https://docs.python.org/3/library/io.html#io.TextIOWrapper.reconfigure>`__
a `standard stream <https://docs.python.org/3/library/sys.html#sys.stdin>`__.
Possible options are
* ``encoding``
* ``errors``
* ``newline``
* ``line_buffering``
* ``write_through``
When this option is specified as a simple ``string``,
it is interpreted as ``{"encoding": "<string-value>", "errors": "replace"}``
Note: ``errors`` always defaults to ``"replace"``
output.shorten
--------------
Type
``bool``
Default
``true``
Description
Controls whether the output strings should be shortened to fit
on one console line.
Set this option to ``"eaw"`` to also work with east-asian characters
with a display width greater than 1.
output.colors
-------------
Type
``object`` (`key` -> `ANSI color`)
Default
2024-04-20 20:49:28 +02:00
.. code:: json
{
"success": "1;32",
"skip" : "2",
"debug" : "0;37",
"info" : "1;37",
"warning": "1;33",
"error" : "1;31"
}
Description
Controls the
`ANSI colors <https://gist.github.com/fnky/458719343aabd01cfb17a3a4f7296797#colors--graphics-mode>`__
used for various outputs.
Output for |mode: color|__
* ``success``: successfully downloaded files
* ``skip``: skipped files
Logging Messages:
* ``debug``: debug logging messages
* ``info``: info logging messages
* ``warning``: warning logging messages
* ``error``: error logging messages
.. __: `output.mode`_
2022-05-29 19:15:25 +02:00
output.ansi
-----------
Type
``bool``
Default
2024-04-20 20:49:28 +02:00
``true``
2022-05-29 19:15:25 +02:00
Description
| On Windows, enable ANSI escape sequences and colored output
| by setting the ``ENABLE_VIRTUAL_TERMINAL_PROCESSING`` flag for stdout and stderr.
2021-05-04 18:07:08 +02:00
output.skip
-----------
Type
``bool``
Default
``true``
Description
Show skipped file downloads.
output.fallback
---------------
Type
``bool``
Default
``true``
Description
Include fallback URLs in the output of ``-g/--get-urls``.
output.private
--------------
Type
``bool``
Default
``false``
Description
Include private fields,
i.e. fields whose name starts with an underscore,
in the output of ``-K/--list-keywords`` and ``-j/--dump-json``.
output.progress
---------------
Type
2022-12-20 17:30:46 +01:00
* ``bool``
* ``string``
Default
``true``
Description
Controls the progress indicator when *gallery-dl* is run with
multiple URLs as arguments.
* ``true``: Show the default progress indicator
(``"[{current}/{total}] {url}"``)
* ``false``: Do not show any progress indicator
* Any ``string``: Show the progress indicator using this
as a custom `format string`_. Possible replacement keys are
``current``, ``total`` and ``url``.
output.log
----------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* |Logging Configuration|_
Default
``"[{name}][{levelname}] {message}"``
Description
2022-12-20 17:30:46 +01:00
Configuration for logging output to stderr.
If this is a simple ``string``, it specifies
the format string for logging messages.
output.logfile
--------------
Type
2022-12-20 17:30:46 +01:00
* |Path|_
* |Logging Configuration|_
Description
File to write logging output to.
output.unsupportedfile
----------------------
Type
2022-12-20 17:30:46 +01:00
* |Path|_
* |Logging Configuration|_
Description
File to write external URLs unsupported by *gallery-dl* to.
The default format string here is ``"{message}"``.
2023-12-08 17:29:54 +01:00
output.errorfile
----------------
Type
* |Path|_
* |Logging Configuration|_
2023-12-08 17:29:54 +01:00
Description
File to write input URLs which returned an error to.
The default format string here is also ``"{message}"``.
2023-12-08 17:29:54 +01:00
When combined with
``-I``/``--input-file-comment`` or
``-x``/``--input-file-delete``,
this option will cause *all* input URLs from these files
2023-12-08 17:29:54 +01:00
to be commented/deleted after processing them
and not just successful ones.
2018-10-08 23:08:11 +02:00
output.num-to-str
-----------------
Type
``bool``
Default
``false``
Description
Convert numeric values (``integer`` or ``float``) to ``string``
before outputting them as JSON.
2018-10-08 23:08:11 +02:00
Postprocessor Options
=====================
This section lists all options available inside
`Postprocessor Configuration`_ objects.
Each option is titled as ``<name>.<option>``, meaning a post processor
of type ``<name>`` will look for an ``<option>`` field inside its "body".
For example an ``exec`` post processor will recognize
an `async <exec.async_>`__, `command <exec.command_>`__,
and `event <exec.event_>`__ field:
.. code:: json
{
"name" : "exec",
"async" : false,
"command": "...",
"event" : "after"
}
classify.mapping
----------------
Type
``object`` (`directory` -> `extensions`)
Default
.. code:: json
{
"Pictures": ["jpg", "jpeg", "png", "gif", "bmp", "svg", "webp"],
"Video" : ["flv", "ogv", "avi", "mp4", "mpg", "mpeg", "3gp", "mkv", "webm", "vob", "wmv"],
"Music" : ["mp3", "aac", "flac", "ogg", "wma", "m4a", "wav"],
"Archives": ["zip", "rar", "7z", "tar", "gz", "bz2"]
}
Description
A mapping from directory names to filename extensions that should
be stored in them.
Files with an extension not listed will be ignored and stored
in their default location.
compare.action
--------------
Type
``string``
Default
``"replace"``
Description
The action to take when files do **not** compare as equal.
* ``"replace"``: Replace/Overwrite the old version with the new one
* ``"enumerate"``: Add an enumeration index to the filename of the new
version like `skip = "enumerate" <extractor.*.skip_>`__
compare.equal
-------------
Type
``string``
Default
``"null"``
Description
The action to take when files do compare as equal.
* ``"abort:N"``: Stop the current extractor run
after ``N`` consecutive files compared as equal.
* ``"terminate:N"``: Stop the current extractor run,
including parent extractors,
after ``N`` consecutive files compared as equal.
* ``"exit:N"``: Exit the program
after ``N`` consecutive files compared as equal.
compare.shallow
---------------
Type
``bool``
Default
``false``
Description
Only compare file sizes. Do not read and compare their content.
exec.archive
------------
Type
|Path|_
Description
File to store IDs of executed commands in,
similar to `extractor.*.archive`_.
2023-02-05 16:05:13 +01:00
``archive-format``, ``archive-prefix``, and ``archive-pragma`` options,
akin to
`extractor.*.archive-format`_,
`extractor.*.archive-prefix`_, and
`extractor.*.archive-pragma`_, are supported as well.
exec.async
----------
Type
``bool``
Default
``false``
Description
Controls whether to wait for a subprocess to finish
or to let it run asynchronously.
exec.command
------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Example
* ``"convert {} {}.png && rm {}"``
* ``["echo", "{user[account]}", "{id}"]``
Description
The command to run.
* If this is a ``string``, it will be executed using the system's
shell, e.g. ``/bin/sh``. Any ``{}`` will be replaced
with the full path of a file or target directory, depending on
`exec.event`_
* If this is a ``list``, the first element specifies the program
name and any further elements its arguments.
Each element of this list is treated as a `format string`_ using
the files' metadata as well as ``{_path}``, ``{_directory}``,
and ``{_filename}``.
exec.event
----------
Type
* ``string``
* ``list`` of ``strings``
Default
``"after"``
Description
The event(s) for which `exec.command`_ is run.
See `metadata.event`_ for a list of available events.
hash.chunk-size
---------------
Type
``integer``
Default
``32768``
Description
Number of bytes read per chunk during file hash computation.
hash.event
----------
Type
* ``string``
* ``list`` of ``strings``
Default
``"file"``
Description
The event(s) for which `file hashes <hash.hashes_>`__ are computed.
See `metadata.event`_ for a list of available events.
hash.filename
-------------
Type
* ``bool``
Default
``false``
Description
Rebuild `filenames <extractor.*.filename_>`__ after computing
`hash digests <hash.hashes_>`__ and adding them to the metadata dict.
hash.hashes
-----------
Type
* ``string``
* ``object`` (`field name` -> `hash algorithm`)
Default
``"md5,sha1"``
Example
.. code:: json
"sha256:hash_sha,sha3_512:hash_sha3"
.. code:: json
{
"hash_sha" : "sha256",
"hash_sha3": "sha3_512"
}
Description
Hash digests to compute.
metadata.mode
-------------
Type
``string``
Default
``"json"``
Description
Selects how to process metadata.
* ``"json"``: write metadata using |json.dump()|_
* ``"jsonl"``: write metadata in `JSON Lines
<https://jsonlines.org/>`__ format
* ``"tags"``: write ``tags`` separated by newlines
* ``"custom"``: write the result of applying `metadata.content-format`_
to a file's metadata dictionary
* ``"modify"``: add or modify metadata entries
* ``"delete"``: remove metadata entries
metadata.filename
-----------------
Type
``string``
Default
``null``
Example
``"{id}.data.json"``
Description
A `format string`_ to build the filenames for metadata files with.
(see `extractor.filename <extractor.*.filename_>`__)
Using ``"-"`` as filename will write all output to ``stdout``.
If this option is set, `metadata.extension`_ and
`metadata.extension-format`_ will be ignored.
metadata.directory
------------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"."``
Example
* ``"metadata"``
* ``["..", "metadata", "\fF {id // 500 * 500}"]``
Description
Directory where metadata files are stored in
relative to `metadata.base-directory`_.
metadata.base-directory
-----------------------
Type
* ``bool``
* |Path|_
Default
``false``
Description
Selects the relative location for metadata files.
* ``false``: current target location for file downloads (base-directory_ + directory_)
* ``true``: current base-directory_ location
* any |Path|_: custom location
metadata.extension
------------------
Type
``string``
Default
``"json"`` or ``"txt"``
Description
Filename extension for metadata files that will be appended to the
original file names.
metadata.extension-format
-------------------------
Type
``string``
Example
* ``"{extension}.json"``
* ``"json"``
Description
Custom format string to build filename extensions for metadata
files with, which will replace the original filename extensions.
Note: `metadata.extension`_ is ignored if this option is set.
metadata.event
--------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"file"``
Example
* ``"prepare,file,after"``
* ``["prepare-after", "skip"]``
Description
The event(s) for which metadata gets written to a file.
Available events are:
``init``
After post processor initialization
and before the first file download
``finalize``
On extractor shutdown, e.g. after all files were downloaded
``finalize-success``
On extractor shutdown when no error occurred
``finalize-error``
On extractor shutdown when at least one error occurred
``prepare``
Before a file download
``prepare-after``
Before a file download,
but after building and checking file paths
``file``
When completing a file download,
but before it gets moved to its target location
``after``
After a file got moved to its target location
``skip``
When skipping a file download
``post``
When starting to download all files of a `post`,
e.g. a Tweet on Twitter or a post on Patreon.
``post-after``
After downloading all files of a `post`
metadata.include
----------------
Type
``list`` of ``strings``
Example
``["id", "width", "height", "description"]``
Description
Include only the given top-level keys when writing JSON data.
Note: Missing or undefined fields will be silently ignored.
metadata.exclude
----------------
Type
``list`` of ``strings``
Example
``["blocked", "watching", "status"]``
Description
Exclude all given keys from written JSON data.
Note: Cannot be used with `metadata.include`_.
metadata.fields
---------------
Type
* ``list`` of ``strings``
* ``object`` (`field name` -> `format string`_)
Example
.. code:: json
["blocked", "watching", "status[creator][name]"]
.. code:: json
{
"blocked" : "***",
"watching" : "\fE 'yes' if watching else 'no'",
"status[username]": "{status[creator][name]!l}"
}
Description
* ``"mode": "delete"``:
A list of metadata field names to remove.
* ``"mode": "modify"``:
An object with metadata field names mapping to a `format string`_
whose result is assigned to said field name.
metadata.content-format
-----------------------
Type
2022-12-20 17:30:46 +01:00
* ``string``
* ``list`` of ``strings``
Example
* ``"tags:\n\n{tags:J\n}\n"``
* ``["tags:", "", "{tags:J\n}"]``
Description
Custom format string to build the content of metadata files with.
Note: Only applies for ``"mode": "custom"``.
metadata.ascii
--------------
Type
``bool``
Default
``false``
Description
Escape all non-ASCII characters.
See the ``ensure_ascii`` argument of |json.dump()|_ for further details.
Note: Only applies for ``"mode": "json"`` and ``"jsonl"``.
metadata.indent
---------------
Type
* ``integer``
* ``string``
Default
``4``
Description
Indentation level of JSON output.
See the ``indent`` argument of |json.dump()|_ for further details.
Note: Only applies for ``"mode": "json"``.
metadata.separators
-------------------
Type
``list`` with two ``string`` elements
Default
``[", ", ": "]``
Description
``<item separator>`` - ``<key separator>`` pair
to separate JSON keys and values with.
See the ``separators`` argument of |json.dump()|_ for further details.
Note: Only applies for ``"mode": "json"`` and ``"jsonl"``.
metadata.sort
-------------
Type
``bool``
Default
``false``
Description
Sort output by `key`.
See the ``sort_keys`` argument of |json.dump()|_ for further details.
Note: Only applies for ``"mode": "json"`` and ``"jsonl"``.
metadata.open
-------------
Type
``string``
Defsult
``"w"``
Description
The ``mode`` in which metadata files get opened.
For example,
use ``"a"`` to append to a file's content
or ``"w"`` to truncate it.
See the ``mode`` argument of |open()|_ for further details.
metadata.encoding
-----------------
Type
``string``
Defsult
``"utf-8"``
Description
Name of the encoding used to encode a file's content.
See the ``encoding`` argument of |open()|_ for further details.
metadata.private
----------------
Type
``bool``
Default
``false``
Description
Include private fields,
i.e. fields whose name starts with an underscore.
metadata.skip
-------------
Type
``bool``
Default
``false``
Description
Do not overwrite already existing files.
metadata.archive
----------------
Type
|Path|_
Description
File to store IDs of generated metadata files in,
similar to `extractor.*.archive`_.
2023-02-05 16:05:13 +01:00
``archive-format``, ``archive-prefix``, and ``archive-pragma`` options,
akin to
`extractor.*.archive-format`_,
`extractor.*.archive-prefix`_, and
`extractor.*.archive-pragma`_, are supported as well.
metadata.mtime
--------------
Type
``bool``
Default
``false``
Description
Set modification times of generated metadata files
according to the accompanying downloaded file.
Enabling this option will only have an effect
*if* there is actual ``mtime`` metadata available, that is
* after a file download (``"event": "file"`` (default), ``"event": "after"``)
* when running *after* an ``mtime`` post processes for the same `event <metadata.event_>`__
For example, a ``metadata`` post processor for ``"event": "post"`` will
*not* be able to set its file's modification time unless an ``mtime``
post processor with ``"event": "post"`` runs *before* it.
mtime.event
-----------
Type
* ``string``
* ``list`` of ``strings``
Default
``"file"``
Description
The event(s) for which `mtime.key`_ or `mtime.value`_ get evaluated.
See `metadata.event`_ for a list of available events.
mtime.key
---------
Type
``string``
Default
``"date"``
Description
Name of the metadata field whose value should be used.
2023-07-24 14:27:37 +02:00
This value must be either a UNIX timestamp or a
|datetime|_ object.
Note: This option gets ignored if `mtime.value`_ is set.
mtime.value
-----------
Type
``string``
Default
``null``
Example
* ``"{status[date]}"``
* ``"{content[0:6]:R22/2022/D%Y%m%d/}"``
Description
A `format string`_ whose value should be used.
2023-07-24 14:27:37 +02:00
The resulting value must be either a UNIX timestamp or a
|datetime|_ object.
python.archive
--------------
Type
|Path|_
Description
File to store IDs of called Python functions in,
similar to `extractor.*.archive`_.
``archive-format``, ``archive-prefix``, and ``archive-pragma`` options,
akin to
`extractor.*.archive-format`_,
`extractor.*.archive-prefix`_, and
`extractor.*.archive-pragma`_, are supported as well.
python.event
------------
Type
* ``string``
* ``list`` of ``strings``
Default
``"file"``
Description
The event(s) for which `python.function`_ gets called.
See `metadata.event`_ for a list of available events.
python.function
---------------
Type
``string``
Example
* ``"my_module:generate_text"``
* ``"~/.local/share/gdl-utils.py:resize"``
Description
The Python function to call.
This function is specified as ``<module>:<function name>``
and gets called with the current metadata dict as argument.
``module`` is either an importable Python module name
or the |Path|_ to a `.py` file,
rename.from
-----------
Type
``string``
Description
The `format string`_ for filenames to rename.
When no value is given, `extractor.*.filename`_ is used.
rename.to
---------
Type
``string``
Description
The `format string`_ for target filenames.
When no value is given, `extractor.*.filename`_ is used.
Note:
With default settings, the potential download to `extractor.*.filename`_
still happens, even when using this post processor.
Disabling `file downloads <extractor.*.download_>`__
when using this option is recommended.
rename.skip
-----------
Type
``bool``
Default
``true``
Description
Do not rename a file when another file with the target name already exists.
ugoira.extension
----------------
Type
``string``
Default
``"webm"``
Description
Filename extension for the resulting video files.
ugoira.ffmpeg-args
------------------
Type
``list`` of ``strings``
Default
``null``
Example
``["-c:v", "libvpx-vp9", "-an", "-b:v", "2M"]``
Description
2024-06-15 21:45:06 +02:00
Additional |ffmpeg| command-line arguments.
ugoira.ffmpeg-demuxer
---------------------
Type
``string``
Default
``auto``
Description
2024-06-15 21:45:06 +02:00
|ffmpeg| demuxer to read and process input files with.
Possible values are
* "`concat <https://ffmpeg.org/ffmpeg-formats.html#concat-1>`_" (inaccurate frame timecodes for non-uniform frame delays)
* "`image2 <https://ffmpeg.org/ffmpeg-formats.html#image2-1>`_" (accurate timecodes, requires nanosecond file timestamps, i.e. no Windows or macOS)
* "mkvmerge" (accurate timecodes, only WebM or MKV, requires `mkvmerge <ugoira.mkvmerge-location_>`__)
`"auto"` will select `mkvmerge` if available and fall back to `concat` otherwise.
ugoira.ffmpeg-location
----------------------
Type
|Path|_
Default
``"ffmpeg"``
Description
Location of the ``ffmpeg`` (or ``avconv``) executable to use.
ugoira.mkvmerge-location
------------------------
Type
|Path|_
Default
``"mkvmerge"``
Description
Location of the ``mkvmerge`` executable for use with the
`mkvmerge demuxer <ugoira.ffmpeg-demuxer_>`__.
ugoira.ffmpeg-output
--------------------
Type
* ``bool``
* ``string``
Default
``"error"``
Description
2024-06-15 21:45:06 +02:00
Controls |ffmpeg| output.
2024-06-15 21:45:06 +02:00
* ``true``: Enable |ffmpeg| output
* ``false``: Disable all |ffmpeg| output
* any ``string``: Pass ``-hide_banner`` and ``-loglevel``
2024-06-15 21:45:06 +02:00
with this value as argument to |ffmpeg|
ugoira.ffmpeg-twopass
---------------------
Type
``bool``
Default
``false``
Description
Enable Two-Pass encoding.
ugoira.framerate
----------------
Type
``string``
Default
``"auto"``
Description
2024-06-15 21:45:06 +02:00
Controls the frame rate argument (``-r``) for |ffmpeg|
* ``"auto"``: Automatically assign a fitting frame rate
based on delays between frames.
* ``"uniform"``: Like ``auto``, but assign an explicit frame rate
only to Ugoira with uniform frame delays.
* any other ``string``: Use this value as argument for ``-r``.
* ``null`` or an empty ``string``: Don't set an explicit frame rate.
ugoira.keep-files
-----------------
Type
``bool``
Default
``false``
Description
Keep ZIP archives after conversion.
ugoira.libx264-prevent-odd
--------------------------
Type
``bool``
Default
``true``
Description
Prevent ``"width/height not divisible by 2"`` errors
when using ``libx264`` or ``libx265`` encoders
by applying a simple cropping filter. See this `Stack Overflow
thread <https://stackoverflow.com/questions/20847674>`__
for more information.
This option, when ``libx264/5`` is used, automatically
adds ``["-vf", "crop=iw-mod(iw\\,2):ih-mod(ih\\,2)"]``
2024-06-15 21:45:06 +02:00
to the list of |ffmpeg| command-line arguments
to reduce an odd width/height by 1 pixel and make them even.
ugoira.mtime
------------
Type
``bool``
Default
``true``
Description
Set modification times of generated ugoira aniomations.
ugoira.repeat-last-frame
------------------------
Type
``bool``
Default
``true``
Description
Allow repeating the last frame when necessary
to prevent it from only being displayed for a very short amount of time.
zip.compression
---------------
Type
``string``
Default
``"store"``
Description
Compression method to use when writing the archive.
Possible values are ``"store"``, ``"zip"``, ``"bzip2"``, ``"lzma"``.
zip.extension
-------------
Type
``string``
Default
``"zip"``
Description
Filename extension for the created ZIP archive.
zip.files
---------
Type
``list`` of |Path|
Example
``["info.json"]``
Description
List of extra files to be added to a ZIP archive.
Note: Relative paths are relative to the current
`download directory <extractor.*.directory_>`__.
zip.keep-files
--------------
Type
``bool``
Default
``false``
Description
Keep the actual files after writing them to a ZIP archive.
zip.mode
--------
Type
``string``
Default
``"default"``
Description
* ``"default"``: Write the central directory file header
once after everything is done or an exception is raised.
* ``"safe"``: Update the central directory file header
each time a file is stored in a ZIP archive.
This greatly reduces the chance a ZIP archive gets corrupted in
case the Python interpreter gets shut down unexpectedly
(power outage, SIGKILL) but is also a lot slower.
Miscellaneous Options
=====================
2020-10-25 03:05:10 +01:00
extractor.modules
-----------------
Type
``list`` of ``strings``
Default
The ``modules`` list in
2024-06-15 01:57:34 +02:00
`extractor/__init__.py <https://github.com/mikf/gallery-dl/blob/master/gallery_dl/extractor/__init__.py#L12>`__
2020-10-25 03:05:10 +01:00
Example
``["reddit", "danbooru", "mangadex"]``
Description
2023-01-30 23:32:36 +01:00
List of internal modules to load when searching for a suitable
2020-10-25 03:05:10 +01:00
extractor class. Useful to reduce startup time and memory usage.
2023-01-30 23:32:36 +01:00
extractor.module-sources
------------------------
Type
``list`` of |Path|_ instances
Example
``["~/.config/gallery-dl/modules", null]``
Description
List of directories to load external extractor modules from.
Any file in a specified directory with a ``.py`` filename extension
gets `imported <https://docs.python.org/3/reference/import.html>`__
and searched for potential extractors,
i.e. classes with a ``pattern`` attribute.
Note: ``null`` references internal extractors defined in
2024-06-15 01:57:34 +02:00
`extractor/__init__.py <https://github.com/mikf/gallery-dl/blob/master/gallery_dl/extractor/__init__.py#L12>`__
2023-01-30 23:32:36 +01:00
or by `extractor.modules`_.
2023-02-28 18:18:55 +01:00
globals
-------
Type
* |Path|_
* ``string``
Example
* ``"~/.local/share/gdl-globals.py"``
* ``"gdl-globals"``
Description
| Path to or name of an
`importable <https://docs.python.org/3/reference/import.html>`__
Python module,
| whose namespace,
2024-06-15 01:57:34 +02:00
in addition to the ``GLOBALS`` dict in
`util.py <https://github.com/mikf/gallery-dl/blob/v1.27.0/gallery_dl/util.py#L566-L578>`__,
gets used as |globals parameter|__ for compiled Python expressions.
2023-02-28 18:18:55 +01:00
.. |globals parameter| replace:: ``globals`` parameter
.. __: https://docs.python.org/3/library/functions.html#eval
cache.file
----------
Type
|Path|_
Default
* (``%APPDATA%`` or ``"~"``) + ``"/gallery-dl/cache.sqlite3"`` on Windows
* (``$XDG_CACHE_HOME`` or ``"~/.cache"``) + ``"/gallery-dl/cache.sqlite3"`` on all other platforms
Description
Path of the SQLite3 database used to cache login sessions,
cookies and API tokens across `gallery-dl` invocations.
Set this option to ``null`` or an invalid path to disable
this cache.
filters-environment
-------------------
Type
``bool``
Default
``true``
Description
Evaluate filter expressions raising an exception as ``false``
instead of aborting the current extractor run
by wrapping them in a `try`/`except` block.
format-separator
----------------
Type
``string``
Default
``"/"``
Description
Character(s) used as argument separator in format string
`format specifiers <formatting.md#format-specifiers>`__.
For example, setting this option to ``"#"`` would allow a replacement
operation to be ``Rold#new#`` instead of the default ``Rold/new/``
input-files
-----------
Type
``list`` of |Path|_
Example
``["~/urls.txt", "$HOME/input"]``
Description
Additional input files.
2022-02-13 22:39:26 +01:00
signals-ignore
--------------
Type
``list`` of ``strings``
Example
``["SIGTTOU", "SIGTTIN", "SIGTERM"]``
Description
The list of signal names to ignore, i.e. set
`SIG_IGN <https://docs.python.org/3/library/signal.html#signal.SIG_IGN>`_
as signal handler for.
2023-08-21 21:18:40 +02:00
subconfigs
----------
Type
``list`` of |Path|_
Example
``["~/cfg-twitter.json", "~/cfg-reddit.json"]``
Description
Additional configuration files to load.
2022-07-18 22:20:30 +02:00
warnings
--------
Type
``string``
Default
``"default"``
Description
The `Warnings Filter action <https://docs.python.org/3/library/warnings.html#the-warnings-filter>`__
used for (urllib3) warnings.
API Tokens & IDs
================
All configuration keys listed in this section have fully functional default
values embedded into *gallery-dl* itself, but if things unexpectedly break
or you want to use your own personal client credentials, you can follow these
instructions to get an alternative set of API tokens and IDs.
extractor.deviantart.client-id & .client-secret
-----------------------------------------------
Type
``string``
How To
* login and visit DeviantArt's
`Applications & Keys <https://www.deviantart.com/developers/apps>`__
section
* click "Register Application"
* scroll to "OAuth2 Redirect URI Whitelist (Required)"
and enter "https://mikf.github.io/gallery-dl/oauth-redirect.html"
* scroll to the bottom and agree to the API License Agreement.
Submission Policy, and Terms of Service.
* click "Save"
* copy ``client_id`` and ``client_secret`` of your new
application and put them in your configuration file
as ``"client-id"`` and ``"client-secret"``
* clear your `cache <cache.file_>`__ to delete any remaining
``access-token`` entries. (``gallery-dl --clear-cache deviantart``)
* get a new `refresh-token <extractor.deviantart.refresh-token_>`__ for the
new ``client-id`` (``gallery-dl oauth:deviantart``)
extractor.flickr.api-key & .api-secret
--------------------------------------
Type
``string``
How To
* login and `Create an App <https://www.flickr.com/services/apps/create/apply/>`__
in Flickr's `App Garden <https://www.flickr.com/services/>`__
* click "APPLY FOR A NON-COMMERCIAL KEY"
* fill out the form with a random name and description
and click "SUBMIT"
2022-12-20 17:30:46 +01:00
* copy ``Key`` and ``Secret`` and put them in your configuration file
as ``"api-key"`` and ``"api-secret"``
extractor.reddit.client-id & .user-agent
----------------------------------------
Type
``string``
How To
* login and visit the `apps <https://www.reddit.com/prefs/apps/>`__
section of your account's preferences
* click the "are you a developer? create an app..." button
* fill out the form:
* choose a name
* select "installed app"
* set ``http://localhost:6414/`` as "redirect uri"
* solve the "I'm not a robot" reCAPTCHA if needed
* click "create app"
* copy the client id (third line, under your application's name and
"installed app") and put it in your configuration file
2022-12-20 17:30:46 +01:00
as ``"client-id"``
* use "``Python:<application name>:v1.0 (by /u/<username>)``" as
2022-12-20 17:30:46 +01:00
``user-agent`` and replace ``<application name>`` and ``<username>``
accordingly (see Reddit's
`API access rules <https://github.com/reddit/reddit/wiki/API>`__)
* clear your `cache <cache.file_>`__ to delete any remaining
``access-token`` entries. (``gallery-dl --clear-cache reddit``)
* get a `refresh-token <extractor.reddit.refresh-token_>`__ for the
new ``client-id`` (``gallery-dl oauth:reddit``)
extractor.smugmug.api-key & .api-secret
---------------------------------------
Type
``string``
How To
* login and `Apply for an API Key <https://api.smugmug.com/api/developer/apply>`__
* use a random name and description,
set "Type" to "Application", "Platform" to "All",
and "Use" to "Non-Commercial"
* fill out the two checkboxes at the bottom and click "Apply"
* copy ``API Key`` and ``API Secret``
and put them in your configuration file
2022-12-20 17:30:46 +01:00
as ``"api-key"`` and ``"api-secret"``
extractor.tumblr.api-key & .api-secret
--------------------------------------
Type
``string``
How To
* login and visit Tumblr's
`Applications <https://www.tumblr.com/oauth/apps>`__ section
* click "Register application"
* fill out the form: use a random name and description, set
https://example.org/ as "Application Website" and "Default
callback URL"
* solve Google's "I'm not a robot" challenge and click "Register"
* click "Show secret key" (below "OAuth Consumer Key")
* copy your ``OAuth Consumer Key`` and ``Secret Key``
and put them in your configuration file
2022-12-20 17:30:46 +01:00
as ``"api-key"`` and ``"api-secret"``
Custom Types
============
Date
----
Type
* ``string``
* ``integer``
Example
* ``"2019-01-01T00:00:00"``
* ``"2019"`` with ``"%Y"`` as `date-format`_
* ``1546297200``
Description
A |Date|_ value represents a specific point in time.
* If given as ``string``, it is parsed according to `date-format`_.
* If given as ``integer``, it is interpreted as UTC timestamp.
Duration
--------
Type
* ``float``
* ``list`` with 2 ``floats``
* ``string``
Example
* ``2.85``
* ``[1.5, 3.0]``
* ``"2.85"``, ``"1.5-3.0"``
Description
A |Duration|_ represents a span of time in seconds.
* If given as a single ``float``, it will be used as that exact value.
* If given as a ``list`` with 2 floating-point numbers ``a`` & ``b`` ,
it will be randomly chosen with uniform distribution such that ``a <= N <= b``.
(see `random.uniform() <https://docs.python.org/3/library/random.html#random.uniform>`_)
* If given as a ``string``, it can either represent a single ``float``
value (``"2.85"``) or a range (``"1.5-3.0"``).
Path
----
Type
* ``string``
* ``list`` of ``strings``
Example
* ``"file.ext"``
* ``"~/path/to/file.ext"``
* ``"$HOME/path/to/file.ext"``
* ``["$HOME", "path", "to", "file.ext"]``
Description
A |Path|_ is a ``string`` representing the location of a file
or directory.
Simple `tilde expansion <https://docs.python.org/3/library/os.path.html#os.path.expanduser>`__
and `environment variable expansion <https://docs.python.org/3/library/os.path.html#os.path.expandvars>`__
is supported.
In Windows environments, backslashes (``"\"``) can, in addition to
forward slashes (``"/"``), be used as path separators.
Because backslashes are JSON's escape character,
they themselves have to be escaped.
The path ``C:\path\to\file.ext`` has therefore to be written as
``"C:\\path\\to\\file.ext"`` if you want to use backslashes.
Logging Configuration
---------------------
Type
``object``
Example
.. code:: json
{
"format" : "{asctime} {name}: {message}",
"format-date": "%H:%M:%S",
"path" : "~/log.txt",
"encoding" : "ascii"
}
.. code:: json
{
"level" : "debug",
"format": {
"debug" : "debug: {message}",
"info" : "[{name}] {message}",
"warning": "Warning: {message}",
"error" : "ERROR: {message}"
}
}
Description
Extended logging output configuration.
* format
* General format string for logging messages
or an ``object`` with format strings for each loglevel.
In addition to the default
`LogRecord attributes <https://docs.python.org/3/library/logging.html#logrecord-attributes>`__,
it is also possible to access the current
2024-06-15 01:57:34 +02:00
`extractor <https://github.com/mikf/gallery-dl/blob/v1.27.0/gallery_dl/extractor/common.py#L28>`__,
`job <https://github.com/mikf/gallery-dl/blob/v1.27.0/gallery_dl/job.py#L33>`__,
`path <https://github.com/mikf/gallery-dl/blob/v1.27.0/gallery_dl/path.py#L27>`__,
and `keywords` objects and their attributes, for example
``"{extractor.url}"``, ``"{path.filename}"``, ``"{keywords.title}"``
* Default: ``"[{name}][{levelname}] {message}"``
* format-date
* Format string for ``{asctime}`` fields in logging messages
(see `strftime() directives <https://docs.python.org/3/library/time.html#time.strftime>`__)
* Default: ``"%Y-%m-%d %H:%M:%S"``
* level
* Minimum logging message level
(one of ``"debug"``, ``"info"``, ``"warning"``, ``"error"``, ``"exception"``)
* Default: ``"info"``
* path
* |Path|_ to the output file
* mode
* Mode in which the file is opened;
use ``"w"`` to truncate or ``"a"`` to append
(see |open()|_)
* Default: ``"w"``
* encoding
* File encoding
* Default: ``"utf-8"``
Note: path, mode, and encoding are only applied when configuring
logging output to a file.
Postprocessor Configuration
---------------------------
Type
``object``
Example
.. code:: json
{ "name": "mtime" }
.. code:: json
{
"name" : "zip",
"compression": "store",
"extension" : "cbz",
"filter" : "extension not in ('zip', 'rar')",
"whitelist" : ["mangadex", "exhentai", "nhentai"]
}
Description
An ``object`` containing a ``"name"`` attribute specifying the
post-processor type, as well as any of its `options <Postprocessor Options_>`__.
It is possible to set a ``"filter"`` expression similar to
`image-filter <extractor.*.image-filter_>`_ to only run a post-processor
conditionally.
It is also possible set a ``"whitelist"`` or ``"blacklist"`` to
only enable or disable a post-processor for the specified
extractor categories.
The available post-processor types are
``classify``
Categorize files by filename extension
``compare``
| Compare versions of the same file and replace/enumerate them on mismatch
| (requires `downloader.*.part`_ = ``true`` and `extractor.*.skip`_ = ``false``)
``exec``
Execute external commands
``hash``
Compute file hash digests
``metadata``
Write metadata to separate files
``mtime``
Set file modification time according to its metadata
``python``
Call Python functions
``rename``
Rename previously downloaded files
``ugoira``
2024-06-15 21:45:06 +02:00
Convert Pixiv Ugoira to WebM using |ffmpeg|
``zip``
Store files in a ZIP archive
2024-06-15 21:45:06 +02:00
2024-06-15 21:45:06 +02:00
.. |ytdl| replace:: `yt-dlp`_/`youtube-dl`_
.. |ytdl's| replace:: yt-dlp's/youtube-dl's
.. |ffmpeg| replace:: FFmpeg_
.. |.netrc| replace:: ``.netrc``
.. |requests.request()| replace:: ``requests.request()``
.. |timeout| replace:: ``timeout``
.. |verify| replace:: ``verify``
.. |mature_content| replace:: ``mature_content``
.. |webbrowser.open()| replace:: ``webbrowser.open()``
.. |datetime| replace:: ``datetime``
2017-07-04 19:34:34 +02:00
.. |datetime.max| replace:: ``datetime.max``
.. |Date| replace:: ``Date``
.. |Duration| replace:: ``Duration``
.. |Path| replace:: ``Path``
2019-06-20 17:19:44 +02:00
.. |Last-Modified| replace:: ``Last-Modified``
.. |Logging Configuration| replace:: ``Logging Configuration``
.. |Postprocessor Configuration| replace:: ``Postprocessor Configuration``
2017-07-04 19:34:34 +02:00
.. |strptime| replace:: strftime() and strptime() Behavior
.. |postprocessors| replace:: ``postprocessors``
.. |mode: color| replace:: ``"mode": "color"``
.. |open()| replace:: the built-in ``open()`` function
.. |json.dump()| replace:: ``json.dump()``
2017-07-04 19:34:34 +02:00
.. _directory: `extractor.*.directory`_
.. _base-directory: `extractor.*.base-directory`_
.. _date-format: `extractor.*.date-format`_
.. _deviantart.metadata: `extractor.deviantart.metadata`_
.. _deviantart.comments: `extractor.deviantart.comments`_
.. _postprocessors: `extractor.*.postprocessors`_
.. _download archive: `extractor.*.archive`_
.. _.netrc: https://stackoverflow.com/tags/.netrc/info
.. _Last-Modified: https://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html#sec14.29
.. _datetime: https://docs.python.org/3/library/datetime.html#datetime-objects
.. _datetime.max: https://docs.python.org/3/library/datetime.html#datetime.datetime.max
.. _strptime: https://docs.python.org/3/library/datetime.html#strftime-strptime-behavior
.. _webbrowser.open(): https://docs.python.org/3/library/webbrowser.html
.. _open(): https://docs.python.org/3/library/functions.html#open
.. _json.dump(): https://docs.python.org/3/library/json.html#json.dump
.. _mature_content: https://www.deviantart.com/developers/http/v1/20160316/object/deviation
.. _Authentication: https://github.com/mikf/gallery-dl#authentication
.. _OAuth: https://github.com/mikf/gallery-dl#oauth
2022-12-20 17:30:46 +01:00
.. _format string: formatting.md
.. _format strings: formatting.md
.. _youtube-dl: https://github.com/ytdl-org/youtube-dl
2024-06-15 21:45:06 +02:00
.. _yt-dlp: https://github.com/yt-dlp/yt-dlp
.. _FFmpeg: https://www.ffmpeg.org/
.. _requests.request(): https://requests.readthedocs.io/en/master/api/#requests.request
.. _timeout: https://requests.readthedocs.io/en/master/user/advanced/#timeouts
.. _verify: https://requests.readthedocs.io/en/master/user/advanced/#ssl-cert-verification
.. _`Requests' proxy documentation`: https://requests.readthedocs.io/en/master/user/advanced/#proxies