Mike Fährmann
cad85640de
move 'util.PathFormat' into its own 'path' module
...
to prevent circular imports between 'formatter' and 'util'
2021-09-27 21:29:37 +02:00
Mike Fährmann
0038a8c1a4
implement a way to specify 'extended' format strings
...
Starting a format string with '\f<Type> ' allows to set a different
format string type than the default. Available ones are
- T: A template file containing the actual format string
"\fT ~/.templates/booru.txt
- E: An arbitrary Python expression
"\fE title.upper().replace(' ', '-')"
- M: Name of a Python module followed by a function name inside it.
This function gets called with the current metadata dict as
argument and should return a string.
"\fM my_module:generate_text"
'\f' was chosen since it is highly unlikely that a regular format string
would start with it, but that character could be changed to for example
'\\' or '/' etc.
2021-09-27 19:46:14 +02:00
Mike Fährmann
76c7d3f977
add FUNDING.yml ( #1877 )
2021-09-27 19:21:39 +02:00
Mike Fährmann
74145467dd
move 'util.Formatter' into its own 'formatter' module
2021-09-27 02:37:04 +02:00
Mike Fährmann
9377543162
[mastodon] add 'following' extractor ( #1891 )
2021-09-26 00:12:34 +02:00
Mike Fährmann
2c2932973c
[mastodon] support specifying accounts by ID
...
Same as a3b473bd
for Twitter
Instead of just
https://instance.tld/@user
it is now also possible to refer to that account with
https://instance.tld/users/user
https://instance.tld/@id:12345
https://instance.tld/users/id:12345
2021-09-25 20:28:16 +02:00
Mike Fährmann
94143eb86c
[twitter] add 'quote_by' metadata field ( #1481 )
...
Only present for tweets quoted by another tweet.
Represents the tweet_id of said tweet quoting this one.
2021-09-25 18:15:14 +02:00
Mike Fährmann
1cc73e11aa
add '--write-infojson' command-line option
2021-09-23 18:02:20 +02:00
Mike Fährmann
a23f5d45d7
[deviantart] fix bug with fetching premium content ( #1879 )
...
When a user has both 'watchers' and 'paid' folders and one of them is
inaccessible, the other one could get handled as inaccessible as well.
2021-09-23 18:02:20 +02:00
Mike Fährmann
ada36c2044
[deviantart] update default archive_fmt for single deviations
...
(#1874 )
use the same as gallery downloads
2021-09-23 18:02:19 +02:00
Mike Fährmann
140a7bfdcd
update --clear-cache command in deviantart client-id instructions
2021-09-23 18:02:19 +02:00
Mike Fährmann
da16eabb82
[twitter] ensure card entries have a 'url' ( #1868 )
2021-09-23 18:02:19 +02:00
Mike Fährmann
e69ee41f25
implement 'page-reverse' option ( #1854 )
2021-09-23 18:02:19 +02:00
cyberdrop-me
c83668c2ff
[CyberDrop] Change directory name format ( #1871 )
...
Album IDs are random, organization would be much better having the album name then the identifier at the end
2021-09-20 15:20:47 +02:00
Mike Fährmann
e4684c5cb9
[desktopography] simplify ( #1740 )
2021-09-17 20:09:24 +02:00
Giacomo Rossetto
4a7d7899ff
Implement desktopography extractor ( #1740 )
2021-09-17 19:59:51 +02:00
Alice
9992ff38da
[fantia] add 'date' metadata field ( #1853 )
2021-09-17 19:59:00 +02:00
Mike Fährmann
fba95c3a9e
[nozomi] preserve case of search tags ( fixes #1860 )
2021-09-16 16:43:06 +02:00
Mike Fährmann
7b05600056
[pinterest] only mention cookies for authentication ( #1856 )
...
regular login is broken due to CAPTCHA
2021-09-16 16:43:05 +02:00
Mike Fährmann
4b3e309b90
[aryion] update/improve pagination ( #1849 )
...
Manually increment the 'p' query parameter,
instead of relying on a "Next" link which only works up to page 200.
2021-09-16 16:27:25 +02:00
Mike Fährmann
266ed9b62e
[aryion] add 'tag' extractor ( closes #1849 )
2021-09-14 23:33:33 +02:00
Mike Fährmann
6bbeaac029
[mangadex] fix extraction ( fixes #1852 )
2021-09-14 17:48:41 +02:00
Mike Fährmann
e9bf8d2591
[instagram] update default delay to 6-12 seconds ( #1835 )
2021-09-14 17:47:11 +02:00
Mike Fährmann
c9e6693530
allow specifying a minimum/maximum for 'sleep-*' options ( #1835 )
...
for example '"sleep-request": [5.0, 10.0]' to wait between 5 and 10
seconds between each HTTP request
2021-09-14 17:40:05 +02:00
Mike Fährmann
bd845303ad
implement a way to shorten filenames with east-asian characters
...
(#1377 )
Setting 'output.shorten' to "eaw" (East-Asian Width) uses a slower
algorithm that also considers characters with a width > 1.
2021-09-13 21:38:33 +02:00
Mike Fährmann
2ff2974353
[common] update default argument handling in Extractor.request()
...
more lines of code, but slightly less execution time
2021-09-11 01:26:11 +02:00
Mike Fährmann
0fd959a2a7
[twitter] support '/with_replies' URLs ( closes #1833 )
2021-09-10 20:44:26 +02:00
Mike Fährmann
e93360e45d
[reddit] extend subcategory depending on input URL ( closes #1836 )
...
- https://www.reddit.com/r/lavaporn/
-> 'subreddit'
- https://www.reddit.com/r/lavaporn/new/
-> 'subreddit-new'
- https://www.reddit.com/user/username/
-> 'user'
- https://www.reddit.com/user/username/gilded/
-> 'user-gilded'
2021-09-10 20:44:17 +02:00
Mike Fährmann
7bbb1f92d7
[gelbooru_v02] add 'favorite' extractor ( closes #1834 )
2021-09-10 20:43:59 +02:00
Mike Fährmann
4ec11af6a4
[kemonoparty] implement login with username & password ( #1824 )
2021-09-09 01:06:25 +02:00
Mike Fährmann
0e33746fe0
[artstation] use '/album/all' view for user portfolios ( #1826 )
2021-09-08 21:46:58 +02:00
Mike Fährmann
4f5f9ed1e5
[oauth] fix typo
...
this has been here since February ...
(8974f036
)
2021-09-08 20:14:39 +02:00
Mike Fährmann
83bbb628d8
[kemonoparty] add 'favorite' extractor ( #1824 )
2021-09-08 00:32:49 +02:00
Mike Fährmann
35d75a4071
[erome] send Referer header for file downloads ( fixes #1829 )
2021-09-08 00:29:16 +02:00
Mike Fährmann
44f572c27f
[deviantart] implement a 'auto-unwatch' option ( #1466 , #1757 )
2021-09-08 00:27:28 +02:00
Mike Fährmann
d79bcb6236
allow extractors to register a 'finalize()' method
2021-09-07 21:15:30 +02:00
Mike Fährmann
9158853ff7
release version 1.18.4
2021-09-04 21:09:56 +02:00
Mike Fährmann
268c9ca034
update default gallery-dl.conf
2021-09-03 19:37:12 +02:00
Mike Fährmann
47a780942c
update extractor test results
2021-09-03 19:36:12 +02:00
Mike Fährmann
eed6ef3de0
[pixiv] fix pixivision title extraction
2021-09-02 22:34:59 +02:00
Mike Fährmann
7645cdfb88
[inkbunny] fix extraction ( closes #1816 )
...
'digitalsales', 'forsale', and 'printsales'
are no longer included in the data returned from the API.
2021-09-02 18:41:08 +02:00
Mike Fährmann
3e36543c98
[nhentai] add 'favorite' extractor ( #1814 )
2021-09-02 18:26:22 +02:00
Mike Fährmann
656358ea92
[nhentai] use API endpoint for gallery data
2021-09-02 17:40:41 +02:00
Mike Fährmann
8cd7759682
[reddit] cleanup ReddeitAPI.__init__ ( #1813 )
...
- remove warning about 'client-id'/'user-agent' mismatch
- only use 'user-agent' from config for custom 'client-id'
2021-08-31 17:08:02 +02:00
Mike Fährmann
0a94fe5774
[reddit] delay RedditAPI initialization ( #1813 )
...
Move it outside the constructor so that eventual exceptions can get
caught in the expected places.
2021-08-30 22:39:40 +02:00
Mike Fährmann
57854624a1
[exhentai] improve image limits check ( #1808 )
...
Check for a 'text/html' Content-Type instead of the very specific
137 bytes Content-Length, which might change depending on compression
or other factors.
2021-08-29 21:19:55 +02:00
Mike Fährmann
96fec14ef7
[deviantart] rename 'watch' option to 'auto-watch'
...
(#1466 , #1757 )
Similar reason as in e05a96db
.
'watch' is already used by the WatchExtractor class.
2021-08-29 21:10:53 +02:00
Mike Fährmann
e75f2de9da
[deviantart] add 'comments' option ( #1800 )
2021-08-29 20:56:34 +02:00
Mike Fährmann
6ce16c6d31
[deviantart] add 'tag' extractor ( closes #1803 )
2021-08-28 23:42:10 +02:00
Mike Fährmann
4e9f8fe395
[shopify] support windsorstore.com ( #1793 )
2021-08-28 18:34:12 +02:00