Mike Fährmann
f4e3cee6ac
use yt-dlp by default ( #1850 , #2028 )
2021-11-29 18:24:26 +01:00
Mike Fährmann
f1b142e993
{kemonoparty[ change default 'files' order to attachments,file,inline
...
(#1991 )
2021-11-29 04:41:30 +01:00
Mike Fährmann
275543b2d2
update extractor test results
2021-11-27 19:26:44 +01:00
Mike Fährmann
e7ea4f2567
[mangoxo] fix metadata extraction
2021-11-27 18:19:51 +01:00
Mike Fährmann
e298882acc
[kemonoparty] match URLs with www subdomain
2021-11-26 18:58:26 +01:00
Mike Fährmann
addb72e1bb
[reactor] support thatpervert.com ( closes #2029 )
2021-11-26 18:58:07 +01:00
Mike Fährmann
d8d9502e1e
[reactor] inherit from BaseExtractor
2021-11-26 18:58:07 +01:00
Mike Fährmann
f4ea216c95
[shopify] support loungeunderwear.com ( closes #2053 )
2021-11-26 18:58:06 +01:00
Mike Fährmann
93cef78450
[gelbooru] workaround pagination limits
...
Gelbooru only allows to retrieve the latest 20k posts for a tag search.
Add 'id:<N' to the search tags to work around that limitation, where N
is the ID of the last retrieved post.
http://gelbooru.me/index.php?page=forum&s=view&id=1467
2021-11-26 18:56:31 +01:00
Mike Fährmann
f2ae179713
[exhentai] fix extraction for disowned galleries ( closes #2055 )
2021-11-24 21:26:16 +01:00
Alice
612850438e
[skeb] add 'thumbnails' option ( #2047 ) ( #2051 )
2021-11-23 21:16:42 +01:00
Mike Fährmann
11a3d96d13
[mangadex] load additional metadata using includes[] directives
...
- always provide 'artist', 'author', and 'group' metadata fields (#2049 )
- remove 'metadata' option
2021-11-22 01:16:33 +01:00
Mike Fährmann
19e00f1322
[dynastyscans] provide 'date' as proper datetime object ( #2050 )
2021-11-21 22:50:52 +01:00
Mike Fährmann
af6424f398
allow testing metadata in list elements
2021-11-21 22:46:34 +01:00
Mike Fährmann
c67756e187
[kemonoparty] add 'dms' option ( #2008 )
2021-11-20 23:36:16 +01:00
Mike Fährmann
3a7a19c7b9
[dynastyscans] add 'manga' extractor ( closes #2035 )
2021-11-19 22:51:26 +01:00
Mike Fährmann
9bc83af3a6
[kemonoparty] 'postfile' -> 'file' ( #1991 )
...
to stay consistent with the existing file types for kemono
2021-11-19 01:50:48 +01:00
Mike Fährmann
522782c09d
[subscribestar] emit metadata for posts without media ( #1569 )
2021-11-18 23:42:17 +01:00
Mike Fährmann
1c8aaf9318
[subscribestar] add 'num' enumeration index ( closes #2040 )
2021-11-18 23:38:41 +01:00
Mike Fährmann
d433735750
[kemonoparty] skip duplicate files ( #2032 , #1991 , #1899 )
...
Extract the SHA-256 file hash from URLs
and skip files with the same hash in the same post.
- provide a 'hash' metadata field (empty string if not available)
- remove 'patreon-skip-file' option
2021-11-17 22:44:15 +01:00
Mike Fährmann
d4ec245554
[kemonoparty] implement a 'files' option ( #1991 )
...
similar to 8d676151
2021-11-17 22:43:41 +01:00
Mike Fährmann
ab8eea1a24
[twitter] fix extractor for direct image links ( fixes #2030 )
2021-11-16 22:57:46 +01:00
Mike Fährmann
2076d40681
[ytdl] improve error handling ( #1680 )
2021-11-15 22:56:42 +01:00
Mike Fährmann
2aaac3c997
[instagram] include user metadata for 'tagged' downloads ( #2024 )
...
Adds
- tagged_owner_id
- tagged_full_name
- tagged_username
containing the values for the user profile the URL originated from,
e.g. 'instagram' for https://www.instagram.com/instagram/tagged/ .
2021-11-15 21:21:59 +01:00
Mike Fährmann
cfa4876848
[philomena] support furbooru.org ( closes #1995 )
2021-11-15 20:57:51 +01:00
Mike Fährmann
4377f1c284
[twitter] distinguish between fatal & nonfatal errors ( #2020 )
...
only show a warning for nonfatal errors
and do not raise a StopExtraction exception
2021-11-13 22:46:40 +01:00
Kyle Anthony Williams
a14b72be21
[webtoons] Use swebtoon-phinf.pstatic.net instead of webtoon-phinf.pstatic.net ( #2005 )
...
* [webtoons] Use swebtoon-phinf.pstatic.net instead of webtoon-phinf.pstatic.net
This trick to avoid having to set a Referer header comes from
Webtoon's RSS feeds. The two URLs below are equivalent in content:
https://webtoon-phinf.pstatic.net/20210929_153/1632867980912DmcGK_JPEG/16328679808882705182.jpg?type=q90
https://swebtoon-phinf.pstatic.net/20210929_153/1632867980912DmcGK_JPEG/16328679808882705182.jpg?type=q90
The URL with the domain "webtoon-phinf.pstatic.net" needs a Referer
header, and the domain "swebtoon-phinf.pstatic.net" does not. This
is because of the environment "swebtoon" images live in, one without
explicit network control: RSS feeds on sites such as Feedly. This change should
make it easier for gallery-dl developers to embed Webtoon comics without
worrying about headers.
2021-11-11 20:03:34 +01:00
Mike Fährmann
6e3658ef52
[kemonoparty] provide 'date' metadata for gumroad ( #2007 )
...
Not the 'published' or 'edited' values since they are 'null',
but still better then nothing at all.
2021-11-11 19:38:10 +01:00
Mike Fährmann
37c9dedee1
[seisoparty] remove module
2021-11-09 22:41:04 +01:00
Mike Fährmann
efa178cc91
[ytdl] implement parsing ytdl command-line options ( #1680 )
...
- adds 'config-file' and 'cmdline-args' options
for both ytdl downloader and extractor
- create 'ytdl' helper module, which combines YoutubeDL creation
and option parsing.
- most likely a buggy mess due to incompatibilities between the
original youtube-dl and yt-dlp.
2021-11-07 02:44:11 +01:00
Mike Fährmann
7cb303d745
[redgifs] improve URL extraction
...
Fields inside 'urls' can be None, which would have caused an exception
with the old method.
2021-11-05 20:02:43 +01:00
Mike Fährmann
2befed1a96
[redgifs] update search URL pattern ( #1984 )
2021-11-05 20:00:06 +01:00
Mike Fährmann
b315a0ecef
[redgifs] update to API v2 ( #1984 )
2021-11-04 21:31:20 +01:00
Mike Fährmann
f0fc3b0ba1
[kemonoparty] add 'comments' option ( #1980 )
2021-11-03 23:02:13 +01:00
Mike Fährmann
1fac74b14d
[reddit] prevent crash for galleries with no 'media_metadata'
...
(fixes #2001 )
2021-11-03 17:55:40 +01:00
Mike Fährmann
211de95dd0
update extractor test results
2021-11-01 02:58:53 +01:00
Mike Fährmann
8bea02c38c
[deviantart] fix 'index' values for stashed deviations
2021-11-01 01:08:24 +01:00
Mike Fährmann
dd88a7d980
{cyberdrop] restore video extraction ( fixes #1993 )
...
fixes a regression introduced in f33c2ef7
2021-10-31 04:34:01 +01:00
Mike Fährmann
fa5646eadc
[mangoxo] fix login and extraction
2021-10-31 02:16:13 +01:00
Mike Fährmann
4c49174579
[mangakakalot] update domain and fix extraction
2021-10-31 02:16:13 +01:00
YongChan Cho
14852f7050
[hitomi] fix image path ( #1988 )
2021-10-30 21:45:01 +02:00
Mike Fährmann
dad2875a3e
fix calculating retry sleep times ( fixes #1990 )
2021-10-29 23:53:48 +02:00
Mike Fährmann
9156e90f1f
[twitter] add 'pinned' option
2021-10-29 22:10:58 +02:00
Mike Fährmann
06b414c9a3
[redgifs] 'gfyId' -> 'id' ( #1984 )
2021-10-29 02:05:39 +02:00
Ryu juheon
d4614e5ba4
[hitomi] fix image URLs ( #1982 )
2021-10-28 19:29:48 +02:00
Mike Fährmann
6434ccf9e8
[redgifs] split from 'gfycat' ( #1984 )
...
Update API endpoints and metadata names - mostly 'gfycat' -> 'gif' -
and remove some obsolete checks.
2021-10-28 19:22:41 +02:00
Mike Fährmann
e4696b40ba
[instagram] update query hashes
2021-10-27 21:37:31 +02:00
Alice
bfd7401b1e
[skeb] add 'user' and 'post' extractors ( #1031 ) ( #1971 )
...
* Create skeb.py
* Update __init__.py
* Update supportedsites.py
* Update supportedsites.md
* Update supportedsites.py
* Update skeb.py
2021-10-26 20:00:41 +02:00
Ryu juheon
6b6d92d51c
[hitomi]: fix image URLs ( #1975 )
2021-10-26 19:35:01 +02:00
Mike Fährmann
dcb201ff19
[gfycat] show warning when there are no available formats
2021-10-26 19:26:50 +02:00
Mike Fährmann
e436a2607b
[gfycat] consistent 'userName' values for 'user' downloads ( #1962 )
...
by using the name from the input URL and not relying on possibly faulty
or incomplete API results.
'userData[username]', if available, will still have the original name.
2021-10-26 19:15:30 +02:00
Mike Fährmann
f1487a3cfa
[kemonoparty:discord] improve 'inline' extraction ( #1940 )
...
- extract media.discordapp.*NET* URLs
- rewrite media.discordapp.net to cdn.discordapp.com
- use a more restricted set of characters for the URL path
2021-10-24 21:15:21 +02:00
Mike Fährmann
02a247f4e5
[deviantart] full resolution for non-downloadable images ( #293 )
...
Many thanks to @Ironchest337 for discovering this method
and providing a well-documented implementation.
2021-10-24 21:11:12 +02:00
Mike Fährmann
a7ddb5f5fa
[deviantart] update 'search' argument handling ( fixes #1911 )
...
- use 'alltime' by default
- support newer 'order' values (most-recent, this-week, etc)
2021-10-23 21:48:02 +02:00
Mike Fährmann
c19e762fdf
[vk] add 'album' extractor ( #474 , fixes #1952 )
...
todo: better metadata for albums
2021-10-23 00:46:20 +02:00
Mike Fährmann
8bb442f20d
[redgifs][gfycat] provide fallback URLs ( fixes #1962 )
...
and extend the 'format' option
2021-10-22 22:47:29 +02:00
Mike Fährmann
b6443c576d
[kemonoparty:discord] extract 'inline' files
2021-10-22 02:50:47 +02:00
Mike Fährmann
bcbf9bcf36
[kemonoparty] split 'discord' extractor ( #1940 )
...
in 'server' and 'channel'
2021-10-18 04:04:58 +02:00
Mike Fährmann
db857b40d8
[kemonoparty] improve inline extraction ( #1899 )
2021-10-17 21:47:11 +02:00
Mike Fährmann
975e0a4fe0
[furaffinity] unquote search queries ( #1958 )
...
instead of unescape
(unquote -> url params, unescape -> html entities)
2021-10-17 20:48:08 +02:00
Mike Fährmann
8d676151b7
[patreon] implement 'files' option ( #1935 )
2021-10-17 04:17:10 +02:00
Mike Fährmann
6695ef2e10
[patreon] better filenames for 'content' images ( #1954 )
2021-10-16 23:21:14 +02:00
Mike Fährmann
70005e3275
[kemonoparty:discord] support downloading from a specific channel
...
https://kemono.party/discord/server/ <server-id>#<channel-name>>
2021-10-15 18:50:08 +02:00
Mike Fährmann
003f25931d
[kemonoparty:discord] provide a 'channel_name'
2021-10-15 18:37:08 +02:00
Mike Fährmann
28bdd58e6d
[nhentai] simplify
2021-10-14 16:45:53 +02:00
Mike Fährmann
50098762e3
[nhentai] add 'tag' extractor ( closes #1950 )
2021-10-14 16:23:47 +02:00
Mike Fährmann
fe6ce5495a
[kemonoparty] add 'discord' extractor ( #1827 , #1940 )
2021-10-13 20:33:05 +02:00
Mike Fährmann
918fc9974d
[picarto] add 'gallery' extractor ( closes #1931 )
2021-10-13 01:22:10 +02:00
Mike Fährmann
e33125ad39
[pixiv] add 'sketch' extractor ( #1497 )
2021-10-13 00:02:11 +02:00
Mike Fährmann
e9dc6ff262
[inkbunny] add 'following' extractor ( #515 )
2021-10-11 21:10:20 +02:00
Mike Fährmann
9c8fc6e7b4
[inkbunny] match "long" URLs for pools and favorites ( #1937 )
2021-10-11 17:24:22 +02:00
Mike Fährmann
f33c2ef73b
[cyberdrop] extract direct download URLs ( #1943 )
...
do not rely on redirects from f.cyberdrop.cc
2021-10-10 23:05:29 +02:00
Mike Fährmann
b93915c113
[inkbunny] add 'pool' extractor ( #1937 )
2021-10-09 22:07:07 +02:00
Mike Fährmann
373d3e1c57
[seisoparty] implement login with username & password ( #1906 )
2021-10-08 22:44:31 +02:00
Mike Fährmann
7c5f62b453
[seisoparty] add 'favorite' extractor ( #1906 )
2021-10-08 22:26:34 +02:00
Mike Fährmann
d93b5474c3
[mangadex] update parameter handling for API requests
...
- move common parameters into '_pagination()'
- add 'ratings' (#1908 ) and 'api-parameters' options
2021-10-05 19:46:48 +02:00
Mike Fährmann
cd66c3c415
[twitter] add 'size' option ( #1881 )
2021-10-05 19:14:54 +02:00
Mike Fährmann
fb98b3fdaf
[redgifs][gfycat] remove webtoken code ( fixes #1907 )
2021-10-01 17:08:33 +02:00
Mike Fährmann
96215c926e
[mangadex] fix retrieving chapters from 'pornographic' titles
...
(fixes #1908 )
2021-10-01 16:48:53 +02:00
Mike Fährmann
da9685609c
[kemonoparty] update file download URLs
...
(closes #1902 , fixes #1903 )
2021-09-30 23:02:46 +02:00
Mike Fährmann
783eae6fc5
[hiperdex] fix extraction
2021-09-30 00:32:34 +02:00
Mike Fährmann
e0bdacd932
[fappic] add 'image' extractor ( closes #1898 )
2021-09-28 23:35:29 +02:00
Mike Fährmann
9377543162
[mastodon] add 'following' extractor ( #1891 )
2021-09-26 00:12:34 +02:00
Mike Fährmann
2c2932973c
[mastodon] support specifying accounts by ID
...
Same as a3b473bd
for Twitter
Instead of just
https://instance.tld/@user
it is now also possible to refer to that account with
https://instance.tld/users/user
https://instance.tld/@id:12345
https://instance.tld/users/id:12345
2021-09-25 20:28:16 +02:00
Mike Fährmann
94143eb86c
[twitter] add 'quote_by' metadata field ( #1481 )
...
Only present for tweets quoted by another tweet.
Represents the tweet_id of said tweet quoting this one.
2021-09-25 18:15:14 +02:00
Mike Fährmann
a23f5d45d7
[deviantart] fix bug with fetching premium content ( #1879 )
...
When a user has both 'watchers' and 'paid' folders and one of them is
inaccessible, the other one could get handled as inaccessible as well.
2021-09-23 18:02:20 +02:00
Mike Fährmann
ada36c2044
[deviantart] update default archive_fmt for single deviations
...
(#1874 )
use the same as gallery downloads
2021-09-23 18:02:19 +02:00
Mike Fährmann
da16eabb82
[twitter] ensure card entries have a 'url' ( #1868 )
2021-09-23 18:02:19 +02:00
Mike Fährmann
e69ee41f25
implement 'page-reverse' option ( #1854 )
2021-09-23 18:02:19 +02:00
cyberdrop-me
c83668c2ff
[CyberDrop] Change directory name format ( #1871 )
...
Album IDs are random, organization would be much better having the album name then the identifier at the end
2021-09-20 15:20:47 +02:00
Mike Fährmann
e4684c5cb9
[desktopography] simplify ( #1740 )
2021-09-17 20:09:24 +02:00
Giacomo Rossetto
4a7d7899ff
Implement desktopography extractor ( #1740 )
2021-09-17 19:59:51 +02:00
Alice
9992ff38da
[fantia] add 'date' metadata field ( #1853 )
2021-09-17 19:59:00 +02:00
Mike Fährmann
fba95c3a9e
[nozomi] preserve case of search tags ( fixes #1860 )
2021-09-16 16:43:06 +02:00
Mike Fährmann
4b3e309b90
[aryion] update/improve pagination ( #1849 )
...
Manually increment the 'p' query parameter,
instead of relying on a "Next" link which only works up to page 200.
2021-09-16 16:27:25 +02:00
Mike Fährmann
266ed9b62e
[aryion] add 'tag' extractor ( closes #1849 )
2021-09-14 23:33:33 +02:00
Mike Fährmann
6bbeaac029
[mangadex] fix extraction ( fixes #1852 )
2021-09-14 17:48:41 +02:00
Mike Fährmann
e9bf8d2591
[instagram] update default delay to 6-12 seconds ( #1835 )
2021-09-14 17:47:11 +02:00
Mike Fährmann
c9e6693530
allow specifying a minimum/maximum for 'sleep-*' options ( #1835 )
...
for example '"sleep-request": [5.0, 10.0]' to wait between 5 and 10
seconds between each HTTP request
2021-09-14 17:40:05 +02:00
Mike Fährmann
2ff2974353
[common] update default argument handling in Extractor.request()
...
more lines of code, but slightly less execution time
2021-09-11 01:26:11 +02:00
Mike Fährmann
0fd959a2a7
[twitter] support '/with_replies' URLs ( closes #1833 )
2021-09-10 20:44:26 +02:00
Mike Fährmann
e93360e45d
[reddit] extend subcategory depending on input URL ( closes #1836 )
...
- https://www.reddit.com/r/lavaporn/
-> 'subreddit'
- https://www.reddit.com/r/lavaporn/new/
-> 'subreddit-new'
- https://www.reddit.com/user/username/
-> 'user'
- https://www.reddit.com/user/username/gilded/
-> 'user-gilded'
2021-09-10 20:44:17 +02:00
Mike Fährmann
7bbb1f92d7
[gelbooru_v02] add 'favorite' extractor ( closes #1834 )
2021-09-10 20:43:59 +02:00
Mike Fährmann
4ec11af6a4
[kemonoparty] implement login with username & password ( #1824 )
2021-09-09 01:06:25 +02:00
Mike Fährmann
0e33746fe0
[artstation] use '/album/all' view for user portfolios ( #1826 )
2021-09-08 21:46:58 +02:00
Mike Fährmann
4f5f9ed1e5
[oauth] fix typo
...
this has been here since February ...
(8974f036
)
2021-09-08 20:14:39 +02:00
Mike Fährmann
83bbb628d8
[kemonoparty] add 'favorite' extractor ( #1824 )
2021-09-08 00:32:49 +02:00
Mike Fährmann
35d75a4071
[erome] send Referer header for file downloads ( fixes #1829 )
2021-09-08 00:29:16 +02:00
Mike Fährmann
44f572c27f
[deviantart] implement a 'auto-unwatch' option ( #1466 , #1757 )
2021-09-08 00:27:28 +02:00
Mike Fährmann
d79bcb6236
allow extractors to register a 'finalize()' method
2021-09-07 21:15:30 +02:00
Mike Fährmann
47a780942c
update extractor test results
2021-09-03 19:36:12 +02:00
Mike Fährmann
eed6ef3de0
[pixiv] fix pixivision title extraction
2021-09-02 22:34:59 +02:00
Mike Fährmann
7645cdfb88
[inkbunny] fix extraction ( closes #1816 )
...
'digitalsales', 'forsale', and 'printsales'
are no longer included in the data returned from the API.
2021-09-02 18:41:08 +02:00
Mike Fährmann
3e36543c98
[nhentai] add 'favorite' extractor ( #1814 )
2021-09-02 18:26:22 +02:00
Mike Fährmann
656358ea92
[nhentai] use API endpoint for gallery data
2021-09-02 17:40:41 +02:00
Mike Fährmann
8cd7759682
[reddit] cleanup ReddeitAPI.__init__ ( #1813 )
...
- remove warning about 'client-id'/'user-agent' mismatch
- only use 'user-agent' from config for custom 'client-id'
2021-08-31 17:08:02 +02:00
Mike Fährmann
0a94fe5774
[reddit] delay RedditAPI initialization ( #1813 )
...
Move it outside the constructor so that eventual exceptions can get
caught in the expected places.
2021-08-30 22:39:40 +02:00
Mike Fährmann
57854624a1
[exhentai] improve image limits check ( #1808 )
...
Check for a 'text/html' Content-Type instead of the very specific
137 bytes Content-Length, which might change depending on compression
or other factors.
2021-08-29 21:19:55 +02:00
Mike Fährmann
96fec14ef7
[deviantart] rename 'watch' option to 'auto-watch'
...
(#1466 , #1757 )
Similar reason as in e05a96db
.
'watch' is already used by the WatchExtractor class.
2021-08-29 21:10:53 +02:00
Mike Fährmann
e75f2de9da
[deviantart] add 'comments' option ( #1800 )
2021-08-29 20:56:34 +02:00
Mike Fährmann
6ce16c6d31
[deviantart] add 'tag' extractor ( closes #1803 )
2021-08-28 23:42:10 +02:00
Mike Fährmann
4e9f8fe395
[shopify] support windsorstore.com ( #1793 )
2021-08-28 18:34:12 +02:00
Mike Fährmann
95157e0f4b
[shopify] use API for product listings ( #1793 )
2021-08-28 18:23:22 +02:00
Mike Fährmann
6651da27e9
[twitter] fix 'url' extraction for users without 'expanded_url'
...
(#1532 , #1787 )
2021-08-27 18:41:16 +02:00
Mike Fährmann
ecc8da4704
[deviantart] implement a 'watch' option ( #1466 , #1757 )
2021-08-27 18:39:43 +02:00
Mike Fährmann
a4f249c22e
[deviantart] prevent exception on empty videos ( fixes #1796 )
2021-08-25 20:14:50 +02:00
Mike Fährmann
ae78d95a5f
[twitter] fix issue when filtering quote tweets ( #1792 )
...
When a user quotes his own Tweet and that Tweet gets filtered by
'"quoted": false', it could also get filtered when it appeared later
as regular Tweet.
2021-08-25 20:04:22 +02:00
Mike Fährmann
6b229ac829
[furaffinity] expand URL pattern for searches ( closes #1780 )
2021-08-23 23:38:52 +02:00
Mike Fährmann
0817f468ef
[twitter] expand t.co links in user descriptions ( #1532 , #1787 )
2021-08-23 23:34:59 +02:00
Mike Fährmann
7c0ae88185
[twitter] add 'url' to user objects ( #1532 , #1787 )
2021-08-23 22:51:35 +02:00
Mike Fährmann
5919dc5b5a
[twitter] slightly improve '_transform_user()'
2021-08-23 22:28:09 +02:00
Mike Fährmann
c04f7ab139
[foolfuuka] add 'gallery' extractor ( #1785 )
2021-08-21 22:46:23 +02:00
Mike Fährmann
ddd175de77
[mangadex] prevent KeyError for manga without English title
2021-08-21 22:46:23 +02:00
Mike Fährmann
20ee091289
[429chan] add 'thread' and 'board' extractors ( closes #1773 )
2021-08-21 22:46:22 +02:00
Mike Fährmann
6b56b3ebe1
[twitter] report API errors as generic StopExtraction exceptions
...
prevents duplicate logging messages for nonexistent users
(#1759 )
2021-08-21 22:46:22 +02:00
Mike Fährmann
51eb50749f
[foolslide] remove entry for kobato.hologfx.com
2021-08-21 22:46:22 +02:00
Mike Fährmann
4718f9c5dd
[oauth] use defaults when config values are set to None/null
...
(fixes #1778 )
2021-08-21 22:46:01 +02:00
James C. Wise
1f02878351
[Deviantart] [ #1776 ] Remove the "you need session cookies to download mature scraps" warning ( #1777 )
2021-08-16 22:50:53 +02:00
Mike Fährmann
bb6a130942
automatically set required DDoS-GUARD cookies ( #1779 )
...
for kemono.party and seiso.party
2021-08-16 17:40:29 +02:00
Mike Fährmann
bd08ee2859
remove most 'yield Message.Version' statements
...
only leave them in oauth.py as noop results
2021-08-16 03:10:48 +02:00
Mike Fährmann
c866fcba48
[twitter] fix 'logout' ( #1719 )
...
delete 'auth_token' cookie and cookies.txt path
2021-08-16 01:36:34 +02:00
Mike Fährmann
9cb5ea5eda
update default User-Agent headers
2021-08-14 04:01:41 +02:00
Mike Fährmann
52984f7e22
[twitter] add option to log out when blocked ( #1719 )
2021-08-12 19:11:41 +02:00
Mike Fährmann
ed4b3c48cb
fix flake8 and other tests
2021-08-12 16:05:26 +02:00
enormous-muscles
975e1ac6e2
Add Wikieat extractor ( #1699 )
...
* Add Wikieat extractor
* Add Wikieat extractor to extractor list
2021-08-12 15:13:20 +02:00
Nyasume
fa6af46756
Added ability to download GIFs instead of mp4 from Luscious and Reactor ( #1701 )
2021-08-12 15:12:42 +02:00
Ryu JuHeon
9429eaa0a3
[hitomi]: fix image URLs ( #1765 )
2021-08-12 14:39:10 +02:00
Mike Fährmann
c34dbc86bb
[kemonoparty] update file server domain ( #1764 )
2021-08-10 22:24:08 +02:00
Mike Fährmann
e5a93e113f
[twitter] extend 'replies' option ( #1254 )
...
Allow setting 'replies to '"self"' to only download from self-replies.
2021-08-10 22:14:00 +02:00
Mike Fährmann
f9096584ab
[behance] fix 'collection' extraction
2021-08-10 00:48:31 +02:00
Mike Fährmann
229498b8aa
[twitter] warn about suspended accounts etc ( closes #1759 )
2021-08-09 02:58:27 +02:00
Mike Fährmann
a5de2244d4
[furaffinity] fix using 'category-tranfer' ( #1274 )
2021-08-08 23:50:32 +02:00
Mike Fährmann
cadfad4eea
[danbooru] add 'external' option ( closes #1747 )
2021-08-08 19:18:41 +02:00
Mike Fährmann
5b1c62bfa9
[furaffinity] add 'external' option ( closes #1492 )
2021-08-08 19:16:02 +02:00
Mike Fährmann
5d5ab669fa
[instagram] use custom User-Agent header for video downloads
...
(#1682 , #1623 , #1580 )
2021-08-05 22:26:47 +02:00
Mike Fährmann
7b029dfe85
[instagram] increase default delay between HTTP requests to 8s
...
(closes #1732 )
2021-08-05 22:08:35 +02:00
Mike Fährmann
5eca3781be
[kemonoparty] fix username extraction ( #1750 )
2021-08-04 16:09:22 +02:00
Mike Fährmann
fe970fc87f
[vk] prevent exception for empty/private profiles (fixes 1742)
2021-08-02 15:38:51 +02:00
Mike Fährmann
ac91a84543
[bbc] provide fallback URLs ( #1706 )
2021-07-30 01:14:07 +02:00
Mike Fährmann
a316e44f8e
[bbc] add 'width' option ( #1706 )
2021-07-30 01:09:32 +02:00
Mike Fährmann
c37c2818fb
[nsfwalbum] retry all requests when extracting image URLs
...
(#1733 , fixes #1271 )
2021-07-30 00:23:04 +02:00
Mike Fährmann
220cfe244e
[deviantart] get original files for GIF previews ( #1731 )
2021-07-30 00:20:53 +02:00
Mike Fährmann
7a0da4f93f
[newgrounds] add 'format' option ( closes #1729 )
2021-07-29 19:11:20 +02:00
Mike Fährmann
223a4e79cd
[newgrounds] fix using 'category-tranfer' ( #1274 )
2021-07-29 15:54:04 +02:00
Mike Fährmann
4e95cef6d2
[nsfwalbum] retry backend requests ( fixes #1733 )
2021-07-29 15:29:37 +02:00
Mike Fährmann
6c11105587
[bbc] improve image dimensions ( #1706 )
...
download the 1920xN versions instead of 976x549
2021-07-29 03:22:15 +02:00
Mike Fährmann
57c1a86082
[bbc] support multi-page gallery listings ( closes #1730 )
2021-07-28 22:42:33 +02:00
Mike Fährmann
486474800f
[kemonoparty] skip duplicated patreon files ( closes #1689 )
...
this behavior can be disabled with the 'patreon-skip-file' option
2021-07-23 16:38:57 +02:00
Mike Fährmann
da7297c0b9
[comicvine] add extractor ( closes #1712 )
2021-07-23 16:17:06 +02:00
Mike Fährmann
e4788fa663
[bbc] add 'gallery' and 'programme' extractors ( closes #1706 )
2021-07-22 20:37:05 +02:00
Mike Fährmann
c3b5c88b04
update extractor test results
2021-07-20 20:21:33 +02:00
Mike Fährmann
3868ec02d1
[pururin] update domain and fix extraction
2021-07-18 04:11:59 +02:00
Mike Fährmann
b89a44090f
[naverwebtoon] fix comic extraction
2021-07-18 03:41:54 +02:00
Mike Fährmann
c8e678a5b4
[instagram] fix extraction of '/explore/tags/' posts
...
(closes #1666 )
2021-07-18 00:21:18 +02:00
Mike Fährmann
a6a51f207d
[moebooru] fix 'tags' ending with a '+' when logged in ( #1702 )
2021-07-16 15:42:03 +02:00
Mike Fährmann
f5b097165e
[ytdl] transfer YoutubeDL objects to downloader ( #1680 )
...
allows specifying downloader-specific options per subcategory
but overwrites all downloader.ytdl settings
2021-07-16 15:40:54 +02:00
Mike Fährmann
06e69ea79a
[ytdl] actually set options for YoutubeDL objects ( #1680 )
...
I somehow managed to remove the options parameter for
the YoutubeDL constructor in 9a849cdf
without noticing ...
2021-07-16 15:05:55 +02:00
Mike Fährmann
dfe1f490e9
[mangadex] use custom User-Agent header ( #1535 )
2021-07-15 16:39:32 +02:00
Mike Fährmann
36a2aff363
[vk] improve metadata extraction and URL pattern ( fixes #1691 )
...
- always fetch all user metadata
- use 'user[name]' for directory names if available
2021-07-15 00:43:42 +02:00
Mike Fährmann
e622e004f0
[ytdl] improve module imports ( #1680 )
...
Apply 'extractor.ytdl.module' for every URL, not just the first.
2021-07-14 03:08:00 +02:00
Mike Fährmann
193401ce3b
[ytdl] "fix" cookie transfer between session and ytdl ( #1680 )
...
requests' CookieJar class is not quite compatible with the standard
http.cookiejar.CookieJar used by youtube_dl
2021-07-12 18:50:25 +02:00
Mike Fährmann
9a849cdf61
[ytdl] allow setting 'module' for subcategories ( #1680 )
2021-07-12 18:47:12 +02:00
Mike Fährmann
dff0da60f9
[ytdl] add 'generic' option ( #1680 )
2021-07-11 23:48:18 +02:00
Mike Fährmann
d3da96142a
[ytdl] support cookies + username&password ( #1680 )
2021-07-11 22:51:57 +02:00
Mike Fährmann
36ac2197db
[ytdl] add extractor for sites supported by youtube-dl
...
(#1680 , #878 )
Can be used by prefixing any URL with 'ytdl:',
or by setting 'extractor,ytdl.enabled' to 'true'.
2021-07-10 20:55:47 +02:00
Mike Fährmann
64240c8d42
[imagevenue] fix extraction
...
(closes #1677 )
2021-07-09 20:13:18 +02:00
Mike Fährmann
d287d2eb88
[kemonoparty] parse 'o' query parameters ( #1674 )
2021-07-09 18:29:50 +02:00
Mike Fährmann
8b036778e3
[kemonoparty] add 'max-posts' option ( #1674 )
2021-07-09 18:19:02 +02:00
Mike Fährmann
5612ca31c2
[hitomi] fix image URLs ( closes #1679 )
2021-07-09 18:01:49 +02:00
Mike Fährmann
8ecca3af58
[pixiv] add extractor for 'pixivision' articles ( #1672 )
2021-07-07 16:27:16 +02:00
Mike Fährmann
312a28e78a
[mastodon] add 'replies' option ( #1669 )
2021-07-07 00:59:02 +02:00
Mike Fährmann
513c491cea
[mastodon] reset 'params' after first pagination iteration
...
otherwise query parameters in 'params' get specified twice the second
time around - once from the 'links["next"]' URL and once from 'params'
itself.
2021-07-07 00:07:18 +02:00
Mike Fährmann
a1f5b78039
[mastodon] add 'reblogs' option ( #1669 )
2021-07-06 23:27:32 +02:00
Mike Fährmann
21c2da454f
update extractor test results
2021-07-04 22:00:32 +02:00
Mike Fährmann
7f591c78cb
[mangafox] cleanup
2021-07-04 03:21:02 +02:00
FollieHiyuki
4763bc1e4e
Add MangaExtractor for mangafox ( #1633 )
2021-07-03 22:53:21 +02:00
Mike Fährmann
b519bf567c
[hiperdex] use domain from input URL
2021-07-02 23:23:42 +02:00
Mike Fährmann
93d356712c
[mastodon] implement 'text-posts' option ( #1569 )
...
similar to Twitter's 'text-tweets'
2021-07-02 22:12:41 +02:00
Mike Fährmann
414bdc95a3
[twitter] set 'retweet_id' for original retweets ( #1481 )
2021-07-02 21:50:37 +02:00
Mike Fährmann
5323c1c73a
[twitter] ensure guest tokens are returned as string ( #1665 )
2021-07-01 14:35:53 +02:00