pubak42
e7326cdf1d
[sex.com] Download videos from cdn ( #3408 )
...
The format of video sources was changed recently to be a full URL with https:// in the beginning.
The original extractor code appended the video source URL to root url of the website, thus yielding
invalid url in format ...sex.comhttps... that failed to resolve.
2022-12-16 10:16:40 +01:00
ClosedPort22
d0ad6d0e67
[e621] implement manual pagination mode
2022-12-16 16:15:36 +08:00
Mike Fährmann
6f0735568c
[2chen] fix file URLs
2022-12-15 18:05:32 +01:00
enduser420
a2be06d873
[2chen] add '.club' support ( #3406 )
2022-12-15 17:51:02 +01:00
Mike Fährmann
a6d4733e11
[pixiv] extract 'date_url' metadata ( #3405 )
...
i.e. the datetime encoded in each file URL.
https://i.pximg.net/img-master/img/2022/12/01/13/44/55/12345678_p0.jpg
->
2022-12-01 13:44:55 +09:00
->
2022-12-01 04:44:55
2022-12-15 11:40:20 +01:00
Mike Fährmann
7f99d4b3f7
link to required MSVC package for Windows executables
...
(#3331 , #3372 )
2022-12-14 20:00:48 +01:00
Mike Fährmann
1317625ec4
[webmshare] add 'video' extractor ( #2410 )
2022-12-14 19:59:07 +01:00
Mike Fährmann
90a9c0790f
[twitter] update 'search' pagination ( #544 )
...
Only stop when list of all returned Tweets is empty
instead of when no valid Tweet was found.
2022-12-14 19:56:59 +01:00
Mike Fährmann
1cbc234819
[mangafox] extract more metadata ( #3167 )
2022-12-14 19:56:59 +01:00
Mike Fährmann
3082544fff
misc fixes
...
- fix typo (#3399 )
- remove double assignment
- [bunkr] update things I forgot in 6b6f886d
- [soundgasm] adjust 'archive_fmt' (#3388 )
2022-12-14 13:30:27 +01:00
enduser420
41bf236d36
[lynxchan] add generic extractors for lynxchan imageboards ( #3394 )
...
* [lynxchan] add generic extractors for lynxchan imageboards
includes kohlchan.net, endchan.org:wq
* [lynxchan] set pop default to empty tuple
* Apply suggestions from code review
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2022-12-12 13:13:39 +01:00
Mike Fährmann
3c75c3bbc4
[soundgasm] add 'user' extractor ( #3384 )
...
based on code from PR #3388 by @enduser420
2022-12-12 12:23:59 +01:00
Mike Fährmann
2952add4a8
[reddit] increase 'id-max' default value ( #3397 )
...
to float("inf")
2022-12-12 12:10:06 +01:00
Mike Fährmann
a001c9c06f
[instagram] prevent post 'date' overwriting file 'date' ( #3392 )
2022-12-12 11:38:48 +01:00
Mike Fährmann
6b6f886dcf
[bunkr] update domain ( #3391 )
...
and improve bunkr/app.bunkr handling
2022-12-11 23:18:01 +01:00
ClosedPort22
bf3fd5951a
Merge branch 'master' into da-extra-stash
2022-12-10 21:34:37 +08:00
Mike Fährmann
eb94568e1f
[soundgasm] add 'audio' extractor ( #3384 )
2022-12-09 23:19:07 +01:00
Mike Fährmann
dfe7b23579
support Firefox containers for --cookies-from-browser ( #3346 )
2022-12-09 19:43:55 +01:00
Mike Fährmann
cd931e1139
update extractor test results
2022-12-08 18:58:29 +01:00
Mike Fährmann
989ec9fc79
[khinsider] fix metadata extraction
2022-12-07 21:36:34 +01:00
Mike Fährmann
1c25cc7a3e
[warosu] fix and update
2022-12-07 21:23:45 +01:00
Mike Fährmann
79e52f3539
[imgth] rewrite
...
- inherit from GalleryExtractor
- fix image URLs
- better metadata
2022-12-07 21:19:06 +01:00
Mike Fährmann
202c1210d5
[exhentai] fix pagination
2022-12-06 22:39:13 +01:00
Mike Fährmann
ca4742200b
use util.NONE as 'keyword-default' default value
2022-12-06 22:27:57 +01:00
Mike Fährmann
43c211f1a7
extend and rename util.CustomNone
2022-12-06 22:08:51 +01:00
Mike Fährmann
6afb3cc766
restore paths for archived files ( #3362 )
2022-12-06 22:01:37 +01:00
Mike Fährmann
4a3a1f4c87
[komikcast] update domain and fix extraction
2022-12-06 22:00:23 +01:00
ClosedPort22
13d825731e
[deviantart] fix test for sta.sh URL extraction
...
Without the 'count' assertion, the test would be essentially useless.
2022-12-07 00:32:03 +08:00
ClosedPort22
6356c9be96
[deviantart] extract sta.sh URLs from 'text_content'
2022-12-06 22:57:53 +08:00
Mike Fährmann
5f57a27ba6
[imagetwist] fix extraction
2022-12-05 22:15:59 +01:00
Mike Fährmann
a42ba25ca1
[foolslide] remove 'kireicake'
...
site redirects to (unclaimed) mangadex group
2022-12-05 19:47:56 +01:00
Mike Fährmann
86f0597c95
[kissgoddess] remove module
...
site does not host albums anymore
2022-12-05 19:28:50 +01:00
Mike Fährmann
049d1bae9a
release version 1.24.1
2022-12-04 21:10:14 +01:00
Mike Fährmann
d0b160461a
terrible workaround for errors with 'http-metadata' ( #3334 )
2022-12-04 19:36:21 +01:00
Mike Fährmann
20e12b5d7c
[nitter] support '/i/user/' URLs ( #3310 )
...
as well as using 'id:<userid>' as username
not all nitter instances seem to support '/i/user/' ...
2022-12-04 16:21:13 +01:00
Mike Fährmann
f820fbed53
remove no longer used entries from supportedsites.py
2022-12-04 16:21:13 +01:00
Mike Fährmann
fceaee3c4f
[lolisafe] remove zz.ht
2022-12-04 16:21:13 +01:00
Mike Fährmann
4554c43d5f
[bunkr] use 'media-files' servers for more file types
2022-12-04 16:21:13 +01:00
enduser420
4bc756dfe0
[2chen] fix extraction ( #3356 )
...
update 'archive_fmt'
update tests
update 'board' regex
2022-12-04 16:19:36 +01:00
enduser420
54844944ab
[pixhost] add 'gallery' support ( #3353 )
2022-12-04 16:19:14 +01:00
enduser420
213676c785
[fapachi] add 'post' and 'user' extractors ( #3347 )
...
* [fapachi] add 'post' and 'user' extractors
* [fapachi] add 'keyword' to test
* [fapachi] remove whitespaces
2022-12-02 13:54:29 +01:00
Mike Fährmann
a18511e346
[nitter] retry downloads on 404 ( #3313 )
2022-12-01 11:09:02 +01:00
Mike Fährmann
80102fa367
[downloader:http] add 'retry-codes' option ( #3313 )
2022-12-01 11:08:23 +01:00
Mike Fährmann
88610c3478
[patreon] update API query parameters
2022-12-01 10:02:28 +01:00
Mike Fährmann
c19b1f03b9
[patreon] fix '403 Forbidden' errors
...
send 'Content-Type' headers for API requests
2022-12-01 09:52:36 +01:00
Mike Fährmann
b4253f69c9
[downloader:http] fix ZeroDivisionError ( #3328 )
...
ensure 'time_elapsed' only get used as divisor
when it is greater than zero
2022-11-30 21:56:18 +01:00
Mike Fährmann
fc34f76cc5
[bunkr] fix video downloads ( #3326 )
...
by sending 'https://stream.bunkr.is/ ' as Referer header
2022-11-30 15:10:31 +01:00
Mike Fährmann
86a396e086
[bcy] fix JSONDecodeError ( #3321 )
2022-11-30 14:44:19 +01:00
Mike Fährmann
5b9a22af7f
[patreon] improve 'campaign_id' extraction ( #3235 )
2022-11-30 11:57:52 +01:00
Mike Fährmann
1bdd0e4338
[nitter] support '/i/web/' Tweet URLs ( #3310 )
2022-11-30 11:45:56 +01:00