Mike Fährmann
7b562907c3
[nijie] add favorites extractor
...
adds support for 'https://nijie.info/user_like_illust_view.php?id= ...'
2018-03-31 18:54:25 +02:00
Mike Fährmann
445db75955
[nijie] improve extraction and metadata
...
- add 'title' and 'description'
- split 'artist_id' into 'user_id' and 'artist_id'
- 'user_id' is the ID of the user from which the image entry
originates from
- 'artist_id' is the ID of the actual image artist
- improve pagination and URL patterns
2018-03-31 18:48:41 +02:00
Mike Fährmann
a112e3f2a0
[nijie] add doujin extractor
...
adds support for "https://nijie.info/members_dojin.php?id= <artist_id>"
2018-03-31 18:17:41 +02:00
Mike Fährmann
f39153b6e9
[nhentai] add extractor for search results
2018-03-28 17:21:44 +02:00
Mike Fährmann
52d41c41e7
[exhentai] add extractor for favorited galleries
2018-03-27 18:58:42 +02:00
Mike Fährmann
63cc2599c4
[exhentai] add extractor for search results
2018-03-27 16:50:47 +02:00
Mike Fährmann
d1c91a1f2b
[mangadex] fix manga-page extraction
2018-03-25 17:22:12 +02:00
Mike Fährmann
299ae24996
[test] add a few downloader tests
2018-03-25 15:10:25 +02:00
Mike Fährmann
dd314279fb
[test] add unit tests for extractor module functions
2018-03-25 11:49:42 +02:00
Mike Fährmann
a993d0ea90
release version 1.3.2
2018-03-23 15:27:56 +01:00
Mike Fährmann
e7525b1b0e
[artstation] add challenge extractor ( #80 )
2018-03-23 15:06:09 +01:00
Mike Fährmann
3f2dd6b6f8
avoid double path-separators
...
(#74 )
2018-03-22 10:24:59 +01:00
Mike Fährmann
f5c6a2d7f5
[nhentai] use API to get gallery info
2018-03-21 12:58:41 +01:00
Mike Fährmann
b2ba2b821d
[hitomi] fix image URLs and improve metadata
...
- use '?a.hitomi.la' as subdomain depending in gallery-id
- add 'characters', 'tags' and 'date' information
- support multiple entires per metadata-value
- rename 'num' to 'page'
2018-03-20 18:09:42 +01:00
Mike Fährmann
3905474805
[booru] call update_page() with correct dict ( closes #82 )
2018-03-19 11:33:19 +01:00
Mike Fährmann
44c267e362
[artstation] add search extractor ( #80 )
2018-03-17 19:04:37 +01:00
Mike Fährmann
40ca562d7b
[artstation] add album extractor ( #80 )
2018-03-17 17:36:31 +01:00
Mike Fährmann
7121eeae8b
check supportedsites.rst in release script
2018-03-17 15:35:38 +01:00
Mike Fährmann
c59f9b71f1
release version 1.3.1
2018-03-16 12:29:02 +01:00
Mike Fährmann
f367d5c281
[deviantart] move delay-increase after expect_error check
...
[ci skip]
2018-03-15 16:44:58 +01:00
Mike Fährmann
557cb94f81
[deviantart] use proper exponential backoff on API errors
...
... and use separate API credentials for unit tests.
2018-03-15 16:01:42 +01:00
Mike Fährmann
723cc66bb1
[artstation] add user-, image- and likes-extractors
2018-03-14 14:05:14 +01:00
Mike Fährmann
b69cc94f0e
[util] implement bencode()
2018-03-14 13:17:34 +01:00
Mike Fährmann
4d74749496
[tests] rework filters for extractor tests
...
CI incompatible tests will now only be skipped if tests are run in
a CI environment.
2018-03-13 13:11:10 +01:00
Mike Fährmann
d6ef52897c
[imgchili] remove module
...
All previously hosted images yield a 404
and the main page is just a logo.
2018-03-13 11:02:58 +01:00
Mike Fährmann
7847ab1d5a
[imagehosts] remove even more dead sites
...
All removed sites either
- reject all incoming connections or
- display a message from their domain registrar
2018-03-12 21:25:13 +01:00
Mike Fährmann
5f37d40a3e
[komikcast] bypass cloudflare challenge
2018-03-10 16:09:40 +01:00
Mike Fährmann
f9884e2338
[pixiv] update URL pattern
...
add support for 'https://www.pixiv.net/user/ <id>'
2018-03-10 16:05:12 +01:00
Mike Fährmann
85ed023c2e
[mangadex] remove the trailing ' - MangaDex' in a better way
...
str.rstrip() works differently than assumed.
2018-03-10 15:54:50 +01:00
Mike Fährmann
9fb82e6b43
apply expand_path() to archive paths
2018-03-08 18:06:39 +01:00
Mike Fährmann
32bbd12f08
update extractor tests
2018-03-08 18:04:34 +01:00
Mike Fährmann
ca326bd275
[deviantart] fix folder and collection archive IDs
...
{folder[index]} and {collection[index]} are both '0' when being
delegated from Gallery- or FavoriteExtractors, as there is no
way of knowing a folder's index when getting folder-information
from the API.
2018-03-08 14:32:23 +01:00
Mike Fährmann
e32fe1cdf1
[pinterest] cast IDs to int
...
... and update test results.
Image URLs changed from
https://s-media-cache-ak0.pinimg.com/ ... to
https://i.pinimg.com/ ...
2018-03-06 14:28:21 +01:00
Mike Fährmann
179ecee965
[turboimagehost] fix extraction
2018-03-06 14:25:10 +01:00
Mike Fährmann
1400868f53
[mangadex] general improvements
...
- support >100 chapter entries per manga
- custom archive ID format
- detect non-existing chapters
2018-03-06 14:15:15 +01:00
Mike Fährmann
749fbbfa6c
[mangadex] add chapter- and manga-extractor
2018-03-05 18:37:21 +01:00
Mike Fährmann
b58449fd88
release version 1.3.0
2018-03-02 18:47:17 +01:00
Mike Fährmann
6e38cf5aab
[mangareader] use 'https://'
...
The site now redirects from http://mangareader.net/
to https://mangareader.net/
2018-03-02 17:19:17 +01:00
Mike Fährmann
1d71123f91
[pixiv] update archive IDs and add metadata-fields
...
(Pixiv bookmarks actually have their own IDs, comments and tags,
independent of the bookmarked image, which makes creating an
archive ID a lot easier)
2018-03-02 16:11:53 +01:00
Mike Fährmann
858fdbdb22
[tumblr] improve 'inline' extraction
...
'quote' posts store their HTML content in the 'source' field
2018-03-02 06:59:44 +01:00
Mike Fährmann
1d54a8e07d
fix logging output during downloads
...
from:
filename.ext[download][warning] ...
to:
filename.ext
[download][warning] ...
2018-03-01 18:43:43 +01:00
Mike Fährmann
5008e105ee
update archive IDs
...
... to behave in a more straightforward way when dealing with
bookmarks/favourites/etc.
specific IDs are now grouped by their owner, album-id, ... to
allow for duplicates when it would be expected.
2018-03-01 18:20:50 +01:00
Mike Fährmann
829ddf4ac1
[sankaku] general improvements
...
- simplify regex
- unquote search tags
- increase default wait-time between HTTP requests
- downloading several hundreds of images always resulted
in '429 Too Many Requests' eventually
- circumvent paging restrictions for unauthenticated users by only
using the 'next' parameter
- setting 'page' to a constant, low value (or simply omitting it)
does the trick
2018-02-27 16:51:14 +01:00
Jad
49463f76bb
support multi-page URL ( #79 )
...
* support multi-page URL
* fix
* all done.
* fix, again
2018-02-26 11:13:49 +01:00
Mike Fährmann
19aefdfde3
[directlink] update test results
2018-02-26 03:01:23 +01:00
Mike Fährmann
74029c50bb
[directlink] unquote metadata fields
2018-02-26 02:12:47 +01:00
Mike Fährmann
2fad0b1f1b
add 'U' conversion for format strings to unquote their content
...
(#74 )
2018-02-25 21:57:59 +01:00
Mike Fährmann
8cdce21dcb
make archive keys user-configurable
2018-02-25 21:57:01 +01:00
Mike Fährmann
8f338347b6
[imagehosts] cleanup
...
removed
- chronos.to - unable to resolve hostname
- coreimg.net - same
- imgmaid.net - same
- hosturimage.com - everything returns 404
- imageontime.org - redirects to some shady site
- imgupload.yt - cloudflare error 522, host down
- img4ever.net - read timeout
2018-02-23 01:05:42 +01:00
Mike Fährmann
edfd3d9fc9
[yeet] remove module
...
- archive.yeet.net returns a 500 server error
- yeet.net moved to yeet.rip, but the archive is gone
2018-02-23 01:05:41 +01:00