1
0
mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-23 03:02:50 +01:00
Commit Graph

1019 Commits

Author SHA1 Message Date
Mike Fährmann
f5c6a2d7f5
[nhentai] use API to get gallery info 2018-03-21 12:58:41 +01:00
Mike Fährmann
b2ba2b821d
[hitomi] fix image URLs and improve metadata
- use '?a.hitomi.la' as subdomain depending in gallery-id
- add 'characters', 'tags' and 'date' information
- support multiple entires per metadata-value
- rename 'num' to 'page'
2018-03-20 18:09:42 +01:00
Mike Fährmann
3905474805
[booru] call update_page() with correct dict (closes #82) 2018-03-19 11:33:19 +01:00
Mike Fährmann
44c267e362
[artstation] add search extractor (#80) 2018-03-17 19:04:37 +01:00
Mike Fährmann
40ca562d7b
[artstation] add album extractor (#80) 2018-03-17 17:36:31 +01:00
Mike Fährmann
7121eeae8b
check supportedsites.rst in release script 2018-03-17 15:35:38 +01:00
Mike Fährmann
c59f9b71f1
release version 1.3.1 2018-03-16 12:29:02 +01:00
Mike Fährmann
f367d5c281
[deviantart] move delay-increase after expect_error check
[ci skip]
2018-03-15 16:44:58 +01:00
Mike Fährmann
557cb94f81
[deviantart] use proper exponential backoff on API errors
... and use separate API credentials for unit tests.
2018-03-15 16:01:42 +01:00
Mike Fährmann
723cc66bb1
[artstation] add user-, image- and likes-extractors 2018-03-14 14:05:14 +01:00
Mike Fährmann
b69cc94f0e
[util] implement bencode() 2018-03-14 13:17:34 +01:00
Mike Fährmann
4d74749496
[tests] rework filters for extractor tests
CI incompatible tests will now only be skipped if tests are run in
a CI environment.
2018-03-13 13:11:10 +01:00
Mike Fährmann
d6ef52897c
[imgchili] remove module
All previously hosted images yield a 404
and the main page is just a logo.
2018-03-13 11:02:58 +01:00
Mike Fährmann
7847ab1d5a
[imagehosts] remove even more dead sites
All removed sites either
- reject all incoming connections or
- display a message from their domain registrar
2018-03-12 21:25:13 +01:00
Mike Fährmann
5f37d40a3e
[komikcast] bypass cloudflare challenge 2018-03-10 16:09:40 +01:00
Mike Fährmann
f9884e2338
[pixiv] update URL pattern
add support for 'https://www.pixiv.net/user/<id>'
2018-03-10 16:05:12 +01:00
Mike Fährmann
85ed023c2e
[mangadex] remove the trailing ' - MangaDex' in a better way
str.rstrip() works differently than assumed.
2018-03-10 15:54:50 +01:00
Mike Fährmann
9fb82e6b43
apply expand_path() to archive paths 2018-03-08 18:06:39 +01:00
Mike Fährmann
32bbd12f08
update extractor tests 2018-03-08 18:04:34 +01:00
Mike Fährmann
ca326bd275
[deviantart] fix folder and collection archive IDs
{folder[index]} and {collection[index]} are both '0' when being
delegated from Gallery- or FavoriteExtractors, as there is no
way of knowing a folder's index when getting folder-information
from the API.
2018-03-08 14:32:23 +01:00
Mike Fährmann
e32fe1cdf1
[pinterest] cast IDs to int
... and update test results.

Image URLs changed from
https://s-media-cache-ak0.pinimg.com/... to
https://i.pinimg.com/...
2018-03-06 14:28:21 +01:00
Mike Fährmann
179ecee965
[turboimagehost] fix extraction 2018-03-06 14:25:10 +01:00
Mike Fährmann
1400868f53
[mangadex] general improvements
- support >100 chapter entries per manga
- custom archive ID format
- detect non-existing chapters
2018-03-06 14:15:15 +01:00
Mike Fährmann
749fbbfa6c
[mangadex] add chapter- and manga-extractor 2018-03-05 18:37:21 +01:00
Mike Fährmann
b58449fd88
release version 1.3.0 2018-03-02 18:47:17 +01:00
Mike Fährmann
6e38cf5aab
[mangareader] use 'https://'
The site now redirects from http://mangareader.net/
to https://mangareader.net/
2018-03-02 17:19:17 +01:00
Mike Fährmann
1d71123f91
[pixiv] update archive IDs and add metadata-fields
(Pixiv bookmarks actually have their own IDs, comments and tags,
independent of the bookmarked image, which makes creating an
archive ID a lot easier)
2018-03-02 16:11:53 +01:00
Mike Fährmann
858fdbdb22
[tumblr] improve 'inline' extraction
'quote' posts store their HTML content in the 'source' field
2018-03-02 06:59:44 +01:00
Mike Fährmann
1d54a8e07d
fix logging output during downloads
from:
filename.ext[download][warning] ...

to:
filename.ext
[download][warning] ...
2018-03-01 18:43:43 +01:00
Mike Fährmann
5008e105ee
update archive IDs
... to behave in a more straightforward way when dealing with
bookmarks/favourites/etc.

specific IDs are now grouped by their owner, album-id, ... to
allow for duplicates when it would be expected.
2018-03-01 18:20:50 +01:00
Mike Fährmann
829ddf4ac1
[sankaku] general improvements
- simplify regex
- unquote search tags
- increase default wait-time between HTTP requests
  - downloading several hundreds of images always resulted
    in '429 Too Many Requests' eventually
- circumvent paging restrictions for unauthenticated users by only
  using the 'next' parameter
  - setting 'page' to a constant, low value (or simply omitting it)
    does the trick
2018-02-27 16:51:14 +01:00
Jad
49463f76bb support multi-page URL (#79)
* support multi-page URL

* fix

* all done.

* fix, again
2018-02-26 11:13:49 +01:00
Mike Fährmann
19aefdfde3
[directlink] update test results 2018-02-26 03:01:23 +01:00
Mike Fährmann
74029c50bb
[directlink] unquote metadata fields 2018-02-26 02:12:47 +01:00
Mike Fährmann
2fad0b1f1b
add 'U' conversion for format strings to unquote their content
(#74)
2018-02-25 21:57:59 +01:00
Mike Fährmann
8cdce21dcb
make archive keys user-configurable 2018-02-25 21:57:01 +01:00
Mike Fährmann
8f338347b6
[imagehosts] cleanup
removed
- chronos.to      - unable to resolve hostname
- coreimg.net     - same
- imgmaid.net     - same
- hosturimage.com - everything returns 404
- imageontime.org - redirects to some shady site
- imgupload.yt    - cloudflare error 522, host down
- img4ever.net    - read timeout
2018-02-23 01:05:42 +01:00
Mike Fährmann
edfd3d9fc9
[yeet] remove module
- archive.yeet.net returns a 500 server error
- yeet.net moved to yeet.rip, but the archive is gone
2018-02-23 01:05:41 +01:00
Mike Fährmann
e1e0668ca8
add option to set default replacement field value
Missing or undefined keywords will now be replaced with the value
set for 'keywords-default'. The default is Python's 'None', which
is equivalent to setting this option to JSON's 'null'.
2018-02-23 00:59:20 +01:00
Mike Fährmann
ac3da8115e
[util] don't add text: URLs to list of downloaded URLs 2018-02-20 18:14:27 +01:00
Mike Fährmann
8704d850bf
add explicit proxy support (#76)
- '--proxy' as command-line argument
- 'extractor.*.proxy' as config option
2018-02-19 18:45:06 +01:00
Mike Fährmann
367b963d37
[pixiv] fix ugoira extraction ... again (#78)
Some animations are not available for mobile devices, so we
pretend to be a desktop browser when requesting the ugoira page.
2018-02-19 16:50:12 +01:00
Mike Fährmann
b79f1f2ca7
[pixiv] fix ugoira extraction (closes #78) 2018-02-19 08:51:09 +01:00
Mike Fährmann
731ffd4986
improve text.filename_from_url() performance
- urlsplit() is faster than urlparse()
- rpartition() is faster than rindex() + slicing
- new version is 2.3 times as fast
2018-02-18 16:50:07 +01:00
Mike Fährmann
d122203be1
[mangastream] fix extraction 2018-02-17 22:40:16 +01:00
Mike Fährmann
8809b32aed
release version 1.2.0 2018-02-16 22:29:57 +01:00
Mike Fährmann
b50bdbf3d7
change config specifiers in input file format
Instead of a dictionary/object, input file options are now specified
by a 'key=value' pair starting with '-' for options only applying to
the next URL or '-G' for Global options applying to all following URLs.

See the docstring of parse_inputfile() for details.

Example option specifiers:

- filename = "{id}.{extension}"
- extractor.pixiv.user.directory = ["Pixiv Users", "{user[id]}"]
-spaces="are_optional"
-G keywords = {"global": "option"}
2018-02-16 03:10:41 +01:00
Mike Fährmann
f970a8f13c
fix adding keys to download archive when using skip=false 2018-02-13 23:45:30 +01:00
Mike Fährmann
179bcdd349
adjust archive-ids 2018-02-13 04:50:45 +01:00
Mike Fährmann
be3ea4425d
test archive-id creation and uniqueness 2018-02-12 23:02:09 +01:00