1
0
mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-25 20:22:36 +01:00
Commit Graph

58 Commits

Author SHA1 Message Date
Mike Fährmann
6940ad0e72
[booru] allow multiple 'url' keys (#5859) 2024-07-17 20:49:52 +02:00
Mike Fährmann
bf44add132
[booru] prevent crash when file URL is empty (#5859)
- catch all exceptions, including IndexError,  and not just a select few
- change logging message level to  warning
2024-07-17 20:20:17 +02:00
Mike Fährmann
512abeb4ae
[booru] add 'url' option 2023-01-13 16:32:32 +01:00
Mike Fährmann
775895f44b
[booru] refactor 'tags' and 'notes' extraction
- move HTML request for post pages into its own function
- move gelbooru_v02.py notes extraction to gelbooru.py
  since it only works there
- clean up some code
2022-10-31 12:01:19 +01:00
Mike Fährmann
9ebc20e290
[booru] call nameext_from_url() before update() and _prepare()
to be able to overwrite filename and extension in _prepare()
2022-02-18 00:37:59 +01:00
thatfuckingbird
dff03a6605
[booru] add an option to extract notes (only gelbooru for now) (#1457)
* [booru] add an option to extract notes (currently implemented only for gelbooru)

* appease linter

* [gelbooru] rename "text" to "body" in note extraction

* add a code comment about reusing return value of _extended_tags
2021-04-13 23:40:24 +02:00
Mike Fährmann
08d7934c6e
move extractors from booru.py into their own gelbooru_v02 module 2021-02-17 00:26:24 +01:00
Mike Fährmann
0978c1e184
[booru] use BaseExtractor 2021-01-27 01:33:01 +01:00
Mike Fährmann
e41e2be2f9
[booru] split '_prepare_post()' 2020-12-24 01:13:54 +01:00
Mike Fährmann
78061658ea
[booru] reduce exceptions caught during _prepare_post()
don't catch HttpErrors etc.
2020-12-21 02:05:59 +01:00
Mike Fährmann
8d2e4e5f13
[booru] improve error handling
e.g. for posts without a valid 'file_url' (#1176)
2020-12-17 01:16:45 +01:00
Mike Fährmann
a3a863fc13
[booru] add generalized extractors for *booru sites
similar to cc15fbe7
2020-12-08 18:34:30 +01:00
Mike Fährmann
1e3dd7330e
merge SharedConfigMixin functionality into Extractor 2020-11-17 00:34:07 +01:00
Mike Fährmann
7a0ba370d1
[gelbooru] rewrite mp4 video URLs (fixes #1048) 2020-10-15 15:14:18 +02:00
Mike Fährmann
fb98b567fa
[gelbooru] improve post ID extraction for pools 2020-04-22 21:28:18 +02:00
Mike Fährmann
f117e32910
[danbooru] restore 'popular' functionality 2020-02-29 23:37:53 +01:00
Mike Fährmann
86c00f9e66
[danbooru] move extractor logic from booru.py 2020-02-28 22:53:45 +01:00
Mike Fährmann
f02a768b5c
[danbooru] add 'ugoira' option (#406)
to choose between ZIP archives or converted video files
for Ugoira posts
2019-08-31 21:51:52 +02:00
Mike Fährmann
7a5e78741c
[booru] build directory path for each file (#385) 2019-08-18 23:28:33 +02:00
Mike Fährmann
a4ba34c835
[booru] prevent crash when no tags are present (#259) 2019-05-17 19:32:53 +02:00
Mike Fährmann
4b1880fa5e
propagate 'match' to base extractor constructor 2019-02-11 13:31:10 +01:00
Mike Fährmann
6284731107
simplify extractor constants
- single strings for URL patterns
- tuples instead of lists for 'directory_fmt' and 'test'
- single-tuple tests where applicable
2019-02-08 13:45:40 +01:00
Mike Fährmann
4d656a81ca
replace SharedConfigExtractor class with a Mixin 2019-02-04 13:46:02 +01:00
Mike Fährmann
1d43cbbf52
[gelbooru] tag-splitting for non-api mode 2018-07-06 15:24:19 +02:00
Mike Fährmann
3dbc7c5f8d
[gelbooru] restore pool functionality 2018-07-04 12:21:41 +02:00
Mike Fährmann
a2c74bc6f0
[gelbooru] inherit from BooruExtractor class
Breaks pool functionality when using API calls (for now),
but reduces code clutter and enables the `tags` option.
2018-07-04 12:21:41 +02:00
Mike Fährmann
4a57509392
generalize tag-splitting option (#92)
- extend functionality to other booru sites:
  - http://behoimi.org/
  - https://konachan.com/
  - https://e621.net/
  - https://rule34.xxx/
  - https://safebooru.org/
  - https://yande.re/
2018-07-04 12:21:16 +02:00
Mike Fährmann
87853538b4
[yandere] add option to split tags by type (#92) 2018-06-29 19:38:53 +02:00
Mike Fährmann
95392554ee
use text.urljoin() 2018-04-26 17:00:26 +02:00
Mike Fährmann
3905474805
[booru] call update_page() with correct dict (closes #82) 2018-03-19 11:33:19 +01:00
Mike Fährmann
5008e105ee
update archive IDs
... to behave in a more straightforward way when dealing with
bookmarks/favourites/etc.

specific IDs are now grouped by their owner, album-id, ... to
allow for duplicates when it would be expected.
2018-03-01 18:20:50 +01:00
Mike Fährmann
34873dbd90
set 'archive_fmt' values
These are going to be used to create an unique id for each image.
2018-02-01 15:30:49 +01:00
Mike Fährmann
974e73bdbb
[booru] smaller code adjustments 2018-01-06 17:48:49 +01:00
Mike Fährmann
9e8a84ab6c
[booru] rewrite using Mixin classes (#59)
- improved code structure
- improved URL patterns
- better pagination to work around page limits on
  - Danbooru
  - e621
  - 3dbooru
2018-01-04 00:01:39 +01:00
Mike Fährmann
00420ff202
[booru] consistent order for "popular" results 2017-09-06 12:33:19 +02:00
Mike Fährmann
be30fb2f98
add common config category for boorus and foolslide 2017-08-29 22:42:48 +02:00
Mike Fährmann
07214f4007
[booru] place subcategories into base classes 2017-08-26 22:27:55 +02:00
Mike Fährmann
18e6ed1c7e
[booru] add extractors for "Popular" images 2017-08-24 21:29:22 +02:00
Mike Fährmann
58e95a7487
share extractor and downloader sessions
There was never any "good" reason for the strict separation
between extractors and downloaders. This change allows for
reduced resource usage (probably unnoticeable) and less lines
of code at the "cost" of tighter coupling.
2017-06-30 19:38:14 +02:00
Mike Fährmann
c184e47ee3
put common directory- and filename formats in base classes 2017-05-30 12:10:16 +02:00
Mike Fährmann
85a46ed700
[booru] fix issue with multiple tags 2017-05-04 11:58:51 +02:00
Mike Fährmann
ff92674379
[booru] simple skip functionality
(#6)
2017-03-04 23:21:55 +01:00
Mike Fährmann
94e10f249a
code adjustments according to pep8 nr2 2017-02-01 00:53:19 +01:00
Mike Fährmann
19c2d4ff6f
remove explicit (sub)category keywords 2016-09-25 14:22:07 +02:00
Mike Fährmann
d7e168799d
consistent extractor naming scheme + docstrings 2016-09-12 10:34:31 +02:00
Mike Fährmann
bb8f2a3e9d
[gelbooru] update to new format 2015-11-21 02:40:30 +01:00
Mike Fährmann
1bce63124b
[3dbooru] update to new format 2015-11-21 01:48:44 +01:00
Mike Fährmann
74199422a8
[booru] add base classes for tag/pool/post extractors 2015-11-21 00:54:29 +01:00
Mike Fährmann
80917f5392
[danbooru] rewrite to use multiple extractors 2015-11-20 20:24:15 +01:00
Mike Fährmann
c2f0720184
code cleanup to use nameext_from_url 2015-11-16 17:32:26 +01:00