Mike Fährmann
|
87a14a50e7
|
[zerochan] improve redirect handling, add 'redirects' option (#5891)
|
2024-08-10 11:32:30 +02:00 |
|
Mike Fährmann
|
8a6e208605
|
[zerochan] fix 'Invalid control character' errors (#5892)
|
2024-07-29 11:24:17 +02:00 |
|
Mike Fährmann
|
70f18b7a78
|
[zerochan] fix tag redirections (#5891)
|
2024-07-26 20:41:34 +02:00 |
|
Mike Fährmann
|
5207a0c2e0
|
[zerochan] implement 'tags' option (#5874)
allow splitting tags into separate lists by category
|
2024-07-23 10:21:33 +02:00 |
|
Mike Fährmann
|
1aadc29c5b
|
[zerochan] fix 'source' extraction
|
2024-07-23 09:34:44 +02:00 |
|
Mike Fährmann
|
ae40c61c21
|
[zerochan] fix tag category extraction (#5874)
|
2024-07-23 09:16:32 +02:00 |
|
Mike Fährmann
|
fef80a2f55
|
[zerochan] fetch metadata for each post separately (#5869)
instead of processing all posts at once before returning any of them
|
2024-07-20 02:11:27 +02:00 |
|
Mike Fährmann
|
b376fa814e
|
[zerochan] handle "KeyError - 'items'" (#5826)
Zerochan sometimes sends an empty response when there are no more
accessible posts to be had.
|
2024-07-05 21:34:33 +02:00 |
|
Mike Fährmann
|
cc6b9e4c18
|
[zerochan] use API by default (#3669)
add 'pagination' option
|
2024-02-25 00:36:14 +01:00 |
|
Mike Fährmann
|
42335ea880
|
[zerochan] fix skipping every other post
|
2024-02-15 02:51:01 +01:00 |
|
Mike Fährmann
|
adc3aa0b77
|
[zerochan] fix metadata extraction
author, path, tags
|
2023-11-24 21:21:14 +01:00 |
|
Mike Fährmann
|
a453335a9f
|
remove test results in extractor modules
and add generic example URLs
|
2023-09-11 16:30:55 +02:00 |
|
Mike Fährmann
|
d97b8c2fba
|
consistent cookie-related names
- rename every cookie variable or method to 'cookies_*'
- simplify '.session.cookies' to just '.cookies'
- more consistent 'login()' structure
|
2023-07-22 01:20:50 +02:00 |
|
enduser420
|
d52ed2bc5a
|
[zerochan] fix 'tags' extraction
|
2023-07-18 16:38:04 +05:30 |
|
Mike Fährmann
|
ed2d715019
|
fix 'keywords' in extractor tests (#3491)
|
2023-01-03 15:14:23 +01:00 |
|
Mike Fährmann
|
4063563cd7
|
[zerochan] update for layout v3
- remove cookie disabling v3
- fix and improve metadata extraction
|
2022-12-17 12:51:51 +01:00 |
|
Mike Fährmann
|
b0cb4a1b9c
|
replace 'text.extract()' with 'text.extr()' where possible
|
2022-11-05 01:14:09 +01:00 |
|
Mike Fährmann
|
3cb8327c60
|
[zerochan] add 'metadata' option (#2861)
|
2022-09-02 23:25:19 +02:00 |
|
Mike Fährmann
|
21ff77fea0
|
[zerochan] extract more metadata for single posts
Neither HTML pages nor RSS feed entries have *all* metadata.
It might be necessary to do 1-2 extra HTTP requests to grab everything.
|
2022-08-14 17:26:29 +02:00 |
|
Mike Fährmann
|
98af5a0409
|
[zerochan] implement login with username & password (#1434)
|
2022-07-29 12:56:20 +02:00 |
|
Mike Fährmann
|
3a8addfe45
|
[zerochan] add 'tag' and 'image' extractors (#1434)
|
2022-07-27 22:58:23 +02:00 |
|