Mike Fährmann
|
a453335a9f
|
remove test results in extractor modules
and add generic example URLs
|
2023-09-11 16:30:55 +02:00 |
|
Mike Fährmann
|
d97b8c2fba
|
consistent cookie-related names
- rename every cookie variable or method to 'cookies_*'
- simplify '.session.cookies' to just '.cookies'
- more consistent 'login()' structure
|
2023-07-22 01:20:50 +02:00 |
|
enduser420
|
d52ed2bc5a
|
[zerochan] fix 'tags' extraction
|
2023-07-18 16:38:04 +05:30 |
|
Mike Fährmann
|
ed2d715019
|
fix 'keywords' in extractor tests (#3491)
|
2023-01-03 15:14:23 +01:00 |
|
Mike Fährmann
|
4063563cd7
|
[zerochan] update for layout v3
- remove cookie disabling v3
- fix and improve metadata extraction
|
2022-12-17 12:51:51 +01:00 |
|
Mike Fährmann
|
b0cb4a1b9c
|
replace 'text.extract()' with 'text.extr()' where possible
|
2022-11-05 01:14:09 +01:00 |
|
Mike Fährmann
|
3cb8327c60
|
[zerochan] add 'metadata' option (#2861)
|
2022-09-02 23:25:19 +02:00 |
|
Mike Fährmann
|
21ff77fea0
|
[zerochan] extract more metadata for single posts
Neither HTML pages nor RSS feed entries have *all* metadata.
It might be necessary to do 1-2 extra HTTP requests to grab everything.
|
2022-08-14 17:26:29 +02:00 |
|
Mike Fährmann
|
98af5a0409
|
[zerochan] implement login with username & password (#1434)
|
2022-07-29 12:56:20 +02:00 |
|
Mike Fährmann
|
3a8addfe45
|
[zerochan] add 'tag' and 'image' extractors (#1434)
|
2022-07-27 22:58:23 +02:00 |
|