Mike Fährmann
bd08ee2859
remove most 'yield Message.Version' statements
...
only leave them in oauth.py as noop results
2021-08-16 03:10:48 +02:00
Mike Fährmann
968d3e8465
remove '&' from URL patterns
...
'/?&#' -> '/?#' and '?&#' -> '?#'
According to https://www.ietf.org/rfc/rfc3986.txt , URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
e89413da22
update test results
2020-01-12 21:41:06 +01:00
Mike Fährmann
c0a1241648
[livedoor] force https:// for image URLs
2019-12-03 16:14:05 +01:00
Mike Fährmann
978cb03f81
update misc test results
...
- Livedoor now uses https:// for its image URLs
- Instagram image URLs got simplified
2019-11-20 21:45:48 +01:00
Mike Fährmann
da6789b2b0
disable unique archive id checks for some tests
...
- same image twice in a livedoor blog post
- unreliable results for related pinterest items
2019-11-10 17:04:51 +01:00
Mike Fährmann
4e8a548a61
[livedoor] update metadata extraction
2019-09-06 21:44:25 +02:00
Mike Fährmann
40c7eb3424
[livedoor] improve extraction ( fixes #301 )
2019-06-06 15:22:27 +02:00
Mike Fährmann
1cde38110d
[livedoor] return 'date' as datetime object
2019-05-25 23:45:56 +02:00
Mike Fährmann
e88824e1a7
[livedoor] fix adjustments for https:// URLs
2019-05-25 23:45:22 +02:00
Mike Fährmann
f2cf1c1d73
use 'text.extract_from()' in a few places
2019-04-21 15:19:20 +02:00
Mike Fährmann
35919a9bb8
[livedoor] add blog- and post-extractors ( #190 )
2019-04-06 16:27:48 +02:00