Mike Fährmann
e0bdacd932
[fappic] add 'image' extractor ( closes #1898 )
2021-09-28 23:35:29 +02:00
Mike Fährmann
bd08ee2859
remove most 'yield Message.Version' statements
...
only leave them in oauth.py as noop results
2021-08-16 03:10:48 +02:00
Mike Fährmann
64240c8d42
[imagevenue] fix extraction
...
(closes #1677 )
2021-07-09 20:13:18 +02:00
Mike Fährmann
204523611c
[imgclick] use 'http://' for image URLs
...
The TLS certificate for main.imgclick.net is invalid.
2021-03-19 01:30:49 +01:00
Mike Fährmann
ca6b0fc2ac
[imagehosts] cleanup
2021-02-10 17:36:45 +01:00
Mike Fährmann
95a66bdad6
[imgclick] add 'image' extractor ( closes #1307 )
...
basically reverts b0e8daf415
2021-02-10 17:22:12 +01:00
Mike Fährmann
e5e591b848
[vipr] simplify and add test ( #1258 )
2021-01-20 21:40:04 +01:00
v-delta
e707e060cb
[vipr] add image extractor ( #1258 )
...
* [vipr] add image extractor
Adds support for images hosted on https://vipr.im
* Fix codestyle issues
2021-01-20 21:38:43 +01:00
Mike Fährmann
1e3dd7330e
merge SharedConfigMixin functionality into Extractor
2020-11-17 00:34:07 +01:00
Mike Fährmann
968d3e8465
remove '&' from URL patterns
...
'/?&#' -> '/?#' and '?&#' -> '?#'
According to https://www.ietf.org/rfc/rfc3986.txt , URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
9be7ff600e
[imagetwist] replace test image
...
the old one expired, it seems
2019-12-22 22:13:13 +01:00
Mike Fährmann
d0920e84e9
update test results
2019-12-19 18:33:48 +01:00
Mike Fährmann
a2af2d2965
adjust cache maxage values
2019-03-14 22:21:49 +01:00
Mike Fährmann
fffbfd3dce
[imgspice] fix extraction
2019-03-09 20:29:23 +01:00
Mike Fährmann
5530871b5a
change results of text.nameext_from_url()
...
Instead of getting a complete 'filename' from an URL and splitting that
into 'name' and 'extension', the new approach gets rid of the complete
version and renames 'name' to 'filename'. (Using anything other than
{extension} for a filename extension doesn't really work anyway)
Example: "https://example.org/path/filename.ext "
before:
- filename : filename.ext
- name : filename
- extension: ext
now:
- filename : filename
- extension: ext
2019-02-14 16:07:17 +01:00
Mike Fährmann
2e516a1e3e
store the full original URL in Extractor.url
2019-02-12 18:46:48 +01:00
Mike Fährmann
4b1880fa5e
propagate 'match' to base extractor constructor
2019-02-11 13:31:10 +01:00
Mike Fährmann
abbd45d0f4
update handling of extractor URL patterns
...
When loading extractor classes during 'extractor.find(…)', their
'pattern' attribute will be replaced with a compiled version of itself.
2019-02-08 20:08:16 +01:00
Mike Fährmann
6284731107
simplify extractor constants
...
- single strings for URL patterns
- tuples instead of lists for 'directory_fmt' and 'test'
- single-tuple tests where applicable
2019-02-08 13:45:40 +01:00
Mike Fährmann
34bab080ae
rewrite URL patterns to use only 1 per extractor
2019-02-08 12:03:10 +01:00
Mike Fährmann
793b24e513
[imagehosts] fix and improve various extractors
2019-02-06 17:41:26 +01:00
Mike Fährmann
6126615698
update URLs for supportedsites.rst
2019-01-30 16:18:22 +01:00
Mike Fährmann
e53cdfd6a8
update build_supportedsites.py
2019-01-09 14:58:35 +01:00
Mike Fährmann
fd8ed35591
[turboimagehost] fix extraction
2018-10-23 21:08:24 +02:00
Mike Fährmann
f3793660ef
update tests
2018-08-02 14:57:28 +02:00
Mike Fährmann
ecdc3475b8
[pixhost] support .to TLDs
2018-05-23 18:32:34 +02:00
Mike Fährmann
95392554ee
use text.urljoin()
2018-04-26 17:00:26 +02:00
Mike Fährmann
564e12ca8f
replace 'imgyt' with 'imxto'
...
https://img.yt/ wasn't available for a couple of days, but has now
re-emerged as https://imx.to/ with a new web-interface.
Links to older images still work (see tests).
2018-04-09 15:53:20 +02:00
Mike Fährmann
7847ab1d5a
[imagehosts] remove even more dead sites
...
All removed sites either
- reject all incoming connections or
- display a message from their domain registrar
2018-03-12 21:25:13 +01:00
Mike Fährmann
179ecee965
[turboimagehost] fix extraction
2018-03-06 14:25:10 +01:00
Mike Fährmann
8f338347b6
[imagehosts] cleanup
...
removed
- chronos.to - unable to resolve hostname
- coreimg.net - same
- imgmaid.net - same
- hosturimage.com - everything returns 404
- imageontime.org - redirects to some shady site
- imgupload.yt - cloudflare error 522, host down
- img4ever.net - read timeout
2018-02-23 01:05:42 +01:00
Mike Fährmann
34873dbd90
set 'archive_fmt' values
...
These are going to be used to create an unique id for each image.
2018-02-01 15:30:49 +01:00
Mike Fährmann
239d7afea7
[hosturimage] fix extraction of larger images
2017-10-25 12:56:16 +02:00
Mike Fährmann
68a0a7579c
fix/improve some regular expressions
2017-10-09 22:37:50 +02:00
Mike Fährmann
8e14714c2b
[imgspice] fix extraction
2017-09-26 21:04:48 +02:00
Mike Fährmann
f32b1a0292
[imgyt] fix extraction
2017-09-14 15:04:32 +02:00
Mike Fährmann
49c7e70c10
[acidimg] add image extractor
2017-09-09 15:19:18 +02:00
Mike Fährmann
0245a0ba5f
fix extraction and update test results
...
- fixes for hbrowse, imgyt, imgcandy, hosturimage
- test updates for deviantart, gfycat
2017-08-08 19:11:13 +02:00
Mike Fährmann
c951d6276c
[imagetwist] use https
2017-06-24 16:21:00 +02:00
Mike Fährmann
c184e47ee3
put common directory- and filename formats in base classes
2017-05-30 12:10:16 +02:00
Mike Fährmann
244ab75cad
[kissmanga] update AES key retrieval
2017-04-21 20:36:47 +02:00
Chen John L
a5485a46cb
fixed the module for pixhost
2017-04-21 19:54:10 +08:00
Mike Fährmann
841fd50242
move code into util.py
2017-03-28 13:12:44 +02:00
Mike Fährmann
4e7661ab01
[imgtrex] re-add extractor
2017-03-21 15:47:51 +01:00
Mike Fährmann
0b59d9f8c7
disable urllib3s InsecureConnectionWarning
2017-02-11 21:21:57 +01:00
Mike Fährmann
0af02007a9
[imagetwist] fix site access
2017-02-08 22:59:00 +01:00
Mike Fährmann
7880cc1ad7
[imgtrex] remove extractor - domain no longer exists
2017-02-05 16:54:04 +01:00
Mike Fährmann
21e0dfbe20
[chronos] raise NotFoundError instead of crashing
2017-02-02 15:54:50 +01:00
Mike Fährmann
2b38398940
[imgyt] raise NotFoundError instead of crashing
2017-02-02 15:52:48 +01:00
Mike Fährmann
94e10f249a
code adjustments according to pep8 nr2
2017-02-01 00:53:19 +01:00