Mike Fährmann
fffbfd3dce
[imgspice] fix extraction
2019-03-09 20:29:23 +01:00
Mike Fährmann
5530871b5a
change results of text.nameext_from_url()
...
Instead of getting a complete 'filename' from an URL and splitting that
into 'name' and 'extension', the new approach gets rid of the complete
version and renames 'name' to 'filename'. (Using anything other than
{extension} for a filename extension doesn't really work anyway)
Example: "https://example.org/path/filename.ext "
before:
- filename : filename.ext
- name : filename
- extension: ext
now:
- filename : filename
- extension: ext
2019-02-14 16:07:17 +01:00
Mike Fährmann
2e516a1e3e
store the full original URL in Extractor.url
2019-02-12 18:46:48 +01:00
Mike Fährmann
4b1880fa5e
propagate 'match' to base extractor constructor
2019-02-11 13:31:10 +01:00
Mike Fährmann
abbd45d0f4
update handling of extractor URL patterns
...
When loading extractor classes during 'extractor.find(…)', their
'pattern' attribute will be replaced with a compiled version of itself.
2019-02-08 20:08:16 +01:00
Mike Fährmann
6284731107
simplify extractor constants
...
- single strings for URL patterns
- tuples instead of lists for 'directory_fmt' and 'test'
- single-tuple tests where applicable
2019-02-08 13:45:40 +01:00
Mike Fährmann
34bab080ae
rewrite URL patterns to use only 1 per extractor
2019-02-08 12:03:10 +01:00
Mike Fährmann
793b24e513
[imagehosts] fix and improve various extractors
2019-02-06 17:41:26 +01:00
Mike Fährmann
6126615698
update URLs for supportedsites.rst
2019-01-30 16:18:22 +01:00
Mike Fährmann
e53cdfd6a8
update build_supportedsites.py
2019-01-09 14:58:35 +01:00
Mike Fährmann
fd8ed35591
[turboimagehost] fix extraction
2018-10-23 21:08:24 +02:00
Mike Fährmann
f3793660ef
update tests
2018-08-02 14:57:28 +02:00
Mike Fährmann
ecdc3475b8
[pixhost] support .to TLDs
2018-05-23 18:32:34 +02:00
Mike Fährmann
95392554ee
use text.urljoin()
2018-04-26 17:00:26 +02:00
Mike Fährmann
564e12ca8f
replace 'imgyt' with 'imxto'
...
https://img.yt/ wasn't available for a couple of days, but has now
re-emerged as https://imx.to/ with a new web-interface.
Links to older images still work (see tests).
2018-04-09 15:53:20 +02:00
Mike Fährmann
7847ab1d5a
[imagehosts] remove even more dead sites
...
All removed sites either
- reject all incoming connections or
- display a message from their domain registrar
2018-03-12 21:25:13 +01:00
Mike Fährmann
179ecee965
[turboimagehost] fix extraction
2018-03-06 14:25:10 +01:00
Mike Fährmann
8f338347b6
[imagehosts] cleanup
...
removed
- chronos.to - unable to resolve hostname
- coreimg.net - same
- imgmaid.net - same
- hosturimage.com - everything returns 404
- imageontime.org - redirects to some shady site
- imgupload.yt - cloudflare error 522, host down
- img4ever.net - read timeout
2018-02-23 01:05:42 +01:00
Mike Fährmann
34873dbd90
set 'archive_fmt' values
...
These are going to be used to create an unique id for each image.
2018-02-01 15:30:49 +01:00
Mike Fährmann
239d7afea7
[hosturimage] fix extraction of larger images
2017-10-25 12:56:16 +02:00
Mike Fährmann
68a0a7579c
fix/improve some regular expressions
2017-10-09 22:37:50 +02:00
Mike Fährmann
8e14714c2b
[imgspice] fix extraction
2017-09-26 21:04:48 +02:00
Mike Fährmann
f32b1a0292
[imgyt] fix extraction
2017-09-14 15:04:32 +02:00
Mike Fährmann
49c7e70c10
[acidimg] add image extractor
2017-09-09 15:19:18 +02:00
Mike Fährmann
0245a0ba5f
fix extraction and update test results
...
- fixes for hbrowse, imgyt, imgcandy, hosturimage
- test updates for deviantart, gfycat
2017-08-08 19:11:13 +02:00
Mike Fährmann
c951d6276c
[imagetwist] use https
2017-06-24 16:21:00 +02:00
Mike Fährmann
c184e47ee3
put common directory- and filename formats in base classes
2017-05-30 12:10:16 +02:00
Mike Fährmann
244ab75cad
[kissmanga] update AES key retrieval
2017-04-21 20:36:47 +02:00
Chen John L
a5485a46cb
fixed the module for pixhost
2017-04-21 19:54:10 +08:00
Mike Fährmann
841fd50242
move code into util.py
2017-03-28 13:12:44 +02:00
Mike Fährmann
4e7661ab01
[imgtrex] re-add extractor
2017-03-21 15:47:51 +01:00
Mike Fährmann
0b59d9f8c7
disable urllib3s InsecureConnectionWarning
2017-02-11 21:21:57 +01:00
Mike Fährmann
0af02007a9
[imagetwist] fix site access
2017-02-08 22:59:00 +01:00
Mike Fährmann
7880cc1ad7
[imgtrex] remove extractor - domain no longer exists
2017-02-05 16:54:04 +01:00
Mike Fährmann
21e0dfbe20
[chronos] raise NotFoundError instead of crashing
2017-02-02 15:54:50 +01:00
Mike Fährmann
2b38398940
[imgyt] raise NotFoundError instead of crashing
2017-02-02 15:52:48 +01:00
Mike Fährmann
94e10f249a
code adjustments according to pep8 nr2
2017-02-01 00:53:19 +01:00
Mike Fährmann
d82508f245
fix tests for turboimagehost and pinterest
2017-01-27 22:40:18 +01:00
Mike Fährmann
ad4b02508f
trying to understand travis-ci unit test failures
...
- added some debug output via logging module
- unit tests work on my machine (tm)
2017-01-12 22:35:42 +01:00
Mike Fährmann
c604a65b88
[imgyt] use token as filename if none is given
2016-12-25 00:53:32 +01:00
Mike Fährmann
b0e8daf415
[imgclick] remove extractor - uses captcha
2016-12-08 16:51:25 +01:00
Mike Fährmann
2fae0b1803
[fapat] add extractor
2016-12-07 08:45:52 +01:00
Mike Fährmann
583f1b8bbb
[postimg] add extractor
2016-12-06 12:46:41 +01:00
Mike Fährmann
d1cd9acf54
[pixhost] adjust to new site layout
2016-12-06 10:05:24 +01:00
Mike Fährmann
d402e644bf
update tests
2016-11-29 17:11:41 +01:00
Mike Fährmann
46440fda2d
[imagevenue] add extractor
2016-11-28 22:30:00 +01:00
Mike Fährmann
99440ca51a
[imgtrial] add extractor
2016-11-13 21:25:37 +01:00
Mike Fährmann
5f2824dfe6
[imgspot] add extractor
2016-11-13 21:24:38 +01:00
Mike Fährmann
88193718e8
[pixhost] add extractor
2016-11-09 12:03:14 +01:00
Mike Fährmann
07e9e2c4f1
[imgmaid] add extractor
2016-11-08 00:17:10 +01:00