Mike Fährmann
da6789b2b0
disable unique archive id checks for some tests
...
- same image twice in a livedoor blog post
- unreliable results for related pinterest items
2019-11-10 17:04:51 +01:00
Mike Fährmann
67e54ed8ea
release version 1.11.1
2019-11-09 00:52:06 +01:00
Mike Fährmann
ce98a86c0e
fix data file inclusion in source distributions
2019-11-09 00:47:13 +01:00
Mike Fährmann
6c86fbfe2a
release version 1.11.0
2019-11-08 22:34:56 +01:00
Mike Fährmann
ba083b30b2
fix snap build
...
… hopefully
2019-11-08 21:44:12 +01:00
Mike Fährmann
94a94f3b86
miscellaneous stuff
2019-11-08 20:58:53 +01:00
Mike Fährmann
4867ece13b
update README.rst
...
- use 'python3 -m pip' instead of pure 'pip'
- advice Windows users to use 'py -3'
- use '.tar.gz' for archive links since they are quite a bit smaller
- reword and simplify some stuff
2019-11-08 16:41:27 +01:00
Mike Fährmann
4f39d2ae64
install bash completion into share/bash-completion/completions
2019-11-07 23:23:48 +01:00
Mike Fährmann
b0197098e6
[imgur] get title from webpage if missing in API response
...
(closes #467 )
2019-11-07 21:10:04 +01:00
Mike Fährmann
dd5d2b2eac
[deviantart] add user profile extractor ( #377 , #419 )
2019-11-07 18:29:49 +01:00
Mike Fährmann
a437e78620
[deviantart] minimize cookie usage during scraps extraction
...
(#445 )
2019-11-05 21:55:13 +01:00
Mike Fährmann
1a197d2195
store the original cookiejar as Extractor._cookiejar
2019-11-05 21:53:22 +01:00
Mike Fährmann
de83ae4576
make 'method' argument of Extractor.request keyword-only
2019-11-05 17:28:09 +01:00
Mike Fährmann
a5be08a830
[downloader:ytdl] forward proxy settings
2019-11-05 16:16:26 +01:00
Mike Fährmann
4325695d74
[luscious] expand GraphQL queries
2019-11-04 21:17:22 +01:00
Mike Fährmann
94dbdbf506
[nijie] change default filename format
...
… to be consistent with Pixiv filenames
2019-11-04 20:47:38 +01:00
HRXN
794aace0bc
Add basic package version notice to README.rst ( #465 )
2019-11-03 23:52:50 +01:00
Mike Fährmann
9e88e7a344
[postprocessor:exec] improve ( #421 , #413 )
...
- add 'final' option
- include job status in pp finalization
- improve and extend documentation
2019-11-03 21:45:45 +01:00
Mike Fährmann
c18fadc221
[instagram] extract videos without youtube-dl ( #391 )
2019-11-03 14:02:56 +01:00
Mike Fährmann
f15eedb634
[sexcom] set Referer header for file downloads ( closes #464 )
2019-11-03 13:27:58 +01:00
Mike Fährmann
2a3bd4e3c7
rename extractor classes starting with a digit
2019-11-02 20:42:09 +01:00
Mike Fährmann
b3b9da6d74
[photobucket] replace test URL
...
The other user deleted all of is images.
2019-11-02 20:17:08 +01:00
Mike Fährmann
64786363be
[4chan] simplify
...
- remove 'chan.py'
- slight adjustments to directory and filenames
2019-11-02 20:11:21 +01:00
Mike Fährmann
557e2c018b
[8chan] remove module
2019-11-02 20:06:47 +01:00
Mike Fährmann
e14782a948
[instagram] simplify graphql extraction for post pages
2019-11-01 22:08:25 +01:00
Mike Fährmann
c01ff78467
[twitter] extend 'videos' option to force extraction with ytdl
...
(closes #459 )
2019-11-01 22:06:07 +01:00
Mike Fährmann
f8ac67ce50
[hitomi] extend URL pattern + follow redirects
2019-11-01 21:40:10 +01:00
Mike Fährmann
e782b4b230
update setup.py
...
- add 'cloudflare' and 'video' extra install target (#460 )
(e.g. 'pip install gallery-dl[cloudflare]')
- remove py2exe and distutils code
- get version without calling exec()
- inline variables
2019-11-01 21:33:48 +01:00
Mike Fährmann
e877ca97c3
[naver] adjust directory names and metadata structure
2019-10-31 16:53:48 +01:00
Mike Fährmann
702f2fbd1f
[issuu] add publication and user extractors ( #413 )
2019-10-31 16:52:57 +01:00
Mike Fährmann
8361d874d7
[hitomi] fix extraction
2019-10-29 16:23:20 +01:00
Mike Fährmann
5fa6ff04dd
[instagram] extract '__additionalDataLoaded' ( #391 )
...
The '_sharedData' of Post pages is missing its 'graphql' part for
logged in users. This data is now included in the parameters of a
function call to '__additionalDataLoaded(...)'
And, of course, video extraction with youtube-dl broke because of
this change as well.
2019-10-29 16:00:31 +01:00
Mike Fährmann
5af291ba5c
include failed downloads and child extractors in exit status
2019-10-29 15:56:54 +01:00
Mike Fährmann
322c2e7ed4
renaming variables
...
mostly 'keyword(s)' to 'kwdict'
2019-10-29 15:46:35 +01:00
Mike Fährmann
87a87bff7e
[simplyhentai] fix image URLs
2019-10-28 21:11:06 +01:00
Mike Fährmann
4409d00141
embed error messages in StopExtraction exceptions
2019-10-28 16:39:49 +01:00
Mike Fährmann
d5e3910270
adjust 'util.raises()'
2019-10-28 15:06:17 +01:00
Mike Fährmann
d44f790e81
adjust output for HTTP status related errors
2019-10-27 23:55:02 +01:00
Mike Fährmann
03e0cec715
return with non-zero exit status on error
2019-10-27 23:54:18 +01:00
Mike Fährmann
c887493a80
overhaul exception stuff
2019-10-27 23:53:37 +01:00
Mike Fährmann
109718a5e3
[blogger] add blog and post extractors ( closes #364 )
2019-10-26 14:15:55 +02:00
Mike Fährmann
244d396b0b
add '--ugoira-conv-lossless' command-line option ( #432 )
...
and cleanup the arguments for the regular '--ugoira-conv':
- remove '-an'
- enable two-pass encoding
2019-10-26 00:32:19 +02:00
Mike Fährmann
49a6b1b6c0
[twitter] extract video stream info without youtube-dl ( #452 )
...
This should allow video downloads when logged in without
'forward-cookies' disabled and from protected tweets.
youtube-dl still gets used to download HLS playlists, but the data
extraction part, which doesn't work with youtube-dl at the moment,
now gets handled by gallery-dl itself.
2019-10-25 13:41:36 +02:00
Mike Fährmann
9f0dbf2a72
[twitter] raise proper exception for protected Tweets
2019-10-25 13:26:16 +02:00
Mike Fährmann
083e14ad9a
[downloader:ytdl] add data from '_ytdl_extra' to info_dicts
2019-10-25 13:17:13 +02:00
Mike Fährmann
6e08ada4fe
[luscious] simplify some metadata entries
2019-10-25 13:14:59 +02:00
Mike Fährmann
9e3a8607ee
[deviantart] update usernames ( #455 )
...
In the case that a user changed his username, requesting deviations
with an old name might cause problems (missing deviations, etc.)
The internal 'username' value therefore now gets updated to the
current username taken from the user profile.
2019-10-24 22:23:16 +02:00
Mike Fährmann
2eb38810c5
[twitter] fix image extraction when logged in ( #452 )
...
... for individual tweets.
To get a Tweet page with the old Twitter layout, an Internet
Explorer User-Agent (e.g. Mozilla/5.0 (Windows NT 6.1; WOW64;
Trident/7.0; rv:11.0) like Gecko) as well as a Referer header
pointing to the page itself is required. The "app_shell_visited"
cookie appears to be optional at the moment, but that is what
a regular web browser would send.
2019-10-23 22:18:29 +02:00
Mike Fährmann
8f38a35b91
[imgur] use API with "public" client_id ( #446 )
...
Using the API endpoints makes it possible to access NSFW content
without logging in.
2019-10-23 21:43:55 +02:00
Mike Fährmann
b23c822b23
[luscious] use GraphQL
2019-10-22 21:17:08 +02:00