gallery-dl

mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-26 12:42:29 +01:00

Author	SHA1	Message	Date
Mike Fährmann	8361d874d7	[hitomi] fix extraction	2019-10-29 16:23:20 +01:00
Mike Fährmann	5fa6ff04dd	[instagram] extract '__additionalDataLoaded' (#391 ) The '_sharedData' of Post pages is missing its 'graphql' part for logged in users. This data is now included in the parameters of a function call to '__additionalDataLoaded(...)' And, of course, video extraction with youtube-dl broke because of this change as well.	2019-10-29 16:00:31 +01:00
Mike Fährmann	5af291ba5c	include failed downloads and child extractors in exit status	2019-10-29 15:56:54 +01:00
Mike Fährmann	322c2e7ed4	renaming variables mostly 'keyword(s)' to 'kwdict'	2019-10-29 15:46:35 +01:00
Mike Fährmann	87a87bff7e	[simplyhentai] fix image URLs	2019-10-28 21:11:06 +01:00
Mike Fährmann	4409d00141	embed error messages in StopExtraction exceptions	2019-10-28 16:39:49 +01:00
Mike Fährmann	d5e3910270	adjust 'util.raises()'	2019-10-28 15:06:17 +01:00
Mike Fährmann	d44f790e81	adjust output for HTTP status related errors	2019-10-27 23:55:02 +01:00
Mike Fährmann	03e0cec715	return with non-zero exit status on error	2019-10-27 23:54:18 +01:00
Mike Fährmann	c887493a80	overhaul exception stuff	2019-10-27 23:53:37 +01:00
Mike Fährmann	109718a5e3	[blogger] add blog and post extractors (closes #364 )	2019-10-26 14:15:55 +02:00
Mike Fährmann	244d396b0b	add '--ugoira-conv-lossless' command-line option (#432 ) and cleanup the arguments for the regular '--ugoira-conv': - remove '-an' - enable two-pass encoding	2019-10-26 00:32:19 +02:00
Mike Fährmann	49a6b1b6c0	[twitter] extract video stream info without youtube-dl (#452 ) This should allow video downloads when logged in without 'forward-cookies' disabled and from protected tweets. youtube-dl still gets used to download HLS playlists, but the data extraction part, which doesn't work with youtube-dl at the moment, now gets handled by gallery-dl itself.	2019-10-25 13:41:36 +02:00
Mike Fährmann	9f0dbf2a72	[twitter] raise proper exception for protected Tweets	2019-10-25 13:26:16 +02:00
Mike Fährmann	083e14ad9a	[downloader:ytdl] add data from '_ytdl_extra' to info_dicts	2019-10-25 13:17:13 +02:00
Mike Fährmann	6e08ada4fe	[luscious] simplify some metadata entries	2019-10-25 13:14:59 +02:00
Mike Fährmann	9e3a8607ee	[deviantart] update usernames (#455 ) In the case that a user changed his username, requesting deviations with an old name might cause problems (missing deviations, etc.) The internal 'username' value therefore now gets updated to the current username taken from the user profile.	2019-10-24 22:23:16 +02:00
Mike Fährmann	2eb38810c5	[twitter] fix image extraction when logged in (#452 ) ... for individual tweets. To get a Tweet page with the old Twitter layout, an Internet Explorer User-Agent (e.g. Mozilla/5.0 (Windows NT 6.1; WOW64; Trident/7.0; rv:11.0) like Gecko) as well as a Referer header pointing to the page itself is required. The "app_shell_visited" cookie appears to be optional at the moment, but that is what a regular web browser would send.	2019-10-23 22:18:29 +02:00
Mike Fährmann	8f38a35b91	[imgur] use API with "public" client_id (#446 ) Using the API endpoints makes it possible to access NSFW content without logging in.	2019-10-23 21:43:55 +02:00
Mike Fährmann	b23c822b23	[luscious] use GraphQL	2019-10-22 21:17:08 +02:00
Mike Fährmann	ef17d94469	update test results	2019-10-21 21:53:21 +02:00
Mike Fährmann	2057c6ba29	[naver] add blog and post extractors (closes #447 )	2019-10-21 16:59:15 +02:00
Mike Fährmann	a5e6252137	update deviantart client-id/-secret instructions (#451 , #424 )	2019-10-21 16:57:45 +02:00
Mike Fährmann	389d2d7e38	implement 'cookies-update' option (#445 )	2019-10-19 15:23:55 +02:00
Mike Fährmann	fbc0a6a059	[nozomi] skip unavailable posts (#388 )	2019-10-17 23:05:04 +02:00
Mike Fährmann	ae98dbcbb3	[nozomi] implement searching for negated terms (#388 ) It's incredibly slow and resource intensive (> 1GB of memory), but that is also how it is implemented on nozomi.la itself.	2019-10-17 22:53:37 +02:00
Mike Fährmann	1c03a389df	[twitter] small improvements to search extractor - put search results in separate directories - set 'max_position' to '-1' for first request -> prevent duplicate results - add a test - flake8	2019-10-17 19:50:59 +02:00
Mike Fährmann	c3042978b8	[deviantart] match "/gallery/all" (closes #449 )	2019-10-17 17:54:44 +02:00
Alice	bcddcca6db	Add search downloading to twitter.py (#448 ) Adds the functionality to download search results on twitter.com/search. Since twitter only allows downloading of up to 3,200 of a users most recent tweets, you will be unable to download old images from users with a lot of tweets. To bypass this, you can use the twitter search to get the tweets from the sections in time you were stopped at. An example search would be "from:user since:2015-01-01 until:2016-01-01 filter:images". The URL you would use will look something like this https://twitter.com/search?f=tweets&q=from%3Asupernaturepics%20since%3A2015-01-01%20until%3A2016-01-01%20filter%3Aimages&src=typd&lang=en The _tweets_from_api function had to be changed because it would not get the next page of results using the last "data-tweet-id". It would return the same JSON but with a "min_position" string added. Using this string for the "max_position" param from the second page onwards correctly returned the next pages. This change does not interfere with how the other extractors work as far as I know. The 2 regex patterns in the extractors had to be changed to not match the search URL.	2019-10-16 18:23:10 +02:00
Mike Fährmann	1693d97bd3	update extractor class hierarchies - let the GalleryExtractor class inherit directly from Extractor - make ChapterExtractor a subclass of GalleryExtractor - change enumeration field names of GalleryExtractors to 'num'	2019-10-16 18:15:29 +02:00
Mike Fährmann	7ebd984e8d	[imgur] print error message if no JSON data is found (#446 )	2019-10-16 17:45:14 +02:00
Mike Fährmann	5882b00f2f	[imgur] implement login support (#446 )	2019-10-15 22:00:22 +02:00
Mike Fährmann	91643ca54b	[nozomi] add search extractor (#388 )	2019-10-14 23:49:46 +02:00
Mike Fährmann	1adafdd3d0	document cache file requirement for DeviantArt refresh tokens	2019-10-13 23:01:57 +02:00
Mike Fährmann	df2b3c6888	restore OAuth2 authentication error messages	2019-10-13 22:48:01 +02:00
Mike Fährmann	6779512fc7	[nozomi] add post and tag extractors (#388 )	2019-10-13 22:16:03 +02:00
Mike Fährmann	6abe5f5bbb	[patreon] fix pagination (#444 ) The Patreon-provided URLs for the next set of posts aren't always complete, i.e. they can be missing their scheme and the subsequent double slash: "www.patreon.com/…"	2019-10-12 22:30:51 +02:00
Mike Fährmann	ff1e4a86aa	release version 1.10.6	2019-10-11 20:15:56 +02:00
Mike Fährmann	d4ffd6c952	[yaplog] improve metadata extraction (#443 ) - provide a fallback if there is no numerical image ID - add a 'filename' field - convert 'date' to an actual datetime object	2019-10-11 18:39:52 +02:00
Mike Fährmann	15af2f8464	[hitomi] fallback to /reader/ page if main page returns 404 Some galleries return a 404: Not Found error when trying to access them through the main gallery URL, but their content is still available on the respective /reader/ page.	2019-10-11 18:39:52 +02:00
Mike Fährmann	8af59a4bba	fix & update docs - update Requests links - add example for --exec - set '-dev' version	2019-10-11 18:36:25 +02:00
Mike Fährmann	dc6ad81e2e	[yaplog] prevent crash on empty posts (#443 )	2019-10-10 21:19:09 +02:00
Mike Fährmann	94eb7c6cad	[deviantart] fix sta.sh extraction (436)	2019-10-10 18:40:15 +02:00
Mike Fährmann	1032cfa34b	[downloader:http] extend mimetype map with archive formats	2019-10-10 18:30:23 +02:00
Mike Fährmann	27b5b2497e	[deviantart] fix download URLs (#436 ) ... except for sta.sh content. Instead of using the old '/api/v1/oauth2/deviation/download' endpoint, which started delivering URLs to 404 pages a while ago, it is also possible to get a download URL from the relatively new '/_napi/da-browse/shared_api/deviation/extended_fetch' endpoint used by DeviantArt's Eclipse interface. The current strategy is therefore: - Iterate over deviations using the OAuth2 API - Fetch original download URLs with the new NAPI/Shared API	2019-10-09 20:35:52 +02:00
Mike Fährmann	93aac8dfea	[yaplog] fix incomplete image URLs (#443 )	2019-10-09 17:42:15 +02:00
Mike Fährmann	a782b009b8	[yaplog] match blog names with '-' (#443 )	2019-10-09 17:40:30 +02:00
Mike Fährmann	cf5e716b9d	[hitomi] fix image URLs	2019-10-09 17:21:37 +02:00
Mike Fährmann	ad81c07204	[postprocessor] match logger names of downloader modules The logger name for a postprocessor object got changed to "postprocessor.<module-name>" instead of just "postprocessor"	2019-10-06 23:30:18 +02:00
Mike Fährmann	03bc8adfc7	[postprocessor:exec] run after file moved to target location (#421)	2019-10-06 23:12:22 +02:00

1 2 3 4 5 ...

2012 Commits