gallery-dl

mirror of https://github.com/mikf/gallery-dl.git synced 2024-11-22 18:53:21 +01:00

Author	SHA1	Message	Date
Mike Fährmann	ec61696316	add 't' format string conversion (closes #1065 ) to Trim whitespace from the beginning and end of strings. Example: '{field!t}' becomes 'foo' for 'field' == " \nfoo\t\r"	2020-10-16 00:37:22 +02:00
Mike Fährmann	1b1cf01d0d	add a general 'generate_csrf_token()' function	2020-10-15 15:14:18 +02:00
Mike Fährmann	d5fa716d89	fix crash when using 'skip=false' and archive (fixes #1023 ) Separating the archive check from pathfmt.exists() in `b5243297` had some unintended side effects. It is also not possible to monkey-patch a dunder method like __contains__ because of the special method lookup that gets performed for them.	2020-09-23 19:07:40 +02:00
Mike Fährmann	65744a7a31	use alternative for all falsey values in format strings … and not just None (#525) It would be better to consistently use None for all non-existent fields and/or fields without a valid value, but this is a good enough workaround for now.	2020-09-19 22:02:47 +02:00
Mike Fährmann	b5243297ff	write skipped files to archive (closes #550 )	2020-09-03 18:37:38 +02:00
Mike Fährmann	4d8b3e4f70	defer directory creation (fixes #722 ) Only call os.makedirs() before a file is getting downloaded, and not immediately for every Directory message.	2020-07-04 22:15:23 +02:00
Mike Fährmann	1ae1df0d27	update '--write-pages' (#737 ) - fix infinite recursion for responses with multiple entries in 'history' - hide values of Set-Cookie headers - only write the response content by default (use '-o write-pages=all' to also include HTTP headers)	2020-06-18 15:07:30 +02:00
Mike Fährmann	1fcf938f9c	implement a general 'delete_items()' function	2020-06-06 23:49:49 +02:00
Mike Fährmann	ddc253cf9a	implement a 'path-replace' option (#662 , #755 )	2020-05-25 22:21:58 +02:00
Mike Fährmann	15c3d29062	move dump_response() into a separate function (#737 )	2020-05-25 22:21:58 +02:00
Mike Fährmann	bc53302ad6	extend 'path-restrict' option Allow its value to be a JSON object / Python dict that specifies a mapping from invalid/unwanted input characters to specific output characters. For example {"/": "-", "": "+"} will transform "foo / bar*" into "foo - +++bar+++" (closes #662, #755)	2020-05-25 22:21:56 +02:00
Mike Fährmann	3201fe3521	add global SENTINEL object	2020-05-19 22:32:53 +02:00
Mike Fährmann	c8787647ed	add global WINDOWS bool	2020-05-19 22:32:53 +02:00
Mike Fährmann	ece73b5b2a	make 'path' and 'keywords' available in logging messages Wrap all loggers used by job, extractor, downloader, and postprocessor objects into a (custom) LoggerAdapter that provides access to the underlying job, extractor, pathfmt, and kwdict objects and their properties. __init__() signatures for all downloader and postprocessor classes have been changed to take the current Job object as their first argument, instead of the current extractor or pathfmt. (#574, #575)	2020-05-18 19:04:51 +02:00
Mike Fährmann	abbd8fbbd9	reset filenames on empty file extensions (#733 )	2020-05-18 19:04:50 +02:00
Mike Fährmann	38bc6430d3	[downloader:http] don't overwrite existing '_mtime' fields	2020-04-10 23:08:03 +02:00
Mike Fährmann	9159cb8fb3	remove trailing dots and spaces from directory names (#647 )	2020-03-19 21:12:18 +01:00
Mike Fährmann	90e4c645ba	[formatter] allow multiple "special" format specifiers (#595 ) It is now, for example, possible to specify multiple replacement operations per format replacement field: {name:Ra/b/Rc/d/}	2020-02-16 21:47:08 +01:00
Mike Fährmann	219c4cc78c	[formatter] allow for numeric list and string indices	2020-02-15 22:46:22 +01:00
Mike Fährmann	7d1da614d9	[formatter] implement field name alternatives (#525 ) The format string '{a\|b\|c}' will now try to use the value from 'a' and fall back to 'b' and 'c' if accessing a field raises an exception or if its value is None.	2020-02-15 17:58:21 +01:00
Mike Fährmann	56f1c96168	implement 'parent-directory' option (#551 )	2020-01-29 18:32:37 +01:00
Mike Fährmann	2a9be48511	improve util.load/save_cookiestxt() and add tests - take a file object as argument instead of an filename - accept whitespace before comments (" # comment") - map expiration "0" to None and not the number 0	2020-01-25 23:02:15 +01:00
Mike Fährmann	c1a6862863	implement functions to load/save cookies.txt files (closes #586 ) The methods of the standard libraries' MozillaCookieJar have several shortcomings (#HttpOnly_ cookies, 0 expiration timestamps, etc.) and require construction of an ultimately pointless CookieJar object.	2020-01-21 21:59:36 +01:00
Mike Fährmann	760b9b4db4	add remove_file() and remove_directory() helpers these functions call os.unlink() or os.rmdir() while catching and suppressing potential OSErrors	2020-01-18 00:21:26 +01:00
Mike Fährmann	b2d542ad40	improve PathFormat._enum_file() open only one try-except block for the whole loop, instead of one for each iteration in os.path.exists()	2020-01-18 00:21:25 +01:00
Mike Fährmann	025f6e3398	add fallback for missing WITHOUT ROWID support (#553 )	2020-01-03 22:58:28 +01:00
Mike Fährmann	58391d492d	cache archive keys generated in __contains__() (#524 ) To avoid writing a different key to the archive than what was checked against before the file download.	2019-12-20 16:43:08 +01:00
Mike Fährmann	0f1538af78	split filename formatting into its own function	2019-11-29 22:32:07 +01:00
Mike Fährmann	3fc1e12949	[postprocessor:metadata] filter private entries i.e. keys starting with an underscore	2019-11-21 16:58:44 +01:00
Mike Fährmann	d5e3910270	adjust 'util.raises()'	2019-10-28 15:06:17 +01:00
Mike Fährmann	c887493a80	overhaul exception stuff	2019-10-27 23:53:37 +01:00
Mike Fährmann	776e9e073f	close archive on job completion (#417 )	2019-09-10 22:43:51 +02:00
Mike Fährmann	0ce98169b8	improve path generation - fix 'abspath()' results for Python <3.7 (closes #402) - 'abspath()' in Python 3.7+ removes trailing path separators - in Python <3.7 it doesn't - filter empty path segments	2019-08-28 23:25:18 +02:00
Mike Fährmann	3284c62f22	ensure PathFormat.directory ends with a path separator ... plus some other small optimizations	2019-08-20 00:25:13 +02:00
Mike Fährmann	e77a656437	optimize directory path generation - use str.join() instead of os.path.join() (less "features", but 10x as fast) - cache directory formatters - detect and optimize field access for 1-element format strings	2019-08-19 15:56:20 +02:00
Mike Fährmann	454bf1ebf9	preserve enumeration index after 'set_extension()' (#306 )	2019-08-16 23:12:33 +02:00
Mike Fährmann	f5039b897f	replace DownloadArchive.check() with __contains__() Interestingly enough, 'a in obj' is slightly faster than 'obj.check(a)' and is also nicer to look at, I think.	2019-08-16 23:12:32 +02:00
Mike Fährmann	5a210991b6	Remove control characters from filesystem paths - add 'path-remove' option to specify the set of characters that should be removed - rename 'restrict-filenames' to 'path-restrict' - #348, #380	2019-08-16 23:12:16 +02:00
Mike Fährmann	0bb873757a	update PathFormat class - change 'has_extension' from a simple flag/bool to a field that contains the original filename extension - rename 'keywords' to 'kwdict' and some other stuff as well - inline 'adjust_path()' - put enumeration index before filename extension (#306)	2019-08-12 21:40:37 +02:00
Mike Fährmann	8dc42bb178	implement 'enumerate' for 'extractor.skip' (#306 ) [ci skip]	2019-08-08 18:37:54 +02:00
Mike Fährmann	b1bea8aaeb	add 'restrict-filenames' option (#348 )	2019-07-23 17:41:24 +02:00
Mike Fährmann	7b77ecc35a	fix paths for files without extension (#220 )	2019-07-15 16:39:03 +02:00
Mike Fährmann	16c582aaf9	implement 'mtime' post-processor (#332 ) This can set a file's modification time according to a UNIX timestamp or a datetime object from its metadata.	2019-07-14 22:39:17 +02:00
Mike Fährmann	40da44b17f	Merge branch 'v1.9.0'	2019-06-29 15:39:52 +02:00
Mike Fährmann	95b1e4c3c0	implement R<old>/<new>/ format option (#318 )	2019-06-23 22:45:44 +02:00
Mike Fährmann	f4ba98771d	use Last-Modified header to set file modification time (#236, #277)	2019-06-19 23:16:32 +02:00
Mike Fährmann	523ebc9b0b	Fix serialization of 'datetime' objects in '--write-metadata' Simplified universal serialization support in json.dump() can be achieved by passing 'default=str', which was already the case in DataJob.run() for -j/--dump-json, but not for the 'metadata' post-processor. This commit introduces util.dump_json() that (more or less) unifies the JSON output procedure of both --write-metadata and --dump-json. (#251, #252)	2019-05-09 16:49:22 +02:00
Mike Fährmann	23baecb29e	fix 'CONVERSIONS' variable name	2019-03-05 22:50:56 +01:00
Mike Fährmann	105097ddcf	add 'S' conversion options for format string fields Same as 's' (convert to string), but has a better, human-readable conversion for lists.	2019-03-04 21:13:34 +01:00
Mike Fährmann	148b8f15d0	update tests for util.py	2019-02-14 11:15:19 +01:00

1 2 3

126 Commits