Commit Graph

64 Commits

Author SHA1 Message Date
Nikolaj Olsson
c78dda9571 Improve OCR replace list guessses 2020-06-14 20:23:35 +02:00
Nikolaj Olsson
4be1867bf0 Fix for ocr auto-guesses in last letter 2020-06-14 17:15:37 +02:00
Nikolaj Olsson
22eb7df74e Work on OCR fix engine
split words before j/y for guesses + update dictionaries
2020-06-14 17:01:44 +02:00
Nikolaj Olsson
83fd957887 Work on OCR + work on #4195 2020-05-23 21:22:05 +02:00
Nikolaj Olsson
32b8d875dc Fix bug in OCR auto guesses +fix new italic space detect for nOcr 2020-05-18 19:02:34 +02:00
Nikolaj Olsson
7b98e2e2f2 Fix switch of end chars in OCR-fix - thx moob :)
Fix #4119
2020-04-17 08:33:13 +02:00
Nikolaj Olsson
9e96ad4434 Minor OCR fixes 2020-04-16 09:42:27 +02:00
Nikolaj Olsson
afef6d0623 Minor refactor 2020-04-15 13:03:48 +02:00
Nikolaj Olsson
b2e8d3bb97 Remove some hardcoded OCR rules + add some softcoded rules 2020-04-12 09:05:39 +02:00
Nikolaj Olsson
fd1a288424 Fix issue with spell check replace - thx LithiumFlower :)
(also allow ".mts" ext for m2ts like files - thx borifax)
2020-03-15 08:46:15 +01:00
Waldi Ravens
5828cec1d1 [NameList] Add processing of user Names list 2020-01-23 03:42:21 +01:00
Waldi Ravens
5bff5c7a41 [OcrFixReplaceList] Fix XML tag name "PartialLinesAlways"
and replace inconsistent field name.
2020-01-21 00:27:02 +01:00
Nikolaj Olsson
b9af4c6c30 Fix issues with user-replace-list - thx shag00 :)
Fix #3904
2020-01-11 09:31:05 +01:00
Waldi Ravens
46782409f6 [OcrFixReplaceList] Fix initialisation from User list 2019-11-25 20:28:31 +01:00
Nikolaj Olsson
f1f8a4df8f Yet another fix - thx xylographe :) 2019-11-25 19:54:58 +01:00
Nikolaj Olsson
b2b20dc7e2 Fix xml tags 2019-11-25 19:40:27 +01:00
Nikolaj Olsson
626d290d1f Fix a few more issues with OcrFixReplaceList - thx xylographe :) 2019-11-25 19:29:51 +01:00
Nikolaj Olsson
fbd659fc09 Fix BeginLines/EndLines/WholeWords/RegEx for OcrFixReplaceLst_User files - thx Maitch :)
Fix #3850
2019-11-25 18:28:15 +01:00
niksedk
074ec4565a Improve "fix ocvr errors" regarding uppercase words + periods - thx Araynilmar :)
Fix #3733
2019-09-01 10:39:23 +02:00
Waldi Ravens
66e36a8321 Formatting (whitespace only) 2019-08-16 22:55:32 +02:00
niksedk
92a427c4d5 Fix reading name list online - thx splerman :)
Fix #3693
2019-08-14 17:00:29 +02:00
Nikolaj Olsson
28c4deee01 Fix "Change all" for whole text in OCR - thx wtester7 :)
(user file was overwriting "x_OCRFixReplaceList.xml")
Work on #3431
2019-02-28 19:43:29 +01:00
Nikolaj Olsson
2e8a12803e Refact - thx invandrofly :)
fix #3350
2019-02-13 18:35:55 +01:00
Nikolaj Olsson
ea1b28a06e Refactor (minor) 2019-02-02 12:42:30 +01:00
Nikolaj Olsson
ce37dc1d75 Refactor minor stuff 2019-01-29 21:33:20 +01:00
Nikolaj Olsson
5bcfd6191c Refactor - fix minor issues from codacy 2019-01-21 09:53:15 +01:00
Nikolaj Olsson
0548443c8c Refactor - fix minor issues from codacy 2019-01-20 18:14:52 +01:00
Nikolaj Olsson
21c6910aef Refactor - add braces for libse 2019-01-19 14:40:37 +01:00
Nikolaj Olsson
f23f4e742c Refact - add more braces 2019-01-13 01:08:28 +01:00
Nikolaj Olsson
6b358478ae Refactor NameList
Fix #2767
2018-03-20 08:36:50 +01:00
Nikolaj Olsson
01bcd63181 Fix issue with Greek "V" in replace list 2018-03-13 18:55:30 +01:00
Nikolaj Olsson
5cc8f2ae13 Fix some issues found by PVS-Studio
work on #2810
2018-03-06 23:33:24 +01:00
Nikolaj Olsson
7d74f00193 Work on "Remove text for HI" 2018-02-03 22:35:44 +01:00
Nikolaj Olsson
ed514669f5 Improve OCR fix engine a little bit
Work on #2694
2018-01-02 17:41:49 +01:00
Nikolaj Olsson
3fe6337160 Optimize two string operations 2017-12-10 22:18:13 +01:00
Ivandro Ismael
9950d9499b RegexUtils] - move regex methods to regexutils. 2017-12-03 16:23:08 +00:00
Nikolaj Olsson
c02f3287ab Minor refact 2017-08-07 17:42:48 +02:00
Nikolaj Olsson
5886609fc9 Add to names language list is now culture neutral (accordingly to changes in SE 3.5.3) 2017-05-26 22:04:13 +02:00
Ivandro Ismael
550c8819c7 [namelist] - update part 4. 2017-05-07 15:26:23 +01:00
Ivandro Ismael
401450e0fd [NameList] - change NameEtc => names 2017-05-03 23:04:50 +01:00
Nikolaj Olsson
7f4439f013 Optimize reading of names lists
About 45% faster (now uses "XmlReader", results might depend on disk speed too)
2017-04-16 17:37:50 +02:00
Ivandro Ismael
fd10a31b5b [NamesList] - Minor fix for IsInNamesMultiWordList(). 2017-04-11 03:45:39 +01:00
Ivandro Ismael
91fa8d59c5 [NamesList] - Remove postfix Etc in method name. 2017-04-11 03:42:26 +01:00
Ivandro Ismael
db9b5fd61f [Name-List] - Update names_etc.xml (#2278)
* [Name-List] - Update names_etc.xml

* [Dictionary] - Update part 2.

* [Namelist] - <ignore_list> => <names> and new <blacklist> added.

* [Namelist] - Make name list culture insensitive.

* [Namelist] - Update installer.

* Fix broken codes.
2017-04-10 19:23:24 +02:00
Ivandro Ismael
3397886a46 [Configuration] - Cache directory
Suffix everyting out with 'Directory'
2016-10-31 01:50:15 +00:00
aaaxx
74c5c0a29e Updated OcrFixReplaceList.cs
Added other common Latin ligatures present in Unicode.

Also added the acute accent, which I've often seen used instead of the
apostrophe, either as an OCR error or because people mistake it for the
curly apostrophe.

Closes #1961
2016-09-25 11:15:04 +02:00
aaaxx
a02a82a599 minor comment edit in OcrFixReplaceList.cs 2016-09-19 03:00:26 +02:00
Ivandro Ismael
e2ea2c87e2
[OcrReplaceList] - Remove fruitless replaces. 2016-09-07 04:29:32 +01:00
Nikolaj Olsson
1342b062fb Merge pull request #1801 from ivandrofly/patch-ocr
[OCR] - Fix from prev PR.
2016-06-18 23:47:09 +02:00
Nikolaj Olsson
7b5c874f4d Minor fix for last commit (fix #1800) 2016-06-18 17:08:27 +02:00