Nikolaj Olsson
c78dda9571
Improve OCR replace list guessses
2020-06-14 20:23:35 +02:00
Nikolaj Olsson
4be1867bf0
Fix for ocr auto-guesses in last letter
2020-06-14 17:15:37 +02:00
Nikolaj Olsson
22eb7df74e
Work on OCR fix engine
...
split words before j/y for guesses + update dictionaries
2020-06-14 17:01:44 +02:00
Nikolaj Olsson
83fd957887
Work on OCR + work on #4195
2020-05-23 21:22:05 +02:00
Nikolaj Olsson
32b8d875dc
Fix bug in OCR auto guesses +fix new italic space detect for nOcr
2020-05-18 19:02:34 +02:00
Nikolaj Olsson
7b98e2e2f2
Fix switch of end chars in OCR-fix - thx moob :)
...
Fix #4119
2020-04-17 08:33:13 +02:00
Nikolaj Olsson
9e96ad4434
Minor OCR fixes
2020-04-16 09:42:27 +02:00
Nikolaj Olsson
afef6d0623
Minor refactor
2020-04-15 13:03:48 +02:00
Nikolaj Olsson
b2e8d3bb97
Remove some hardcoded OCR rules + add some softcoded rules
2020-04-12 09:05:39 +02:00
Nikolaj Olsson
fd1a288424
Fix issue with spell check replace - thx LithiumFlower :)
...
(also allow ".mts" ext for m2ts like files - thx borifax)
2020-03-15 08:46:15 +01:00
Waldi Ravens
5828cec1d1
[NameList] Add processing of user Names list
2020-01-23 03:42:21 +01:00
Waldi Ravens
5bff5c7a41
[OcrFixReplaceList] Fix XML tag name "PartialLinesAlways"
...
and replace inconsistent field name.
2020-01-21 00:27:02 +01:00
Nikolaj Olsson
b9af4c6c30
Fix issues with user-replace-list - thx shag00 :)
...
Fix #3904
2020-01-11 09:31:05 +01:00
Waldi Ravens
46782409f6
[OcrFixReplaceList] Fix initialisation from User list
2019-11-25 20:28:31 +01:00
Nikolaj Olsson
f1f8a4df8f
Yet another fix - thx xylographe :)
2019-11-25 19:54:58 +01:00
Nikolaj Olsson
b2b20dc7e2
Fix xml tags
2019-11-25 19:40:27 +01:00
Nikolaj Olsson
626d290d1f
Fix a few more issues with OcrFixReplaceList - thx xylographe :)
2019-11-25 19:29:51 +01:00
Nikolaj Olsson
fbd659fc09
Fix BeginLines/EndLines/WholeWords/RegEx for OcrFixReplaceLst_User files - thx Maitch :)
...
Fix #3850
2019-11-25 18:28:15 +01:00
niksedk
074ec4565a
Improve "fix ocvr errors" regarding uppercase words + periods - thx Araynilmar :)
...
Fix #3733
2019-09-01 10:39:23 +02:00
Waldi Ravens
66e36a8321
Formatting (whitespace only)
2019-08-16 22:55:32 +02:00
niksedk
92a427c4d5
Fix reading name list online - thx splerman :)
...
Fix #3693
2019-08-14 17:00:29 +02:00
Nikolaj Olsson
28c4deee01
Fix "Change all" for whole text in OCR - thx wtester7 :)
...
(user file was overwriting "x_OCRFixReplaceList.xml")
Work on #3431
2019-02-28 19:43:29 +01:00
Nikolaj Olsson
2e8a12803e
Refact - thx invandrofly :)
...
fix #3350
2019-02-13 18:35:55 +01:00
Nikolaj Olsson
ea1b28a06e
Refactor (minor)
2019-02-02 12:42:30 +01:00
Nikolaj Olsson
ce37dc1d75
Refactor minor stuff
2019-01-29 21:33:20 +01:00
Nikolaj Olsson
5bcfd6191c
Refactor - fix minor issues from codacy
2019-01-21 09:53:15 +01:00
Nikolaj Olsson
0548443c8c
Refactor - fix minor issues from codacy
2019-01-20 18:14:52 +01:00
Nikolaj Olsson
21c6910aef
Refactor - add braces for libse
2019-01-19 14:40:37 +01:00
Nikolaj Olsson
f23f4e742c
Refact - add more braces
2019-01-13 01:08:28 +01:00
Nikolaj Olsson
6b358478ae
Refactor NameList
...
Fix #2767
2018-03-20 08:36:50 +01:00
Nikolaj Olsson
01bcd63181
Fix issue with Greek "V" in replace list
2018-03-13 18:55:30 +01:00
Nikolaj Olsson
5cc8f2ae13
Fix some issues found by PVS-Studio
...
work on #2810
2018-03-06 23:33:24 +01:00
Nikolaj Olsson
7d74f00193
Work on "Remove text for HI"
2018-02-03 22:35:44 +01:00
Nikolaj Olsson
ed514669f5
Improve OCR fix engine a little bit
...
Work on #2694
2018-01-02 17:41:49 +01:00
Nikolaj Olsson
3fe6337160
Optimize two string operations
2017-12-10 22:18:13 +01:00
Ivandro Ismael
9950d9499b
RegexUtils] - move regex methods to regexutils.
2017-12-03 16:23:08 +00:00
Nikolaj Olsson
c02f3287ab
Minor refact
2017-08-07 17:42:48 +02:00
Nikolaj Olsson
5886609fc9
Add to names language list is now culture neutral (accordingly to changes in SE 3.5.3)
2017-05-26 22:04:13 +02:00
Ivandro Ismael
550c8819c7
[namelist] - update part 4.
2017-05-07 15:26:23 +01:00
Ivandro Ismael
401450e0fd
[NameList] - change NameEtc => names
2017-05-03 23:04:50 +01:00
Nikolaj Olsson
7f4439f013
Optimize reading of names lists
...
About 45% faster (now uses "XmlReader", results might depend on disk speed too)
2017-04-16 17:37:50 +02:00
Ivandro Ismael
fd10a31b5b
[NamesList] - Minor fix for IsInNamesMultiWordList().
2017-04-11 03:45:39 +01:00
Ivandro Ismael
91fa8d59c5
[NamesList] - Remove postfix Etc in method name.
2017-04-11 03:42:26 +01:00
Ivandro Ismael
db9b5fd61f
[Name-List] - Update names_etc.xml ( #2278 )
...
* [Name-List] - Update names_etc.xml
* [Dictionary] - Update part 2.
* [Namelist] - <ignore_list> => <names> and new <blacklist> added.
* [Namelist] - Make name list culture insensitive.
* [Namelist] - Update installer.
* Fix broken codes.
2017-04-10 19:23:24 +02:00
Ivandro Ismael
3397886a46
[Configuration] - Cache directory
...
Suffix everyting out with 'Directory'
2016-10-31 01:50:15 +00:00
aaaxx
74c5c0a29e
Updated OcrFixReplaceList.cs
...
Added other common Latin ligatures present in Unicode.
Also added the acute accent, which I've often seen used instead of the
apostrophe, either as an OCR error or because people mistake it for the
curly apostrophe.
Closes #1961
2016-09-25 11:15:04 +02:00
aaaxx
a02a82a599
minor comment edit in OcrFixReplaceList.cs
2016-09-19 03:00:26 +02:00
Ivandro Ismael
e2ea2c87e2
[OcrReplaceList] - Remove fruitless replaces.
2016-09-07 04:29:32 +01:00
Nikolaj Olsson
1342b062fb
Merge pull request #1801 from ivandrofly/patch-ocr
...
[OCR] - Fix from prev PR.
2016-06-18 23:47:09 +02:00
Nikolaj Olsson
7b5c874f4d
Minor fix for last commit (fix #1800 )
2016-06-18 17:08:27 +02:00