Commit Graph

138 Commits

Author SHA1 Message Date
niksedk
dd27e5fe3d Improve English OCR rule a little - thx tormento :)
Related to https://forum.doom9.org/showthread.php?p=1958951#post1958951
2021-12-15 16:08:41 +01:00
niksedk
fe26a640c5 Improve rule slightly for t5 2021-12-02 16:59:03 +01:00
niksedk
44e9165666 Fix OCR replace list entry with "|" > "I"
+ a little clean
2021-11-05 21:08:21 +01:00
Nikolaj Olsson
64589b90c3 Minor fix for OCR
space after "-" or "'" for nOCR/BIC + update dictionaries
2020-06-17 18:25:31 +02:00
Nikolaj Olsson
c78dda9571 Improve OCR replace list guessses 2020-06-14 20:23:35 +02:00
Nikolaj Olsson
22eb7df74e Work on OCR fix engine
split words before j/y for guesses + update dictionaries
2020-06-14 17:01:44 +02:00
Nikolaj Olsson
12b30549e0 Update dictionaries 2020-06-05 14:21:12 +02:00
Waldi Ravens
233b1eece3 Normalize EOLs in Git repository 2020-05-26 13:50:11 +02:00
Nikolaj Olsson
83fd957887 Work on OCR + work on #4195 2020-05-23 21:22:05 +02:00
Nikolaj Olsson
d4e42042b1 Update dictionaries (minor) 2020-05-20 21:08:10 +02:00
Nikolaj Olsson
7bf3c1a2db Update dictionaries (minor) 2020-05-20 14:32:07 +02:00
Nikolaj Olsson
001e361d7c Add language context menu to edit bic db + update OCR dictionaries 2020-05-18 15:03:26 +02:00
Nikolaj Olsson
a4310aec3d Work on OCR/italic 2020-05-17 23:06:01 +02:00
Nikolaj Olsson
44e686593a Improve italic detection for "Binary image compare" OCR - thx tormento :)
+ a few related improvements
See doom9 posts around http://forum.doom9.net/showthread.php?p=1910580#post1910580
2020-05-17 09:29:12 +02:00
Nikolaj Olsson
38a75d048d Minor OCR stuff 2020-05-16 12:52:00 +02:00
Nikolaj Olsson
77f98581ff Improve ocr dictionaries slightly 2020-05-07 07:57:02 +02:00
Nikolaj Olsson
3beb5c53f4 Add/update OCR dictionaries 2020-05-03 19:40:24 +02:00
nikolaj.olsson
aa7f24b094 Remove "+" from regex - thx ivandrofly :) 2020-04-25 07:55:19 +02:00
Nikolaj Olsson
eb21f3af76 Update English OCR fix list (minor) 2020-04-24 12:21:48 +02:00
Nikolaj Olsson
a3e42a4026 Minor fixes for OCR
Handle "1" as "I" i some situations + don't count "I" and "a" as wrong letters in English
2020-04-24 12:05:53 +02:00
Nikolaj Olsson
11820b2273 Add word to English OCR fix replace list 2020-04-23 19:55:13 +02:00
Nikolaj Olsson
215ab7a165 Update English OCR replace list 2020-04-23 09:52:58 +02:00
Nikolaj Olsson
9e96ad4434 Minor OCR fixes 2020-04-16 09:42:27 +02:00
Nikolaj Olsson
b2e8d3bb97 Remove some hardcoded OCR rules + add some softcoded rules 2020-04-12 09:05:39 +02:00
Nikolaj Olsson
e7201b0fb3 Improve ocr fixes slightly - thx tormento :)
"l..." to "I..."
2020-03-21 16:24:49 +01:00
Nikolaj Olsson
e29e769d31 Remove allowed word - thx GCRaistlin
<Word from="backseat" to="back seat" />
2020-03-18 15:18:20 +01:00
Nikolaj Olsson
0e59b29cc3 Update dictionaries 2020-01-19 13:10:00 +01:00
Nikolaj Olsson
b6da4414c5 Use display-friendly language name in "Fix common errors" - thx Zoltan :) 2019-11-16 14:47:09 +01:00
Ivandro Ismael
a5b62d939e Update English OCR xml
Remove consecutive words
2019-09-17 03:12:38 +01:00
Waldi Ravens
0469c7f59f dictionaries: automated XML upkeep 2019-08-10 21:50:57 +02:00
Waldi Ravens
a3629d4816 Update English OCRFixReplaceList 2019-05-10 11:16:51 +02:00
Waldi Ravens
575d52c61c dictionaries: automated XML upkeep 2019-05-10 00:22:06 +02:00
Nikolaj Olsson
3a213ece5d Update dictionaries (minor) 2019-02-28 19:33:27 +01:00
Nikolaj Olsson
ad3a391689 Improve change casing - add stuttering support 2019-02-23 21:20:33 +01:00
Nikolaj Olsson
1fd26db4d0 Remove "gotta" to "got to" from English OCR fix replace list 2019-02-13 05:10:51 +01:00
nikolaj.olsson
85c46c0d22 Spell check English OCR fix replace list - thx Ding-adong :)
Fix #3357
2019-02-12 17:49:39 +01:00
Ivandro Ismael
1e3a13da10 Normalize single quote 2019-02-08 00:52:36 +00:00
Nikolaj Olsson
ee27c440f2 Work on #3343 2019-02-07 19:26:30 +01:00
Nikolaj Olsson
14af6cba65 Add many correction to eng_OCRFixReplaceList.xml - thx Ding-adong :)
Fix #3339
2019-02-06 18:41:35 +01:00
Ivandro Ismael
e3bf46ab98 update names 2019-02-03 14:29:48 +00:00
Ivandro Ismael
274dbd5205 update #2 2019-02-02 04:38:03 +00:00
nikolaj.olsson
8a6dbb3994 Some fixes for eng_OCRFixReplaceList.xml - thx Ding-adong :)
Fix #3319
2019-01-30 12:03:47 +01:00
Nikolaj Olsson
b81d97363b Add two words to en ocr fix list 2019-01-17 20:07:27 +01:00
May Kittens Devour Your Soul
1a36e01e68
Update eng_OCRFixReplaceList.xml 2019-01-14 15:43:15 +01:00
Nikolaj Olsson
6a6f51e052 Work on #3289 2019-01-12 01:11:11 +01:00
Nikolaj Olsson
98c189a20f Improve eng_OCRFixReplaceList - thx Ding-adong :)
Fix #3289
2019-01-12 00:22:43 +01:00
Nikolaj Olsson
64caff97fe Fix a few minor issues in "Fix common errors" - thx darnn :)
Fix #3244
2019-01-04 14:20:38 +01:00
Nikolaj Olsson
047a8cdfc3 Improve eng_OCRFixReplaceList.xml - thx Ding-adong :)
Work on #3269
2019-01-02 17:02:25 +01:00
nikolaj.olsson
de2667da87 Improve OCR of comma / quote - thx Tuukka :) 2018-12-29 07:03:02 +01:00
Nikolaj Olsson
8ae0a6e89d Work on OCR 2018-11-24 12:04:44 +01:00