Nikolaj Olsson
|
64589b90c3
|
Minor fix for OCR
space after "-" or "'" for nOCR/BIC + update dictionaries
|
2020-06-17 18:25:31 +02:00 |
|
Nikolaj Olsson
|
c78dda9571
|
Improve OCR replace list guessses
|
2020-06-14 20:23:35 +02:00 |
|
Nikolaj Olsson
|
22eb7df74e
|
Work on OCR fix engine
split words before j/y for guesses + update dictionaries
|
2020-06-14 17:01:44 +02:00 |
|
Nikolaj Olsson
|
12b30549e0
|
Update dictionaries
|
2020-06-05 14:21:12 +02:00 |
|
Waldi Ravens
|
233b1eece3
|
Normalize EOLs in Git repository
|
2020-05-26 13:50:11 +02:00 |
|
Nikolaj Olsson
|
83fd957887
|
Work on OCR + work on #4195
|
2020-05-23 21:22:05 +02:00 |
|
Nikolaj Olsson
|
d4e42042b1
|
Update dictionaries (minor)
|
2020-05-20 21:08:10 +02:00 |
|
Nikolaj Olsson
|
7bf3c1a2db
|
Update dictionaries (minor)
|
2020-05-20 14:32:07 +02:00 |
|
Nikolaj Olsson
|
001e361d7c
|
Add language context menu to edit bic db + update OCR dictionaries
|
2020-05-18 15:03:26 +02:00 |
|
Nikolaj Olsson
|
a4310aec3d
|
Work on OCR/italic
|
2020-05-17 23:06:01 +02:00 |
|
Nikolaj Olsson
|
44e686593a
|
Improve italic detection for "Binary image compare" OCR - thx tormento :)
+ a few related improvements
See doom9 posts around http://forum.doom9.net/showthread.php?p=1910580#post1910580
|
2020-05-17 09:29:12 +02:00 |
|
Nikolaj Olsson
|
38a75d048d
|
Minor OCR stuff
|
2020-05-16 12:52:00 +02:00 |
|
Nikolaj Olsson
|
77f98581ff
|
Improve ocr dictionaries slightly
|
2020-05-07 07:57:02 +02:00 |
|
Nikolaj Olsson
|
3beb5c53f4
|
Add/update OCR dictionaries
|
2020-05-03 19:40:24 +02:00 |
|
nikolaj.olsson
|
aa7f24b094
|
Remove "+" from regex - thx ivandrofly :)
|
2020-04-25 07:55:19 +02:00 |
|
Nikolaj Olsson
|
eb21f3af76
|
Update English OCR fix list (minor)
|
2020-04-24 12:21:48 +02:00 |
|
Nikolaj Olsson
|
a3e42a4026
|
Minor fixes for OCR
Handle "1" as "I" i some situations + don't count "I" and "a" as wrong letters in English
|
2020-04-24 12:05:53 +02:00 |
|
Nikolaj Olsson
|
11820b2273
|
Add word to English OCR fix replace list
|
2020-04-23 19:55:13 +02:00 |
|
Nikolaj Olsson
|
215ab7a165
|
Update English OCR replace list
|
2020-04-23 09:52:58 +02:00 |
|
Nikolaj Olsson
|
9e96ad4434
|
Minor OCR fixes
|
2020-04-16 09:42:27 +02:00 |
|
Nikolaj Olsson
|
b2e8d3bb97
|
Remove some hardcoded OCR rules + add some softcoded rules
|
2020-04-12 09:05:39 +02:00 |
|
Nikolaj Olsson
|
e7201b0fb3
|
Improve ocr fixes slightly - thx tormento :)
"l..." to "I..."
|
2020-03-21 16:24:49 +01:00 |
|
Nikolaj Olsson
|
e29e769d31
|
Remove allowed word - thx GCRaistlin
<Word from="backseat" to="back seat" />
|
2020-03-18 15:18:20 +01:00 |
|
Nikolaj Olsson
|
0e59b29cc3
|
Update dictionaries
|
2020-01-19 13:10:00 +01:00 |
|
Nikolaj Olsson
|
b6da4414c5
|
Use display-friendly language name in "Fix common errors" - thx Zoltan :)
|
2019-11-16 14:47:09 +01:00 |
|
Ivandro Ismael
|
a5b62d939e
|
Update English OCR xml
Remove consecutive words
|
2019-09-17 03:12:38 +01:00 |
|
Waldi Ravens
|
0469c7f59f
|
dictionaries: automated XML upkeep
|
2019-08-10 21:50:57 +02:00 |
|
Waldi Ravens
|
a3629d4816
|
Update English OCRFixReplaceList
|
2019-05-10 11:16:51 +02:00 |
|
Waldi Ravens
|
575d52c61c
|
dictionaries: automated XML upkeep
|
2019-05-10 00:22:06 +02:00 |
|
Nikolaj Olsson
|
3a213ece5d
|
Update dictionaries (minor)
|
2019-02-28 19:33:27 +01:00 |
|
Nikolaj Olsson
|
ad3a391689
|
Improve change casing - add stuttering support
|
2019-02-23 21:20:33 +01:00 |
|
Nikolaj Olsson
|
1fd26db4d0
|
Remove "gotta" to "got to" from English OCR fix replace list
|
2019-02-13 05:10:51 +01:00 |
|
nikolaj.olsson
|
85c46c0d22
|
Spell check English OCR fix replace list - thx Ding-adong :)
Fix #3357
|
2019-02-12 17:49:39 +01:00 |
|
Ivandro Ismael
|
1e3a13da10
|
Normalize single quote
|
2019-02-08 00:52:36 +00:00 |
|
Nikolaj Olsson
|
ee27c440f2
|
Work on #3343
|
2019-02-07 19:26:30 +01:00 |
|
Nikolaj Olsson
|
14af6cba65
|
Add many correction to eng_OCRFixReplaceList.xml - thx Ding-adong :)
Fix #3339
|
2019-02-06 18:41:35 +01:00 |
|
Ivandro Ismael
|
e3bf46ab98
|
update names
|
2019-02-03 14:29:48 +00:00 |
|
Ivandro Ismael
|
274dbd5205
|
update #2
|
2019-02-02 04:38:03 +00:00 |
|
nikolaj.olsson
|
8a6dbb3994
|
Some fixes for eng_OCRFixReplaceList.xml - thx Ding-adong :)
Fix #3319
|
2019-01-30 12:03:47 +01:00 |
|
Nikolaj Olsson
|
b81d97363b
|
Add two words to en ocr fix list
|
2019-01-17 20:07:27 +01:00 |
|
May Kittens Devour Your Soul
|
1a36e01e68
|
Update eng_OCRFixReplaceList.xml
|
2019-01-14 15:43:15 +01:00 |
|
Nikolaj Olsson
|
6a6f51e052
|
Work on #3289
|
2019-01-12 01:11:11 +01:00 |
|
Nikolaj Olsson
|
98c189a20f
|
Improve eng_OCRFixReplaceList - thx Ding-adong :)
Fix #3289
|
2019-01-12 00:22:43 +01:00 |
|
Nikolaj Olsson
|
64caff97fe
|
Fix a few minor issues in "Fix common errors" - thx darnn :)
Fix #3244
|
2019-01-04 14:20:38 +01:00 |
|
Nikolaj Olsson
|
047a8cdfc3
|
Improve eng_OCRFixReplaceList.xml - thx Ding-adong :)
Work on #3269
|
2019-01-02 17:02:25 +01:00 |
|
nikolaj.olsson
|
de2667da87
|
Improve OCR of comma / quote - thx Tuukka :)
|
2018-12-29 07:03:02 +01:00 |
|
Nikolaj Olsson
|
8ae0a6e89d
|
Work on OCR
|
2018-11-24 12:04:44 +01:00 |
|
Nikolaj Olsson
|
87af4f872c
|
Work on ocr
|
2018-11-22 16:12:12 +01:00 |
|
Nikolaj Olsson
|
d8535f5e05
|
Minor work on ocr
|
2018-11-20 20:14:32 +01:00 |
|
Nikolaj Olsson
|
490a8ff1c2
|
Work on Tesseract4/OCR (make images binary)
|
2018-11-14 22:55:20 +01:00 |
|