Commit Graph

150 Commits

Author SHA1 Message Date
Nikolaj Olsson
c6deddd55a Fix some wrong OCR (English) corrections - thx Zoltan :) 2024-09-28 06:57:14 +02:00
niksedk
8766af6581 Fix en ocr fix for "He is Lt. Fleming." 2022-12-18 09:48:16 +01:00
niksedk
0a6cb6ce79 Do not change L to I in "l-l-let's" 2022-12-18 08:46:39 +01:00
niksedk
344ebde9fe A little OCR dictionary update 2022-12-10 18:14:44 +01:00
niksedk
f7704137f6 Fix #6315 2022-10-08 10:19:14 +02:00
niksedk
7f641fbccf A few fixes for eng_OCRFixReplaceList.xml - thx Ding-adong :)
Working on  #6315
2022-10-08 09:44:26 +02:00
niksedk
f2ff61ec9a Improve OCR for English ordinals - thx RedSoxFan04 :)
Fix #6304
2022-10-04 21:43:06 +02:00
niksedk
6f02791f59 Add a little to ocr dictionaries 2022-04-23 20:42:36 +02:00
niksedk
17e60433eb Remove wrong ocr replace rule - thx Omair :) 2022-03-21 20:46:12 +01:00
niksedk
6da49f0b23 Update change log 2021-12-26 08:41:05 +01:00
niksedk
8d286f6163 Work on dictionaries 2021-12-25 11:15:39 +01:00
niksedk
6b74c201b6 Minor improvements for the new word split
See https://github.com/SubtitleEdit/subtitleedit/discussions/5616
2021-12-20 12:12:14 +01:00
niksedk
dd27e5fe3d Improve English OCR rule a little - thx tormento :)
Related to https://forum.doom9.org/showthread.php?p=1958951#post1958951
2021-12-15 16:08:41 +01:00
niksedk
fe26a640c5 Improve rule slightly for t5 2021-12-02 16:59:03 +01:00
niksedk
44e9165666 Fix OCR replace list entry with "|" > "I"
+ a little clean
2021-11-05 21:08:21 +01:00
Nikolaj Olsson
64589b90c3 Minor fix for OCR
space after "-" or "'" for nOCR/BIC + update dictionaries
2020-06-17 18:25:31 +02:00
Nikolaj Olsson
c78dda9571 Improve OCR replace list guessses 2020-06-14 20:23:35 +02:00
Nikolaj Olsson
22eb7df74e Work on OCR fix engine
split words before j/y for guesses + update dictionaries
2020-06-14 17:01:44 +02:00
Nikolaj Olsson
12b30549e0 Update dictionaries 2020-06-05 14:21:12 +02:00
Waldi Ravens
233b1eece3 Normalize EOLs in Git repository 2020-05-26 13:50:11 +02:00
Nikolaj Olsson
83fd957887 Work on OCR + work on #4195 2020-05-23 21:22:05 +02:00
Nikolaj Olsson
d4e42042b1 Update dictionaries (minor) 2020-05-20 21:08:10 +02:00
Nikolaj Olsson
7bf3c1a2db Update dictionaries (minor) 2020-05-20 14:32:07 +02:00
Nikolaj Olsson
001e361d7c Add language context menu to edit bic db + update OCR dictionaries 2020-05-18 15:03:26 +02:00
Nikolaj Olsson
a4310aec3d Work on OCR/italic 2020-05-17 23:06:01 +02:00
Nikolaj Olsson
44e686593a Improve italic detection for "Binary image compare" OCR - thx tormento :)
+ a few related improvements
See doom9 posts around http://forum.doom9.net/showthread.php?p=1910580#post1910580
2020-05-17 09:29:12 +02:00
Nikolaj Olsson
38a75d048d Minor OCR stuff 2020-05-16 12:52:00 +02:00
Nikolaj Olsson
77f98581ff Improve ocr dictionaries slightly 2020-05-07 07:57:02 +02:00
Nikolaj Olsson
3beb5c53f4 Add/update OCR dictionaries 2020-05-03 19:40:24 +02:00
nikolaj.olsson
aa7f24b094 Remove "+" from regex - thx ivandrofly :) 2020-04-25 07:55:19 +02:00
Nikolaj Olsson
eb21f3af76 Update English OCR fix list (minor) 2020-04-24 12:21:48 +02:00
Nikolaj Olsson
a3e42a4026 Minor fixes for OCR
Handle "1" as "I" i some situations + don't count "I" and "a" as wrong letters in English
2020-04-24 12:05:53 +02:00
Nikolaj Olsson
11820b2273 Add word to English OCR fix replace list 2020-04-23 19:55:13 +02:00
Nikolaj Olsson
215ab7a165 Update English OCR replace list 2020-04-23 09:52:58 +02:00
Nikolaj Olsson
9e96ad4434 Minor OCR fixes 2020-04-16 09:42:27 +02:00
Nikolaj Olsson
b2e8d3bb97 Remove some hardcoded OCR rules + add some softcoded rules 2020-04-12 09:05:39 +02:00
Nikolaj Olsson
e7201b0fb3 Improve ocr fixes slightly - thx tormento :)
"l..." to "I..."
2020-03-21 16:24:49 +01:00
Nikolaj Olsson
e29e769d31 Remove allowed word - thx GCRaistlin
<Word from="backseat" to="back seat" />
2020-03-18 15:18:20 +01:00
Nikolaj Olsson
0e59b29cc3 Update dictionaries 2020-01-19 13:10:00 +01:00
Nikolaj Olsson
b6da4414c5 Use display-friendly language name in "Fix common errors" - thx Zoltan :) 2019-11-16 14:47:09 +01:00
Ivandro Ismael
a5b62d939e Update English OCR xml
Remove consecutive words
2019-09-17 03:12:38 +01:00
Waldi Ravens
0469c7f59f dictionaries: automated XML upkeep 2019-08-10 21:50:57 +02:00
Waldi Ravens
a3629d4816 Update English OCRFixReplaceList 2019-05-10 11:16:51 +02:00
Waldi Ravens
575d52c61c dictionaries: automated XML upkeep 2019-05-10 00:22:06 +02:00
Nikolaj Olsson
3a213ece5d Update dictionaries (minor) 2019-02-28 19:33:27 +01:00
Nikolaj Olsson
ad3a391689 Improve change casing - add stuttering support 2019-02-23 21:20:33 +01:00
Nikolaj Olsson
1fd26db4d0 Remove "gotta" to "got to" from English OCR fix replace list 2019-02-13 05:10:51 +01:00
nikolaj.olsson
85c46c0d22 Spell check English OCR fix replace list - thx Ding-adong :)
Fix #3357
2019-02-12 17:49:39 +01:00
Ivandro Ismael
1e3a13da10 Normalize single quote 2019-02-08 00:52:36 +00:00
Nikolaj Olsson
ee27c440f2 Work on #3343 2019-02-07 19:26:30 +01:00