Commit Graph

1025 Commits

Author SHA1 Message Date
niksedk
514b1f509e More work on split list 2021-12-20 16:05:31 +01:00
niksedk
6b74c201b6 Minor improvements for the new word split
See https://github.com/SubtitleEdit/subtitleedit/discussions/5616
2021-12-20 12:12:14 +01:00
niksedk
6e93a8248f Change how names list works with split word list - thx Dnkhatri :) 2021-12-20 09:44:39 +01:00
niksedk
7ace645355 Improve Italian ocr replace list a little - thx tormento :)
See https://forum.doom9.org/showthread.php?p=1958951#post1958951
2021-12-19 13:05:52 +01:00
niksedk
5d6d2efacd Update change log + word lists 2021-12-18 19:28:12 +01:00
niksedk
fdafbaeff8 Improve words-without-space-split 2021-12-18 17:31:56 +01:00
niksedk
ac395f9b5d Improve words-without-space-split 2021-12-18 15:43:05 +01:00
niksedk
91d9f69431 Improve ocr string-split-when-space-is-missing - thx Dnkhatri :)
Related to #5616
2021-12-18 13:49:06 +01:00
niksedk
dd27e5fe3d Improve English OCR rule a little - thx tormento :)
Related to https://forum.doom9.org/showthread.php?p=1958951#post1958951
2021-12-15 16:08:41 +01:00
niksedk
fe26a640c5 Improve rule slightly for t5 2021-12-02 16:59:03 +01:00
niksedk
44e9165666 Fix OCR replace list entry with "|" > "I"
+ a little clean
2021-11-05 21:08:21 +01:00
May Kittens Devour Your Soul
7af23f741d
Update hrv_OCRFixReplaceList.xml 2021-08-27 16:45:25 +02:00
May Kittens Devour Your Soul
4366d1399d
Update hrv_OCRFixReplaceList.xml 2021-08-27 16:27:25 +02:00
May Kittens Devour Your Soul
d9d4d5d1fa
Update hrv_OCRFixReplaceList.xml 2021-07-02 10:56:48 +02:00
Nikolaj Olsson
bd792d4a29 Improve spell check slightly 2021-06-06 09:13:08 +02:00
Nikolaj Olsson
8789beb5f8 Fix #5095 2021-06-04 11:15:54 +02:00
Nikolaj Olsson
fb3c6ca018 Update en_names 2021-01-03 21:03:50 +01:00
Παναγιώτης
4a81e4d951
Update el_NoBreakAfterList.xml 2020-12-28 12:53:39 +02:00
May Kittens Devour Your Soul
09ef534444
Update hrv_OCRFixReplaceList.xml 2020-11-18 10:38:27 +01:00
May Kittens Devour Your Soul
0dd4070aa6
Update hrv_OCRFixReplaceList.xml 2020-11-18 10:37:26 +01:00
Nikolaj Olsson
ceffca3cd8 Add a few user words 2020-11-07 23:59:21 +01:00
Nikolaj Olsson
73c7ce57ee Add name 2020-11-07 07:39:20 +01:00
Nikolaj Olsson
6612d767d3 Update dictionaries 2020-10-10 10:10:40 +02:00
May Kittens Devour Your Soul
cfa8aad5a1
Update hrv_OCRFixReplaceList.xml 2020-10-02 14:29:15 +02:00
Nikolaj Olsson
03b49b1dcb Add Russian no-break-after list - thx Elheym :) 2020-10-01 13:39:38 +02:00
Nikolaj Olsson
49fa63cafc Update Greek no-break-after list - thx Lero91 :)
See https://github.com/SubtitleEdit/subtitleedit/issues/4393#issuecomment-700646141
2020-09-29 14:45:44 +02:00
Nikolaj Olsson
5e2bb456b2 Update ocr dictionaries 2020-08-02 13:57:01 +02:00
Nikolaj Olsson
49a0c3c942
Merge pull request #4257 from diomed/patch-4
Update hrv_OCRFixReplaceList.xml
2020-07-24 16:53:44 +02:00
Nikolaj Olsson
63edf983d4 Add "COVID-19" to names list 2020-07-19 09:05:17 +02:00
May Kittens Devour Your Soul
5445af33c7
Update hrv_OCRFixReplaceList.xml 2020-07-01 13:13:12 +02:00
May Kittens Devour Your Soul
c7df3f2fae
Update hrv_OCRFixReplaceList.xml 2020-06-29 10:46:26 +02:00
May Kittens Devour Your Soul
97de2da091
Update hrv_OCRFixReplaceList.xml 2020-06-29 09:32:03 +02:00
Nikolaj Olsson
e0ac8d33a6 Add name 2020-06-20 08:13:58 +02:00
Nikolaj Olsson
64589b90c3 Minor fix for OCR
space after "-" or "'" for nOCR/BIC + update dictionaries
2020-06-17 18:25:31 +02:00
Waldi Ravens
55b9af30f0 dictionaries: automated XML upkeep 2020-06-15 20:57:30 +02:00
Waldi Ravens
51763d2542 dictionaries: Fix Swedish OCRFixReplaceList 2020-06-15 20:55:39 +02:00
Waldi Ravens
92f88a63bc dictionaries: Update Portuguese no-break-after list - thx moob :) 2020-06-14 21:31:13 +02:00
Nikolaj Olsson
c78dda9571 Improve OCR replace list guessses 2020-06-14 20:23:35 +02:00
Waldi Ravens
a1c35e349e dictionaries: automated XML upkeep 2020-06-14 19:35:41 +02:00
Waldi Ravens
c705d7f4f1 dictionaries: Update Greek no-break-after list - thx Lero91 :) 2020-06-14 19:01:35 +02:00
Nikolaj Olsson
22eb7df74e Work on OCR fix engine
split words before j/y for guesses + update dictionaries
2020-06-14 17:01:44 +02:00
Nikolaj Olsson
d52a9994ad Work on OCR 2020-06-12 19:12:38 +02:00
Nikolaj Olsson
94754fc3de Work on OCR 2020-06-12 07:48:32 +02:00
May Kittens Devour Your Soul
4c64f8a3a9
Update hrv_OCRFixReplaceList.xml 2020-06-08 10:57:05 +02:00
Nikolaj Olsson
12b30549e0 Update dictionaries 2020-06-05 14:21:12 +02:00
xylographe
0d8140d728
Merge pull request #4200 from diomed/patch-2
Update hrv_OCRFixReplaceList.xml
2020-05-26 16:58:08 +02:00
Waldi Ravens
233b1eece3 Normalize EOLs in Git repository 2020-05-26 13:50:11 +02:00
May Kittens Devour Your Soul
6e60b7edee
Update hrv_OCRFixReplaceList.xml 2020-05-26 13:15:34 +02:00
Nikolaj Olsson
83fd957887 Work on OCR + work on #4195 2020-05-23 21:22:05 +02:00
Waldi Ravens
80fce956b9 dictionaries: Fix pol_OCRFixReplaceList.xml syntax 2020-05-22 11:03:41 +02:00
Nikolaj Olsson
dc4a52af1c Update change log 2020-05-21 15:51:26 +02:00
Nikolaj Olsson
dfad7c2e5e Add Polish OCR fix replace list - thx Janusz :) 2020-05-21 15:51:00 +02:00
Nikolaj Olsson
d4e42042b1 Update dictionaries (minor) 2020-05-20 21:08:10 +02:00
Nikolaj Olsson
7bf3c1a2db Update dictionaries (minor) 2020-05-20 14:32:07 +02:00
Nikolaj Olsson
001e361d7c Add language context menu to edit bic db + update OCR dictionaries 2020-05-18 15:03:26 +02:00
Nikolaj Olsson
a4310aec3d Work on OCR/italic 2020-05-17 23:06:01 +02:00
Nikolaj Olsson
44e686593a Improve italic detection for "Binary image compare" OCR - thx tormento :)
+ a few related improvements
See doom9 posts around http://forum.doom9.net/showthread.php?p=1910580#post1910580
2020-05-17 09:29:12 +02:00
Nikolaj Olsson
38a75d048d Minor OCR stuff 2020-05-16 12:52:00 +02:00
Omar Si
bf8b3678e1 Update ar_NoBreakAfterList.xml
Closes #4178
2020-05-11 15:22:34 +02:00
Waldi Ravens
6c38fff7ef Update ar_NoBreakAfterList.xml - thx OmrSi :) 2020-05-10 21:56:40 +02:00
Waldi Ravens
16867fc909 Update ar_NoBreakAfterList.xml - thx OmrSi :) 2020-05-10 21:04:36 +02:00
May Kittens Devour Your Soul
c1f977671d Update hrv_OCRFixReplaceList.xml
Closes #4176
2020-05-10 12:52:40 +02:00
Waldi Ravens
08f3674751 dictionaries: automated XML upkeep 2020-05-09 22:04:35 +02:00
Nikolaj Olsson
307c57b57a Add Greek no-break-after list - thx Lero91 :) 2020-05-09 20:10:19 +02:00
May Kittens Devour Your Soul
cc0a99c2a5
Update hrv_OCRFixReplaceList.xml 2020-05-08 09:49:19 +02:00
Nikolaj Olsson
1775b751f3 Remove wrontly committed backup file - thx xylographe :)
See comment 77f98581ff (commitcomment-39016183)
2020-05-07 21:00:31 +02:00
Nikolaj Olsson
77f98581ff Improve ocr dictionaries slightly 2020-05-07 07:57:02 +02:00
Nikolaj Olsson
138a313f6c Update Bulgarian "no-break-after-list" - thx Eva :) 2020-05-04 13:42:36 +02:00
Nikolaj Olsson
3beb5c53f4 Add/update OCR dictionaries 2020-05-03 19:40:24 +02:00
May Kittens Devour Your Soul
809fb420d6 Update hrv_OCRFixReplaceList.xml
Closes #4162
2020-05-03 12:48:52 +02:00
Nikolaj Olsson
4864b4933f Add Bulgarian no-break-after-list 2020-05-02 20:09:35 +02:00
May Kittens Devour Your Soul
9eb3b1fedc
Update hrv_OCRFixReplaceList.xml 2020-04-28 10:48:41 +02:00
nikolaj.olsson
aa7f24b094 Remove "+" from regex - thx ivandrofly :) 2020-04-25 07:55:19 +02:00
Nikolaj Olsson
eb21f3af76 Update English OCR fix list (minor) 2020-04-24 12:21:48 +02:00
Nikolaj Olsson
a3e42a4026 Minor fixes for OCR
Handle "1" as "I" i some situations + don't count "I" and "a" as wrong letters in English
2020-04-24 12:05:53 +02:00
Nikolaj Olsson
11820b2273 Add word to English OCR fix replace list 2020-04-23 19:55:13 +02:00
Nikolaj Olsson
215ab7a165 Update English OCR replace list 2020-04-23 09:52:58 +02:00
May Kittens Devour Your Soul
5bc924d73f Update hrv_OCRFixReplaceList.xml
Closes #4136
2020-04-21 21:03:12 +02:00
Waldi Ravens
c92ec03788 Update ar_NoBreakAfterList.xml - thx OmrSi :) 2020-04-17 23:54:35 +02:00
Waldi Ravens
e21ab99f74 Add Arabic no-break-after list - thx OmrSi :) 2020-04-17 21:01:35 +02:00
Nikolaj Olsson
9e96ad4434 Minor OCR fixes 2020-04-16 09:42:27 +02:00
May Kittens Devour Your Soul
2a66d70005
Update hrv_OCRFixReplaceList.xml 2020-04-12 19:50:04 +02:00
Nikolaj Olsson
b2e8d3bb97 Remove some hardcoded OCR rules + add some softcoded rules 2020-04-12 09:05:39 +02:00
Nikolaj Olsson
7d835c1496 Add two names 2020-04-12 07:04:33 +02:00
May Kittens Devour Your Soul
8b409e7d38 Update hrv_OCRFixReplaceList.xml
Closes #4093
2020-04-07 15:28:36 +02:00
May Kittens Devour Your Soul
e04cf61a38 Update hrv_OCRFixReplaceList.xml
Closes #4064
2020-03-30 15:13:12 +02:00
Nikolaj Olsson
08c3cd62ca Minor OCR update 2020-03-28 19:18:26 +01:00
May Kittens Devour Your Soul
5f94882710
Update hrv_OCRFixReplaceList.xml 2020-03-22 18:52:27 +01:00
Nikolaj Olsson
e7201b0fb3 Improve ocr fixes slightly - thx tormento :)
"l..." to "I..."
2020-03-21 16:24:49 +01:00
Nikolaj Olsson
37e57e35e5 Add "Sunday" to English names - thx Raistlin :) 2020-03-21 09:43:48 +01:00
May Kittens Devour Your Soul
639f9287be Update hrv_OCRFixReplaceList.xml
Closes #4050
2020-03-19 20:02:34 +01:00
Nikolaj Olsson
e29e769d31 Remove allowed word - thx GCRaistlin
<Word from="backseat" to="back seat" />
2020-03-18 15:18:20 +01:00
May Kittens Devour Your Soul
611ac74900
Update hrb_OCRFixReplaceList.xml 2020-03-15 18:57:21 +01:00
May Kittens Devour Your Soul
ccf7cd09b9 Update hrv_OCRFixReplaceList.xml
Closes #4044
2020-03-14 22:20:58 +01:00
Nikolaj Olsson
a3d2b775e2 Add a few words to English user word list 2020-03-12 14:37:23 +01:00
May Kittens Devour Your Soul
9beab186a2
Update hrv_OCRFixReplaceList.xml 2020-03-10 14:28:04 +01:00
May Kittens Devour Your Soul
51781e673f
Update hrv_OCRFixReplaceList.xml 2020-03-03 22:02:04 +01:00
May Kittens Devour Your Soul
c1cbfa3f5a Update hrv_OCRFixReplaceList.xml
Closes #4010
2020-02-26 10:40:38 +01:00
Nikolaj Olsson
0239c7876a Update dictionaries 2020-02-25 11:57:00 +01:00
May Kittens Devour Your Soul
4b17abefad Update hrv_OCRFixReplaceList.xml
Closes #4001
2020-02-19 08:30:41 +01:00