Commit Graph

1048 Commits

Author SHA1 Message Date
May Kittens Devour Your Soul
7d2600831f
Update hrv_OCRFixReplaceList.xml 2022-01-26 13:34:46 +01:00
May Kittens Devour Your Soul
e0bfd5fe8d
Update hrv_OCRFixReplaceList.xml 2022-01-26 12:27:28 +01:00
niksedk
8a084b6a17 Undo name 2022-01-25 14:07:07 +01:00
niksedk
9b4b9760ec Minor fixes 2022-01-25 06:40:51 +01:00
niksedk
86fb003192 Update dictionaries 2022-01-21 19:44:40 +01:00
niksedk
b536d16f4c Add German word split list 2022-01-17 08:01:21 +01:00
niksedk
fe1e11d0db Update names/nocr-db a little 2022-01-14 16:39:10 +01:00
niksedk
9563d6bbff Add a few extra words to the Macedonian word split list 2022-01-13 12:05:12 +01:00
niksedk
bdd160c162 Add two new word split lists (mkd + rus) 2022-01-11 16:43:34 +01:00
niksedk
dd830b7942 New Polish word split list - thx Janusz :) 2022-01-10 14:44:59 +01:00
niksedk
4e6e932401 Work on dictionaries 2022-01-10 11:08:01 +01:00
niksedk
796548f036 Work on dictionaries 2022-01-09 22:18:18 +01:00
niksedk
dd0a8e1a73 Try to improve assa properties - thx Leon :)
Related somewhat to #5684
2022-01-09 10:16:07 +01:00
niksedk
78dbf89011 Improve spell check regarding Yen symbol (¥) - thx Dnkhatri :) 2021-12-27 19:03:43 +01:00
niksedk
5ca26ec918 More work related to word-split-list 2021-12-26 20:10:49 +01:00
niksedk
6da49f0b23 Update change log 2021-12-26 08:41:05 +01:00
niksedk
8d286f6163 Work on dictionaries 2021-12-25 11:15:39 +01:00
niksedk
b85d77f48e Add Polish word split list 2021-12-23 16:36:07 +01:00
niksedk
bed4b50fdb Add more words to the English word-split-list 2021-12-22 21:57:14 +01:00
niksedk
db9cda1082 Update dictionaries 2021-12-22 14:40:24 +01:00
niksedk
6e34450925 Add Spanish split list 2021-12-20 20:39:13 +01:00
niksedk
7fa610fd39 Add French word split list 2021-12-20 20:28:31 +01:00
niksedk
58b75cf09c Add more words to split list 2021-12-20 18:37:10 +01:00
niksedk
514b1f509e More work on split list 2021-12-20 16:05:31 +01:00
niksedk
6b74c201b6 Minor improvements for the new word split
See https://github.com/SubtitleEdit/subtitleedit/discussions/5616
2021-12-20 12:12:14 +01:00
niksedk
6e93a8248f Change how names list works with split word list - thx Dnkhatri :) 2021-12-20 09:44:39 +01:00
niksedk
7ace645355 Improve Italian ocr replace list a little - thx tormento :)
See https://forum.doom9.org/showthread.php?p=1958951#post1958951
2021-12-19 13:05:52 +01:00
niksedk
5d6d2efacd Update change log + word lists 2021-12-18 19:28:12 +01:00
niksedk
fdafbaeff8 Improve words-without-space-split 2021-12-18 17:31:56 +01:00
niksedk
ac395f9b5d Improve words-without-space-split 2021-12-18 15:43:05 +01:00
niksedk
91d9f69431 Improve ocr string-split-when-space-is-missing - thx Dnkhatri :)
Related to #5616
2021-12-18 13:49:06 +01:00
niksedk
dd27e5fe3d Improve English OCR rule a little - thx tormento :)
Related to https://forum.doom9.org/showthread.php?p=1958951#post1958951
2021-12-15 16:08:41 +01:00
niksedk
fe26a640c5 Improve rule slightly for t5 2021-12-02 16:59:03 +01:00
niksedk
44e9165666 Fix OCR replace list entry with "|" > "I"
+ a little clean
2021-11-05 21:08:21 +01:00
May Kittens Devour Your Soul
7af23f741d
Update hrv_OCRFixReplaceList.xml 2021-08-27 16:45:25 +02:00
May Kittens Devour Your Soul
4366d1399d
Update hrv_OCRFixReplaceList.xml 2021-08-27 16:27:25 +02:00
May Kittens Devour Your Soul
d9d4d5d1fa
Update hrv_OCRFixReplaceList.xml 2021-07-02 10:56:48 +02:00
Nikolaj Olsson
bd792d4a29 Improve spell check slightly 2021-06-06 09:13:08 +02:00
Nikolaj Olsson
8789beb5f8 Fix #5095 2021-06-04 11:15:54 +02:00
Nikolaj Olsson
fb3c6ca018 Update en_names 2021-01-03 21:03:50 +01:00
Παναγιώτης
4a81e4d951
Update el_NoBreakAfterList.xml 2020-12-28 12:53:39 +02:00
May Kittens Devour Your Soul
09ef534444
Update hrv_OCRFixReplaceList.xml 2020-11-18 10:38:27 +01:00
May Kittens Devour Your Soul
0dd4070aa6
Update hrv_OCRFixReplaceList.xml 2020-11-18 10:37:26 +01:00
Nikolaj Olsson
ceffca3cd8 Add a few user words 2020-11-07 23:59:21 +01:00
Nikolaj Olsson
73c7ce57ee Add name 2020-11-07 07:39:20 +01:00
Nikolaj Olsson
6612d767d3 Update dictionaries 2020-10-10 10:10:40 +02:00
May Kittens Devour Your Soul
cfa8aad5a1
Update hrv_OCRFixReplaceList.xml 2020-10-02 14:29:15 +02:00
Nikolaj Olsson
03b49b1dcb Add Russian no-break-after list - thx Elheym :) 2020-10-01 13:39:38 +02:00
Nikolaj Olsson
49fa63cafc Update Greek no-break-after list - thx Lero91 :)
See https://github.com/SubtitleEdit/subtitleedit/issues/4393#issuecomment-700646141
2020-09-29 14:45:44 +02:00
Nikolaj Olsson
5e2bb456b2 Update ocr dictionaries 2020-08-02 13:57:01 +02:00
Nikolaj Olsson
49a0c3c942
Merge pull request #4257 from diomed/patch-4
Update hrv_OCRFixReplaceList.xml
2020-07-24 16:53:44 +02:00
Nikolaj Olsson
63edf983d4 Add "COVID-19" to names list 2020-07-19 09:05:17 +02:00
May Kittens Devour Your Soul
5445af33c7
Update hrv_OCRFixReplaceList.xml 2020-07-01 13:13:12 +02:00
May Kittens Devour Your Soul
c7df3f2fae
Update hrv_OCRFixReplaceList.xml 2020-06-29 10:46:26 +02:00
May Kittens Devour Your Soul
97de2da091
Update hrv_OCRFixReplaceList.xml 2020-06-29 09:32:03 +02:00
Nikolaj Olsson
e0ac8d33a6 Add name 2020-06-20 08:13:58 +02:00
Nikolaj Olsson
64589b90c3 Minor fix for OCR
space after "-" or "'" for nOCR/BIC + update dictionaries
2020-06-17 18:25:31 +02:00
Waldi Ravens
55b9af30f0 dictionaries: automated XML upkeep 2020-06-15 20:57:30 +02:00
Waldi Ravens
51763d2542 dictionaries: Fix Swedish OCRFixReplaceList 2020-06-15 20:55:39 +02:00
Waldi Ravens
92f88a63bc dictionaries: Update Portuguese no-break-after list - thx moob :) 2020-06-14 21:31:13 +02:00
Nikolaj Olsson
c78dda9571 Improve OCR replace list guessses 2020-06-14 20:23:35 +02:00
Waldi Ravens
a1c35e349e dictionaries: automated XML upkeep 2020-06-14 19:35:41 +02:00
Waldi Ravens
c705d7f4f1 dictionaries: Update Greek no-break-after list - thx Lero91 :) 2020-06-14 19:01:35 +02:00
Nikolaj Olsson
22eb7df74e Work on OCR fix engine
split words before j/y for guesses + update dictionaries
2020-06-14 17:01:44 +02:00
Nikolaj Olsson
d52a9994ad Work on OCR 2020-06-12 19:12:38 +02:00
Nikolaj Olsson
94754fc3de Work on OCR 2020-06-12 07:48:32 +02:00
May Kittens Devour Your Soul
4c64f8a3a9
Update hrv_OCRFixReplaceList.xml 2020-06-08 10:57:05 +02:00
Nikolaj Olsson
12b30549e0 Update dictionaries 2020-06-05 14:21:12 +02:00
xylographe
0d8140d728
Merge pull request #4200 from diomed/patch-2
Update hrv_OCRFixReplaceList.xml
2020-05-26 16:58:08 +02:00
Waldi Ravens
233b1eece3 Normalize EOLs in Git repository 2020-05-26 13:50:11 +02:00
May Kittens Devour Your Soul
6e60b7edee
Update hrv_OCRFixReplaceList.xml 2020-05-26 13:15:34 +02:00
Nikolaj Olsson
83fd957887 Work on OCR + work on #4195 2020-05-23 21:22:05 +02:00
Waldi Ravens
80fce956b9 dictionaries: Fix pol_OCRFixReplaceList.xml syntax 2020-05-22 11:03:41 +02:00
Nikolaj Olsson
dc4a52af1c Update change log 2020-05-21 15:51:26 +02:00
Nikolaj Olsson
dfad7c2e5e Add Polish OCR fix replace list - thx Janusz :) 2020-05-21 15:51:00 +02:00
Nikolaj Olsson
d4e42042b1 Update dictionaries (minor) 2020-05-20 21:08:10 +02:00
Nikolaj Olsson
7bf3c1a2db Update dictionaries (minor) 2020-05-20 14:32:07 +02:00
Nikolaj Olsson
001e361d7c Add language context menu to edit bic db + update OCR dictionaries 2020-05-18 15:03:26 +02:00
Nikolaj Olsson
a4310aec3d Work on OCR/italic 2020-05-17 23:06:01 +02:00
Nikolaj Olsson
44e686593a Improve italic detection for "Binary image compare" OCR - thx tormento :)
+ a few related improvements
See doom9 posts around http://forum.doom9.net/showthread.php?p=1910580#post1910580
2020-05-17 09:29:12 +02:00
Nikolaj Olsson
38a75d048d Minor OCR stuff 2020-05-16 12:52:00 +02:00
Omar Si
bf8b3678e1 Update ar_NoBreakAfterList.xml
Closes #4178
2020-05-11 15:22:34 +02:00
Waldi Ravens
6c38fff7ef Update ar_NoBreakAfterList.xml - thx OmrSi :) 2020-05-10 21:56:40 +02:00
Waldi Ravens
16867fc909 Update ar_NoBreakAfterList.xml - thx OmrSi :) 2020-05-10 21:04:36 +02:00
May Kittens Devour Your Soul
c1f977671d Update hrv_OCRFixReplaceList.xml
Closes #4176
2020-05-10 12:52:40 +02:00
Waldi Ravens
08f3674751 dictionaries: automated XML upkeep 2020-05-09 22:04:35 +02:00
Nikolaj Olsson
307c57b57a Add Greek no-break-after list - thx Lero91 :) 2020-05-09 20:10:19 +02:00
May Kittens Devour Your Soul
cc0a99c2a5
Update hrv_OCRFixReplaceList.xml 2020-05-08 09:49:19 +02:00
Nikolaj Olsson
1775b751f3 Remove wrontly committed backup file - thx xylographe :)
See comment 77f98581ff (commitcomment-39016183)
2020-05-07 21:00:31 +02:00
Nikolaj Olsson
77f98581ff Improve ocr dictionaries slightly 2020-05-07 07:57:02 +02:00
Nikolaj Olsson
138a313f6c Update Bulgarian "no-break-after-list" - thx Eva :) 2020-05-04 13:42:36 +02:00
Nikolaj Olsson
3beb5c53f4 Add/update OCR dictionaries 2020-05-03 19:40:24 +02:00
May Kittens Devour Your Soul
809fb420d6 Update hrv_OCRFixReplaceList.xml
Closes #4162
2020-05-03 12:48:52 +02:00
Nikolaj Olsson
4864b4933f Add Bulgarian no-break-after-list 2020-05-02 20:09:35 +02:00
May Kittens Devour Your Soul
9eb3b1fedc
Update hrv_OCRFixReplaceList.xml 2020-04-28 10:48:41 +02:00
nikolaj.olsson
aa7f24b094 Remove "+" from regex - thx ivandrofly :) 2020-04-25 07:55:19 +02:00
Nikolaj Olsson
eb21f3af76 Update English OCR fix list (minor) 2020-04-24 12:21:48 +02:00
Nikolaj Olsson
a3e42a4026 Minor fixes for OCR
Handle "1" as "I" i some situations + don't count "I" and "a" as wrong letters in English
2020-04-24 12:05:53 +02:00
Nikolaj Olsson
11820b2273 Add word to English OCR fix replace list 2020-04-23 19:55:13 +02:00
Nikolaj Olsson
215ab7a165 Update English OCR replace list 2020-04-23 09:52:58 +02:00