Commit Graph

1232 Commits

Author SHA1 Message Date
May Kittens Devour Your Soul
50bc6b9ca8
Update hrv_OCRFixReplaceList.xml 2023-06-29 17:39:48 +02:00
May Kittens Devour Your Soul
4190d37533
Update hrv_OCRFixReplaceList.xml 2023-06-29 17:12:21 +02:00
May Kittens Devour Your Soul
3e91e80f6f
Update srp_OCRFixReplaceList.xml 2023-06-18 21:20:37 +02:00
niksedk
05510ff485 Add Czech to word lists window 2023-06-10 11:25:41 +02:00
May Kittens Devour Your Soul
d03caa4421
Update hrv_OCRFixReplaceList.xml 2023-05-18 16:18:26 +02:00
niksedk
478d1e5032 Add a few words 2023-03-19 11:41:09 +01:00
Μητσάκης Παναγιώτης
ed516c5449
Removed wrong word 2023-01-11 11:53:44 +02:00
niksedk
2c81b8eb5a Update dictionaries 2022-12-24 15:30:03 +01:00
niksedk
5ba89058da Add a few common movie names 2022-12-24 14:46:13 +01:00
niksedk
8766af6581 Fix en ocr fix for "He is Lt. Fleming." 2022-12-18 09:48:16 +01:00
niksedk
0a6cb6ce79 Do not change L to I in "l-l-let's" 2022-12-18 08:46:39 +01:00
niksedk
344ebde9fe A little OCR dictionary update 2022-12-10 18:14:44 +01:00
niksedk
7b783d3e89 update dictionaries 2022-12-09 17:59:06 +01:00
niksedk
12acc21d5b Update dictionaries 2022-12-06 21:11:13 +01:00
niksedk
202f3edd05 Update dictionaries 2022-12-04 17:01:38 +01:00
niksedk
13989785bf Add a few English words 2022-12-03 12:13:13 +01:00
niksedk
b21fb7055b Work on dictionaries 2022-12-01 15:25:54 +01:00
niksedk
2a8fee7806 Update words 2022-11-25 19:36:53 +01:00
niksedk
06279828df update dictionaries + remove two duplicate lines 2022-11-20 19:21:52 +01:00
niksedk
45dbc4b24a Update dictionaries 2022-11-19 08:57:47 +01:00
niksedk
40e6fd0cd1 Update dictionaries 2022-11-13 19:30:25 +01:00
niksedk
e5cc8ef307 Update dictionaries 2022-11-06 16:52:09 +01:00
niksedk
df4d9d5eff Work on dictionaries 2022-11-04 05:48:48 +01:00
niksedk
34d1758979 Update dictionaries 2022-11-03 20:46:48 +01:00
niksedk
2bcfa7f596 Update dictionaries 2022-11-02 20:24:36 +01:00
niksedk
e3bdff09c7 Work on wordlists 2022-11-01 19:12:38 +01:00
niksedk
33aa384342 Update dictionaries 2022-10-31 19:55:37 +01:00
niksedk
2103f5da94 Add a few English words 2022-10-31 18:33:58 +01:00
niksedk
2b6bd9c0fb Update dictionaries - thx Jean-Pierre :) 2022-10-29 18:59:15 +02:00
niksedk
8beecf1638 Fix for Dutch word 2022-10-28 20:54:03 +02:00
niksedk
f7704137f6 Fix #6315 2022-10-08 10:19:14 +02:00
niksedk
7f641fbccf A few fixes for eng_OCRFixReplaceList.xml - thx Ding-adong :)
Working on  #6315
2022-10-08 09:44:26 +02:00
niksedk
f2ff61ec9a Improve OCR for English ordinals - thx RedSoxFan04 :)
Fix #6304
2022-10-04 21:43:06 +02:00
niksedk
9377781f53 Do not overwrite user word dictionaries
Somewhat related to #6292
2022-09-28 20:18:43 +02:00
May Kittens Devour Your Soul
7c73903914
Update hrv_OCRFixReplaceList.xml 2022-08-11 17:22:50 +02:00
niksedk
53625c7f0c Add a few German nouns 2022-07-30 19:42:27 +02:00
niksedk
678b00c3a4 Add a few German nouns 2022-07-29 21:28:34 +02:00
niksedk
87539e3fb7 Add a few German nouns 2022-07-27 15:45:42 +02:00
niksedk
8a477299aa Add a few more German nouns 2022-07-23 20:54:42 +02:00
niksedk
daec03f49c Split out "word lists" in own menu item 2022-07-23 18:41:43 +02:00
niksedk
eb137dd5c4 Add a few German nouns 2022-07-21 20:29:30 +02:00
niksedk
afb5778c75 Allow for "SE" spell check dictionary file (besides user file) 2022-07-19 12:46:59 +02:00
niksedk
537dd4e706 Update dictionaries 2022-07-17 08:30:51 +02:00
niksedk
48c7dec568 Update BOM 2022-07-16 21:10:21 +02:00
niksedk
c62d90b77d Add a few more German nouns 2022-07-16 21:02:04 +02:00
niksedk
7f16256d2e Update deu user words - thx Stefan :) 2022-07-16 20:16:24 +02:00
niksedk
df89ab2bbc Update German nouns 2022-07-16 19:53:45 +02:00
niksedk
7c248996c6 Add a few more German nouns 2022-07-15 22:42:55 +02:00
niksedk
9d3c300d71 Update German nouns 2022-07-12 22:11:45 +02:00
niksedk
35603d0642 Update German nouns 2022-07-11 17:25:07 +02:00
niksedk
36af54a7e7 Add a few more German nouns 2022-07-06 07:12:03 +02:00
niksedk
5e2877df84 Work on German nouns 2022-07-04 20:35:25 +02:00
niksedk
7ea49d65fd Update german nouns 2022-07-03 20:04:06 +02:00
niksedk
266c53e013 Add name 2022-06-30 20:51:00 +02:00
niksedk
96d7a823a2 Add German noun 2022-06-30 07:24:20 +02:00
niksedk
1afb012399 Work on German nouns... 2022-06-29 05:58:19 +02:00
niksedk
2d225420aa Testing some German nouns for fix casing after audio-to-text 2022-06-29 05:20:18 +02:00
niksedk
f715f26df7 Update dictionaries 2022-06-20 20:00:38 +02:00
niksedk
ae04b0958d Update dictionaries 2022-06-05 06:52:42 +02:00
niksedk
2720471f38 Update dictionaries 2022-05-25 20:04:28 +02:00
niksedk
fb8a0b988a Update dicteionaries 2022-05-21 06:34:02 +02:00
niksedk
08ad5eb88f Update dictionaries 2022-05-16 06:43:46 +02:00
niksedk
fb66eb8919 Add a few items to dictionaries 2022-05-14 21:08:50 +02:00
niksedk
0cfc60bfbe Update dictionaries 2022-05-14 12:29:35 +02:00
niksedk
9cbe659c78 Update en user words 2022-05-08 12:44:24 +02:00
niksedk
53c664b006 Add name 2022-05-08 09:48:59 +02:00
niksedk
3c41bd034f Update dictioanries 2022-05-07 12:26:14 +02:00
May Kittens Devour Your Soul
6647b2d29e
Update hrv_OCRFixReplaceList.xml 2022-04-29 19:21:44 +02:00
niksedk
7b0ac3e81c Update dictionaries 2022-04-24 14:43:00 +02:00
niksedk
6f02791f59 Add a little to ocr dictionaries 2022-04-23 20:42:36 +02:00
niksedk
b51d11dfa0 Add two words to english - machinewrapped :)
Related to #5868
2022-03-27 20:31:21 +02:00
niksedk
b9322b7ab0 Add "cancelled" to dictionary - thx Omair :) 2022-03-24 20:10:11 +01:00
niksedk
d63c55ea67 Add a few en-us words 2022-03-21 21:43:42 +01:00
niksedk
b827aa667f Add two new names 2022-03-21 20:58:54 +01:00
niksedk
9056d61f3f Add English name "Buddhahood" - thx Omair :) 2022-03-21 20:55:33 +01:00
niksedk
17e60433eb Remove wrong ocr replace rule - thx Omair :) 2022-03-21 20:46:12 +01:00
Ivandro Jao
e3d1117baf Remove names already present in "names.xml" 2022-03-16 23:53:15 +00:00
May Kittens Devour Your Soul
59d9b22b9f
Update hrv_OCRFixReplaceList.xml 2022-02-21 13:31:16 +01:00
May Kittens Devour Your Soul
060f4e085c
Update hrv_OCRFixReplaceList.xml 2022-02-14 10:51:54 +01:00
niksedk
2376c53988 Work a little on dictionaries 2022-02-02 21:19:37 +01:00
niksedk
64b9ead3a2 Update dictionaries a little 2022-01-30 09:45:26 +01:00
niksedk
57a2159cc1 Update dictionaries 2022-01-26 21:58:02 +01:00
Nikolaj Olsson
4a06dee4b6
Merge pull request #5735 from diomed/master
Updates for Croatian OCR
2022-01-26 19:13:46 +01:00
niksedk
efa952b520 Minor update 2022-01-26 15:51:41 +01:00
May Kittens Devour Your Soul
7d2600831f
Update hrv_OCRFixReplaceList.xml 2022-01-26 13:34:46 +01:00
May Kittens Devour Your Soul
e0bfd5fe8d
Update hrv_OCRFixReplaceList.xml 2022-01-26 12:27:28 +01:00
niksedk
8a084b6a17 Undo name 2022-01-25 14:07:07 +01:00
niksedk
9b4b9760ec Minor fixes 2022-01-25 06:40:51 +01:00
niksedk
86fb003192 Update dictionaries 2022-01-21 19:44:40 +01:00
niksedk
b536d16f4c Add German word split list 2022-01-17 08:01:21 +01:00
niksedk
fe1e11d0db Update names/nocr-db a little 2022-01-14 16:39:10 +01:00
niksedk
9563d6bbff Add a few extra words to the Macedonian word split list 2022-01-13 12:05:12 +01:00
niksedk
bdd160c162 Add two new word split lists (mkd + rus) 2022-01-11 16:43:34 +01:00
niksedk
dd830b7942 New Polish word split list - thx Janusz :) 2022-01-10 14:44:59 +01:00
niksedk
4e6e932401 Work on dictionaries 2022-01-10 11:08:01 +01:00
niksedk
796548f036 Work on dictionaries 2022-01-09 22:18:18 +01:00
niksedk
dd0a8e1a73 Try to improve assa properties - thx Leon :)
Related somewhat to #5684
2022-01-09 10:16:07 +01:00
niksedk
78dbf89011 Improve spell check regarding Yen symbol (¥) - thx Dnkhatri :) 2021-12-27 19:03:43 +01:00
niksedk
5ca26ec918 More work related to word-split-list 2021-12-26 20:10:49 +01:00
niksedk
6da49f0b23 Update change log 2021-12-26 08:41:05 +01:00
niksedk
8d286f6163 Work on dictionaries 2021-12-25 11:15:39 +01:00
niksedk
b85d77f48e Add Polish word split list 2021-12-23 16:36:07 +01:00
niksedk
bed4b50fdb Add more words to the English word-split-list 2021-12-22 21:57:14 +01:00
niksedk
db9cda1082 Update dictionaries 2021-12-22 14:40:24 +01:00
niksedk
6e34450925 Add Spanish split list 2021-12-20 20:39:13 +01:00
niksedk
7fa610fd39 Add French word split list 2021-12-20 20:28:31 +01:00
niksedk
58b75cf09c Add more words to split list 2021-12-20 18:37:10 +01:00
niksedk
514b1f509e More work on split list 2021-12-20 16:05:31 +01:00
niksedk
6b74c201b6 Minor improvements for the new word split
See https://github.com/SubtitleEdit/subtitleedit/discussions/5616
2021-12-20 12:12:14 +01:00
niksedk
6e93a8248f Change how names list works with split word list - thx Dnkhatri :) 2021-12-20 09:44:39 +01:00
niksedk
7ace645355 Improve Italian ocr replace list a little - thx tormento :)
See https://forum.doom9.org/showthread.php?p=1958951#post1958951
2021-12-19 13:05:52 +01:00
niksedk
5d6d2efacd Update change log + word lists 2021-12-18 19:28:12 +01:00
niksedk
fdafbaeff8 Improve words-without-space-split 2021-12-18 17:31:56 +01:00
niksedk
ac395f9b5d Improve words-without-space-split 2021-12-18 15:43:05 +01:00
niksedk
91d9f69431 Improve ocr string-split-when-space-is-missing - thx Dnkhatri :)
Related to #5616
2021-12-18 13:49:06 +01:00
niksedk
dd27e5fe3d Improve English OCR rule a little - thx tormento :)
Related to https://forum.doom9.org/showthread.php?p=1958951#post1958951
2021-12-15 16:08:41 +01:00
niksedk
fe26a640c5 Improve rule slightly for t5 2021-12-02 16:59:03 +01:00
niksedk
44e9165666 Fix OCR replace list entry with "|" > "I"
+ a little clean
2021-11-05 21:08:21 +01:00
May Kittens Devour Your Soul
7af23f741d
Update hrv_OCRFixReplaceList.xml 2021-08-27 16:45:25 +02:00
May Kittens Devour Your Soul
4366d1399d
Update hrv_OCRFixReplaceList.xml 2021-08-27 16:27:25 +02:00
May Kittens Devour Your Soul
d9d4d5d1fa
Update hrv_OCRFixReplaceList.xml 2021-07-02 10:56:48 +02:00
Nikolaj Olsson
bd792d4a29 Improve spell check slightly 2021-06-06 09:13:08 +02:00
Nikolaj Olsson
8789beb5f8 Fix #5095 2021-06-04 11:15:54 +02:00
Nikolaj Olsson
fb3c6ca018 Update en_names 2021-01-03 21:03:50 +01:00
Παναγιώτης
4a81e4d951
Update el_NoBreakAfterList.xml 2020-12-28 12:53:39 +02:00
May Kittens Devour Your Soul
09ef534444
Update hrv_OCRFixReplaceList.xml 2020-11-18 10:38:27 +01:00
May Kittens Devour Your Soul
0dd4070aa6
Update hrv_OCRFixReplaceList.xml 2020-11-18 10:37:26 +01:00
Nikolaj Olsson
ceffca3cd8 Add a few user words 2020-11-07 23:59:21 +01:00
Nikolaj Olsson
73c7ce57ee Add name 2020-11-07 07:39:20 +01:00
Nikolaj Olsson
6612d767d3 Update dictionaries 2020-10-10 10:10:40 +02:00
May Kittens Devour Your Soul
cfa8aad5a1
Update hrv_OCRFixReplaceList.xml 2020-10-02 14:29:15 +02:00
Nikolaj Olsson
03b49b1dcb Add Russian no-break-after list - thx Elheym :) 2020-10-01 13:39:38 +02:00
Nikolaj Olsson
49fa63cafc Update Greek no-break-after list - thx Lero91 :)
See https://github.com/SubtitleEdit/subtitleedit/issues/4393#issuecomment-700646141
2020-09-29 14:45:44 +02:00
Nikolaj Olsson
5e2bb456b2 Update ocr dictionaries 2020-08-02 13:57:01 +02:00
Nikolaj Olsson
49a0c3c942
Merge pull request #4257 from diomed/patch-4
Update hrv_OCRFixReplaceList.xml
2020-07-24 16:53:44 +02:00
Nikolaj Olsson
63edf983d4 Add "COVID-19" to names list 2020-07-19 09:05:17 +02:00
May Kittens Devour Your Soul
5445af33c7
Update hrv_OCRFixReplaceList.xml 2020-07-01 13:13:12 +02:00
May Kittens Devour Your Soul
c7df3f2fae
Update hrv_OCRFixReplaceList.xml 2020-06-29 10:46:26 +02:00
May Kittens Devour Your Soul
97de2da091
Update hrv_OCRFixReplaceList.xml 2020-06-29 09:32:03 +02:00
Nikolaj Olsson
e0ac8d33a6 Add name 2020-06-20 08:13:58 +02:00
Nikolaj Olsson
64589b90c3 Minor fix for OCR
space after "-" or "'" for nOCR/BIC + update dictionaries
2020-06-17 18:25:31 +02:00
Waldi Ravens
55b9af30f0 dictionaries: automated XML upkeep 2020-06-15 20:57:30 +02:00
Waldi Ravens
51763d2542 dictionaries: Fix Swedish OCRFixReplaceList 2020-06-15 20:55:39 +02:00
Waldi Ravens
92f88a63bc dictionaries: Update Portuguese no-break-after list - thx moob :) 2020-06-14 21:31:13 +02:00
Nikolaj Olsson
c78dda9571 Improve OCR replace list guessses 2020-06-14 20:23:35 +02:00
Waldi Ravens
a1c35e349e dictionaries: automated XML upkeep 2020-06-14 19:35:41 +02:00
Waldi Ravens
c705d7f4f1 dictionaries: Update Greek no-break-after list - thx Lero91 :) 2020-06-14 19:01:35 +02:00
Nikolaj Olsson
22eb7df74e Work on OCR fix engine
split words before j/y for guesses + update dictionaries
2020-06-14 17:01:44 +02:00
Nikolaj Olsson
d52a9994ad Work on OCR 2020-06-12 19:12:38 +02:00
Nikolaj Olsson
94754fc3de Work on OCR 2020-06-12 07:48:32 +02:00
May Kittens Devour Your Soul
4c64f8a3a9
Update hrv_OCRFixReplaceList.xml 2020-06-08 10:57:05 +02:00
Nikolaj Olsson
12b30549e0 Update dictionaries 2020-06-05 14:21:12 +02:00
xylographe
0d8140d728
Merge pull request #4200 from diomed/patch-2
Update hrv_OCRFixReplaceList.xml
2020-05-26 16:58:08 +02:00
Waldi Ravens
233b1eece3 Normalize EOLs in Git repository 2020-05-26 13:50:11 +02:00
May Kittens Devour Your Soul
6e60b7edee
Update hrv_OCRFixReplaceList.xml 2020-05-26 13:15:34 +02:00
Nikolaj Olsson
83fd957887 Work on OCR + work on #4195 2020-05-23 21:22:05 +02:00
Waldi Ravens
80fce956b9 dictionaries: Fix pol_OCRFixReplaceList.xml syntax 2020-05-22 11:03:41 +02:00
Nikolaj Olsson
dc4a52af1c Update change log 2020-05-21 15:51:26 +02:00
Nikolaj Olsson
dfad7c2e5e Add Polish OCR fix replace list - thx Janusz :) 2020-05-21 15:51:00 +02:00
Nikolaj Olsson
d4e42042b1 Update dictionaries (minor) 2020-05-20 21:08:10 +02:00
Nikolaj Olsson
7bf3c1a2db Update dictionaries (minor) 2020-05-20 14:32:07 +02:00
Nikolaj Olsson
001e361d7c Add language context menu to edit bic db + update OCR dictionaries 2020-05-18 15:03:26 +02:00
Nikolaj Olsson
a4310aec3d Work on OCR/italic 2020-05-17 23:06:01 +02:00
Nikolaj Olsson
44e686593a Improve italic detection for "Binary image compare" OCR - thx tormento :)
+ a few related improvements
See doom9 posts around http://forum.doom9.net/showthread.php?p=1910580#post1910580
2020-05-17 09:29:12 +02:00
Nikolaj Olsson
38a75d048d Minor OCR stuff 2020-05-16 12:52:00 +02:00
Omar Si
bf8b3678e1 Update ar_NoBreakAfterList.xml
Closes #4178
2020-05-11 15:22:34 +02:00
Waldi Ravens
6c38fff7ef Update ar_NoBreakAfterList.xml - thx OmrSi :) 2020-05-10 21:56:40 +02:00
Waldi Ravens
16867fc909 Update ar_NoBreakAfterList.xml - thx OmrSi :) 2020-05-10 21:04:36 +02:00
May Kittens Devour Your Soul
c1f977671d Update hrv_OCRFixReplaceList.xml
Closes #4176
2020-05-10 12:52:40 +02:00
Waldi Ravens
08f3674751 dictionaries: automated XML upkeep 2020-05-09 22:04:35 +02:00
Nikolaj Olsson
307c57b57a Add Greek no-break-after list - thx Lero91 :) 2020-05-09 20:10:19 +02:00
May Kittens Devour Your Soul
cc0a99c2a5
Update hrv_OCRFixReplaceList.xml 2020-05-08 09:49:19 +02:00
Nikolaj Olsson
1775b751f3 Remove wrontly committed backup file - thx xylographe :)
See comment 77f98581ff (commitcomment-39016183)
2020-05-07 21:00:31 +02:00
Nikolaj Olsson
77f98581ff Improve ocr dictionaries slightly 2020-05-07 07:57:02 +02:00
Nikolaj Olsson
138a313f6c Update Bulgarian "no-break-after-list" - thx Eva :) 2020-05-04 13:42:36 +02:00
Nikolaj Olsson
3beb5c53f4 Add/update OCR dictionaries 2020-05-03 19:40:24 +02:00
May Kittens Devour Your Soul
809fb420d6 Update hrv_OCRFixReplaceList.xml
Closes #4162
2020-05-03 12:48:52 +02:00
Nikolaj Olsson
4864b4933f Add Bulgarian no-break-after-list 2020-05-02 20:09:35 +02:00
May Kittens Devour Your Soul
9eb3b1fedc
Update hrv_OCRFixReplaceList.xml 2020-04-28 10:48:41 +02:00
nikolaj.olsson
aa7f24b094 Remove "+" from regex - thx ivandrofly :) 2020-04-25 07:55:19 +02:00
Nikolaj Olsson
eb21f3af76 Update English OCR fix list (minor) 2020-04-24 12:21:48 +02:00
Nikolaj Olsson
a3e42a4026 Minor fixes for OCR
Handle "1" as "I" i some situations + don't count "I" and "a" as wrong letters in English
2020-04-24 12:05:53 +02:00
Nikolaj Olsson
11820b2273 Add word to English OCR fix replace list 2020-04-23 19:55:13 +02:00
Nikolaj Olsson
215ab7a165 Update English OCR replace list 2020-04-23 09:52:58 +02:00
May Kittens Devour Your Soul
5bc924d73f Update hrv_OCRFixReplaceList.xml
Closes #4136
2020-04-21 21:03:12 +02:00
Waldi Ravens
c92ec03788 Update ar_NoBreakAfterList.xml - thx OmrSi :) 2020-04-17 23:54:35 +02:00
Waldi Ravens
e21ab99f74 Add Arabic no-break-after list - thx OmrSi :) 2020-04-17 21:01:35 +02:00
Nikolaj Olsson
9e96ad4434 Minor OCR fixes 2020-04-16 09:42:27 +02:00
May Kittens Devour Your Soul
2a66d70005
Update hrv_OCRFixReplaceList.xml 2020-04-12 19:50:04 +02:00
Nikolaj Olsson
b2e8d3bb97 Remove some hardcoded OCR rules + add some softcoded rules 2020-04-12 09:05:39 +02:00
Nikolaj Olsson
7d835c1496 Add two names 2020-04-12 07:04:33 +02:00
May Kittens Devour Your Soul
8b409e7d38 Update hrv_OCRFixReplaceList.xml
Closes #4093
2020-04-07 15:28:36 +02:00
May Kittens Devour Your Soul
e04cf61a38 Update hrv_OCRFixReplaceList.xml
Closes #4064
2020-03-30 15:13:12 +02:00
Nikolaj Olsson
08c3cd62ca Minor OCR update 2020-03-28 19:18:26 +01:00
May Kittens Devour Your Soul
5f94882710
Update hrv_OCRFixReplaceList.xml 2020-03-22 18:52:27 +01:00
Nikolaj Olsson
e7201b0fb3 Improve ocr fixes slightly - thx tormento :)
"l..." to "I..."
2020-03-21 16:24:49 +01:00
Nikolaj Olsson
37e57e35e5 Add "Sunday" to English names - thx Raistlin :) 2020-03-21 09:43:48 +01:00
May Kittens Devour Your Soul
639f9287be Update hrv_OCRFixReplaceList.xml
Closes #4050
2020-03-19 20:02:34 +01:00
Nikolaj Olsson
e29e769d31 Remove allowed word - thx GCRaistlin
<Word from="backseat" to="back seat" />
2020-03-18 15:18:20 +01:00
May Kittens Devour Your Soul
611ac74900
Update hrb_OCRFixReplaceList.xml 2020-03-15 18:57:21 +01:00