Commit Graph

539 Commits

Author SHA1 Message Date
May Kittens Devour Your Soul
debf460133 Update hrv_OCRFixReplaceList.xml 2017-01-18 00:48:07 +01:00
May Kittens Devour Your Soul
ed22973343 Update hrv_OCRFixReplaceList.xml 2017-01-18 00:45:32 +01:00
May Kittens Devour Your Soul
700a0027d0 Update hrv_OCRFixReplaceList.xml 2017-01-02 18:44:14 +01:00
Nikolaj Olsson
2b81d4af77 Updated a few words in ocr replace list - thx Boulder08 :) 2016-12-12 16:28:05 +01:00
May Kittens Devour Your Soul
978eb01aa1 Update hrv_OCRFixReplaceList.xml 2016-12-03 17:55:54 +01:00
May Kittens Devour Your Soul
403b5a0f38 Update hrv_OCRFixReplaceList.xml 2016-12-03 14:48:38 +01:00
May Kittens Devour Your Soul
0ee009c5fa Update hrv_OCRFixReplaceList.xml 2016-12-03 14:34:47 +01:00
Kruno H
ccda549f27 Update hrv_OCRFixReplaceList.xml 2016-11-14 11:43:55 +01:00
Nikolaj Olsson
2ec402752b Merge pull request #1993 from diomed/patch-4
Update hrv_OCRFixReplaceList.xml
2016-10-23 20:22:08 +02:00
Kruno H
aafa5bfcdf Update hrv_OCRFixReplaceList.xml 2016-10-17 16:28:32 +02:00
Kruno H
c4cac4ca8f Update hrv_OCRFixReplaceList.xml 2016-10-17 16:24:05 +02:00
Kruno H
ceb509e646 Update hrv_OCRFixReplaceList.xml 2016-10-12 11:28:40 +02:00
Nikolaj Olsson
992aef4c82 Fixed crash in "Binary image compare" + minor dictionary update - thx Zoltan :) 2016-10-11 19:03:47 +02:00
Kruno H
8c26422343 Update hrv_OCRFixReplaceList.xml 2016-10-09 17:52:41 +02:00
Nikolaj Olsson
621643ad2a Minor ocr additions 2016-10-09 10:51:51 +02:00
Kruno H
c37669ef4d Update hrv_OCRFixReplaceList.xml 2016-10-08 17:00:45 +02:00
Kruno H
d0a31aa580 Update hrv_OCRFixReplaceList.xml 2016-10-01 11:31:23 +02:00
Waldi Ravens
1aa9400b1d Updated eng_OCRFixReplaceList.xml 2016-09-27 21:42:31 +02:00
Kruno H
669456810c Update hrv_OCRFixReplaceList.xml
Closes #1979
2016-09-27 20:56:59 +02:00
aaaxx
e5c3157767 Update eng_OCRFixReplaceList.xml
Closes #1978

Edits
========================================

Should be spaced instead of hyphenated (probably joined by OCR):

- `<Word from="airstrike" to="air-strike" />`
- `<Word from="wallplant" to="wall-plant" />`

Typo in replacement:

- `<Word from="lfeelonelung" to="l feel one lung" />`
- `<Word from="lneed"        to="l need" />`
- `<Word from="lthink___"    to="l think..." />`
- `<Word from="ltold"        to="l told" />`
- `<Word from="lv\/asn't"    to="l wasen't" />`
- `<Word from="Voilé"        to="Voilá" />`
- `<Ending from="pshycol"    to="pshyco!" />`

Capital "i" is a more likely replacement:

- `<Word from="lt"      to="it" />`
- `<Word from="lt'II"   to="it'll" />`
- `<Word from="lt'Il"   to="it'll" />`
- `<Word from="lt'll"   to="it'll" />`
- `<Word from="lt's"    to="it's" />`
- `<Word from="lfstill" to="if still" />`

Vocative, always needs a comma:

- `<Word from="HeyJennifer" to="Hey Jennifer" />`

Removals
========================================

Spelling varies between dictionaries:

- `<Word from="kickflip"  to="kick-flip" />`
- `<Word from="voicemail" to="voice-mail" />`

British vs. American spelling:

- `<Word from="judgement"  to="judgment" />`
- `<Word from="fulfilment" to="fulfillment" />`

Typo, not an OCR error, so spellchecker should deal with it (it doesn't make sense to keep a list of all possible misspellings):

- `<Word from="Goddamit"     to="Goddammit" />`
- `<Word from="mischevious"  to="mischievous" />`
- `<Word from="perscribed"   to="prescribed" />`
- `<Word from="perscription" to="prescription" />`
- `<Word from="pshyco"       to="psycho" />`
- `<Word from="thoguht"      to="thought" />`

Spelling changes meaning:

- `<Word from="ahold"  to="a hold" />`
- `<Word from="google" to="Google" />`

Find and replace are the same:

- `<Word from="I thought" to="I thought" />`
- `<Word from="literally" to="literally" />`

Resulting punctuation seems unlikely:

- `<Word from="'Qkay_"         to="- Okay!" />`
- `<Word from="_Qkay-"         to="- Okay!" />`
- `<Word from="'Qkay"          to="- Okay" />`
- `<Word from="JOEY-"          to="Joey!" />`
- `<Word from="_NO__"          to="No--" />`

Other reason:

Replacement rule                              | Comment
:---------------------------------------------|:-------------------
`<Word from="cp"          to="op" />`         | doesn't seem useful
`<Word from="lnte"        to="inte" />`       | doesn't seem useful
`<Word from="gothere"     to="go there" />`   | could also be "got here"
`<Word from="ridonculous" to="ridiculous" />` | intentional mispronunciation
`<Word from="I02"         to="Pops" />`       | seems really implausible, and it could mess up IDs, codes, etc.
2016-09-27 15:10:58 +02:00
Kruno H
6792825dda Update hrv_OCRFixReplaceList.xml
Closes #1972
2016-09-24 19:17:58 +02:00
Waldi Ravens
d44323f8df Updated hrv_OCRFixReplaceList.xml 2016-09-21 14:33:39 +02:00
Waldi Ravens
e26c5acdf5 dictionaries: automated XML upkeep 2016-09-21 12:40:08 +02:00
Kruno H
dc01fe0b27 Update hrv_OCRFixReplaceList.xml
Closes #1965
2016-09-21 12:29:00 +02:00
Kruno H
6771480232 Update hrv_OCRFixReplaceList.xml
Closes #1959
2016-09-19 10:14:55 +02:00
aaaxx
1443e279a6 Removed licence/license rule: it's not a typo
In British English "licence" is a noun and "license" a verb.
2016-09-16 06:40:36 +02:00
Kruno H
9b7c9f3387 Update hrv_OCRFixReplaceList.xml
Closes #1947
2016-09-12 10:24:29 +02:00
Kruno H
b600fa9f4a Update hrv_OCRFixReplaceList.xml
Closes #1939
2016-09-03 20:40:09 +02:00
Kruno H
e024bdb896 Update hrv_OCRFixReplaceList.xml
Closes #1933
2016-08-30 19:48:40 +02:00
Kruno H
aeae39286b Update hrv_OCRFixReplaceList.xml
Closes #1921
2016-08-25 15:05:00 +02:00
Nikolaj Olsson
f9a2e99d54 Added a few words to the English OCR fix replace list 2016-08-21 20:20:36 +02:00
Waldi Ravens
5b312d4a3a dictionaries: automated XML upkeep 2016-08-20 19:29:34 +02:00
Kruno H
3bf9dbf4d8 Update hrv_OCRFixReplaceList.xml
Closes #1912
2016-08-20 19:18:41 +02:00
Kruno H
3c3c1b2748 Update hrv_OCRFixReplaceList.xml 2016-08-14 17:26:52 +02:00
Kruno H
d2a773b5d6 Update hrv_OCRFixReplaceList.xml 2016-08-12 22:18:04 +02:00
Kruno H
93f3c6a4bc Update hrv_OCRFixReplaceList.xml 2016-08-11 10:17:50 +02:00
Kruno H
1ac4345a96 Update hrv_OCRFixReplaceList.xml 2016-08-10 18:01:07 +02:00
Nikolaj Olsson
eca8f0546a Minor dictionary update 2016-08-07 15:25:26 +02:00
Kruno H
81eee7c457 Update srp_OCRFixReplaceList.xml 2016-08-06 22:31:28 +02:00
Kruno H
3405755824 Update hrv_OCRFixReplaceList.xml
Closes #1884
2016-08-06 20:01:40 +02:00
Kruno H
b5d2fc294a Update hrv_OCRFixReplaceList.xml
Closes #1881
2016-08-05 19:18:14 +02:00
Kruno H
1a62ec44f5 Update hrv_OCRFixReplaceList.xml 2016-08-05 19:12:13 +02:00
Kruno H
1ba7c34fc3 Update hrv_OCRFixReplaceList.xml 2016-08-05 19:12:13 +02:00
Kruno H
7bc6953e1e Update hrv_OCRFixReplaceList.xml
Closes #1877
2016-08-02 13:29:42 +02:00
Waldi Ravens
a9a9dd8a16 dictionaries: automated XML upkeep 2016-07-27 19:24:22 +02:00
Kruno H
a7cfe7a59b Update hrv_OCRFixReplaceList.xml 2016-07-27 16:49:00 +02:00
Kruno H
50cee57807 Update hrv_OCRFixReplaceList.xml 2016-07-26 17:16:33 +02:00
Kruno H
b08c1e1687 Update hrv_OCRFixReplaceList.xml 2016-07-19 18:59:31 +02:00
Kruno H
da030c3326 Update hrv_OCRFixReplaceList.xml 2016-07-19 11:31:01 +02:00
Kruno H
a26ebf877b Update hrv_OCRFixReplaceList.xml 2016-07-14 12:13:59 +02:00
Waldi Ravens
9cc9aa65a9 dictionaries: fixed hun_OCRFixReplaceList.xml RegEx rules 2016-06-30 18:48:33 +02:00
Waldi Ravens
b98ec3dde0 dictionaries: fixed hun_OCRFixReplaceList.xml syntax 2016-06-30 15:55:05 +02:00
Waldi Ravens
ea6a5d5356 dictionaries: updated hrv_OCRFixReplaceList.xml 2016-06-30 14:18:56 +02:00
Kruno H
ffa90af402 Update hrv_OCRFixReplaceList.xml 2016-06-29 15:55:18 +02:00
Nikolaj Olsson
59b161eba2 Added Hungarian ocr fix replace list - thx mia :) 2016-06-27 20:33:05 +02:00
Waldi Ravens
3348c0f59e dictionaries: automated XML upkeep 2016-06-23 20:14:53 +02:00
Kruno H
f011b017eb Update hrv_OCRFixReplaceList.xml 2016-06-23 18:48:56 +02:00
Kruno H
2e20f0c682 Update hrv_OCRFixReplaceList.xml 2016-06-23 12:40:05 +02:00
Kruno H
e24ba0b7d9 Update hrv_OCRFixReplaceList.xml 2016-06-23 11:36:53 +02:00
Kruno H
8493e48f29 Update hrv_OCRFixReplaceList.xml 2016-06-23 11:28:47 +02:00
Kruno H
ae502e4ab6 Update hrv_OCRFixReplaceList.xml 2016-06-23 10:43:50 +02:00
Kruno H
9de6df0ed8 Update hrv_OCRFixReplaceList.xml 2016-06-16 17:44:23 +02:00
Kruno H
7e01db88d7 Update hrv_OCRFixReplaceList.xml 2016-06-16 17:41:54 +02:00
Kruno H
58c2afcf31 Update hrv_OCRFixReplaceList.xml 2016-06-10 20:43:42 +02:00
xylographe
b0ce1684df Merge pull request #1775 from diomed/patch-1
Update hrv_OCRFixReplaceList.xml
2016-06-05 20:13:03 +02:00
Kruno H
702310ca43 Update hrv_OCRFixReplaceList.xml 2016-06-05 16:53:28 +02:00
Nikolaj Olsson
7171bd0973 Working on ocr 2016-06-04 15:52:49 +02:00
Kruno H
054e54549a Update hrv_OCRFixReplaceList.xml 2016-06-04 12:05:07 +02:00
Kruno H
5fc049bccb Update hrv_OCRFixReplaceList.xml 2016-06-04 11:58:19 +02:00
Kruno H
bffa2916c6 Update hrv_OCRFixReplaceList.xml 2016-06-03 16:03:26 +02:00
Waldi Ravens
125d2dceb6 dictionaries: automated XML upkeep 2016-05-29 13:02:16 +02:00
Kruno H
c9bfe95679 Update hrv_OCRFixReplaceList.xml 2016-05-27 16:14:29 +02:00
Kruno H
6c1bef76e9 Update hrv_OCRFixReplaceList.xml 2016-05-18 22:54:00 +02:00
Kruno H
520a74222f Update hrv_OCRFixReplaceList.xml 2016-05-17 11:00:40 +02:00
Kruno H
7e61a4a9f0 Update hrv_OCRFixReplaceList.xml 2016-05-16 15:26:32 +02:00
Nikolaj Olsson
7d09349e0b Some minor improvements for OCR via "Binary image compare" 2016-05-06 15:38:42 +02:00
Kruno H
18b9310209 Update hrv_OCRFixReplaceList.xml 2016-05-02 14:53:41 +02:00
Kruno H
555f01764e Update hrv_OCRFixReplaceList.xml 2016-05-01 12:39:39 +02:00
Kruno H
b3bd8a691a Update hrv_OCRFixReplaceList.xml 2016-04-30 16:07:55 +02:00
Kruno H
35f7845157 Update hrv_OCRFixReplaceList.xml 2016-04-26 15:21:18 +02:00
Kruno H
b9c00de4f5 Update hrv_OCRFixReplaceList.xml 2016-04-18 20:47:59 +02:00
Kruno H
967e3a54ec Update hrv_OCRFixReplaceList.xml 2016-04-18 18:52:27 +02:00
Nikolaj Olsson
b315ed5722 Merge pull request #1699 from diomed/patch-2
Update hrv_OCRFixReplaceList.xml
2016-04-16 12:38:36 +02:00
Kruno H
c1e267f064 Update hrv_OCRFixReplaceList.xml 2016-04-14 15:55:07 +02:00
Waldi Ravens
b8ebe12640 dictionaries: automated XML upkeep 2016-04-13 13:32:50 +02:00
Kruno H
24f69b436a Update hrv_OCRFixReplaceList.xml 2016-04-12 21:13:14 +02:00
Kruno H
c4edc1e6c5 Update hrv_OCRFixReplaceList.xml 2016-04-11 18:53:34 +02:00
Kruno H
ea5f424475 Update hrv_OCRFixReplaceList.xml 2016-04-09 17:05:13 +02:00
Kruno H
297e5de15d Update hrv_OCRFixReplaceList.xml 2016-04-04 12:21:48 +02:00
Kruno H
f998748b7b Update hrv_OCRFixReplaceList.xml 2016-04-04 10:30:51 +02:00
Nikolaj Olsson
61deccc05f Merge pull request #1652 from diomed/patch-3
Update hrv_OCRFixReplaceList.xml
2016-03-25 12:43:05 +01:00
Kruno H
5ec6188ccf Update hrv_OCRFixReplaceList.xml 2016-03-24 14:34:23 +01:00
Kruno H
03f1f7d7c0 Update eng_OCRFixReplaceList.xml 2016-03-23 17:40:31 +01:00
Kruno H
6c590ac997 Update hrv_OCRFixReplaceList.xml 2016-03-23 17:25:49 +01:00
Kruno H
00caef81ec Update eng_OCRFixReplaceList.xml 2016-03-22 20:35:44 +01:00
Kruno H
0e4e194e92 Update hrv_OCRFixReplaceList.xml 2016-03-19 19:56:28 +01:00
Kruno H
559c3ab014 Update hrv_OCRFixReplaceList.xml 2016-03-18 21:03:03 +01:00
Kruno H
5dc1c74427 Update hrv_OCRFixReplaceList.xml 2016-03-18 20:30:50 +01:00
Waldi Ravens
b3f7d6a816 dictionaries: automated XML upkeep 2016-03-15 11:30:30 +02:00
Kruno H
dcff6f2925 Update hrv_OCRFixReplaceList.xml 2016-03-13 20:18:03 +01:00