Commit Graph

483 Commits

Author SHA1 Message Date
May Kittens Devour Your Soul
0ee009c5fa Update hrv_OCRFixReplaceList.xml 2016-12-03 14:34:47 +01:00
Kruno H
ccda549f27 Update hrv_OCRFixReplaceList.xml 2016-11-14 11:43:55 +01:00
Nikolaj Olsson
2ec402752b Merge pull request #1993 from diomed/patch-4
Update hrv_OCRFixReplaceList.xml
2016-10-23 20:22:08 +02:00
Kruno H
aafa5bfcdf Update hrv_OCRFixReplaceList.xml 2016-10-17 16:28:32 +02:00
Kruno H
c4cac4ca8f Update hrv_OCRFixReplaceList.xml 2016-10-17 16:24:05 +02:00
Kruno H
ceb509e646 Update hrv_OCRFixReplaceList.xml 2016-10-12 11:28:40 +02:00
Nikolaj Olsson
992aef4c82 Fixed crash in "Binary image compare" + minor dictionary update - thx Zoltan :) 2016-10-11 19:03:47 +02:00
Kruno H
8c26422343 Update hrv_OCRFixReplaceList.xml 2016-10-09 17:52:41 +02:00
Nikolaj Olsson
621643ad2a Minor ocr additions 2016-10-09 10:51:51 +02:00
Kruno H
c37669ef4d Update hrv_OCRFixReplaceList.xml 2016-10-08 17:00:45 +02:00
Kruno H
d0a31aa580 Update hrv_OCRFixReplaceList.xml 2016-10-01 11:31:23 +02:00
Waldi Ravens
1aa9400b1d Updated eng_OCRFixReplaceList.xml 2016-09-27 21:42:31 +02:00
Kruno H
669456810c Update hrv_OCRFixReplaceList.xml
Closes #1979
2016-09-27 20:56:59 +02:00
aaaxx
e5c3157767 Update eng_OCRFixReplaceList.xml
Closes #1978

Edits
========================================

Should be spaced instead of hyphenated (probably joined by OCR):

- `<Word from="airstrike" to="air-strike" />`
- `<Word from="wallplant" to="wall-plant" />`

Typo in replacement:

- `<Word from="lfeelonelung" to="l feel one lung" />`
- `<Word from="lneed"        to="l need" />`
- `<Word from="lthink___"    to="l think..." />`
- `<Word from="ltold"        to="l told" />`
- `<Word from="lv\/asn't"    to="l wasen't" />`
- `<Word from="Voilé"        to="Voilá" />`
- `<Ending from="pshycol"    to="pshyco!" />`

Capital "i" is a more likely replacement:

- `<Word from="lt"      to="it" />`
- `<Word from="lt'II"   to="it'll" />`
- `<Word from="lt'Il"   to="it'll" />`
- `<Word from="lt'll"   to="it'll" />`
- `<Word from="lt's"    to="it's" />`
- `<Word from="lfstill" to="if still" />`

Vocative, always needs a comma:

- `<Word from="HeyJennifer" to="Hey Jennifer" />`

Removals
========================================

Spelling varies between dictionaries:

- `<Word from="kickflip"  to="kick-flip" />`
- `<Word from="voicemail" to="voice-mail" />`

British vs. American spelling:

- `<Word from="judgement"  to="judgment" />`
- `<Word from="fulfilment" to="fulfillment" />`

Typo, not an OCR error, so spellchecker should deal with it (it doesn't make sense to keep a list of all possible misspellings):

- `<Word from="Goddamit"     to="Goddammit" />`
- `<Word from="mischevious"  to="mischievous" />`
- `<Word from="perscribed"   to="prescribed" />`
- `<Word from="perscription" to="prescription" />`
- `<Word from="pshyco"       to="psycho" />`
- `<Word from="thoguht"      to="thought" />`

Spelling changes meaning:

- `<Word from="ahold"  to="a hold" />`
- `<Word from="google" to="Google" />`

Find and replace are the same:

- `<Word from="I thought" to="I thought" />`
- `<Word from="literally" to="literally" />`

Resulting punctuation seems unlikely:

- `<Word from="'Qkay_"         to="- Okay!" />`
- `<Word from="_Qkay-"         to="- Okay!" />`
- `<Word from="'Qkay"          to="- Okay" />`
- `<Word from="JOEY-"          to="Joey!" />`
- `<Word from="_NO__"          to="No--" />`

Other reason:

Replacement rule                              | Comment
:---------------------------------------------|:-------------------
`<Word from="cp"          to="op" />`         | doesn't seem useful
`<Word from="lnte"        to="inte" />`       | doesn't seem useful
`<Word from="gothere"     to="go there" />`   | could also be "got here"
`<Word from="ridonculous" to="ridiculous" />` | intentional mispronunciation
`<Word from="I02"         to="Pops" />`       | seems really implausible, and it could mess up IDs, codes, etc.
2016-09-27 15:10:58 +02:00
Kruno H
6792825dda Update hrv_OCRFixReplaceList.xml
Closes #1972
2016-09-24 19:17:58 +02:00
Waldi Ravens
d44323f8df Updated hrv_OCRFixReplaceList.xml 2016-09-21 14:33:39 +02:00
Waldi Ravens
e26c5acdf5 dictionaries: automated XML upkeep 2016-09-21 12:40:08 +02:00
Kruno H
dc01fe0b27 Update hrv_OCRFixReplaceList.xml
Closes #1965
2016-09-21 12:29:00 +02:00
Kruno H
6771480232 Update hrv_OCRFixReplaceList.xml
Closes #1959
2016-09-19 10:14:55 +02:00
aaaxx
1443e279a6 Removed licence/license rule: it's not a typo
In British English "licence" is a noun and "license" a verb.
2016-09-16 06:40:36 +02:00
Kruno H
9b7c9f3387 Update hrv_OCRFixReplaceList.xml
Closes #1947
2016-09-12 10:24:29 +02:00
Kruno H
b600fa9f4a Update hrv_OCRFixReplaceList.xml
Closes #1939
2016-09-03 20:40:09 +02:00
Kruno H
e024bdb896 Update hrv_OCRFixReplaceList.xml
Closes #1933
2016-08-30 19:48:40 +02:00
Kruno H
aeae39286b Update hrv_OCRFixReplaceList.xml
Closes #1921
2016-08-25 15:05:00 +02:00
Nikolaj Olsson
f9a2e99d54 Added a few words to the English OCR fix replace list 2016-08-21 20:20:36 +02:00
Waldi Ravens
5b312d4a3a dictionaries: automated XML upkeep 2016-08-20 19:29:34 +02:00
Kruno H
3bf9dbf4d8 Update hrv_OCRFixReplaceList.xml
Closes #1912
2016-08-20 19:18:41 +02:00
Kruno H
3c3c1b2748 Update hrv_OCRFixReplaceList.xml 2016-08-14 17:26:52 +02:00
Kruno H
d2a773b5d6 Update hrv_OCRFixReplaceList.xml 2016-08-12 22:18:04 +02:00
Kruno H
93f3c6a4bc Update hrv_OCRFixReplaceList.xml 2016-08-11 10:17:50 +02:00
Kruno H
1ac4345a96 Update hrv_OCRFixReplaceList.xml 2016-08-10 18:01:07 +02:00
Nikolaj Olsson
eca8f0546a Minor dictionary update 2016-08-07 15:25:26 +02:00
Kruno H
81eee7c457 Update srp_OCRFixReplaceList.xml 2016-08-06 22:31:28 +02:00
Kruno H
3405755824 Update hrv_OCRFixReplaceList.xml
Closes #1884
2016-08-06 20:01:40 +02:00
Kruno H
b5d2fc294a Update hrv_OCRFixReplaceList.xml
Closes #1881
2016-08-05 19:18:14 +02:00
Kruno H
1a62ec44f5 Update hrv_OCRFixReplaceList.xml 2016-08-05 19:12:13 +02:00
Kruno H
1ba7c34fc3 Update hrv_OCRFixReplaceList.xml 2016-08-05 19:12:13 +02:00
Kruno H
7bc6953e1e Update hrv_OCRFixReplaceList.xml
Closes #1877
2016-08-02 13:29:42 +02:00
Waldi Ravens
a9a9dd8a16 dictionaries: automated XML upkeep 2016-07-27 19:24:22 +02:00
Kruno H
a7cfe7a59b Update hrv_OCRFixReplaceList.xml 2016-07-27 16:49:00 +02:00
Kruno H
50cee57807 Update hrv_OCRFixReplaceList.xml 2016-07-26 17:16:33 +02:00
Kruno H
b08c1e1687 Update hrv_OCRFixReplaceList.xml 2016-07-19 18:59:31 +02:00
Kruno H
da030c3326 Update hrv_OCRFixReplaceList.xml 2016-07-19 11:31:01 +02:00
Kruno H
a26ebf877b Update hrv_OCRFixReplaceList.xml 2016-07-14 12:13:59 +02:00
Waldi Ravens
9cc9aa65a9 dictionaries: fixed hun_OCRFixReplaceList.xml RegEx rules 2016-06-30 18:48:33 +02:00
Waldi Ravens
b98ec3dde0 dictionaries: fixed hun_OCRFixReplaceList.xml syntax 2016-06-30 15:55:05 +02:00
Waldi Ravens
ea6a5d5356 dictionaries: updated hrv_OCRFixReplaceList.xml 2016-06-30 14:18:56 +02:00
Kruno H
ffa90af402 Update hrv_OCRFixReplaceList.xml 2016-06-29 15:55:18 +02:00
Nikolaj Olsson
59b161eba2 Added Hungarian ocr fix replace list - thx mia :) 2016-06-27 20:33:05 +02:00
Waldi Ravens
3348c0f59e dictionaries: automated XML upkeep 2016-06-23 20:14:53 +02:00