Add greek support in word-phrase-utf.chr for searching in a Greek catalog (it can also contain latin records).
Developped in collaboration with Giannis Kourmoulis <ikourmou@lib.auth.gr>
Test plan :
- Install using CHR zebra indexing
- Index a greek catalog
- Look for results with mixed uppercase, lowercase and diacritics in title
Signed-off-by: Chris Cormack <chris@bigballofwax.co.nz>
Signed-off-by: Kyle M Hall <kyle@bywatersolutions.com>
Signed-off-by: Tomas Cohen Arazi <tomascohen@gmail.com>
-# Generic search equivanence character map for Latin languages (English, French, etc.)
+# Generic character map for Latin and Greek characters.
# Define the basic value-set. *Beware* of changing this without re-indexing
# your databases.
encoding utf-8
# basic character set
-lowercase {0-9}{a-z}
-uppercase {0-9}{A-Z}
+lowercase {0-9}{a-z}αβγδεζηθικλμνξοπρστυφχψω
+uppercase {0-9}{A-Z}ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ
# Breaking characters
space {\001-\040}!"#$%&'\()*+,-./:;<=>?@\[\\]^_`\{|}~’{\x88-\x89}{\x98-\x9C}
equivalent oóòõôŏǫȯőǒȍȏ
equivalent Œœöø(oe)
equivalent ō(oo)
+equivalent αάΑΆ(α΄)(Α΄)
+equivalent εέΕΈ(ε΄)(Ε΄)
+equivalent ιίϊΙΊΪ(ι΄)(Ι΄)
+equivalent ηήΗΉ(η΄)(Η΄)
+equivalent οόΟΌ(ο΄)(Ο΄)
+equivalent υύϋΥΎΫ(υ΄)(Υ΄)
+equivalent ωώΩΏ(ω΄)(Ω΄)
+equivalent Σσς
# Supplemental mappings
# mapping diacritics to plain ASCII
map ɏ y
map Ɏ y
+## mapping diacritics to plain on greek characters
+map ά α
+map Ά α
+map έ ε
+map Έ ε
+map ί ι
+map Ί ι
+map ή η
+map Ή η
+map ό ο
+map Ό ο
+map ύ υ
+map Ύ υ
+map ώ ω
+map Ώ ω
+map Σ σ
+map Σ ς
+
# mapping blank string
map (^impr\s) @
map (^impr\.) @