X-Git-Url: http://git.rot13.org/?a=blobdiff_plain;f=TODO;h=124825489ba21a05bcf01edb34c7e42f89adea00;hb=d561ea4848994982474bcf1e3ee5fd7d71ec6fef;hp=439afc88dd7c0de087786057daceb65094ac3eb3;hpb=1615c1ed23383a0dc912ebfbb12bdf3f2fe0cc88;p=webpac2 diff --git a/TODO b/TODO index 439afc8..1248254 100644 --- a/TODO +++ b/TODO @@ -14,9 +14,32 @@ + add support for KinoSearch search library [2.12] + added new set-based normalizer which is pure perl code [2.13] + added --stats to report field and subfield usage [2.14] -- support arrays for normalize/path and lookup -- add Excel input format ++ add validator for input data [2.15] ++ add Excel input format [2.16] ++ remove WebPAC::Normalize::XML and promote WebPAC::Normalize::Set to WebPAC::Normalize [2.20] ++ support arrays for normalize/path [2.21] ++ add marc to normalize and create export MARC file [2.22] ++ implement indicators and repetable subfield in marc export [2.23] ++ add WebPAC::Output::MARC [2.24] ++ add config() and id() to WebPAC::Normalize ++ support local (by hostname) config files ++ implement marc_original_order to remap source records to marc [2.25] ++ fix statistics to use original data instead of data after modify_records ++ fix encoding and recoding issues (use UTF-8 as WebPAC native encoding) [2.26] ++ modify_file preserve order of translations in file [2.27] ++ modify_records regexpes now match just first occurence (repeat to get second...) ++ fix WebPAC::Output::MARC encoding troubles ++ generate reports (validation and stats) for each input ++ rewrite lookup support to use WebPAC::Normalize [2.28] ++ marc_leader shouldn't really be included in hash returned by data_structure ++ add checks for search directive in normalization to parser ++ lookup to another input file ++ support splitting of config yml to multiple files +- fix-length fields (<100) support +- validate should ignore modify_file +- add option to specify output marc path in config.yml - add dBase input format - remove delimiters characters from index and query entered - delete unused files in database directories - scoring for various fields in input/*.xml +- marclint - validate 035$9 as valid