X-Git-Url: http://git.rot13.org/?a=blobdiff_plain;f=TODO;h=8b9f4e8408e008622a620886fca34d718d47ec4a;hb=ac3e38c6121e07fdc54f874de8f22ba5ce5ba781;hp=8b51c7c3c5680e81a5dfff3b1b60b34c5a316e68;hpb=c947ac9857c7e9aee1cb0378c3186dd05e49a61f;p=webpac2 diff --git a/TODO b/TODO index 8b51c7c..8b9f4e8 100644 --- a/TODO +++ b/TODO @@ -14,10 +14,62 @@ + add support for KinoSearch search library [2.12] + added new set-based normalizer which is pure perl code [2.13] + added --stats to report field and subfield usage [2.14] -- add validator for input data [2.15] -- support arrays for normalize/path and lookup -- add Excel input format ++ add validator for input data [2.15] ++ add Excel input format [2.16] ++ remove WebPAC::Normalize::XML and promote WebPAC::Normalize::Set to WebPAC::Normalize [2.20] ++ support arrays for normalize/path [2.21] ++ add marc to normalize and create export MARC file [2.22] ++ implement indicators and repetable subfield in marc export [2.23] ++ add WebPAC::Output::MARC [2.24] ++ add config() and id() to WebPAC::Normalize ++ support local (by hostname) config files ++ implement marc_original_order to remap source records to marc [2.25] ++ fix statistics to use original data instead of data after modify_records ++ fix encoding and recoding issues (use UTF-8 as WebPAC native encoding) [2.26] ++ modify_file preserve order of translations in file [2.27] ++ modify_records regexpes now match just first occurence (repeat to get second...) ++ fix WebPAC::Output::MARC encoding troubles ++ generate reports (validation and stats) for each input ++ rewrite lookup support to use WebPAC::Normalize [2.28] ++ marc_leader shouldn't really be included in hash returned by data_structure ++ add checks for search directive in normalization to parser ++ lookup to another input file ++ support splitting of config yml to multiple files ++ add set and get to create in-memory lookup ++ support marc_remove('*') ++ fix-length fields (<100) support [2.29] ++ work without indexers installed or enabled [2.30] ++ simpliest possible conf/llin.yml as example ++ WebPAC::Output::JSON to produce data for Simile's exhibit ++ leader from WebPAC::Input::MARC is now available as rec('leader') ++ show i1 and i2 in statistcs specially (sorted first) ++ ISBN support in normalize ++ marc_template (WebPAC v1 in a command) [2.31] ++ marc_template must use marc_indicators data ++ indicator(field,nr) nr=1|2 for MARC input -> rec(field,'i1') or rec(field,'i2') ++ validator don't accept 0 as valid subfield ++ subroutines gets mungled in normalization files [2.33] ++ fix marc_template hadling of field value 0 ++ csv/excel output conf/normalize/csv_ff-peri.pl [2.34] ++ fix input/excel encoding problems ++ use UTF-8 as internal encoding ++ dump stats into var/stats.yml ++ WebPAC::Output::SWISH for indexing with swish-e [2.35] ++ output modules can now define add_row as opposed to add to get input row before normalization ++ WebPAC::Output::DBI to dump row from normalize into database (so each input record can create multiple rows) ++ implement attribute labels in webpac2.cgi +- WebPAC::Input::Excel iso-8859-1 encoding problem +- fix encoding for swish-e and/or webpac2.cgi +- HTML::FillInForm for html template of form [2.36] +- store display fields and use them in webpac.cgi +- marc_clone to copy records/fields/indicators from input marc +- rewrite WebPAC::Input to be based on Module::Pluggable or Moose +- check usage of config in normalize file (database, input name?) +- load_ds/save_ds should use on-disk hash to preserve inodes +- validate should ignore modify_file +- add option to specify output marc path in config.yml - add dBase input format - remove delimiters characters from index and query entered - delete unused files in database directories - scoring for various fields in input/*.xml +- marclint - validate 035$9 as valid