|
|
|
@2142
|
[2142]
|
05/05/08 23:30:12 |
karpet |
add some tokenizer tests and (doh!) include tokenizer.c |
|
|
|
@2141
|
[2141]
|
04/30/08 00:03:02 |
karpet |
port the ascii optimizations in words.c to tokenizer.c and expose some … |
|
|
|
@2140
|
[2140]
|
04/28/08 22:02:04 |
karpet |
alternate utf8-savvy tokenizer with iterator. initial naive benchmark … |
|
|
|
@2135
|
[2135]
|
04/19/08 22:05:33 |
karpet |
make warnings optional |
|
|
|
@2133
|
[2133]
|
04/17/08 21:18:55 |
karpet |
make test now works under linux. this was an issue with s[n]printf |
|
|
|
@2132
|
[2132]
|
04/16/08 08:14:47 |
karpet |
clarify/rename vars |
|
|
|
@2131
|
[2131]
|
04/15/08 23:35:08 |
karpet |
clarify parser warnings env var |
|
|
|
@2130
|
[2130]
|
04/15/08 23:12:59 |
karpet |
fix bug with XMLClassAttributes. all tests pass... for now. |
|
|
|
@2129
|
[2129]
|
04/15/08 10:29:48 |
karpet |
use our string_to_int instead of strtol() directly |
|
|
|
@2128
|
[2128]
|
04/15/08 10:04:58 |
karpet |
attributes for namespace-aware libxml2 were utterly broken. now the only … |
|
|
|
@2127
|
[2127]
|
04/15/08 10:03:53 |
karpet |
more test refactoring |
|
|
|
@2126
|
[2126]
|
04/15/08 10:03:14 |
karpet |
fix off-by-1 err |
|
|
|
@2125
|
[2125]
|
04/15/08 10:02:40 |
karpet |
new config type for stringlists |
|
|
|
@2124
|
[2124]
|
04/15/08 10:01:56 |
karpet |
test swish_header by default too |
|
|
|
@2123
|
[2123]
|
04/15/08 10:01:26 |
karpet |
add prototypes for stringlist and make some constant ints into powers of 2 |
|
|
|
@2122
|
[2122]
|
04/15/08 10:00:43 |
karpet |
add stringlist utility functions; still TODO is to make StringList? utf-8 … |
|
|
|
@2121
|
[2121]
|
04/15/08 09:55:03 |
karpet |
fix mem leak and add more info to usage() |
|
|
|
@2118
|
[2118]
|
04/13/08 23:56:50 |
karpet |
restructure config tests |
|
|
|
@2117
|
[2117]
|
04/12/08 00:01:19 |
karpet |
more tests |
|
|
|
@2116
|
[2116]
|
04/12/08 00:00:40 |
karpet |
restructure tests and add substr to namedbuffer debugging |
|
|
|
@2114
|
[2114]
|
04/08/08 09:36:19 |
karpet |
oops. forgot indexer needs this too, till we can integrate with libswish3 … |
|
|
|
@2113
|
[2113]
|
04/08/08 09:35:49 |
karpet |
perl examples for xapian |
|
|
|
@2112
|
[2112]
|
04/07/08 21:48:43 |
karpet |
make output a little more swish-like |
|
|
|
@2111
|
[2111]
|
04/06/08 23:29:45 |
karpet |
xapian example can now search as well as index |
|
|
|
@2110
|
[2110]
|
04/03/08 22:44:09 |
karpet |
Refactor duplicate id checks to use hash instead of array. Fixes bug with … |
|
|
|
@2108
|
[2108]
|
03/31/08 23:47:51 |
karpet |
add header read/write to xapian example and fix some mem leaks |
|
|
|
@2106
|
[2106]
|
03/30/08 22:55:55 |
karpet |
test that all ids are unique |
|
|
|
@2105
|
[2105]
|
03/28/08 23:04:21 |
karpet |
whitespace only |
|
|
|
@2104
|
[2104]
|
03/28/08 23:00:02 |
karpet |
add some mem debugging and clean up swish_words example |
|
|
|
@2103
|
[2103]
|
03/27/08 23:35:21 |
karpet |
whitespace only. again.
I am now using gnu indent rather than the … |
|
|
|
@2102
|
[2102]
|
03/26/08 23:52:48 |
karpet |
test now for unique rather than == 0 |
|
|
|
@2101
|
[2101]
|
03/26/08 23:47:21 |
karpet |
whitespace only |
|
|
|
@2100
|
[2100]
|
03/26/08 20:40:57 |
karpet |
use swish hash function rather than raw libxml2 version |
|
|
|
@2099
|
[2099]
|
03/25/08 23:38:16 |
karpet |
fix deprecated header |
|
|
|
@2098
|
[2098]
|
03/25/08 23:37:04 |
karpet |
fix long-standing mem leak with stdin parsing |
|
|
|
@2097
|
[2097]
|
03/23/08 23:49:06 |
karpet |
write header |
|
|
|
@2096
|
[2096]
|
03/21/08 14:27:54 |
karpet |
add prop and meta id auto-init; fix debug scheme to use bitwise comparison |
|
|
|
@2090
|
[2090]
|
03/18/08 23:45:43 |
karpet |
xapian example |
|
|
|
@2087
|
[2087]
|
03/17/08 21:19:24 |
karpet |
init some docs |
|
|
|
@2047
|
[2047]
|
03/07/08 22:33:37 |
karpet |
document new config/header format |
|
|
|
@2046
|
[2046]
|
03/07/08 22:33:11 |
karpet |
more config refactoring |
|
|
|
@2045
|
[2045]
|
03/07/08 22:32:33 |
karpet |
new config support |
|
|
|
@2042
|
[2042]
|
03/02/08 22:52:11 |
karpet |
more refactoring of config/header |
|
|
|
@2041
|
[2041]
|
02/29/08 23:18:18 |
karpet |
major reconstruction of config object.
basically, let go of the naive idea … |
|
|
|
@2031
|
[2031]
|
02/25/08 22:06:18 |
karpet |
one less todo |
|
|
|
@2030
|
[2030]
|
02/25/08 21:58:22 |
karpet |
expand ref counting and clean up some unused code |
|
|
|
@2029
|
[2029]
|
02/25/08 21:15:06 |
karpet |
refactor stash into its own class for easier debugging |
|
|
|
@2028
|
[2028]
|
02/24/08 00:38:44 |
karpet |
refactor the ref counting. TODO more tests |
|
|
|
@2027
|
[2027]
|
02/23/08 22:31:16 |
karpet |
rename ParseData? to ParserData?; add more debug env vars; implement c ptr … |
|
|
|
@2019
|
[2019]
|
02/18/08 23:34:02 |
karpet |
split XS out into separate files; move more from XS to C; TODO figure out … |
|
|
|
@2018
|
[2018]
|
02/13/08 00:48:19 |
karpet |
fix bug in order of tokenize() args |
|
|
|
@2017
|
[2017]
|
02/12/08 00:39:13 |
karpet |
no longer used |
|
|
|
@2016
|
[2016]
|
02/12/08 00:36:44 |
karpet |
nits |
|
|
|
@2015
|
[2015]
|
02/12/08 00:36:10 |
karpet |
more tests |
|
|
|
@2014
|
[2014]
|
02/11/08 08:02:46 |
karpet |
new bindings to match API reorg |
|
|
|
@2013
|
[2013]
|
02/11/08 08:01:37 |
karpet |
reorg |
|
|
|
@2010
|
[2010]
|
02/10/08 22:26:06 |
karpet |
simplify API with top-level swish_3 struct |
|
|
|
@2009
|
[2009]
|
02/03/08 23:29:35 |
karpet |
rename some vars for clarity |
|
|
|
@1955
|
[1955]
|
11/13/07 23:31:51 |
karpet |
doc tweek; come config work |
|
|
|
@1952
|
[1952]
|
10/26/07 00:17:00 |
karpet |
rename messaging functions and add file, line and function name to output |
|
|
|
@1948
|
[1948]
|
10/23/07 09:39:22 |
karpet |
test doc maker now requires explicit number of files to make |
|
|
|
@1935
|
[1935]
|
05/07/07 22:12:38 |
karpet |
oops |
|
|
|
@1934
|
[1934]
|
05/07/07 22:11:18 |
karpet |
change stdin to any filehandle pointer and add more POD |
|
|
|
@1933
|
[1933]
|
05/06/07 23:33:47 |
karpet |
perl bindings split \003 into array of strings, libswish3 pod … |
|
|
|
@1931
|
[1931]
|
05/01/07 23:53:25 |
karpet |
tweek the metanames NB to separate text chunks with ctrl char \003 and … |
|
|
|
@1930
|
[1930]
|
04/30/07 23:08:43 |
karpet |
refactor to buffer all MetaNames? as well as PropertyNames? in NamedBuffer? |
|
|
|
@1929
|
[1929]
|
04/24/07 22:31:17 |
karpet |
fix number of bugs for swish3 example script |
|
|
|
@1928
|
[1928]
|
04/23/07 11:58:51 |
karpet |
refactor SWISH::3::Parser class and ref_cnt system |
|
|
|
@1927
|
[1927]
|
04/20/07 17:54:55 |
karpet |
refactoring to create Analyzer class, and the ability to do regex … |
|
|
|
@1925
|
[1925]
|
04/04/07 16:36:13 |
karpet |
verify locale should also be global |
|
|
|
@1924
|
[1924]
|
04/03/07 23:30:21 |
karpet |
global init/cleanup functions to help reduce duplication |
|
|
|
@1923
|
[1923]
|
03/19/07 11:58:02 |
karpet |
expose the tokenizer into Perl space for benchmarking |
|
|
|
@1922
|
[1922]
|
03/15/07 23:24:22 |
karpet |
don't bother with the ucdata libraries after all. we'll make due with … |
|
|
|
@1921
|
[1921]
|
03/14/07 10:19:44 |
karpet |
reorg the perl namespaces and rename/rework some of the tokenizing to … |
|
|
|
@1920
|
[1920]
|
03/04/07 21:32:53 |
karpet |
reorg namespaces and add stubs for indexer, etc. |
|
|
|
@1919
|
[1919]
|
02/28/07 11:59:40 |
karpet |
link to wiki instead |
|
|
|
@1914
|
[1914]
|
02/28/07 09:29:18 |
karpet |
include perl bindings and some doc cleanup |
|
|
|
@1913
|
[1913]
|
02/27/07 22:57:38 |
karpet |
for all the world to see |