|
|
|
@2178
|
[2178]
|
09/26/08 23:59:57 |
karpet |
some versions of html parser were passing through extra whitespace.
seems … |
|
|
|
@2141
|
[2141]
|
04/30/08 00:03:02 |
karpet |
port the ascii optimizations in words.c to tokenizer.c and expose some … |
|
|
|
@2140
|
[2140]
|
04/28/08 22:02:04 |
karpet |
alternate utf8-savvy tokenizer with iterator. initial naive benchmark … |
|
|
|
@2133
|
[2133]
|
04/17/08 21:18:55 |
karpet |
make test now works under linux. this was an issue with s[n]printf |
|
|
|
@2125
|
[2125]
|
04/15/08 10:02:40 |
karpet |
new config type for stringlists |
|
|
|
@2110
|
[2110]
|
04/03/08 22:44:09 |
karpet |
Refactor duplicate id checks to use hash instead of array. Fixes bug with … |
|
|
|
@2108
|
[2108]
|
03/31/08 23:47:51 |
karpet |
add header read/write to xapian example and fix some mem leaks |
|
|
|
@2106
|
[2106]
|
03/30/08 22:55:55 |
karpet |
test that all ids are unique |
|
|
|
@2104
|
[2104]
|
03/28/08 23:00:02 |
karpet |
add some mem debugging and clean up swish_words example |
|
|
|
@2103
|
[2103]
|
03/27/08 23:35:21 |
karpet |
whitespace only. again.
I am now using gnu indent rather than the … |
|
|
|
@2101
|
[2101]
|
03/26/08 23:47:21 |
karpet |
whitespace only |
|
|
|
@2100
|
[2100]
|
03/26/08 20:40:57 |
karpet |
use swish hash function rather than raw libxml2 version |
|
|
|
@2098
|
[2098]
|
03/25/08 23:37:04 |
karpet |
fix long-standing mem leak with stdin parsing |
|
|
|
@2097
|
[2097]
|
03/23/08 23:49:06 |
karpet |
write header |
|
|
|
@2096
|
[2096]
|
03/21/08 14:27:54 |
karpet |
add prop and meta id auto-init; fix debug scheme to use bitwise comparison |
|
|
|
@2046
|
[2046]
|
03/07/08 22:33:11 |
karpet |
more config refactoring |
|
|
|
@2042
|
[2042]
|
03/02/08 22:52:11 |
karpet |
more refactoring of config/header |
|
|
|
@2041
|
[2041]
|
02/29/08 23:18:18 |
karpet |
major reconstruction of config object.
basically, let go of the naive idea … |
|
|
|
@2028
|
[2028]
|
02/24/08 00:38:44 |
karpet |
refactor the ref counting. TODO more tests |
|
|
|
@2027
|
[2027]
|
02/23/08 22:31:16 |
karpet |
rename ParseData? to ParserData?; add more debug env vars; implement c ptr … |
|
|
|
@2010
|
[2010]
|
02/10/08 22:26:06 |
karpet |
simplify API with top-level swish_3 struct |
|
|
|
@1955
|
[1955]
|
11/13/07 23:31:51 |
karpet |
doc tweek; come config work |
|
|
|
@1952
|
[1952]
|
10/26/07 00:17:00 |
karpet |
rename messaging functions and add file, line and function name to output |
|
|
|
@1927
|
[1927]
|
04/20/07 17:54:55 |
karpet |
refactoring to create Analyzer class, and the ability to do regex … |
|
|
|
@1925
|
[1925]
|
04/04/07 16:36:13 |
karpet |
verify locale should also be global |
|
|
|
@1924
|
[1924]
|
04/03/07 23:30:21 |
karpet |
global init/cleanup functions to help reduce duplication |
|
|
|
@1923
|
[1923]
|
03/19/07 11:58:02 |
karpet |
expose the tokenizer into Perl space for benchmarking |
|
|
|
@1921
|
[1921]
|
03/14/07 10:19:44 |
karpet |
reorg the perl namespaces and rename/rework some of the tokenizing to … |
|
|
|
@1913
|
[1913]
|
02/27/07 22:57:38 |
karpet |
for all the world to see |