|
|
|
@2188
|
[2188]
|
11/03/08 21:44:48 |
karpet |
first pass at the dom-specific property and metaname feature. |
|
|
|
@2187
|
[2187]
|
11/01/08 13:54:06 |
karpet |
change some var and sub names for clarity |
|
|
|
@2184
|
[2184]
|
10/24/08 15:05:31 |
karpet |
add raw tagstack to parser_data. this is to allow for metanames or … |
|
|
|
@2177
|
[2177]
|
09/23/08 21:26:30 |
karpet |
fix html parser to use actual libxml2 api; quiet a compiler warning |
|
|
|
@2176
|
[2176]
|
09/22/08 22:57:38 |
karpet |
all tests passing, all (known) leaks fixed |
|
|
|
@2162
|
[2162]
|
09/20/08 15:37:55 |
karpet |
refactor to simplify signatures for TokenIterator?. Now a TI always creates … |
|
|
|
@2158
|
[2158]
|
09/19/08 07:16:36 |
karpet |
yank words.c in favor of tokenizer.c -- benchmarking shows tokenizer.c is … |
|
|
|
@2155
|
[2155]
|
09/18/08 23:51:41 |
karpet |
avoid xmlStrncat because (a) it fails under linux and (b) the realloc is … |
|
|
|
@2153
|
[2153]
|
07/31/08 23:10:16 |
karpet |
fix the bump_word feature so that ->pos for Word or Token reflects … |
|
|
|
@2150
|
[2150]
|
07/29/08 21:35:42 |
karpet |
ditch SWISH_META_CONNECTOR and SWISH_PROP_CONNECTOR in favor of … |
|
|
|
@2148
|
[2148]
|
07/21/08 23:51:24 |
karpet |
change top-level tokenizer functions to use same signature so that we can … |
|
|
|
@2140
|
[2140]
|
04/28/08 22:02:04 |
karpet |
alternate utf8-savvy tokenizer with iterator. initial naive benchmark … |
|
|
|
@2132
|
[2132]
|
04/16/08 08:14:47 |
karpet |
clarify/rename vars |
|
|
|
@2131
|
[2131]
|
04/15/08 23:35:08 |
karpet |
clarify parser warnings env var |
|
|
|
@2130
|
[2130]
|
04/15/08 23:12:59 |
karpet |
fix bug with XMLClassAttributes. all tests pass... for now. |
|
|
|
@2129
|
[2129]
|
04/15/08 10:29:48 |
karpet |
use our string_to_int instead of strtol() directly |
|
|
|
@2128
|
[2128]
|
04/15/08 10:04:58 |
karpet |
attributes for namespace-aware libxml2 were utterly broken. now the only … |
|
|
|
@2116
|
[2116]
|
04/12/08 00:00:40 |
karpet |
restructure tests and add substr to namedbuffer debugging |
|
|
|
@2108
|
[2108]
|
03/31/08 23:47:51 |
karpet |
add header read/write to xapian example and fix some mem leaks |
|
|
|
@2104
|
[2104]
|
03/28/08 23:00:02 |
karpet |
add some mem debugging and clean up swish_words example |
|
|
|
@2103
|
[2103]
|
03/27/08 23:35:21 |
karpet |
whitespace only. again.
I am now using gnu indent rather than the … |
|
|
|
@2101
|
[2101]
|
03/26/08 23:47:21 |
karpet |
whitespace only |
|
|
|
@2100
|
[2100]
|
03/26/08 20:40:57 |
karpet |
use swish hash function rather than raw libxml2 version |
|
|
|
@2098
|
[2098]
|
03/25/08 23:37:04 |
karpet |
fix long-standing mem leak with stdin parsing |
|
|
|
@2096
|
[2096]
|
03/21/08 14:27:54 |
karpet |
add prop and meta id auto-init; fix debug scheme to use bitwise comparison |
|
|
|
@2041
|
[2041]
|
02/29/08 23:18:18 |
karpet |
major reconstruction of config object.
basically, let go of the naive idea … |
|
|
|
@2030
|
[2030]
|
02/25/08 21:58:22 |
karpet |
expand ref counting and clean up some unused code |
|
|
|
@2028
|
[2028]
|
02/24/08 00:38:44 |
karpet |
refactor the ref counting. TODO more tests |
|
|
|
@2027
|
[2027]
|
02/23/08 22:31:16 |
karpet |
rename ParseData? to ParserData?; add more debug env vars; implement c ptr … |
|
|
|
@2010
|
[2010]
|
02/10/08 22:26:06 |
karpet |
simplify API with top-level swish_3 struct |
|
|
|
@2009
|
[2009]
|
02/03/08 23:29:35 |
karpet |
rename some vars for clarity |
|
|
|
@1952
|
[1952]
|
10/26/07 00:17:00 |
karpet |
rename messaging functions and add file, line and function name to output |
|
|
|
@1934
|
[1934]
|
05/07/07 22:11:18 |
karpet |
change stdin to any filehandle pointer and add more POD |
|
|
|
@1931
|
[1931]
|
05/01/07 23:53:25 |
karpet |
tweek the metanames NB to separate text chunks with ctrl char \003 and … |
|
|
|
@1930
|
[1930]
|
04/30/07 23:08:43 |
karpet |
refactor to buffer all MetaNames? as well as PropertyNames? in NamedBuffer? |
|
|
|
@1928
|
[1928]
|
04/23/07 11:58:51 |
karpet |
refactor SWISH::3::Parser class and ref_cnt system |
|
|
|
@1927
|
[1927]
|
04/20/07 17:54:55 |
karpet |
refactoring to create Analyzer class, and the ability to do regex … |
|
|
|
@1924
|
[1924]
|
04/03/07 23:30:21 |
karpet |
global init/cleanup functions to help reduce duplication |
|
|
|
@1921
|
[1921]
|
03/14/07 10:19:44 |
karpet |
reorg the perl namespaces and rename/rework some of the tokenizing to … |
|
|
|
@1913
|
[1913]
|
02/27/07 22:57:38 |
karpet |
for all the world to see |