... We faced a similar situation -- we index content that resides in a database. So we built a script to dump out the database content to HTML files, giving ...
773
Paul J. Lucas
ypjl
May 5, 2011 7:46 pm
... No. Even for files that are indexed, you can not search for them by their path or file names. - Paul...
772
terrasim_fac
May 5, 2011 7:44 pm
Apologies if this has been discussed elsewhere, but I've searched through and was unsuccessful finding anything, so thought I would ask. Is there a way to...
771
Paul J. Lucas
ypjl
Jun 5, 2010 3:16 pm
... All the documentation is in the config.h file itself. ... I can not know all possible systems. It's your system: you're supposed to know or figure it out....
770
hgpreader
Jun 5, 2010 3:03 pm
This groups looks pretty dead since last September but here goes: I've been unable to get the latest swish++-6.5 to compile having problems with the built in...
769
Gyepi SAM
gyepi@...
Jul 26, 2009 5:07 pm
... If you know, at search time, which sites or subdomains need to be searched, then those values should probably be provided as metadata to the indexer and...
768
opticabo@...
opticabo...
Jul 26, 2009 4:23 am
... Actually this idea and my extractpath question were dumb. I've realised I should use an individual index for each site ("subdomain"). It makes more sense...
767
opticabo@...
opticabo...
Jul 25, 2009 3:43 pm
... Well at the moment I'm just using an associative array in the code which maps the categories to the sites, and I wouldn't actually implement the example I...
766
Gyepi SAM
gyepi@...
Jul 23, 2009 6:10 pm
... Keep in mind that swish++ is an indexing and search tool. The problems you describe relate to data management and require a different set of tools. If a...
765
opticabo@...
opticabo...
Jul 22, 2009 12:38 am
I don't mind coding this if there's no inherent reason the feature isn't there already. Makes more sense to add the feature to the code which returns the row...
764
opticabo@...
opticabo...
Jul 21, 2009 4:27 am
... Thanks, tbh after I posted I looked at the last date and didn't expect a reply. Reading some of the other posts made me feel guilty actually :-) As for...
763
Gyepi SAM
gyepi@...
Jul 18, 2009 11:58 pm
... Assuming you store the meta data in a database, an elegant solution is to encode each site's record id in the file that swish++ indexes; When search...
762
opticabo@...
opticabo...
Jul 15, 2009 8:56 pm
Hi, I'm evaluating swish++/swishe for searching a malleable subset of a collection of external domains. In other words they can use checkboxes to select the...
761
paul@...
ypjl
May 8, 2007 4:03 pm
... That's correct. (If it were to reference the file, the man page would say it did.) ... The .new is intended to replace the original. Hence, there's no ...
760
glen_a_stewart
May 7, 2007 11:37 pm
When an incremental index (.new) exists, it doesn't seem to be referenced by default when using the search++ command. What I expected, was that the...
759
paul@...
ypjl
Jan 23, 2007 5:31 pm
... Do you mean: index a single e-mail address as a single word? No because you'd have to include '@' and '.' as word characters which isn't something you...
758
ignotus@...
Jan 23, 2007 5:19 pm
Hello! I use swish++ to index my email archives, and I frueqently do queries like ( from = foobar@... ) to get all the mails originating from a particular...
757
IceQuake
runderwo_99
Dec 1, 2006 6:46 pm
... I am using tar to generate the archive listing, not to extract the files. Then the archive listing is indexed. The problem with chaining the gz/tar...
756
paul@...
ypjl
Dec 1, 2006 6:54 am
... It occurred to me that you can't use FilterFile for tar files. FilterFile expects to filter one type of file into another type of *file* (singular), not a...
755
paul@...
ypjl
Nov 30, 2006 10:44 pm
... Double-extensions are not supported (nor do they need to be). From ... To be your exactly case, replace "pdf" in the quoted text above with "tar". More...
754
runderwo_99
Nov 30, 2006 10:20 pm
It seems that any filter installed for *.gz takes precedence over any filter installed for *.tar.gz. (In the former case, the file should be decompressed with...
753
paul@...
ypjl
Oct 20, 2006 5:10 am
... Read FAQ #2. - Paul...
752
rbeentje
Oct 20, 2006 12:31 am
Dear Swish++, I've successfully indexed some documents, including meta information, but I can't see how to return the meta information in the search results....
751
Paul J. Lucas
ypjl
Jul 15, 2006 5:03 am
... That's not what a regular search does. A regular search for a word in a file simply returns whether the word is in that file at least once (along with a...
750
eric_jobidon
Jun 26, 2006 8:32 pm
Has anyone tried to perform hit-highlighting of PDF files using SWISH++? What I am referring to is the creation of a "pseudo-xml" file (as specified by Adobe...
749
Paul J. Lucas
ypjl
Apr 3, 2006 4:21 pm
... man grep - Paul...
748
thorsten.kleindienst
thorsten.kle...
Apr 3, 2006 3:10 pm
I found in the docs that filtering the files and directories to be searched is planed for future releases, so it is currently not possible do sth like this: ...
747
Paul J. Lucas
ypjl
Mar 15, 2006 5:59 am
* The definition of MAKEDEPEND was wrong. This broke dependency-file generation when not using g++. * Indexing of some ID3 tags was broken....
746
Paul J. Lucas
ypjl
Mar 7, 2006 3:43 am
... You're free to use something other than SWISH++. ... Right. You're free to write your own code to do this yourself from a file. ... Not if you do it...
745
Brian May
bamay1
Mar 7, 2006 3:32 am
Unfortunately this didn't send the first time; retry... ... -- Brian May <bam@...>...