a friend showed regain to me, and I think it can be very usefull for me. It is really helpfull. I want to use it for searching some directories on a disc on my local machine, just reports, data analyses, and things like this.
However I need to understand the regain-behavior a little bit better, especially what is the best / safest / mosrt robust and reliable way to control regain.
Here are the details to my installation:
a) Operating System: Window 7 (64Bit)
b) Java-Version: 1.6.0_18
c) Regain Version: Desktop 2.0.4 (2.0.4 Stable/regain_v2.0.4-STABLE_desktop_win.exe)
Question 1 (Main-Question):
The Problem I have is to understand:
How to add directories to the list of searched directories, i.e. how to do this correctly? Regain does not behave as I would expect it to behave...
1.) I tried to add a directory_1 [see below for name] to the list of searched dirs. This works fine. the directory appears two times in the file CrawlerConfiguration.xml. Also the searching is succesfull.
2.) Then I try to add a directory_2 to the list of searched dirs. The name of the new directory is added two times in the file CrawlerConfiguration.xml, it is added in <startlist> and <whitelist>. However, the number of documents added to the index is not increased, and the search for documents in directory_2 is not successfull. This is unexpected for me.
3.) However, now the real strange thing: If I
(a) remove the searchindex and clean up the CrawlerConfiguration.xml
(b) add the directory_2 as the first directory to the list of searched dirs, and
(c) add then add directory_1 as the second directory to the list of searched dirs
everything works as I would expect it. This means in step 3c regain really adds documents to the list of searched docs and a later search is successful.
How can this be explained????
The names of the directories are:
directory_1 = c:/js/diary/WORK_test
directory_2 = c:/js/diary/WORK_test2
Question 2 :
Is there any advanced method for controlling regain, I mean a method alternative to the regain-interface via browser? If yes, where can I find a good set of most-frequently used commands for a beginner?
Many thanks for any good advice. Let me know if you need any more detail.