How to preserve indexed Files

Suggestions, questions oder problems with regain

Moderator: thtesche

How to preserve indexed Files

Postby Isidro García » Fri Mar 06, 2009 8:06 am

Hello;

I saw that if I delete files and them launch crawler the index is updated removing the entries indexed.
Is there a posilibility to ignore this situation ?.

The reason is that I put all the files in directories to index them and finally remove from this directory (space question).

regars

Iga
Isidro García
 
Posts: 1
Joined: Fri Mar 06, 2009 7:53 am

Postby Til » Fri Mar 06, 2009 3:21 pm

This could be done with partial indexing.

You could define - let't say - 10 while list entries for 10 different directories and give those entries different name attributes. See: http://regain.murfman.de/wiki/doku.php? ... telist_tag

Then you can copy your first bunch of files in the first directory and run the crawler with:
Code: Select all
java -jar regain-crawler.jar -onlyEntries name1

Then copy the seond bunch to your second directory and call:
Code: Select all
java -jar regain-crawler.jar -onlyEntries name2

regain will now leave the entries matching the white list entry with name1 untouched.
User avatar
Til
Site Admin
 
Posts: 242
Joined: Wed Jan 26, 2005 8:54 pm

Re: How to preserve indexed Files

Postby repcon_iga » Tue May 12, 2009 3:09 pm

Hello;

Well, It could be a excelent solution, but I means an incremental Index..

For example;

I am indexing a directory with 10000 files. When the process ends I delete all the files to preserve space.
The next time It could be modify one file or create another one.. The right way is to put them in a directory and tell the crawler not to delete the files contained in the index... only to update with the new and modified files.. The better could be to say too what files to deleted...

How can I simulate this ?

Regards

Isidro
repcon_iga
 
Posts: 1
Joined: Tue May 12, 2009 3:05 pm

good news

Postby getitdunk » Wed Jan 13, 2010 12:28 am

Yes, this is a good article. Thank you for sharing. I've learned something. I'm waiting for more.
getitdunk
 
Posts: 7
Joined: Fri Jan 08, 2010 3:49 am


08.23.24

Postby ximike2010 » Wed Aug 25, 2010 6:26 am

ximike2010
 
Posts: 301
Joined: Thu Aug 05, 2010 3:40 am


08.23.54

Postby ximike2010 » Wed Sep 01, 2010 12:37 am

ximike2010
 
Posts: 301
Joined: Thu Aug 05, 2010 3:40 am


Return to regain

Who is online

Users browsing this forum: Yahoo [Bot] and 3 guests

cron