Change "+" white space link problem

Suggestions, questions oder problems with regain

Moderator: thtesche

Change "+" white space link problem

Postby dimi » Wed Oct 05, 2011 9:57 am

Dear Developer
i found Reagin after a lot of crawling on google, and thanks is very great product!!
I have installed the server version on windows server machine to index my document on share folder.
The only problem i have found is when i found a document that have a name with white space like "Test document.txt" the indexed link change the space with "+" like "Test+document.txt" so when i click on title u r l found the browser dont found the document with "+" as know the "white space" is substituted with "%20".
I have try to check some Apache modrewrite function but TomCat is not the same...
How i can change this Behavior?
Many thanks for possible solution
dimi
 
Posts: 3
Joined: Wed Oct 05, 2011 9:46 am

Re: Change "+" white space link problem

Postby DaveStannard » Mon Nov 07, 2011 5:14 pm

Hi, we have the same issue here.

The U R L links to the files on the search response have pluses "+" instead of spaces "%20".

ie:
Code: Select all
<a href="file://///servername/docs/special+projects+division/working/things+and+stuff_123.doc">things and stuff.doc</a>

(The links are all to local files, on a drive mapping that I have used a rewriteRule to change to a server U R L. But if I remove the rewriteRule the problem is still there.)


This is happening in:
  • regain_v1.7.9-PREVIEW_server on
  • tomcat 6.0.26 running on xp SP3, and
  • tomcat 6.0.16 running on Win 2003 server

Looking in the lucene indexes with Luke shows that the pluses in the source data:

eg, path_sort is:
Code: Select all
C%3a/docs/special+projects+division/working/


Any ideas

Regards

Dave
DaveStannard
 
Posts: 2
Joined: Mon Nov 07, 2011 4:53 pm

Re: Change "+" white space link problem

Postby dimi » Tue Nov 08, 2011 3:45 pm

Hi Dave
Until now i have thought the only one with this problem. I hope for some help from the Regain Team.
Thanks so much to all
dimi
 
Posts: 3
Joined: Wed Oct 05, 2011 9:46 am

Re: Change "+" white space link problem

Postby DaveStannard » Tue Nov 08, 2011 3:57 pm

Solved it. It's the useFileToHtt pBridge setting; I had it set to true.

I set the following in SearchConfiguration.xml:
Code: Select all
<useFileToHtt pBridge>false</useFileToHtt pBridge>

and all the links are good.

(excuse the space in "Htt p", the forum is filtering for links)
DaveStannard
 
Posts: 2
Joined: Mon Nov 07, 2011 4:53 pm

Re: Change "+" white space link problem

Postby dimi » Wed Nov 09, 2011 1:13 pm

Hi Dave
So you have solved with change from "false" to "true"? I have already this flag to true but i still have "+" between the word, you a have also rebuild the index or restart some services?
Many thanks for help
dimi
 
Posts: 3
Joined: Wed Oct 05, 2011 9:46 am

Re: Change "+" white space link problem

Postby benjamin » Sat Dec 03, 2011 9:00 am

A + in a Link should not be the issue, as it is part of the HTTP-Standard. However, maybe this depends on the platform? File-Links are not treated by Tomcat, but by Firefox/IE/... itself. And these file Links are known to be difficult to set up (as browsers has security policies regarding file links served by http pages.)

The FileBridge-True - Option has no effect in 1.7.9, unfortunately.
benjamin
 
Posts: 65
Joined: Wed May 25, 2011 9:19 am

Re: Change "+" white space link problem

Postby MalorieVillarreal » Thu Apr 11, 2013 2:42 am

Until now i have thought the only one with this problem. I hope for some help from the Regain Team.


_______________________
Wholesale DVD from our DVD Sales Online store offers you the cheapest DVD and discount DVD!
MalorieVillarreal
 
Posts: 1
Joined: Wed Apr 10, 2013 7:24 am
Location: http://www.hotterdvdau.com/


Return to regain

Who is online

Users browsing this forum: No registered users and 1 guest

cron