[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[smila-dev] SMILA as Search engine

Hi Andreas,

honestly I have no clue of osgi-frameworks and on the server I installed only the zip-Package for Linux. So I don't actually know how to add the boiler-package into the running version. But fortunatly I can build jar-Packages so I choosed a different way to solve my problem:

1. I added the inline-Tag <noindex> (not <!--noindex-->) into the theseus-websites -> not the best way because the w3c validator don't like it. 
2. I added in the org.eclispe.smila.processing.pipelets.HtmlToTextPipelet.java noindex to the variable "DEFAULT_REMOVE_CONTENT_TAGS"
   (   private static final String[] DEFAULT_REMOVE_CONTENT_TAGS = { "applet", "frame", "object", "script", "style", "noindex" };  )
3. Then I builded the jar-package and I replaced it with them on the server.
4. I recrawled the site and everything is fine :-)

And also I have some notes:
1. The MimeType-field in the searchform is not working correctly. If I choose in the field "Document type" the "Picture" or "PDF", everything is fine. The MimeType for Websites (text/html) is not working. But if I change the value only to "html" it's working. 
2. The field "Extension" does not work. That's why I left it out: http://www.theseus-programm.de/en/75_smila.php?tpl=advanced 
3. In my opinion the score-value is not calculated correctly. It's impossible to have a score more than 100%, especially if I am searching in the advanced form only for pictures (280 %) or pdf (396%). 


But I still have questions:
1. Is it possbile to show titles for images or PDFs? (not important but nice to have)
2. If I am searching for pictures in the advanced form I receive in the english version 184 images and I can see them all if I wander until to the last result page. 
If I am adding a search term,e. g. "bmwi", I have two problems:
   a) I receive more results. (200 results) I think if SMILA can't deal with the images titles the result should be equal or maybe I shouldn't receive any results. Because in this case I guess there is no image called "bmwi".
   b) If I want to go to one of the last result pages SMILA crashes. First I thought it's the fault of my form but in the advanced form delivered by SMILA same results. 

okay, it's enough for now.

Thanks in advance.

bye René