Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[smila-user] AW: Funny questions about SMILAs

Hi Andreas,
 
to answer the questions its important to add further information to the question.
 
Please add for all questions the data dictionary definition of the index. In SMILA each field may have an own analyzer and the search functionalty is quiet dependend on the analyzer used for a given field.
 
1) Analyzer dependent... Keep in mind that stemming (e.g. english one) may be used dependent on the analyzer... German umlauts may just be rubbish to them.
 
2) yes... just use the constructor definitions (via. DD; take a look onto the schema and just parameterize it as the constructors are used in Java)
 
3) Stemming
 
4) strange... i think they ought to be white spaces...
 
Kind Regards,
 
Georg

Von: smila-user-bounces@xxxxxxxxxxx [smila-user-bounces@xxxxxxxxxxx] im Auftrag von Andreas.Schultz@xxxxxxxxxxx [Andreas.Schultz@xxxxxxxxxxx]
Gesendet: Montag, 1. Februar 2010 10:09
An: smila-user@xxxxxxxxxxx; smila-dev@xxxxxxxxxxx
Betreff: [smila-user] Funny questions about SMILAs

Hi all,

 

it would be kind of you to help me concerning the following topics:

 

1)       How does SMILA work with German special characters like ö,ä,ü,ß.

I tried request with “Schueler”/ “Schüler” and the result was nearly the same.

But when I tried “über” / “ueber” the second request does not return any response.

So please tell me why Schueler and Schüler as part of a request seem to be identical, but über and ueber not!?

2)       Is the Lucene- StandardAnalyzer in a way configurable which allows to alter/add/delete/ etc. stop-words?

3)       Does the Lucene- StandardAnalyzer provide a normalization?

4)       Using “\n”, “\r” or “\t” as a search request leads to a search result which is not empty. Could this be disabled?

 

Best

Andreas Schultz
Senior Software Developer

- - - - Bitte beachten Sie meine neuen Kontaktdaten - - - -


Empolis GmbH  |  Meisenstr. 90 | 33607 Bielefeld  |  Germany
AN ATTENSITY GROUP COMPANY
Phone +49 (0)521 55 785 413|  Fax +49 (0)521 55 785 121
andreas.schultz@xxxxxxxxxxx

 

www.empolis.com
Sitz Kaiserslautern  |  Amtsgericht Kaiserslautern HRB 30711  |  Geschäftsführer: Dr. Stefan Wess, Dr. Peter Tepassé

 

………………………………………………………………………………………………………………………………………………………………………………………………………..

Know. Right. Now.

Das ist unsere Philosophie. Empolis, an Attensity Group Company, bietet eine integrierte Suite von Geschäftsanwendungen,

die mit Hilfe patentierter semantischer Informations-Technologien die exponentiell wachsende Menge unstrukturierter
Daten analysiert, interpretiert und automatisiert verarbeitet. Entscheider, Experten, Mitarbeiter und Kunden erhalten so
stets situations- und aufgabengerecht genau das Wissen, das für ihre Arbeit relevant ist.

………………………………………………………………………………………………………………………………………………………………………………………………………..

Abonnieren Sie unseren monatlichen Newsletter: http://www.empolis.de/newsletter.html

 


Back to the top