[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[smila-dev] SMILA 1.2 released!

Today we released SMILA 1.2!

 

The main new features of this release are:

* Apache Tika integration - extracting text from (binary) documents (see http://wiki.eclipse.org/SMILA/Documentation/TikaPipelet)

* Scalable JDBC crawling (see http://wiki.eclipse.org/SMILA/Documentation/Importing/Crawler/JDBC#Splitting)

* Web-Crawling enhancements (see http://wiki.eclipse.org/SMILA/Documentation/Importing/Crawler/Web)

* Remote-Crawling (see http://wiki.eclipse.org/SMILA/Documentation/Importing/RemoteCrawling)

 

With our cluster setup tutorial (http://wiki.eclipse.org/SMILA/Documentation/HowTo/How_to_setup_SMILA_in_a_cluster) we made the first step to set our focus on the clustering capabilities of SMILA, this will be a key aspect in the next release. Stay tuned!

 

Thanx to all committers!

Andreas