Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[smila-user] Handling Streaming Ressources vs JMS

Hi,

I'm thinking about giving SMILA a try for an indexing and text analysis project analyzing lots of realtime information such as Twitter's data.
Of course I started looking into SMILA's architecture (http://wiki.eclipse.org/SMILA/Architecture_Overview) wether it would be possible to handling streaming resources.

Regarding the Architecture Overview, is it really necessary to use JMS between the crawling and analysis?
I'm going to start over with a dataset of 500GB raw text messages and could imagine going up to 4-5TB - imho this would create an overhead when handling with JMS.

Looking forward hear your experiences!

Regards,

Hannes

--

https://www.xing.com/profile/HannesCarl_Meyer
http://de.linkedin.com/in/hannescarlmeyer
http://twitter.com/hannescarlmeyer

Back to the top