Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[smila-dev] voting against HASH calculating specification

Hi,

I want to discuss old problem again. It's about HASH calculating.
This problem relates to http://wiki.eclipse.org/SMILA/Specifications/CrawlerAPIDiscussion09 discussion.

It was specified that HASH should be calculated on Crawler Controller side automatically by configuration.
In my opinion it's absolutely unacceptable for distributed systems.
I'll argue it by the next sample.

There are distributed system with 2 nodes. CrawlerController and FileCrawler are communicating remotely.
FileCrawler is configured to calculate HASH by the file content.
Let's imagine that FileCrawler is monitoring video archive and crawling procedure is started automatically every hour.

Can you imagine that happens in this situation? Complete video archive will send remotely every hour )).

--
Ivan



Back to the top