Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
RE: [smila-dev] Lucene indexing performance

For performance reasons it's best to keep the instances of searcher, indexer and reader on Lucene indices. Not to do --> performance penalty.

But I am not sure whether a 80% decrease could be argumented that way. 

When working concurrently on those objects (searcher, indexer, ...) the issue is that when parallel changed there may be different views on an index... 

I do not currently know the results. Specs and so on has to be read for that purposes. Things I know is that there is a issue with accessing documents in index and positioning things... (get doc with #874534534)

Kind regards,

Georg


-----Original Message-----
From: smila-dev-bounces@xxxxxxxxxxx [mailto:smila-dev-bounces@xxxxxxxxxxx] On Behalf Of Ivan Churkin
Sent: Mittwoch, 13. Mai 2009 14:09
To: Smila project developer mailing list
Subject: Re: [smila-dev] Lucene indexing performance

btw, Lucene IndexWriter supports multiple threads access...

Regards, Ivan

Daniel.Stucky@xxxxxxxxxxx wrote:
> Hi all,
>
> during an index build (over 150.000 documents) we noticed that indexing
> speed gets slower as the index increases in size. Compared to the first
> hour of execution, the 2nd hour was only capable of indexing 80% of the
> load that was indexed in the first hour.
>
> I took a look at the Lucene integration code (by brox) and found, that
> for each index update (add or delete) a new IndexWriter is created and
> closed. This assures that the document is committed for IndexReaders and
> the index is flushed, but I guess that it's bad for performance.
>
> What were the reasons for implementing it that way ? Wouldn't it be
> possible to reuse an IndexWriter, flushing the index either by Memory
> usage or number of documents added/deleted ?
>
> Bye,
> Daniel
> _______________________________________________
> smila-dev mailing list
> smila-dev@xxxxxxxxxxx
> https://dev.eclipse.org/mailman/listinfo/smila-dev
>   

_______________________________________________
smila-dev mailing list
smila-dev@xxxxxxxxxxx
https://dev.eclipse.org/mailman/listinfo/smila-dev




Back to the top