266971 – [repo] Why two queries: CompositeQuery and CompoundQuery

Bug 266971 - [repo] Why two queries: CompositeQuery and CompoundQuery

Summary: [repo] Why two queries: CompositeQuery and CompoundQuery

Status:	RESOLVED FIXED

Alias:	None

Product:	Equinox
Classification:	Eclipse Project
Component:	p2 (show other bugs)
Version:	3.5
Hardware:	PC Windows XP

Importance:	P3 normal (vote)
Target Milestone:	3.6 M4
Assignee:	Ian Bull
QA Contact:

URL:
Whiteboard:
Keywords:	api

Depends on:
Blocks:	291763
	Show dependency tree

Reported:	2009-03-04 00:10 EST by John Arthorne
Modified:	2009-10-29 18:53 EDT (History)
CC List:	2 users (show)

See Also:

Attachments
Rename composite query to piped query (17.68 KB, patch) 2009-03-07 18:35 EST, Ian Bull	no flags	Details \| Diff
mylyn/context/zip (4.04 KB, application/octet-stream) 2009-03-07 18:35 EST, Ian Bull	no flags	Details
Updated patch (18.08 KB, patch) 2009-10-15 13:41 EDT, Ian Bull	no flags	Details \| Diff
Show Obsolete (2) View All Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description John Arthorne

2009-03-04 00:10:43 EST

With Ian's query work we ended up with two very similar classes: CompositeQuery and CompoundQuery. From looking at their API I can't really see the difference between them, or why I would choose one over the other (except CompoundQuery supports both AND and OR, and CompositeQuery only does AND). The new CompositeQuery is only used in two places, and most of our code still uses the old CompoundQuery. Why do we need two? Can we get rid of one and consolidate on the other?

Comment 1 Andrew Niefer

2009-03-04 08:26:54 EST

I had noticed a difference particularly when using non-IMatchQuery's

CompoundQuery basicaly ran both queries on the full data set and then AND'd or OR'd the results together.

CompositeQuery ran the first query on the full data, then ran the second on the results of the first.

The difference being something like O(n^2) vs O(log(n))  (I think)

Comment 2 Ian Bull

2009-03-06 18:39:48 EST

A CompoundQuery runs each query, and you can choose to AND or OR the results. There are some performance tricks if you have match queries, because we can do this work in parallel.  

Composite queries runs the first query and uses the results of that as input into the second query.  For example, give me all the EMF features, now give me the 3 most recent features. This would give you the three most recent EMF features. This type of query cannot be performed using a CompuondQuery.

Comment 3 Ian Bull

2009-03-07 02:41:47 EST

So to answer the question, why two queries?  They do separate things.  

Some of the discussion (and examples) were discussed in Bug 260012.

Comment 4 John Arthorne

2009-03-07 15:28:06 EST

After re-reading the comments in bug 260012 I completely understand the different uses again. However I and others keep hitting this, and I think the problem is that the query class names aren't expressive enough. I can't tell any meaningful difference between CompoundQuery and CompositeQuery from the class names, so each time I use one I have to look closely at the distinctions again. maybe CompositeQuery should be called something like PipedQuery to make the distinction clearer?

Comment 5 Ian Bull

2009-03-07 16:39:39 EST

(In reply to comment #4)
> After re-reading the comments in bug 260012 I completely understand the
> different uses again. However I and others keep hitting this, and I think the
> problem is that the query class names aren't expressive enough. I can't tell
> any meaningful difference between CompoundQuery and CompositeQuery from the
> class names, so each time I use one I have to look closely at the distinctions
> again. maybe CompositeQuery should be called something like PipedQuery to make
> the distinction clearer?
> 
+1. I agree, the names are not great (I spent a lot of time in these classes, and I have to look at the code to remember which one is which).  I like PipedQuery.

Comment 6 Ian Bull

2009-03-07 18:35:23 EST

Created attachment 127933 [details]
Rename composite query to piped query

I thought that PDE Build used composite query, but I don't see that now.  This patch renames composite query, and changes the references I could find in the SDK. (Only seems to affect p2).

Comment 7 Ian Bull

2009-03-07 18:35:27 EST

Created attachment 127934 [details]
mylyn/context/zip

Comment 8 Ian Bull

2009-04-16 13:25:44 EDT

Is this something we want for 3.5, or should we do this early in 3.6 (in time for API review)? I'll mark this 4.0 so we don't loose it.

Comment 9 John Arthorne

2009-04-16 14:15:34 EDT

Not for 3.5.

Comment 10 Ian Bull

2009-10-15 13:41:34 EDT

Created attachment 149664 [details]
Updated patch

Since the query stuff moved, the old patch became stale. This updates the patch.

Comment 11 Ian Bull

2009-10-29 18:51:00 EDT

I have released this fix to the API Cleanup Branch.  There does not appear to be any clients outside p2 in the SDK.