Community
Participate
Working Groups
Currently about 30% of the index database consists of short strings. Most of those strings are Java identifiers that have common prefixes. We should use some sort of compression scheme that avoids storing the common prefixes more than once. I'd suggest using a trie search structure. This would embeds the string itself implicitly within the search structure and would merge common prefixes. Assuming this saves half the memory used by short strings, this would save us 15% of the total database size. Note: this is a low priority item. We should only look into it after we've fixed all the index-related UI freezes and fragmentation issues.
I'll assign a milestone after all the higher-priority work is done.