Summary: | [performance] Scanner is sending 2 messages per identifier character | ||
---|---|---|---|
Product: | [Eclipse Project] JDT | Reporter: | Philipe Mulet <philippe_mulet> |
Component: | Core | Assignee: | Philipe Mulet <philippe_mulet> |
Status: | VERIFIED FIXED | QA Contact: | |
Severity: | normal | ||
Priority: | P3 | CC: | frederic_fusier, Tod_Creasey |
Version: | 3.1 | Keywords: | performance |
Target Milestone: | 3.1 M6 | ||
Hardware: | PC | ||
OS: | Windows XP | ||
Whiteboard: |
Description
Philipe Mulet
2005-03-25 04:20:06 EST
Added support for treating obvious characters specially, using an array of 128 characters mapping to char natures (LETTER, DIGIT, SPACE, SEPARATOR). With this support, there is no more need to go through slow path when compiling some decent set of sources (JCL 1.4). Early measurements show a 47% performance improvement in pure tokenizing (not considering retrieving identifier sources) when repeating 800 times tokenizing Parser.java (>300k of sources). Before on my machine, we did tokenize 3.5M tokens/sec, now it is over 5M tokens/sec. This seems to improve full build scenario by 1-2%. Need to get some specific performance tests for it. Verified in I20050330-0500 |