Saxon is implementing Unicode codepoint collation simply by doing a Java string compare, and this is incorrect in the case of strings containing characters whose Unicode codepoint value is greater than 65535. This kind of bug is irritating, because hardly anyone is going to be affected by it, yet fixing it will impose a cost on everyone because string comparison will be slower.   more »