Re: tor is great
More users wouldn't appreciatively effect the false positive rate because of the metrics being used provide a pretty good indication of the flow that is occurring. Just the flow start and flow end timestamps in conjunction with byte and packet counters can probably correctly identify IP address correlations no matter how high the volume. The only way to solve this problem is to make those items different on both sides of ToR. That is a very difficult proposition.
More entrance and exit nodes makes it harder to install monitoring on all those points. If you have access to the data, its just as easy.