Some disclaimer here: I'm a reliability engineer so I analyze industrial processes instead of any type of sport. Normally I have access to lots of historical data, which is good because I'm normally looking for small differences. When you look for small differences, you have to throw out results that are far from the mean (or sometimes median). At work, I would throw out the 3-1's because they're far from the mean and I can always get more data. Here, it isn't that simple and I'm sure if you ran a bunch of tests with similarly skilled players, you would probably get some 3-1 results. On the whole, it still seems like the effect of the player is *probably* more than the effect of the faction in a 3-1 match.
I'm a physicist that has to derive some secondary quantities from a limited amount of data that is known to be noisy and inaccurate/biased (although the degree and trend of the noise and bias is commonly unknown). So I'm kind of used to the situation that you have to make the best out of a potentially strictly not sufficient data set. The upside to this that everybody in this business knows of the uncertainties of the result (although those are hard to quantify), or so you'd hope. So, I guess we approach something like this from a different angle
We do throw out outlying data as well, but really, even 3-0 are not that unusual and 3-1s should be as common as 3-2s if the outcome of a match is totally random (perfect match of player skills and everything is totally balanced). Sounds strange?
Ok, after one match the result will be 1-0 or 0-1 with a 50:50 split. After two matches the options are 2-0, 1-1, 1-1 and 0-2. After three matches, you have 3-0, 2-1, 1-2, 2-1, 2-1, 1-2, 1-2 and 0-3, so 2 out of 8 or 25% of matches will have ended in either 3-0 or 0-3 (last post I correctly stated that the chance of 3-0 is 12.5%, but forgot that the 0-3 would go in the same bucket...).
So, the remaining 75% of rounds will be at 2-1 or 1-2 after three rounds. Again, if the chance of winning is 50:50 for either player, we will have a 50:50 split between 3-1/1-3 or 2:2 results (which would end up in 3:2/2:3, obviously).
As a result, if match outcomes are completely random, 3-0/0-3 are expected in 25% of rounds, 3-1/1-3 in 37.5% and 3-2/2-3 in the final 37.5%. This was kind of unexpected for me, but the thing is that we do not actually play 5 games. If one player has three wins, the series stops. I guess we tend to sort of extrapolate that after 5 games the 3-0 would certainly end up in a 5-0, but potentially the other player could have won games 4 and 5.
If we look at the actual results, we see that we have 20 3-0/0-3 (which obviously are way too many to say that these where even matches), 7 3-1/1-3 and 5 3-2/2-3 so the 3-1/1-3 are not much of an outlier, really. Actually, if the player skills would be similar and all rounds end in 3:2s, you can be pretty certain that something else was unbalanced...
To make a more informed decision, I'd need faction, map, pick, and match length at a minimum. Match length would be especially helpful because equal players usually seem to battle longer.
Sounds reasonable, but if you break it down that much you would end up with a small single digit number of matches per case. Match length seems like a good indicator, but on the other hand you have cases where a player throws the towel early because he lost a crucial unit (we even saw this in the last two rounds). Still, if you are interested, I could give you this data. And yeah, I guess statiticans would probably tell us to "Get more data!".
Also, I wasn't trying to say that OST definitely isn't UP, just that there isn't enough data to say it with such certainty. Doomlord might be right but this tournament isn't a good proof either way.
Well, seems like we overall agree then
Problems with automatch stats are, well: First up, I think there is a considerable skill gap even in the top 50 of players; I didn't check but I wouldn't be surprised if all players that made it to the main round are in the top 100 with at least one faction. And back when I did the post on the automatch stats, I actually looked into what I get when using only games of the top players; you quickly get to the point that you are left with very few matches. I don't recall the exact numbers (might be like 100 matches across all factions, a larger map pool (with uncertain starting positions) for the top 50 or so over three months) but there were so few that I dropped the idea of putting that in the article.
Another issue was pointed out by Momo4Sho in a similar discussion: Top players will often try out new strategies in automatch games, so they might not actually play to win. This is different in tournament games.
So, at the end of the day I guess this is actually close to my work enviroment where decisions have to be made based on multiple different pieces of information (which here could be player feedback, tournament stats, automatch stats...), none of which are conclusive by themselve but together hopefully form a somewhat consistent picture.