Hey, chiming in here again..
I've got to say again, good work Oli and thanks for sticking with this and trying to get us a better system. I know we're all gonna have objections here and there to various parts, but I hope I speak for all that we are looking forward to anything that's an improvement over what we have and we thank you for all the hard work! We're all opinionated and just trying to make sure we get the best ranking system(s) for the community.
I agree with both sides of the PPSC/WTA debate here in part, and thus have felt all along that we need multiple systems to cover the variety of play styles/game types/etc. we have here. The more this debate goes on, i don't think we can have just one ranking system that gives us solid math on who is the best player here. There are too many play styles and too many variants for one single overarching ranking system to be completely effective (and accepted).
Thus I'm going to second what someone above mentioned (can't remember who as I go back and look). I propose separating (or at least an option to separate) PPSC standings and WTA standings (a completely different game and truest imo to the W/D/L results that ELO is designed for and the game of diplomacy as well for that matter). In PPSC, maximizing your centers in a survive is not a loss or equivalent to other smaller survives, but is debatable whether it should still be less than a draw, even though there are a number of players here who don't play that way. I personally feel it should still be less than a draw, but it is a popular set of victory conditions and not everyone played this way, so there will be much objection from those players if we change the scoring to the way we feel the game should have been played. In that case I feel that if we want to change things to how they should have been, we should start over fresh with those standings.
Also, am I missing something or are we no longer able to see the experimental rankings? http://www.vdiplomacy.com/halloffame_elo.php doesn't work for me anymore.
Oli,
To specifically answer your question about high player count games and number of players that we interact with on those games (as WWIV, and to a lesser extent Known World 901 and Modern II and other large maps account for almost half of my games), I would say that I at a minimum talk to everyone on my continent and all my neighbors and their neighbors, and have significant relations with probably 1/2 to 3/4ths of those through the early game. Mid game, I'm usually interacting significantly with another 4-8 players I didn't interact with in the early game, and if the game makes it all the way to what I would usually call the end game (though most WWIV games don't make it that far in my experience), I'm usually talking and interacting with everyone left. So out of 35 on a WWIV, I'd say I interact with anywhere from 15-25 of those players in a significant way, even though I often try to talk to most everyone at the beginning. I think this estimate is high though as I interact a lot more and a lot farther out than most on large map.
On a side note, for players like me who play much more of these high player count games, I'd like to interject once again that a high center count performance, with a high number of draws and large survives repeatedly on WWIV games, appears (from stats I've seen at least) to be 'penalized' or less statistically significant in comparison to people who play a lot of 1v1 games. Once again, I'm not arguing for changing the overall stats, but that we really need standings per variant (and I'm also now recommending per game type WTA and PPSC, in addition to the 5% or 10% bet of skill that Devonian? proposed a while back). If in the end, the overall standings on this site are biased toward smaller games (as a natural result of the more explicit win/draw/loss nature of smaller games) than the larger games, that is fine by me (mostly because I don't know how to address it), but I definitely feel it won't be a valid answer to reduce or scale down the score adjustment for large games based on approximately how many people on average a player would really interact with.
I'm not sure how to address the relevancy of established players vs newcomers who show up immediately and high in the rankings without having challenged very many of the experienced crowd. Would having an overall 'all time' ranking that it takes a while to get listed on (say 3 months?) and a monthly 'rolling' ranking of top 100 or top 200 or something be a way to do this?