Forum - vDiplomacy

Forum

A place to discuss topics/games with other webDiplomacy players.

Page 91 of 164

First

Previous

Next

Last

)
08 Jul 13 UTC

Thread is too old to reply to

Best Diplomacy Website

Hey guys, I was wondering what your most preferred Diplomacy website?
I am playing in playdiplomacyonline website as well but honestly I prefer this one more since it is more tactical and does not punish you for making wrong clicks.. What do you guys think?

8 replies

Mod

(P)

)
25 May 13 UTC
(+2)

Thread is too old to reply to

New feature, very early development-stage....

Interactive map.
You can use you mouse to make give orders to your armies.

43 replies

)
06 Jul 13 UTC

Thread is too old to reply to

Germany 1648

We have a Germany 1648 starting in 16h somebody please join we need one more player

1 reply

)
06 Jul 13 UTC

Thread is too old to reply to

WW4 gunboat starting in 24 hours - players needed

Please consider joining gameID=14993. We've got half the players, just need some more.

2 replies

Mod

(B)

)
28 Jun 13 UTC

Thread is too old to reply to

variant test time

http://lab.vdiplomacy.com/board.php?gameID=100

3 replies

NigelFarage (1238

)
03 Jul 13 UTC

Thread is too old to reply to

Classic-Total Domination

I've created a classic-build anywhere map, with an EOG of 34 SCs (i.e., all of the SCs in the game). To play, you have to agree to certain rules (in comments) beforehand. Password is in comments.

Game link: gameID=15041

6 replies

Lukas Podolski (1234

)
02 Jul 13 UTC

Thread is too old to reply to

Replacement needed

gameID=14661 as Turkey
not a very good position, but is not completely dead

1 reply

Mod

(P)

)
09 Jan 13 UTC
(+3)

Thread is too old to reply to

Input of an alternate scoring system needed...

As the Dpoints are not an ideal way to represent a players game-strenght I'm thinking about implementing an alternate rating system (in addition to the traditional Dpoints)
Any math experts here?

Page 6 of 25

First

Previous

Next

Last

Oli (977 )
14 Jan 13 UTC

Usually the multiplier for the new rating is between 10 for experienced players and 30 for new players. All my calculations use 32 for everybody. Maybe the value-difference wouldn't be that big if I used 10 for all matches too...

Devonian (1887 )
14 Jan 13 UTC

On the wiki, I don't understand programming language, can this be translated to mathematical formulas?

http://www.vdiplomacy.com/wiki/index.php?title=Rating

Oli (977 )
14 Jan 13 UTC

Better now?

Oli (977 )
14 Jan 13 UTC

PS: Most of the math is taken from here:
http://en.wikipedia.org/wiki/Elo_rating_system#Theory

Devonian (1887 )
14 Jan 13 UTC

Yes, that is better, but I think there is a problem in this part:
Rr=SC1/(SC1 + SC2)
I don't know the exact problem yet, but I will look at it again later.

I'll have to look at the other two formula's later also.

Oli (977 )
14 Jan 13 UTC

Rr=SC1/(SC1 + SC2)
Describes the relation of your SCs to the SCs of your opponent.
If both have 10 SCs at the end of the game thats 0.5
If A has 5 and B has 15 it's 0.25...

Devonian (1887 )
14 Jan 13 UTC

Hmm... that might not be it, but there has to be something wrong in the formula, or the programming of it.

At the current ratings, and based on the formula, I have a .0000001% chance of beating Bozo. Ouch! I hope I don't have to play him 100 million times to get a win. Actually Bozo has a mathematical certainty of winning every game he plays against anyone.

G-Man (2516 )
15 Jan 13 UTC

Yeah, didn't you know Bozo cheats! (Just kidding Bozo!!)

Retillion (2304 )
15 Jan 13 UTC

Yes, that's what I was trying to say : that is certainly not an Elo rating because when you calculate an Elo, the result is 1 point for a victory, 1/2 for a draw and 0 for a defeat.

Here, in the formula 1, 1/2 or 0 are replaced by SC1 / (SC1+SC2) which is a mistake. I mean, you can calculate what you want but then, that is not an Elo rating.

Then again, an Elo rating can ONLY be used in 2 player-games.

Retillion (2304 )
15 Jan 13 UTC

Here is a little story that illustrates that an apparently nice and correct calculation can give erroneous results :

3 friends go to a bar and have a drink. They each give the waiter one 10 $ note. That's a total of 30 $, right ? The boss of the bar tells the waiter that the bill is 25 $. The waiter doesn't know how to share to 5 $ between the three customers and gives them back each 1 $ and keeps for himself 2 $ as a tip.
BUT then, each customer has finally paid 9 $, for a total of 27 $ and the waiter has kept 2 $. That's a total of 29 $ ! Indeed 3 x 9 + 2 = 29 ! WOW ! What happened to the 30th dollar ???

fasces349 (1007 )
15 Jan 13 UTC

Not quite retillion:
Say its 3 players, one with an ELO of 100, one with 200 and one with 300.

If the map is 18 scs, then it is expected that:
Player A will get 3 scs
Player B will get 6 scs
Player C will get 9 scs

If Player A gets above 3, his rating will increase, if he gets below 3, his rating will decrease. For Player B its 6 and Player C its 9.

Elo works for more then just 1v1 game.

Also the $2 tip was because the cost of the drinks was $25 not $27 and so they paid $27 giving him the $2 tip. While humans can easily make that mistake, a computer will not. That said there is such think as typos in a code, a computer follows the rule of GIGO

Devonian (1887 )
15 Jan 13 UTC

Oli,

You are using similar, but not identical, terms as the wiki, and I think it caused you to miss some. I went back to the original wiki and re-wrote the formula's using the original terms, and added a missing formula. Try these:

Initial true strength:
RA=Player A true strength
RB= Player B true strength

Intermediate term used to simplify the math:
QA=10^(RA / 400)
QB=10^(RB / 400)

Expected Results: (you have only one of these)
EA=QA/(QA + QA)
EB=QB/(QB + QB)

Actual Results: (Not from the original wiki)
SA=SC1/(SC1+SC2)
SB=SC2/(SC1+SC2)

Revised true strength:
R'A=RA+32*(SA-EA)
R'B=RB+32*(SB-EB)

Devonian (1887 )
15 Jan 13 UTC

For points won, I think this would be the only change:

Actual Results: (Not from the original wiki)
SA=PW1/(PW1 + PW2)
SB=PW2/(PW1 + PW2)

Devonian (1887 )
15 Jan 13 UTC

For Win/Draw/Survive/Defeat, I think this would do it:

Check all combinations:
IF S1='Won' SA = 1; SB=0 (4 combinations possible)
IF S1='Drawn' AND S2='Drawn' SA = 0.5; SB=0
IF S1='Drawn' AND S2='Defeated' SA = 1; SB=0
IF S1='Survived' AND S2='Won' SA = 0; SB=1
IF S1='Survived' AND S2='Drawn' SA = 0; SB=1 (is this possible?)
IF S1='Survived' AND S2='Survived' SA = 0.5; SB=.5
IF S1='Survived' AND S2='Defeated' SA = 1; SB=0
IF S1='Defeated' AND S2 ='Defeated' SA = 0.5; SB=.5
IF S1='Defeated' AND S2<>'Survived' SA = 0; SB=1 (you had a typo here)
IF S1='Defeated' AND S2<>'Drawn' SA = 0; SB=1 (New combination)
IF S1='Defeated' AND S2<>'won' SA = 0; SB=1 (New combination)

Devonian (1887 )
15 Jan 13 UTC

Oh, and one more ...
IF player1= "Devonian" SA=1.

fasces349 (1007 )
15 Jan 13 UTC

Devonian, I noticed a typo in your last post, for future reference its spelled Fasces349 not Devonian

Oli (977 )
15 Jan 13 UTC

@Retillion:The WIn/Draw/Lost-score is still strange but uses 1=win, 0.5=draw 0=loss formula.
There must be an error somewhere else...

Oli (977 )
15 Jan 13 UTC

To check the formula I'll rate only this game: gameID=12010 later today with every player has a base-rating of 1500 and check the results.

Devonian (1887 )
15 Jan 13 UTC

That would be good. How about also doing it with a game that ended in a draw.

Decima Legio (1987 )
15 Jan 13 UTC

Premise: I have read the related material
http://en.wikipedia.org/wiki/Elo_rating_system
https://sites.google.com/site/phpdiplomacytournaments/theghost-ratingslist
http://www.vdiplomacy.com/wiki/index.php?title=Rating

I do appreciate the work done/in progress, but I think we’re unnecessarily making it difficult.

The weak point here and source of countless discussion is the way the “Edif” is defined.
Compared to the game typology, it’s not clear what “expected result” and “real result” are supposed to represent. I understand the math, but not the logic behind those assumptions. Yet it’s not clear what those “result” concepts are in words definition. Translating those uncertain word definitions into algebraic definition looks to me as an hazardous mental operation. It’s not chess, where the word “result” is clear since we have either a victory, a loss or a fairly unlikely draw. In chess translating “result” in a number between 0 and 1 makes sense. In diplomacy there are thousands of possible game results.

I am still favorable to Devonian’s initial suggestion, which in retrospect is even simpler than the Ghostrating definition.
Because of this, I’m spending one more coin to sustain that proposal (Devonian, did you change your mind? Don’t waste this support!):
During each game, instead of an Elo-differential, we have a skill-pot to divide running in parallel to a Dpoints-pot.
The pot is composed by the sum of users’ bet, which is not arbitrary, being it a fixed % of the current skill rating (say 5%). This happens regardless the D-point arbitrary bet. So everyone pays in advance in proportion to the own measured skill…
…and we don’t need to elaborate a questionable “Edif” in order to split the pot. The pot is divided accordingly to the game style, PPSC or WTA, using the current pot distribution system.
What’s the cause why this thread was started? Dpoints are not an accurate measure of skill.
Why? The main reasons are because the game-bet is arbitrary, because currently beating a champion is worth the same as beating a noob and because the points are re-inflated when users drop below 100.
I think this system overcomes many problems of numerical interpretation, first of all the word “result”, either “real” or “expected”.
This system will reflect the skill distribution criteria that have actually been used by the players during past games.
This skill-distribution system is robust, simple and already exists... it’s the current one.
Nobody runs out of points, since the skill-bet each time is 5% of your skill-points. There’s no need for skill-points inflation.
If we use an Elo-based formulation we have to tune the coefficient 400, the coefficient 32, the formula of the “real result”.
Using this proposal instead the only coefficient we have to tune is the fixed skill %, assumed 5% each game: make it smaller if you want to slow down skill-rating change after a match; make it bigger if you want to speed up skill-rating change.

Devonian (1887 )
15 Jan 13 UTC

Decima,

"I am still favorable to Devonian’s initial suggestion, which in retrospect is even simpler than the Ghostrating definition.
Because of this, I’m spending one more coin to sustain that proposal (Devonian, did you change your mind? Don’t waste this support!): "

No, I didn't change my mind! I also think my original suggestion was much easier. And I thought that was going to be how it was implimented. I was sad to see that the first iteration was not how I suggested.

However, when it appeared that my suggestion was not the direction it was going, I figured I would at least make sure the math was right. I think the math behind it is complicated, and if done wrong, can give wrong results. It is also hard to conceptualize since it uses high level exponential calculations using fractions and simplified normal distributions.

If Oli is willing to go back to my original suggestion I would be very pleased! It is much simpler, it is easy for everyone to understand, and it makes sense. I also think it should be rather easy to program since the math is very basic.

One big advantage is that everyone will know exactly what they will risk by beginning a game. (Per your example, 5% of their skill-points). And, they can rather easily calculate their potential gain under various winning conditions.

Devonian (1887 )
15 Jan 13 UTC

For reference, my proposal was on 1-9-2013, and is on the first page of this thread.

Captainmeme (1400 )
15 Jan 13 UTC

I am a little confused about:
Algorithm:
Q1=10^(E1 / 400)
Q2=10^(E2 / 400)

Does that not mean that if I were E1, and someone with, say, 1600 Elo was E2, Q1 would be 10^10 whereas Q2 would be 10^4? Therefore, Q1/(Q1+Q2) would essentially just be Q1/Q1, as Q2 would be neglible in comparison...

Sorry if I've missed something here - I'm not in maths mode at the moment - but it seems a little odd.

Decima Legio (1987 )
15 Jan 13 UTC

Right.
Using 400 as coefficient means: "It then follows that for each 400 rating points of advantage over the opponent, the chance of winning is magnified ten times in comparison to the opponent's chance of winning."

Devonian (1887 )
15 Jan 13 UTC

Captain,
Yes. That is why the formula is wrong.

Oli (977 )
15 Jan 13 UTC

I did some test with a different k-Faktor.
As in the wiki stated I used a factor of 32, but it looks like most organisations actially use a k-factor of 10. Changing this to 10 gives the result you can see now...
(And I will give Devonians algorithm a try if I have some more time too)

Oli (977 )
15 Jan 13 UTC

BTW: Even now with the altered k-factor I think we can agree that a rating based on SC count at the end of the game is not that meaningful.
I like the Won=1, Draw=0.5, Lost=0 most (And this is what's used in the last row), as it's easy to understand too.

Spreading the rating based on pointsWon is roughly the same, but it's not that elegant.

Devonian (1887 )
15 Jan 13 UTC

Oli,

Did you change the formula's also? The formula's are the real problem.

With the new K-factor, I now have a 0.0002% chance at beating Bozo, which means I will need to play him 500,000 times to get a victory! At least Bozo does not have a mathmatical certainty of winning every game against every player, but his expected results are still too optimistic for him. (The reason the odds have changed is because the formula is adding less error in each calculation, but unless the formula is changed, there will still be error.)

Devonian (1887 )
15 Jan 13 UTC

Correction. Based on the expected results, he still does have a mathematical certainty of beating every player in every game he plays.

drano019 (2710 )
15 Jan 13 UTC

Question:

I fully admit I haven't followed much of this conversation once it got into algorithms and the like, but from what I gather, there's still issues with how the system is calculating things. Is this so?

Because if not, it seems odd that a player with 21 defeats, 5 wins, 18 draws, and 13 survives would be ahead of a player with 3 defeats, 4 wins, 22 draws, and 0 survives. 18 less defeats, 1 less draw, 4 more draws, and 13 less survives (which if I understand right are considered "losses") is a lower rank?

Page 6 of 25

First

Previous

Next

Last

734 replies

)
26 Jun 13 UTC

Thread is too old to reply to

EUROPE 1939-GAME (bet set to 49)

gameID=14955

A nice map taking place in a very interesting time. Come and join, I hope we all are good communicators!

4 replies

SandgooseXXI (1294

)
28 Jun 13 UTC

Thread is too old to reply to

Country switch

Just a question on this. Say I take over a game where a player is literally a year from burning to nothing and gets the defeat, is that put on my record?!

8 replies

)
27 Jun 13 UTC

Thread is too old to reply to

Sitter needed

From tomorrow until 11th July

4 replies

KICEMEN17 (1075

)
27 Jun 13 UTC

Thread is too old to reply to

Sitter Needed June 30th-July 6th

Hello all. I'm gone on a trip from June 30th- July 6th. If anyone could watch over my account I would be very grateful. I'm in 6 games, pretty solid position in each. I hate to ask for an extend in all these games, as I see it unfair to the players. The reason I'm in many games (I've known about this trip) is because I thought where I was going had internet. This is not correct!

16 replies

)
27 Jun 13 UTC

Thread is too old to reply to

Seeking sitter(s) for Known World 901 anon gunboats

Friday through Monday morning. Great positions! PM if interested.

http://www.vdiplomacy.com/board.php?gameID=14585
http://www.vdiplomacy.com/board.php?gameID=14313

1 reply

The Ambassador (2276

(B)

)
26 Jun 13 UTC

Thread is too old to reply to

Aussie politics

Been quiet of late...

(More below)

22 replies

)
26 Jun 13 UTC

Thread is too old to reply to

World Wide Gunboat looking for 17 players. Quick turns. no meta....

http://vdiplomacy.net/board.php?gameID=14985
http://vdiplomacy.net/board.php?gameID=14984
Thanks!

0 replies

)
17 Jun 13 UTC

Thread is too old to reply to

Competitive Gunboat

Looking for some experienced players who want to play WTA gunboat game. Post if interested, will select variants based on responses.

15 replies

Mod

)
24 Jun 13 UTC

Thread is too old to reply to

Leif not a cheater as far as I know

In a now closed and locked thread Leif replied to something I said.

11 replies

Mod

(B)

)
26 Jun 13 UTC

Thread is too old to reply to

yay!

Go Rudd!

1 reply

)
19 Jun 13 UTC

Thread is too old to reply to

Chaos anyone?

1 reply

)
25 Jun 13 UTC

Thread is too old to reply to

High Stakes Star

Why are all the games I am playing in marked with a High Stakes Star .. even though they are all relatively low stakes including one of 3 DPoints?

4 replies

SandgooseXXI (1294

)
24 Jun 13 UTC

Thread is too old to reply to

How many games you playin?

The number of games Sandgoose is in...is too damn high!

23 replies

)
25 Jun 13 UTC

Thread is too old to reply to

WTA Gunboat gameID=14966

-buck the tiger's odds-
Fall of the American Empire, 35

buy-in, 16 hour phases
experienced and reliable players- join up!

0 replies

KICEMEN17 (1075

)
25 Jun 13 UTC

Thread is too old to reply to

Featured Games

Can someone explain to me why every single game I'm in is starred as a featured game? Some are like, 5 point buy ins.... Is anyone else seeing this?

3 replies

)
21 Jun 13 UTC

Thread is too old to reply to

MODs protecting cheaters! And punishing the victim´s (ME) - revealed

gameID=13373.

76 replies

)
23 Jun 13 UTC

Thread is too old to reply to

Buttergoose Tournament - Urgent Announcement

A player has been banned so Iran is in CD in the Round 1 game (gameID=14592) of the tournament. in order for the tournament to progress fairly, we strongly desire a replacement for Iran. Rules to the tournament are here: thread=41653

3 replies

President Eden (1588

)
22 Jun 13 UTC

Thread is too old to reply to

New feature proposal: No-contest voting option

In Gumers's thread I proposed a no-contest vote option, which would essentially act as a cancel which keeps games on the record for later review. Oli and/or other devs, how feasible would it be to get such a voting option?

15 replies

fasces349 (1007

)
22 Jun 13 UTC

Thread is too old to reply to

Sitter wanted

On Monday I will be leaving on vacation and may not have internet access. I'm not doing particularly well in any of my games but if anyone wants to take over my spots for whatever reason, PM.
gameID=11622
gameID=14493
gameID=14018

0 replies

)
21 Jun 13 UTC

Thread is too old to reply to

MODs protecting cheaters!

I cant believe this is actualy happening and I´ll wait for their answers and final decision before exposing the facts here!

9 replies

)
21 Jun 13 UTC

Thread is too old to reply to

EUROPE 1939-GAME WITH HIGH BET

5 out of 8 have joined now. We need 3 more to join. The bet is set to 100. COME ON NOW, join what surely will be a quality-game!

gameID=14834

0 replies

)
19 Jun 13 UTC

Thread is too old to reply to

1st Crusade

I need a few more testers for the second test of the first crusade map. http://lab.vdiplomacy.com/board.php?gameID=96
Thanks! :)

8 replies

Page 91 of 164

First

Previous

Next

Last