Really love this. I agree with some of the comments here that adding encourageme...

zone411 · 2025-04-07T17:03:53 1744045433

> Could you run some analysis on how often “p1” wins vs “p8”?

I checked the average finishing positions by assigned seat number from the start, but there weren't enough games to show a statistically significant effect. But I just reviewed the data again, and now with many more games it looks like there might be something there (P1 doing better than P8). I'll run additional analysis and include it in the write-up if anything emerges. For those who haven't looked at the logs: the conversation order etc. are randomized each round.

> My follow up thought is that it would be interesting to let llms choose a name at the beginning

Oh, interesting idea!

vessenes · 2025-04-07T20:19:35 1744057175

Cool. Looking forward to hearing more from you guys. This ties to alignment in a lot of interesting ways, and I think over time will provide a super useful benchmark and build human intuition for LLM strategy and thought processes.

I now have more ideas; I'll throw them in the github though.