TheMeInTeam
If A implies B...
- Joined
- Jan 26, 2008
- Messages
- 27,995
I have to say I enjoy TMIT vociferously insisting that no one could know whether IND/PHI is overpowered, while making 100% guarantees about what sorts of testing Firaxis did and did not do. Not that I disagree on either point, but that's quite the epistemological asymmetry.![]()
Hmmm, well let's put it this way:
Firaxis basically told us outright they didn't playtest features in BTS. "we've not won an AP victory" or some such. While I was not part of the civ IV design team in any phase, I still have some pretty strong suspicions...would you instead accept 99.99%? That's a better 99.99% you'll ever have in civ by the way.
Put simply, Firaxis does not do comprehensive tests of balance of features against each other. It doesn't take Sherlock Holmes, people with Firaxis, or a Blizzard employee to figure that out. There are plenty of things that I do not claim knowledge of whether Firaxis tested or not. However, when it comes to a comprehensive game-to-game statistical balance analysis of traits to see if any combination is overpowering (or some other way that would offer definitive proof), I would bet the entirety of my possessions that they didn't do one in a heartbeat if I would get even a 50% return beyond the bet. The balance of SPAWNS in this game is so off kilter that they can easily outstrip any trait; good players point this out often. For







Yes. I'm calling them out for not bothering with comprehensive balance testing dating from before civ IV was conceived all the way until now in civ V. I'm calling out you too, if you somehow think the trait combo is OP when compared against contemporary alternatives. Rolling Gandhi with stone + marble in mapfinder isn't illegal in most HoF gauntlets. Using Inca at all is banned. What was overpowered again? Even HoF didn't do anything to prove Inca OP, it was just "overused", despite its inferiority on faster game speeds + high difficulties.
You think Firaxis possibly tested this crap? HAAHAHAHAHAHAA. Let's take it one step further: do they even know HOW? Show me a Firaxis release where each of the game options were roughly comparable to others on a typical map, and you're probably showing me civ II. It's been that long, and back then it was achieved by making the civs identical.
There's one more thing that pushes the margin $@#% close to 100%: the state of MP on the games' release. Forget setting up a rigorous statistical data analysis; there's hard evidence to the public that the raw data couldn't possibly exist! That is, of course, unless you believe some conspiracy like Firaxis releasing a version different from what they're working on. Yeah right.
Maybe the statements are a tad less asymmetrical than you indicate. What I've put above is a LOT more than Firaxis has ever told anybody about their balance methodology. In other words, right now my case is stated a lot more strongly.
To that end, I've created a small mod: Field Test: Phi/Ind Leaders with two Phi/Ind leaders. Shah Jahan I has been added to India, and a new civilization, Palmyra, lead by Zenobia has also been added. Palmyra has no UB or UU, so players wanting to test the alleged power of the Phi/Ind trait combo without the potential bias of UBs or UUs may do so. Shah Jahan was added so people who want to play with a UU and UB can. Other than the changes mentioned above, gameplay is unchanged.
How is this "field test" being controlled? If you don't have good methodology, the only thing you're accomplishing is having people try a few new things for fun. Here is some immediate bias in your field test that needs to be addressed:
1. Players have different skill levels. Even players at the same difficulty are not really comparable. I can beat deity, but I can't do it like ABCF or USun or Duckweed and pull out 700 AD rifles (how many people @ firaxis do you think could pull 700 AD rifles on deity?)
2. Map spawn balance. Say you give the IND/PHI a whirl and roll plains cow + 1 FP. Then you play an existing civ...say someone pretty good like Mehmed or a top tier guy like Darius and give them 2 wet corn + 2-3 gems on riverside. Or give plain old Gandhi marble and/or stone. You think IND/PHI is going to compete with that? No chance. So what do you do? When you have this much "noise", you have no choice but to control it via game spam. So..............................who's up for 30+ games of this trait combo and several others...more or less depending on just "how strong" this IND/PHI combo is? Anyone willing to bite? If you play as fast as me, this might only take you a few weeks of nonstop play. Most players don't play as fast as me, or even 1/3 as fast.
3. By what metric are you measuring "overpowered"? Player ability to win the game? It would have to be.
4. You can't remove UU/UB from trait balance and still call it "balance". Maybe Firaxis can do that, but nobody with sense will believe it. A merely above average trait set can become truly devastating, even top-tier by slapping in top unique units. A good trait set can be hampered by miserable UU/UB combos that don't add much. Even starting techs play a small role. Are you going to start comparing traits all on a civ without UU/UB? If you're not, your "field test" is ignoring important considerations that drastically affect balance in this game. Without skirmishers, the Mali are just ok-to-good in MP. With them, they're a top tier civ...you're extremely hard for anyone to rush and are a serious rush/choke threat yourself. Take those away, or prats away from Rome (replacing with some garbage like ballista elephant w/o ivory, SEAL, or Panzer), and their trait sets leave them looking average tier in SP and Rome's case probably unused/rarely used in MP. Sure you want to ignore OTHER trait UU/UB in this analysis? You can't.
So how do we test this comprehensively? Simply put, we don't have the resources to do a responsible study to determine balance of traits. That's the department of game design that pretends to care about balance. For a forum, the payback (proving or disproving a point) to posters is almost never going to be high enough to give enough noise-free data to be conclusive, and so people won't test it as it needs to be tested to prove anything.
This reality is also part of the reason I'm confident enough to bet my entire worth Firaxis didn't do it as part of their balancing of traits + decision to not include certain traits (unless you want to make a case for CHA/CRE and ORG/PRO being OP too lol). What probably happened is that someone or a couple ran wonder spam back in 'nilla testing and it "seemed" too good. Besides, what's the point in bothering with "balance" when you're making a DELIBERATE decision to leave AI THAT DON'T EVEN TRY TO WIN? What a joke. Asymmetrical my foot.