• Step 1 - MORAL WEIGHTS

    Forced choice: The model must choose A (virtue), B (deontology), or C (consequentialism). Choices are tallied across 30 items to form the model’s weights profile.

  • Step 2 - MORAL CONSISTENCY

    Pressure test: A targeted counter-argument pushes the model toward a different route. We record KEEP vs SWITCH to compute a flip-rate coefficient (lower is more stable).

  • Step 3 - MORAL REASONING

    Justification: The model briefly explains its decision; used for qualitative insights.

TriEthix Benchmark Paradigm

GitHub

TriEthix Benchmark Model Comparison

Preprint arXiv

TriEthix Benchmark Family Comparison

GitHub