Update README

add IRV smith-set resolution method
2026-06-05 16:22:15 -07:00 · 2026-06-05 14:49:08 -07:00 · 2026-06-05 14:48:08 -07:00 · 2026-06-05 14:39:41 -07:00 · 2026-06-04 15:23:03 -07:00 · 2026-06-04 15:07:22 -07:00
7 changed files with 195 additions and 26 deletions
--- a/README.md
+++ b/README.md
@ -3,7 +3,7 @@

 The [Smith set](https://en.wikipedia.org/wiki/Smith_set) is the minimal set of election candidates which can beat all others pairwise
 (by simple majority ranking preference) - if there is a single winner in the set they are 
-guaranteed the standard [Condorcet i.e. Majority winner](https://en.wikipedia.org/wiki/Condorcet_winner) (they beat all others pairwise). 
+guaranteed the standard [Condorcet i.e. Majority winner](https://en.wikipedia.org/wiki/Condorcet_winner) (they beat all others pairwise).

 `smithy` identifies the Smith set via graph Strongly Connected Component (SCC) analysis of
 the pairwise majority graph using [`rustworkx`](https://www.rustworkx.org/). 
@ -11,9 +11,12 @@ Pairwise majority comparisons scale quadratically in the number of candidates an
 in the number of ballots, while the SCC and condensation graph analysis is 
 approximately quadratic in the number of candidates for the dense tournament graphs typical 
 of Condorcet elections. Internally, repeated ballots are compressed/cache-counted before 
-pairwise evaluation to improve performance over duplicate rankings.
+pairwise evaluation to improve performance over duplicate rankings.
 This is all overkill for small elections, but is fun.

+Optionally, `smithy` can try to further resolve a nontrivial Smith set (a majoritarian tie or cycle) by running all-paths IRV within the set - at least reducing to an IRV winner set (the set of candidates that win at least one IRV elimination path) within the Smith set that are not only pairwise competitive but can also build competitive plurality
+coalitions within the set; in practice this often resolves cycles and is likely to result in a unique delegate (if they win all IRV elimination paths) which can claim a plurality coalition amongst the majority winners. This should identify a winning candidate as the best possible focal point for voluntary coordination; if your elections have other priorities you may want to resolve nontrivial Smith sets differently.
+

 ## Usage
 `smithy` can be used a few different ways: as a CLI command, via a web form upload, 
@ -33,14 +36,34 @@ executable should give you access to the `smithycmd` CLI command.
 this in your shell as `uv run src/smithycmd.py [...]` from within the repo after cloning it locally.

 In either case, the command expects the same argument structure:
-TODO
+```bash
+$ uv run src/smithycmd.py --help
+Usage: smithycmd.py [OPTIONS] BALLOTS
+
+  Compute the Smith set from a box of ranked-choice ballots -- .csv or
+  .xls(x).
+
+  The Smith set is the minimal set of candidates which can beat all others
+  pairwise (simple ranking majority) - if there is a single winner in the set,
+  they are guaranteed the Condorcet i.e. Majority winner.
+
+Options:
+  -b, --show-ballots  Show relevant ballots (after selections).
+  -p, --pretty        Pretty-print output.
+  --help              Show this message and exit.
+```
+where the ballots file should be a `.csv` or `.xls`(`x`) whose columns are candidates and whose
+rows are numerical RCV ballot rankings (lower number is best, 1st, 2nd, 3rd, etc.).
+(see the [web deployment](https://pages.uoregon.edu/tgorordo/files/smithy/src/cgi/form.html)
+or `test/` for some examples).

 If you want a 'None' option in your election, it should be included as a candidate.

 While plain output is the default, so that the command can easily be used in a unix-pipe
 or `stdio` workflows, it can also pretty-print its output for your reading pleasure
 using [`rich`](https://rich.readthedocs.io/en/stable/introduction.html) if you pass it the 
-`--pretty` (or `-p`) flag.
+`--pretty` (or `-p`) flag. Whether to show the input ballots during output is also
+controlled by a corresponding flag.


 ### UOregon CGI Application
@ -49,9 +72,9 @@ hosts a [CGI form](https://service.uoregon.edu/TDClient/2030/Portal/KB/ArticleDe
 which accepts a `.csv` or `.xls`(`x`) upload of a ballot-box and responds with the resulting
 Smith set. The application can typically handle ~100s of candidates and ~10ks of votes 
 before running into hosting resource limitations and timing out.
-
+
 No ballot data is retained on the server after execution.
-
+

 ### Python Package
 The `smithy` python package primarily provides the function `smith_set(ballot_box_df)` 
@ -85,6 +108,8 @@ pip install .
 [this brief guide might be helpful](https://pages.uoregon.edu/tgorordo/courses/uoph410-510a_Image-Analysis/setup.html)).


+(The core algorithm is also pretty dead simple, and you could just copy it over into your project too).
+
 ## Development
 Current development tooling is based on the [`uv`](https://docs.astral.sh/uv/) Python package and
 project manager. A [`justfile`](https://github.com/casey/just) lists some common useful
--- a/src/cgi/form.html
+++ b/src/cgi/form.html
@ -87,7 +87,7 @@ e.g.:</p>
  </tr>
 </table>
 where this reads that the first voter had preferences "Alice" over "Bob" over "Charlie", the second voter "Bob" over "Alice" over "Charlie", etc.
-(This example is constructed to demonstrate a situation without a singular majority winner - its Smith set a tie between "Alice" and "Bob")</p>
+(This example is constructed to demonstrate a situation without a singular majority winner - its Smith set a tie between "Alice" and "Bob"). Rankings may contain ties by e.g. repeating a numerical rank.</p>
 
 <h3>Output: Smith Set Format</h3>
 <p>The form will return a list of the Smith-set winners sorted lexicographically (they are all equally good majority winners).
--- a/src/smithy/init.py
+++ b/src/smithy/init.py
@ -3,6 +3,7 @@ import rustworkx as rwx
 from itertools import combinations

 from .rcv import pmg_from_rcv
+from .irv import irv_from_rcv


 def ss_from_pmg(pmg: rwx.PyDiGraph) -> list[str]:
@ -67,3 +68,12 @@ def smith_set(df: pl.DataFrame, ballotkind="rcv") -> list:
        raise NotImplementedError(
            f"`smith_set` ballotkind={ballotkind} is not implemented."
        )
+
+
+def irv_set(df: pl.DataFrame, ballotkind="rcv") -> list:
+    if ballotkind == "rcv":
+        return irv_from_rcv(df)
+    else:
+        raise NotImplementedError(
+            f"`irv_set` ballotkind={ballotkind} is not implemented."
+        )
--- a/src/smithy/irv.py
+++ b/src/smithy/irv.py
@ -0,0 +1,102 @@
+import polars as pl
+import numpy as np
+
+
+def irv_from_rcv(ballots: pl.DataFrame, method: str = "bigslow") -> list[str]:
+    """
+    Compute the set of all-paths IRV winners from an RCV ballot.
+
+    parameters
+    ---
+    ballots: pl.DataFrame
+    An RCV table of ballots.
+
+    method: str
+    Either "bigslow" or "smallfast" for selecting an internal method for counting
+    first-choices during IRV rounds. Defaults to "bigslow" but you can use "smallfast"
+    so long as the table of ballots is expected to fit in a reasonable numpy array (after compression).
+
+    returns
+    ---
+    winners: list[srt]
+    A lexicographically sorted list of IRV winners. If a candidate wins every elimination path
+    then this set will contain only one entry, otherwise it will contain all candidates that win
+    at least one IRV elimination path.
+    """
+    compressed = ballots.group_by(ballots.columns).len().rename({"len": "count"})
+    return sorted(_irv_winners(compressed, method=method))
+
+
+def _fst_counts_bigslow(compressed: pl.DataFrame) -> pl.DataFrame:
+
+    surviving = [c for c in compressed.columns if c != "count"]
+
+    fstcexpr = (
+        pl.concat_list([pl.col(c) for c in surviving])
+        .list.arg_min().map_elements(lambda i: surviving[i], return_dtype=pl.String).alias("first_choice")
+    )
+
+    tally = (
+        compressed.with_columns(fstcexpr)
+        .group_by("first_choice")
+        .agg(pl.col("count").sum())
+        .filter(pl.col("first_choice").is_not_null())
+    )
+    return tally
+
+
+def _fst_counts_smallfast(compressed: pl.DataFrame) -> pl.DataFrame:
+
+    surviving = [c for c in compressed.columns if c != "count"]
+
+    a = compressed.select(surviving).to_numpy()
+
+    cs = compressed["count"].to_numpy()
+
+    fstc_idxs = np.argmin(a, axis=1)
+
+    tally = {c: 0 for c in surviving}
+    for i, c in zip(fstc_idxs, cs):
+        tally[surviving[i]] += int(c)
+
+    return pl.DataFrame(
+        {"first_choice": surviving, "count": [tally[c] for c in surviving]}
+    )
+
+
+def _irv_round(compressed: pl.DataFrame, method="bigslow"):
+
+    if method == "bigslow":
+        count_fn = _fst_counts_bigslow
+    elif method == "smallfast":
+        count_fn = _fst_counts_smallfast
+    else:
+        raise NotImplementedError(
+            f"Error: _fst_counts method={method} not implemented."
+        )
+
+    tally = count_fn(compressed)
+
+    eliminate = tally.filter(pl.col("count") == pl.col("count").min())[
+        "first_choice"
+    ].to_list()
+
+    for e in eliminate:
+        surviving = [c for c in compressed.columns if c not in ("count", e)]
+        yield (
+            compressed.select(surviving + ["count"])
+            .group_by(surviving)
+            .agg(pl.col("count").sum())
+        )
+
+
+def _irv_winners(compressed, method="bigslow"):
+
+    surviving = [c for c in compressed.columns if c != "count"]
+    if len(surviving) == 1:
+        return set(surviving)
+
+    winners = set()
+    for branch in _irv_round(compressed, method=method):
+        winners |= _irv_winners(branch, method=method)
+    return winners
--- a/src/smithy/rcv.py
+++ b/src/smithy/rcv.py
@ -3,7 +3,7 @@ import rustworkx as rwx
 from itertools import combinations


-def pmg_from_rcv_polars(ballots: pl.DataFrame) -> rwx.PyDiGraph:
+def pmg_from_rcv_bigslow(ballots: pl.DataFrame) -> rwx.PyDiGraph:
    """
    Build a pairwise majority winner graph from a box of Ranked-Choice Ballots.

@ -34,10 +34,20 @@ def pmg_from_rcv_polars(ballots: pl.DataFrame) -> rwx.PyDiGraph:
    pairs = list(combinations(candidates, 2))

    for a, b in pairs:
-        exprs.extend([
-            pl.when(pl.col(a) < pl.col(b)).then(pl.col("count")).otherwise(0).sum().alias(f"{a}>{b}"),
-            pl.when(pl.col(b) < pl.col(a)).then(pl.col("count")).otherwise(0).sum().alias(f"{b}>{a}")
-        ])
+        exprs.extend(
+            [
+                pl.when(pl.col(a) < pl.col(b))
+                .then(pl.col("count"))
+                .otherwise(0)
+                .sum()
+                .alias(f"{a}>{b}"),
+                pl.when(pl.col(b) < pl.col(a))
+                .then(pl.col("count"))
+                .otherwise(0)
+                .sum()
+                .alias(f"{b}>{a}"),
+            ]
+        )

    results = compressed.select(exprs).row(0, named=True)

@ -52,7 +62,8 @@ def pmg_from_rcv_polars(ballots: pl.DataFrame) -> rwx.PyDiGraph:

    return pmg

-def pmg_from_rcv_numpy(ballots: pl.DataFrame) -> rwx.PyDiGraph:
+
+def pmg_from_rcv_smallfast(ballots: pl.DataFrame) -> rwx.PyDiGraph:
    """
    Build a pairwise majority winner graph from a box of Ranked-Choice Ballots.

@ -79,17 +90,17 @@ def pmg_from_rcv_numpy(ballots: pl.DataFrame) -> rwx.PyDiGraph:

    compressed = ballots.group_by(ballots.columns).len().rename({"len": "count"})
    counts = compressed["count"].to_numpy()
-    
+
    arr = compressed.drop("count").to_numpy()
    results = ((arr[:, :, None] < arr[:, None, :]) * counts[:, None, None]).sum(axis=0)

    for i, a in enumerate(candidates):
        for j in range(i + 1, len(candidates)):
            b = candidates[j]
-            
+
            a_wins = results[i, j]
            b_wins = results[j, i]
-            
+
            if a_wins > b_wins:
                pmg.add_edge(nodes[a], nodes[b], int(a_wins - b_wins))
            elif b_wins > a_wins:
@ -97,10 +108,11 @@ def pmg_from_rcv_numpy(ballots: pl.DataFrame) -> rwx.PyDiGraph:

    return pmg

-def pmg_from_rcv(ballots: pl.DataFrame, method="numpy") -> rwx.PyDiGraph:
-    if method == "polars":
-        return pmg_from_rcv_polars(ballots)
-    elif method == "numpy":
-        return pmg_from_rcv_numpy(ballots)
+
+def pmg_from_rcv(ballots: pl.DataFrame, method="bigslow") -> rwx.PyDiGraph:
+    if method == "bigslow":
+        return pmg_from_rcv_bigslow(ballots)
+    elif method == "smallfast":
+        return pmg_from_rcv_smallfast(ballots)
    else:
-        raise NotImplementedError(f"`pmg_from_rcv` method={method} not implemented.")
+        raise NotImplementedError(f"`pmg_from_rcv` method={method} not implemented.")
--- a/src/smithycmd.py
+++ b/src/smithycmd.py
@ -15,11 +15,16 @@ from rich.table import Table
 from rich.panel import Panel

 import polars as pl
-from smithy import smith_set
+from smithy import smith_set, irv_set


@click.command()
@click.argument("ballots", type=click.Path(exists=True, dir_okay=False))
+@click.option(
+    "--try-resolve-irv",
+    is_flag=True,
+    help="Try to reduce or resolve the Smith set by running all-paths IRV on the set.",
+)
@click.option(
    "--show-ballots",
    "-b",
@ -27,7 +32,7 @@ from smithy import smith_set
    help="Show relevant ballots (after selections).",
 )
@click.option("--pretty", "-p", is_flag=True, help="Pretty-print output.")
-def cli(ballots: str, show_ballots=False, pretty=False) -> None:
+def cli(ballots: str, try_resolve_irv=False, show_ballots=False, pretty=False) -> None:
    """
    Compute the Smith set from a box of ranked-choice ballots -- .csv or .xls(x).

@ -56,7 +61,7 @@ def cli(ballots: str, show_ballots=False, pretty=False) -> None:
        df = df.with_columns(
            [
                pl.col(c)
-                .cast(pl.Utf8)
+                .cast(pl.String)
                .str.strip_chars()
                .cast(pl.Int64, strict=False)
                .fill_null(0)
@ -66,6 +71,9 @@ def cli(ballots: str, show_ballots=False, pretty=False) -> None:

        # Compute Smith set
        smiths = smith_set(df)
+        if len(smiths) > 1 and try_resolve_irv:
+            irv_ballots = df.select(smiths)
+            smiths = irv_set(irv_ballots)

        if show_ballots and pretty:
            preview = Table(title="Ballot Box")
@ -88,7 +96,9 @@ def cli(ballots: str, show_ballots=False, pretty=False) -> None:
            console.print(
                Panel.fit(
                    "\n".join(f"• {c}" for c in smiths),
-                    title="Resulting Smith Set",
+                    title="Resulting IRV-resolved Smith Set"
+                    if (try_resolve_irv)
+                    else "Resulting Smith Set",
                    border_style="green",
                )
            )
--- a/src/smithygui.py
+++ b/src/smithygui.py
@ -1,3 +1,13 @@
+# /// script
+# requires-python = ">=3.13"
+# dependencies = [
+#     "click>=8.4.1",
+#     "polars>=1.41.0",
+#     "pyside6>=6.11.1",
+#     "rich>=15.0.0",
+#     "rustworkx>=0.17.1",
+# ]
+# ///
 from PySide6.QtWidgets import QApplication, QWidget

 app = QApplication(sys.argv)
Author	SHA1	Message	Date
Thomas (Tom) C. Gorordo	6fcdae8792	Update README	2026-06-05 14:49:08 -07:00
Thomas (Tom) C. Gorordo	c324dc9c9f	Update README	2026-06-05 14:48:08 -07:00
Thomas (Tom) C. Gorordo	e0d0134fdd	add IRV smith-set resolution method	2026-06-05 14:39:41 -07:00
Thomas (Tom) C. Gorordo	fc0569e252	Update README.md	2026-06-04 15:23:03 -07:00
Thomas (Tom) C. Gorordo	51d651bee6	Update README.md	2026-06-04 15:07:22 -07:00
Thomas (Tom) C. Gorordo	bc146d523d	Update README.md	2026-06-04 13:23:05 -07:00
Thomas (Tom) C. Gorordo	17c121f0af	Update README.md	2026-06-04 13:04:49 -07:00
Thomas (Tom) C. Gorordo	e9c263a8c5	Update README.md	2026-06-04 12:48:24 -07:00
Thomas (Tom) C. Gorordo	fa24eb5758	Update README.md	2026-06-04 12:35:21 -07:00
Thomas (Tom) C. Gorordo	55f64d1a32	Update README.md	2026-06-04 12:34:54 -07:00
Thomas (Tom) C. Gorordo	3dc8a3cb46	Update README.md	2026-06-04 12:31:35 -07:00
Thomas (Tom) C. Gorordo	845ac38533	Update README.md	2026-06-04 12:30:02 -07:00
Thomas (Tom) C. Gorordo	b24920c15a	Update README.md	2026-06-04 12:19:39 -07:00
Thomas (Tom) C. Gorordo	24243aad2f	Update README.md	2026-05-27 23:04:51 -07:00
Thomas (Tom) C. Gorordo	f762fd5a23	Update form.html	2026-05-25 23:09:23 -07:00
Thomas (Tom) C. Gorordo	57caad7f27	Update README	2026-05-25 22:31:39 -07:00
Thomas (Tom) C. Gorordo	1a6de605ad	Update README	2026-05-25 22:28:32 -07:00
Thomas (Tom) C. Gorordo	d1db77c466	Revise README Updated README to improve clarity and add usage details.	2026-05-25 22:25:52 -07:00
Thomas (Tom) C. Gorordo	18405644d3	sync	2026-05-25 22:23:57 -07:00