Work in progress decision selection bias

1. Basic Setup and Notation

  1. Random Variable for Proposal Effect
    Let $X$ be the (random) net benefit or “effect” of the proposal if it is ultimately implemented. If the proposal is never implemented, the payoff is 0.

  2. Modes of Implementation
    There are three ways the proposal can end up being implemented (or “selected”):
    • (a) Random selection (with probability $p(a)$).
    • (b) “Accidental” selection by crossing a decision threshold in the market (with probability $p(b)$).
    • (c) “Intentional” selection on merits (with probability $p(c)$).

    By definition,
    \(p(a) + p(b) + p(c) \;=\; 1.\)

  3. Liquidity Events
    If the proposal is edging near the decision threshold, traders may try to sell shares at that threshold price. Let
    • $p_\ell$ be the probability of sufficient liquidity such that traders can exit at or near the threshold price,
    • $\,1 - p_\ell$ be the probability of insufficient liquidity, forcing holders to keep their shares, thus earning (or losing) the actual effect $X$.
  4. Upwards Bias
    Let $\text{upwardsBias} > 1$ (or $\ge 1$) be a factor representing the conditional optimism embedded in prices if the market is scored only when the proposal is chosen.

2. Share Expected Value (EV)

From your bullet points, an investor’s willingness to pay for one share in the sub‐market can be written:

\[\text{ShareEV} = p(a)\,\mathbb{E}[X] \;+\; p(b)\, \Big[ p_\ell \cdot (\text{threshold price}) \;+\; (1 - p_\ell)\,\mathbb{E}[X] \Big] \;+\; p(c)\, \bigl(\mathbb{E}[X] \times \text{upwardsBias}\bigr).\]

One immediate consequence is that if everyone can reliably exit at the threshold price, then $p_\ell \approx 1$, which heavily influences how $p(b)$ and the share’s overall EV behave.


3. Limit Orders Near the Threshold

You noted that if traders place limit orders just below the threshold price, they can ensure any “accidental crossing” gets matched by sell orders:


4. Defining Decision Selection Bias More Directly

Suppose we let:

Then a decision selection bias can be measured as:
\(\Delta \;=\; \mu_c \;-\; \mu.\)
A positive $\Delta$ means the chosen proposals look “too good” compared to the unconditional average, purely because they were selected in states of the world favoring them. This can happen even if $\mu_c$ < $\mu$ in a bigger, unconditional sense, but the market only sees the slice where the internal signal favored choosing it.

As the skew grows, $\Delta$ grows. Proposals can be adversarially constructed to have a high skew by e.g. having a bi-modal distribution of outcomes selected from using a random variable


5. Extended Markets, Volatility, and Implied Volatility

No matter how long you wait for $\sigma_{\text{emp}} \to 0$, the important event—“price crossing threshold”—remains a possibility with some probability. The sub‐token is always partially pricing in that event, so you can’t eliminate the embedded “jump risk” from the market.


6. Randomization vs. Market Reversion

You discuss partial randomization strategies ($p(a) > 0$) to reduce selection bias:

  1. Randomization Only (no reversion)
    • Even though randomizing a fraction of decisions can reduce confounding, it can also reduce overall decision accuracy. If confounding (i.e., $\Delta$) increases when you increase the fraction of “intentionally chosen” decisions, you can end up in a bind. In your terms, you can’t just keep adding random picks to fix confounding without hurting accuracy in other ways.
  2. Randomization With Reversion
    • If every non‐randomized decision has to revert the market or otherwise pay out in a different mode, you can completely remove the standard form of selection bias (since the markets that truly decide are only the random ones).
    • But that means 100% random picks in that subset, so you do end up implementing some suboptimal proposals purely at random, trading off bias removal vs. increased risk of bad proposals.

Formally, you can define:
\(p(a) = \text{prob. of random selection}, \quad p(\text{reversion}) \text{ (some function of the final threshold crossing)},\)
and then track how each piece influences the final expected utility, selection bias, and trader PnL variance.


7. UpwardsBias from Conditional Pricing

You note that if the proposal is only scored/observed in worlds where it’s chosen, the price “bakes in” a conditional optimism:
\(\text{Price} \;\approx\; \mathbb{E}\bigl[X \,\big|\; \text{chosen} \bigr].\)
In a typical unconditional market, the price might be $\mathbb{E}[X]$. But here, ignoring the possibility of not being selected often inflates the conditional price. Denote this by:
\(\text{upwardsBias} \;=\; \frac{\mathbb{E}[X \,\mid\, \text{chosen}]}{\mathbb{E}[X]}.\)
Any new private signal that “this time the proposal really is good” only appears if the proposal is chosen, further pushing the conditional expectation above the unconditional one.


8. Partial Equilibria

You mention that fully uninformative or fully informative are unstable equilibria. One way to put it formally:

As you shift $\alpha$ and $\beta$, each attempts to pull the other in conflicting directions: if prices get more accurate, the slices that lead to a decision can be more heavily self‐selected, thereby raising bias. If you squash bias (by randomization, reversion, or other methods), you reduce the chance that signals can refine the decision, hurting accuracy. This interplay tends to produce partial‐informative equilibria.


9. Final Notes on Formal Partial Randomization Strategies

A possible parametric approach:

You can write expected utility, trader PnL variance, and bias as functions $f(\rho,\omega)$, then look for an optimal interior point $(\rho^\ast,\omega^\ast)$. From your bullets, it seems that in some cases $\rho^\ast = 0$ unless you revert the corresponding markets, but a small $\rho$ can be beneficial to keep trader risk lower, encouraging more participation.


Potential Points Needing Clarification

  1. Definition of “Accidental”
    How exactly is “accidental crossing” triggered? Do you have a formula for how the threshold is set, or does it adjust in real time based on a specific rule?

  2. Exact Role of Limit Orders
    You suggest that with enough limit orders, $p_\ell$ can become effectively 1. Is there any friction or cost that might prevent large blocks of limit orders at the threshold?

  3. Functional Form of UpwardsBias
    If you want a closed‐form expression for $\text{upwardsBias}$, you might define $\text{upwardsBias} = 1 + \gamma$, where $\gamma$ depends on some measure of correlation or private information advantage. Or keep it as an abstract factor.

  4. Definition of “Confounding Increases as a Function of Decision Accuracy”
    You might want a direct equation linking a measure of confounding (e.g., gap between unconditional and conditional expectations) to the probability that the final decision is correct. In some models, that gap widens the more precisely the final chosen proposals are singled out by good signals.

If you found this interesting, have feedback or are working on something related, let’s chat: twitter (@0xdist) or schedule a 20 min call

Distbit

Distbit

Interested in econ, cryptoecon, agents, finance, epistemology, liberty.