22 Regression discontinuity designs

Over the last twenty years or so, the regression discontinuity design (RDD) has seen a dramatic rise in popularity among researchers. Cunningham (2021) documents a rapid increase in the number of papers using RDD after 1999 and attributes its popularity to its ability to deliver causal inferences with “identifying assumptions that are viewed by many as easier to accept and evaluate” than those of other methods. Lee and Lemieux (2010, p. 282) point out that RDD requires “seemingly mild assumptions compared to those needed for other non-experimental approaches … and that causal inferences from RDD are potentially more credible than those from typical ‘natural experiment’ strategies.”

Another attractive aspect of RDD is the availability of quality pedagogical materials explaining its use. The paper by Lee and Lemieux (2010) is a good reference, as are chapters in Cunningham (2021) and Angrist and Pischke (2008). Given the availability of these materials, we have written this chapter as a complementary resource, focusing more on practical issues and applications in accounting research. The goal of this chapter is to provide a gateway for you to either using RDD in your own research (should you find a setting with a discontinuity in a treatment of interest) or in reviewing papers using (or claiming to use) RDD.

A number of phenomena of interest to accounting researchers involve discontinuities. For example, whether an executive compensation plan is approved is a discontinuous function of shareholder support (e.g., Armstrong et al., 2013) and whether a debt covenant is violated is typically a discontinuous function of the realization of accounting ratios. So it is not surprising that RDD has attracted some interest of accounting researchers, albeit in a relatively small number of papers. Unfortunately, it is also not surprising that RDD has not always yielded credible causal inferences in accounting research, as we discuss below.

Tip

The code in this chapter uses the packages listed below. For instructions on how to set up your computer to use this code, go to the support page for this book. The support page also includes Quarto templates for the code and exercises below.

import polars as pl
import pyfixest as pf

from rdrobust import rdrobust

from era_pl import load_data, load_parquet, rdplot

22.1 Sharp RDD

Sharp RDD exploits the use of sharp, somewhat arbitrary, cut-offs by various decision-makers or regulations. For example, a university might admit students reaching a certain GPA or a regulator might require compliance for all firms with sales of more than $25 million. In some cases, the decision-maker in question is effectively using the cut-off to determine the assignment of observations to a treatment that may be of interest to researchers. Denoting a treatment indicator for observation $i$ as $D_i$, we might have all cases where $x_i$ is at or above some threshold $x_0$ receiving treatment, while all other cases do not.

\[ D_i = \begin{cases} 1 & x_i \geq x_0 \\ 0 & x_i < x_0 \end{cases} \] For example, the first application of RDD (Thistlethwaite and Campbell, 1960) examined the effect of National Merit Scholarships, which were awarded to students who scored 11 or higher on a qualifying exam.¹

Apart from the sharp threshold for treatment assignment, the basic identifying assumption of RDD is encapsulated in the continuity of the function $f(\cdot)$ in the following expression relating the running variable ($x_i$) and treatment ($D_i$) to the outcome of interest ($y_i$).

\[ y_i = f(x_i) + \rho D_i + \eta_i \] Given a small positive number $\Delta$, we can approximate the value of $y_{0i}$ (the outcome when not treated) when $x_i = x_0$ by averaging over values just to the left of $x_0$

\[ \mathbb{E}[y_{0i} | x_i = x_0] \approx \mathbb{E}[y_{i} | x_0 - \Delta < x_i < x_0]\] Similarly, we can approximate the value of $y_{1i}$ (the outcome when treated) when $x_i = x_0$ by averaging over values just to the right of $x_0$. \[ \mathbb{E}[y_{1i} | x_i = x_0] \approx \mathbb{E}[y_{i} | x_0 \leq x_i < x_0 + \Delta]\] This approximation gets tighter and tighter if we move $\Delta$ closer to zero and we can conclude that

\[ \begin{aligned} \lim_{\Delta \to 0}\left( \mathbb{E}[y_{i} | x_0 \leq x_i < x_0 + \Delta] - \mathbb{E}[y_{i} | x_0 - \Delta < x_i < x_0] \right) &= \mathbb{E}[y_{1i} | x_i = x_0] - \mathbb{E}[y_{0i} | x_i = x_0] \\ &= \mathbb{E}[y_{1i} - y_{0i} | x_i = x_0] \\ &= \rho \end{aligned} \] There are two noteworthy points for this analysis. First, we have relied on the continuity of the expectation to the left and right of $x_0$, which is given by the continuity of $f(x_i)$ in $x_i$ to the left of $x_0$ and the continuity of $f(x_i) + \rho$ in $x_i$ to the right of $x_0$. Second, we recover an estimate of the effect of interest $\rho$ even though we have no random assignment.

RDD exploits the fact that whether an observation is just to the left or right of $x_0$ is effectively random. In practice, this implicit assumption of our analysis may not hold and below we discuss how we might detect deviations from this assumption and what they mean for causal analysis.

Of course, another noteworthy feature of the analysis above is that we assumed we could set $\Delta$ arbitrarily close to zero and still have enough data to estimate the two conditional expectations we used to estimate $\rho$. In practice, we will not have this luxury and thus there is a trade-off between getting precise estimates (using more data) and reducing bias (restricting analysis to data as close to $x_0$ as possible).

22.1.1 Use of polynomial regressions

Studies exploiting RDD have often used polynomial functions to approximate $f(x_i)$ with regressions of the following form:

\[ \begin{aligned} y_i &= \alpha + \rho D_i + \beta_1 x_i + \beta_2 x_i^2 + \dots \beta_p x_i^p \\ &\qquad + \gamma_2 D_i x_i + \gamma_2 D_i x_i^2 + \dots \gamma_p D_i x_i^p \end{aligned} \]

In other words, separate polynomials of degree $p$ are fitted on either side of the threshold and the estimated effect is given by $\rho$. Discussion of this approach can be found in Angrist and Pischke (2008, p. 255) and Lee and Lemieux (2010, p. 326).

However, more recent work by Gelman and Imbens (2019) suggests that the use of polynomial-based regressions of the form above “is a flawed approach with three major problems: it leads to noisy estimates, sensitivity to the degree of the polynomial, and poor coverage of confidence intervals. We recommend researchers instead use estimators based on local linear or quadratic polynomials or other smooth functions.” Here “poor coverage of confidence intervals” means excessive rejection of true null hypotheses, which is of particular concern with “noisy estimates”, as the rejection of null hypotheses is precisely what many researchers seek in methods.

Fortunately, canned regression routines for implementing the methods recommended by Gelman and Imbens (2019) are widely available and easy to use.

22.1.2 Manipulation of the running variable

While RDD makes relatively mild assumptions, in practice these assumptions may be violated. In particular, manipulation of the running variable (i.e., the variable that determines whether an observation is assigned to treatment) may occur and lead to biased estimates of treatment effects.

For example, Hoekstra (2009) studies the effect of attending a flagship state university on future income using a discontinuous probability of admission and enrolment around an admission cut-off in terms of SAT points. A concern that might arise in the setting of Hoekstra (2009) is that if students knew the precise cut-off, then the more energetic and determined students who just missed out on the first attempt might retake the SAT to cross the threshold. This would create problems for causal inference, as the students just to the right of the cut-off could be significantly different from those just to the left of the cut-off in terms of variables plausibly associated with the outcome variable. These differences would undermine the validity of the estimates of effects from RDD.²

Fortunately, methods exist for researchers to examine their data for this possibility (see, e.g., Listokin, 2008; McCrary, 2008), which would manifest as “too few” students just to the left of the cut-off and “too many” just to the right.

22.2 Fuzzy RDD

Fuzzy RDD applies when the cut-off $x_0$ is associated not with a sharp change from “no treatment for any” to “treatment for all”, but with a sharp increase in the probability of treatment:

\[ P(D_i = 1 | x_i) = \begin{cases} g_1(x_i) & x_i \geq x_0 \\ g_0(x_i) & x_i < x_0 \end{cases}, \text{ where } g_1(x_0) > g_0(x_0) \]

Note that sharp RDD can be viewed as a limiting case of fuzzy RDD in which $g_1(x_i) = 1$ and $g_0(x_i) = 0$.

The best way to understand fuzzy RDD is as instrumental variables meets sharp RDD. Suppose that shifting from just below to $x_0$ to $x_0$ or just above increases the probability of treatment by $\pi$, then $x_0$ is effectively random in that small range, it has an effect on treatment so long as $\pi$ is sufficiently different from $0$, and its only effect on $y$ over that small range occurs only through its effect on treatment.

We can estimate the effect of treatment, $\rho$, by noting that

\[ \rho = \lim_{\Delta \to 0} \frac{\mathbb{E}[y_{i} | x_0 \leq x_i < x_0 + \Delta] - \mathbb{E}[y_{i} | x_0 - \Delta < x_i < x_0]}{\mathbb{E}[D_{i} | x_0 \leq x_i < x_0 + \Delta] - \mathbb{E}[D_{i} | x_0 - \Delta < x_i < x_0]} \]

Hoekstra (2009) is an excellent paper for building intuition with regard to fuzzy RDD and we recommend you read it.

22.3 Other issues

One issue with RDD is that the effect estimated is a local estimate (i.e., it relates to observations close to the discontinuity). This effect may be very different from the effect at points away from the discontinuity.

Another critical element of RDD is the bandwidth used in estimation. Gow et al. (2016) encourage researchers using RDD to employ methods that exist to estimate optimal bandwidths and the resulting estimates of effects. Credible results from RDD are likely to be robust to alternative bandwidth specifications.

Finally, one strength of RDD is that the estimated relation is often effectively univariate and easily plotted. As suggested by Lee and Lemieux (2010), it is highly desirable for researchers to plot both the underlying data and the fitted regression functions around the discontinuity. Inspection of the plots in Hoekstra (2009) provides some assurance that the estimated effect is “there”. In contrast, the plots below using data from Bloomfield (2021) seem somewhat less compelling.

22.4 Sarbanes-Oxley Act

The Sarbanes-Oxley Act (SOX), passed by the US Congress in 2002 in the wake of scandals such as Enron and WorldCom, was arguably the most significant shift in financial reporting in recent history. Iliev (2010, p. 1163) writes “the law’s main goal was to improve the quality of financial reporting and increase investor confidence.”

A key element of SOX is section 404 (SOX 404). As discussed in Iliev (2010), SOX 404 generated significant controversy, with particular concerns being expressed about the costs of compliance, especially for smaller firms. In response to such concerns, the SEC required only companies whose public float exceeded $75 million in either 2002, 2003, or 2004 to comply with SOX 404 for fiscal years ending on or after 15 November 2004.³ Smaller companies were not required to submit management reports until fiscal 2007 and these did require auditor attestation until 2010.

Iliev (2010) exploits the discontinuity in treatment around $75 million in public float and RDD to evaluate the effect of SOX 404 on audit fees and earnings management, finding evidence that SOX 404’s required management reports increased audit fees, but reduced earnings management.

Some of these data are included in the iliev_2010 data frame included with the era_pl package.

iliev_2010 = load_data("iliev_2010")
iliev_2010

shape: (7_214, 9)

gvkey	fyear	fdate	pfdate	pfyear	publicfloat	mr	af	cik
str	i32	date	date	i32	f64	bool	bool	i32
"028712"	2001	2001-12-31	2002-03-21	2002	21.523	false	false	908598
"028712"	2002	2002-12-31	2002-06-28	2002	11.46	false	false	908598
"028712"	2003	2003-12-31	2003-06-30	2003	13.931	false	false	908598
"028712"	2004	2004-12-31	2004-06-30	2004	68.161	false	false	908598
"028712"	2005	2005-12-31	2005-06-30	2005	26.437	false	false	908598
…	…	…	…	…	…	…	…	…
"013354"	2001	2001-11-30	2002-03-13	2002	124.543557	false	false	807707
"013354"	2002	2002-11-30	2002-05-30	2002	168.886196	false	true	807707
"013354"	2003	2003-11-30	2003-05-30	2003	159.259586	false	true	807707
"013354"	2004	2004-11-30	2004-05-28	2004	244.171332	true	true	807707
"013354"	2005	2005-11-30	2005-05-31	2005	265.510375	true	true	807707

The variables af and mr are indicators for a firm being an accelerated filer and for a firm filing a management report under SOX 404, respectively.

Iliev (2010, p. 1169) explains the criteria for a SOX 404 management report being required as follows:

All accelerated filers with a fiscal year ending on or after November 15, 2004 had to file a MR and an auditor’s attestation of the MR under Section 404. I denote these firms as MR firms. Companies that were not accelerated filers as of their fiscal year ending on or after November 15, 2004 did not have to file an MR in that year (non-MR firms). Those were companies that had a public float under $75 million in their reports for fiscal years 2002 (November 2002 to October 2003), 2003 (November 2003 to October 2004), and 2004 (November 2004 to October 2005).

The code producing the following data frame mr_req_df attempts to encode these requirements.

mr_req_df = (
    iliev_2010
    .filter(
        pl.col("fdate").is_between(
            pl.date(2002, 11, 1), pl.date(2005, 10, 31)
        )
    )
    .with_columns(
        max_float=pl.col("publicfloat").max().over(["gvkey", "fdate"])
    )
    .filter(pl.col("fdate") >= pl.date(2004, 11, 15))
    .with_columns(mr_required=pl.col("max_float") >= 75)
)

Interestingly, as seen in Table 22.1, there appear to be a number of firms that indicated that they were not accelerated filers and did not have to file a management report, despite appearing to meet the criteria.

(
    mr_req_df
    .filter(pl.col("mr_required"), ~pl.col("mr"), ~pl.col("af"))
    .select("cik", "fdate", "pfdate", "publicfloat")
    .sort("publicfloat", descending=True)
    .head(6)
)

Table 22.1: Sample of firm-years appearing to meet criteria, but no SOX 404 report

shape: (6, 4)

cik	fdate	pfdate	publicfloat
i32	date	date	f64
1028205	2005-06-30	2005-08-15	279.101648
29834	2004-12-31	2005-03-16	184.901979
1085869	2004-12-31	2005-02-28	172.091616
906780	2004-12-31	2005-03-01	171.662126
943861	2004-12-31	2005-03-24	168.5346
1003472	2004-12-31	2005-03-09	141.98528

Use the firm with a CIK 1022701 as an example, we see that the values included in iliev_2010 are correct. However, digging into the details of the requirements for accelerated filers suggests that the precise requirement was a float of “$75 million or more as of the last business day of its most recently completed second fiscal quarter.” Unfortunately, this number may differ from the number reported on the first page of the 10-K, introducing possible measurement error in classification based solely on the values in publicfloat. Additionally, there may have been confusion about the precise criteria applicable at the time.

Note that Iliev (2010) uses either the variable mr or (for reasons explained below) public float values from 2002, and so does not rely on a precise mapping from public float values over 2002–2004 to mr in his analysis.

22.4.1 Bloomfield (2021)

The main setting we will use to understand the application of RDD is that of Bloomfield (2021), who predicts that reducing reporting flexibility will lower managers’ ability to obfuscate poor performance, thus lowering risk asymmetry, measured as the extent to which a firm’s returns co-vary more with negative market returns than with positive market returns. Bloomfield (2021, p. 869) uses SOX 404 as a constraint on managers’ ability to obfuscate poor performance and “to justify a causal interpretation” follows Iliev (2010) in using “a regression discontinuity design (‘RDD’) to implement an event study using plausibly exogenous variation in firms’ exposure to the SOX 404 mandate.”

Bloomfield (2021) conducts two primary analyses of the association between SOX 404 adoption and risk asymmetry, which are reported in Tables 2 and 4.

Table 4 reports what Bloomfield (2021) labels the “main analysis”. In that table, Bloomfield (2021, p. 884) “follow[s] Iliev’s [2010] regression discontinuity methodology and use a difference-in-differences design to identify the causal effect of reporting flexibility on risk asymmetry. … [and] use[s] firms’ 2002 public floats—before the threshold announcement—as an instrument for whether or not the firm will become treated at the end of 2004.”

All analyses in Table 4 of Bloomfield (2021) include firm and year fixed effects. Panel B includes firm controls, such as firm age and size, while Panel A omits these controls. Like many papers using such fixed-effect structures, inferences across the two panels are fairly similar and thus here we focus on the analysis of the “full sample” without controls.

To reproduce the result of interest, we start with bloomfield_2021, which is provided in the era_pl package based on data made available by Bloomfield (2021). The data frame bloomfield_2021 contains the fiscal years and PERMCOs for the observations in Bloomfield (2021)’s sample. We then compile data on public floats in 2002, which is the proxy for treatment used in Table 4 of Bloomfield (2021).

bloomfield_2021 = load_data("bloomfield_2021")

float_data = (
    iliev_2010
    .filter(
        pl.col("publicfloat") == 
        pl.col("publicfloat").min().over(["gvkey", "fyear"])
    )
    .filter(pl.col("pfyear").is_in([2002, 2004]))
    .group_by("gvkey", "pfyear")
    .agg(pl.col("publicfloat").mean().alias("float"))
    .pivot(on="pfyear", index="gvkey", values="float")
    .select(
        "gvkey",
        pl.col("2002", "2004").name.prefix("float")
    )
)

To use the data in float_data we need to link PERMCOs to GVKEYs and use ccm_link to create float_data_linked. To maximize the number of successful matches, we do not condition on link-validity dates.

ccmxpf_lnkhist = load_parquet("ccmxpf_lnkhist", "crsp")

ccm_link = (
    ccmxpf_lnkhist
    .filter(
        pl.col("linktype").is_in(["LC", "LU", "LS"]),
        pl.col("linkprim").is_in(["C", "P"]),
    )
    .rename({"lpermco": "permco"})
    .select("gvkey", "permco")
    .unique()
    .collect()
)

float_data_linked = float_data.join(ccm_link, on="gvkey", how="inner")

To help us measure risk asymmetry over the twelve months ending with a firm’s fiscal year-end, we get data on the precise year-end for each firm-year from comp.funda and store it in firm_years. We also collect data on sox as this is also found on comp.funda.

funda = load_parquet("funda", "comp")

funda_mod = (
    funda
    .filter(
        pl.col("indfmt") == "INDL",
        pl.col("datafmt") == "STD",
        pl.col("consol") == "C",
        pl.col("popsrc") == "D",
    )
)

firm_years = (
    funda_mod
    .select("gvkey", "fyear", "datadate", "auopic")
    .with_columns(
        sox=(
            pl.col("auopic").cast(pl.Float64) > 0
        ).fill_null(False)
    )
    .select("gvkey", "fyear", "datadate", "sox")
    .collect()
)

We then link this table with bloomfield_2021 using ccm_link to create risk_asymm_sample, which is simply bloomfield_2021 with datadate replacing fyear.

risk_asymm_sample = (
    bloomfield_2021
    .join(ccm_link, on="permco", how="inner")
    .join(firm_years, on=["fyear", "gvkey"], how="inner")
    .select("permco", "gvkey", "fyear", "datadate")
    .unique()
    .with_columns(
        period_start=(
            pl.col("datadate").dt.offset_by("-12mo")
            + pl.duration(days=1)
        )
    )
)

To calculate risk asymmetry using the approach described in Bloomfield (2021), we need data on market returns. Bloomfield (2021, pp. 877–878) “exclude[s] the highest 1% and lowest 1% of market return data. … This trimming procedure improves the reliability of the parameter estimates.”

factors_daily = load_parquet("factors_daily", "ff")

mkt_rets = (
    factors_daily
    .select("date", "rf", "mktrf")
    .collect()
    .with_columns(pl.col("mktrf").era.truncate().alias("mktrf"))
    .filter(pl.col("mktrf").is_not_null())
)

Finally, we calculate risk asymmetry using the approach described in Bloomfield (2021). After calculating the regression statistics (slope and nobs) grouped by sign_ret (and permco and datadate), we pivot the data so that beta_plus and beta_minus appear in the same rows.

dsf = load_parquet("dsf", "crsp")

risk_asymmetry = (
    dsf
    .join_where(
        risk_asymm_sample.lazy(),
        pl.col("permco") == pl.col("permco_right"),
        pl.col("date") >= pl.col("period_start"),
        pl.col("date") <= pl.col("datadate"),
    )
    .join(mkt_rets.lazy(), on="date", how="inner")
    .with_columns(
        retrf=pl.col("ret") - pl.col("rf"),
        sign_ret=pl.col("mktrf") >= 0,
    )
    .filter(pl.col("retrf").is_not_null())
    .group_by("permco", "gvkey", "datadate", "sign_ret")
    .agg(
        (pl.cov("retrf", "mktrf") / pl.col("mktrf").var()).alias("slope"),
        pl.len().alias("nobs"),
    )
    .collect()
    .pivot(
        on="sign_ret",
        index=["permco", "gvkey", "datadate"],
        values=["slope", "nobs"],
    )
    .rename({
        "slope_false": "beta_minus",
        "slope_true": "beta_plus",
        "nobs_false": "nobs_minus",
        "nobs_true": "nobs_plus",
    })
    .with_columns(
        nobs=pl.sum_horizontal(
            pl.col("nobs_minus", "nobs_plus")
            .fill_null(0)
        )
    )
    .select(
        "permco", "gvkey", "datadate",
        "beta_minus", "beta_plus", "nobs"
    )
)

We combine all the data from above into a single data frame for regression analysis. Bloomfield (2021, p. 878) says that “because of the kurtosis of my estimates, I winsorize $\hat{\beta}$, $\hat{\beta}^{+}$, and $\hat{\beta}^{-}$ at 1% and 99% before constructing my measures of risk asymmetry.”

cutoff = 75

reg_data = (
    risk_asymmetry
    .join(
        float_data_linked.select(
            "permco", "gvkey", "float2002", "float2004"
        ),
        on=["permco", "gvkey"],
        how="left",
    )
    .join(
        firm_years.select("gvkey", "datadate", "sox"),
        on=["gvkey", "datadate"],
        how="left",
    )
    .with_columns(
        treat=(pl.col("float2002") >= cutoff).fill_null(True),
        post=pl.col("datadate") >= pl.date(2005, 11, 1),
        year=pl.col("datadate").dt.year(),
    )
    .with_columns(
        pl.col("beta_minus", "beta_plus").era.winsorize()
    )
    .with_columns(risk_asymm=pl.col("beta_minus") - pl.col("beta_plus"))
)

We then run two regressions: one with firm and year fixed effects and one without. The former aligns with what Bloomfield (2021) reports in Table 4, while the latter better aligns with the RDD analysis we conduct below.

fms = [
    pf.feols("risk_asymm ~ I(treat * post)", data=reg_data, vcov="iid"),
    pf.feols(
        "risk_asymm ~ I(treat * post) | permco + year",
        data=reg_data,
        vcov="iid",
    ),
]

pf.etable(
    fms,
    signif_code=[0.01, 0.05, 0.1],
    custom_model_stats={"Firm and year FE": ["No", "Yes"]},
)

Table 22.2: Effect of SOX 404 on risk asymmetry

	risk_asymm
	(1)	(2)
coef
I(treat * post)	-0.188 (0.066)	-0.337 (0.1)
Intercept	0.264 (0.023)
fe
year	-	x
permco	-	x
stats
Firm and year FE	No	Yes
Observations	1,853	1,852
R²	0.004	0.197
Format of coefficient cell: Coefficient (Std. Error)

print(
    "The results reported above confirm the result reported in @Bloomfield:2021va. "
    "While our estimated coefficient of "
    f"(${float(fms[1].coef()['I(treat * post)']):.3f}$) is not identical to that in @Bloomfield:2021va "
    "($-0.302$), it is close to it and has the same sign and similar statistical significance."
)

The results reported above confirm the result reported in Bloomfield (2021). While our estimated coefficient of ($-0.337$) is not identical to that in Bloomfield (2021) ($-0.302$), it is close to it and has the same sign and similar statistical significance.

However, the careful reader will note that Table 4 presents “event study difference-in-differences results” rather than a conventional RDD analysis. Fortunately, we have the data we need to perform conventional RDD analyses ourselves.

First, we ask whether the 2002 float (float2002) being above or below $75 million is a good instrument for treatment (the variable sox derived from Compustat). This is analogous to producing Figure 1 in Hoekstra (2009), which shows a clear discontinuity in the probability of treatment as the running variable crosses the threshold.

In the case of Bloomfield (2021), we have treatment represented by sox, which is derived from Compustat’s auopic variable. One difference between Iliev (2010) and Bloomfield (2021) is that the data in Iliev (2010) extend only to 2006, while data in the sample used in Table 4 of Bloomfield (2021) extend as late as 2009. This difference between the two papers is potentially significant for the validity of float2002 as an instrument for sox, as whether a firm has public float above the threshold in 2002 is likely to become increasingly less predictive of treatment as time goes on.

reg_data_fs = reg_data.filter(pl.col("post"))
rd_fs_2002 = rdrobust(
    reg_data_fs["sox"],
    reg_data_fs["float2002"],
    c=cutoff,
    masspoints="off",
)

In the following analysis, we use binned means and the local polynomial fit from rdrobust to reproduce the kind of plot provided by rdplot() in the R version.

rdplot(
    reg_data_fs,
    y="sox",
    x="float2002",
    cutoff=cutoff,
    y_label="SOX 404 report indicator",
    x_label="Public float in 2002",
)

Plot of mean value of SOX 404 report indicator (y-axis) by bins based on 2002 public float. Plot also shows fitted curves on each side of the $75 million cutoff. There is a noticeable jump in the fitted curves at the cutoff, but there is no other clear relationship evident from the plot and the jump is plausibly an artifact of overfitting observations near the cutoff. — Figure 22.1: SOX 404 report indicator and 2002 public float

What we observe in Figure 22.1 is that, in contrast to Figure 1 of Hoekstra (2009), there is no obvious discontinuity in the probability of treatment (sox) around the cut-off, undermining the notion that float2002 is a viable instrument for sox.

What happens if we use float2004 as the partitioning variable?

rd_fs = rdrobust(
    reg_data_fs["sox"],
    reg_data_fs["float2004"],
    c=cutoff,
    masspoints="off",
)

rdplot(
    reg_data_fs,
    y="sox",
    x="float2004",
    cutoff=cutoff,
    y_label="SOX 404 report indicator",
    x_label="Public float in 2004",
);

Figure 22.2

From Figure 22.2, we see that things now look a little better and we confirm this statistically next:

rd_fs

Call: rdrobust
Number of Observations:                   866
Polynomial Order Est. (p):                  1
Polynomial Order Bias (q):                  2
Kernel:                            Triangular
Bandwidth Selection:                    mserd
Var-Cov Estimator:                         NN

                                Left      Right
------------------------------------------------
Number of Observations           479        387
Number of Unique Obs.            479        387
Number of Effective Obs.         193        154
Bandwidth Estimation           6.988      6.988
Bandwidth Bias                10.036     10.036
rho (h/b)                      0.696      0.696

Method             Coef.     S.E.   t-stat    P>|t|       95% CI      
-------------------------------------------------------------------------
Conventional       0.241    0.065    3.706   2.110e-04     [0.113, 0.368]
Robust                 -        -    3.273   1.063e-03         [0.1, 0.4]

The probability jump at the cut-off is $0.241$ with a standard error of $0.065$.

This suggests we might run fuzzy RDD with float2004 as the running variable. We can do this using rdrobust() as follows:

rd_ss = rdrobust(
    reg_data_fs["risk_asymm"],
    reg_data_fs["float2004"],
    fuzzy=reg_data_fs["sox"],
    c=cutoff,
    masspoints="off",
)
rd_ss

Call: rdrobust
Number of Observations:                   866
Polynomial Order Est. (p):                  1
Polynomial Order Bias (q):                  2
Kernel:                            Triangular
Bandwidth Selection:                    mserd
Var-Cov Estimator:                         NN

                                Left      Right
------------------------------------------------
Number of Observations           479        387
Number of Unique Obs.            479        387
Number of Effective Obs.         211        169
Bandwidth Estimation           7.836      7.836
Bandwidth Bias                11.964     11.964
rho (h/b)                      0.655      0.655

Method             Coef.     S.E.   t-stat    P>|t|       95% CI      
-------------------------------------------------------------------------
Conventional      -0.746    0.679   -1.098   2.720e-01    [-2.077, 0.585]
Robust                 -        -   -0.801   4.231e-01    [-2.179, 0.915]

The estimated effect at the cut-off is $-0.746$, which is not significantly different from zero.

We can also run a kind of intent-to-treat RDD as follows and can also plot this as in Figure 22.3.

rd_rf = rdrobust(
    reg_data_fs["risk_asymm"],
    reg_data_fs["float2004"],
    c=cutoff,
    masspoints="off",
)
rd_rf

Call: rdrobust
Number of Observations:                   866
Polynomial Order Est. (p):                  1
Polynomial Order Bias (q):                  2
Kernel:                            Triangular
Bandwidth Selection:                    mserd
Var-Cov Estimator:                         NN

                                Left      Right
------------------------------------------------
Number of Observations           479        387
Number of Unique Obs.            479        387
Number of Effective Obs.         174        142
Bandwidth Estimation           5.959      5.959
Bandwidth Bias                 8.552      8.552
rho (h/b)                      0.697      0.697

Method             Coef.     S.E.   t-stat    P>|t|       95% CI      
-------------------------------------------------------------------------
Conventional      -0.202    0.177   -1.144   2.526e-01    [-0.548, 0.144]
Robust                 -        -   -0.958   3.381e-01    [-0.602, 0.207]

rdplot(
    reg_data_fs,
    y="risk_asymm",
    x="float2004",
    cutoff=cutoff,
    y_label="Risk asymmetry",
    x_label="Public float in 2004",
)

Plot of mean value of risk asymmetry (y-axis) by bins based on 2004 public float. Plot also shows fitted curves on each side of the $75 million cutoff. There is a noticeable drop in the fitted curves at the cutoff, but there is no other clear relationship evident from the plot and the drop is plausibly an artifact of overfitting observations near the cutoff. As seen in the statistical analysis, the drop is not statistically significant. — Figure 22.3: Risk asymmetry and 2004 public float

However, in neither the fuzzy RDD nor the intent-to-treat RDD do we reject the null hypothesis of no effect of SOX on risk asymmetry.⁴

22.5 RDD in accounting research

While RDD has seen an explosion in popularity since the turn of the century, relatively few papers in accounting research have used it. Armstrong et al. (2022) identify seven papers using RDD and we identify an additional five papers not included in their survey, suggesting that papers using RDD represent less than 1% of papers published in top accounting journals since 1999. In this subsection, we examine each of these papers, most of which fall into one of the categories below, to illustrate and discuss how RDD is used in practice in accounting research.

22.5.1 Quasi-RDD

In some settings, RDD would appear to be applicable because there is sharp assignment to treatment based on the realization of a continuous running variable exceeding a pre-determined threshold. For example, debt covenants are often based on accounting-related variables that are continuous, but covenant violation is a discontinuous function of those variables. But a bigger issue in practice is that the running variables are difficult to measure precisely and the thresholds are often not easily observed.

As a result, papers that claim to be “using a regression discontinuity design” simply regress the outcome of interest on a treatment indicator and some controls, perhaps restricting the sample to observations closer to violating covenants. While some have called this quasi-RDD, this is arguably simply not-RDD.

22.5.2 Russell 1000/2000 and stuff (mostly taxes)

Another popular area applying RDD is the burgeoning literature studying the effect of institutional ownership on effective tax rates. Khan et al. (2016), Chen et al. (2019), and Bird and Karolyi (2017) exploit the discontinuous relationship between market capitalization and assignment to either the Russell 1000 or the Russell 2000 index to test for the effect of institutional ownership on tax planning.

Unfortunately, subsequent research shows that the data on index assignments supplied by Russell yielded significant differences in pre-treatment outcome variables on either side of the notional discontinuity.

22.5.3 Other papers using RDD

Kajüter et al. (2018) and Manchiraju and Rajgopal (2017) use RDD in the context of event studies of the classic capital-market kind. Two additional papers in accounting research apply RDD in corporate governance settings. Finally, Figure 1 of Li et al. (2018), which we covered in Chapter 21, presents RDD plots and we include some discussion questions on this analysis below.

22.6 Further reading

As discussed above, this chapter is intended to complement existing materials, and we focus more on practical issues and applications in accounting research. The quality of pedagogical materials for RDD is high. Lee and Lemieux (2010) remains an excellent reference as is Chapter 6 of Angrist and Pischke (2008). Chapter 6 of Cunningham (2021) and Chapter 20 of Huntington-Klein (2021) cover RDD.

22.7 Discussion questions

There are many discussion questions below and we expect that instructors will only assign a subset of these. You should read enough of the papers to be able to answer questions you have been assigned below.

22.7.1 Hoekstra (2009)

What is the treatment in Hoekstra (2009)? What alternative (control) is it being compared with? Is identifying the “treatment” received by the control group always as difficult as it is in Hoekstra (2009)? Provide examples from other papers or settings in your answer.
Which approach makes the most sense in Hoekstra (2009)? Sharp RDD? Or fuzzy RDD? Why?
RDD inherently estimated a “local” treatment effect. Which group of potential students is the focus of Hoekstra (2009)? Can you think of other groups that we might be interested in learning more about? How might the actual treatment effects for those groups differ and why?

22.7.2 Bloomfield (2021)

Compare the data in iliev_2010 and float_data for the two firms shown in the output below. What choices has Bloomfield (2021) made in processing the data for these two firms? Do these choices seem to be the best ones? If not, what alternative approach could be used?

iliev_2010.filter(pl.col("gvkey") == "001728")

shape: (5, 9)

gvkey	fyear	fdate	pfdate	pfyear	publicfloat	mr	af	cik
str	i32	date	date	i32	f64	bool	bool	i32
"001728"	2001	2001-12-31	2002-03-01	2002	96.98183	false	false	225051
"001728"	2002	2002-12-31	2002-06-28	2002	69.074991	false	false	225051
"001728"	2003	2003-12-31	2003-06-27	2003	71.202402	false	false	225051
"001728"	2005	2005-01-01	2004-07-03	2004	105.826661	true	true	225051
"001728"	2005	2005-12-31	2005-07-01	2005	87.145568	true	true	225051

float_data.filter(pl.col("gvkey") == "001728")

shape: (1, 3)

gvkey	float2002	float2004
str	f64	f64
"001728"	83.028411	null

iliev_2010.filter(pl.col("gvkey") == "028712")

shape: (5, 9)

gvkey	fyear	fdate	pfdate	pfyear	publicfloat	mr	af	cik
str	i32	date	date	i32	f64	bool	bool	i32
"028712"	2001	2001-12-31	2002-03-21	2002	21.523	false	false	908598
"028712"	2002	2002-12-31	2002-06-28	2002	11.46	false	false	908598
"028712"	2003	2003-12-31	2003-06-30	2003	13.931	false	false	908598
"028712"	2004	2004-12-31	2004-06-30	2004	68.161	false	false	908598
"028712"	2005	2005-12-31	2005-06-30	2005	26.437	false	false	908598

float_data.filter(pl.col("gvkey") == "028712")

shape: (1, 3)

gvkey	float2002	float2004
str	f64	f64
"028712"	16.4915	68.161

The code treat = (float2002 >= cutoff).fill_null(True) above is intended to replicate Stata code used by Bloomfield (2021): generate treat = float2002 >= 75. Why does the Python/Polars code appear to be more complex? What does Stata do that Python does not do? (Hint: If you don’t have access to Stata, you may find Stata’s documentation helpful.)
Bloomfield (2021)’s Stata code for the post indicator reads generate post = fyear - (fyr > 5 & fyr < 11) >= 2005, where fyear is fiscal year from Compustat and fyr represents the month of the fiscal-year end. The code above sets post = datadate >= "2005-11-01". Are the two approaches equivalent? Do we seem to get the right values from post using either approach?
In the text of the paper, Bloomfield (2021) claims to “use firms’ 2002 public floats … as an instrument for whether or not the firm will become treated” and to “follow Iliev’s [2010] regression discontinuity methodology”. Evaluate each of these claims, providing evidence to support your position.
Bloomfield (2021), inspired by Iliev (2010), uses float2002 rather than float2004 as the running variable for treatment. What issues would you be concerned about with float2004 that might be addressed using float2002? Provide some evidence to test your concerns. What implications do you see for the fuzzy RDD analysis we ran above using float2004 as the running variable?
Why do you think Bloomfield (2021) did not include RDD analyses along the lines of the ones we have done above in his paper?
In Table 4, Bloomfield (2021, p. 884) uses “a difference-in-differences design to identify the causal effect of reporting flexibility on risk asymmetry.” As we say in Chapter 21, a difference-in-differences estimator adjusts differences in post-treatment outcome values by subtracting differences in pre-treatment outcome values. Why might differences in pre-treatment outcome values between observations on either side of the threshold be particularly problematic in RDD? Does the use of firm and year fixed effects address this problem? Or does it just suppress it?

22.7.3 Boone and White (2015)

What is the treatment in Boone and White (2015)? (Hint: Read the title.) Most of the analyses in the paper use a “sharp RD methodology”. Does this make sense given the treatment? Why or why not?
In Section 5, Boone and White (2015) suggest that while “pre-index assignment firm characteristics are similar around the threshold, one concern is that there could be differences in other unobservable firm factors, leading to a violation of the necessary assumptions for the sharp RD methodology.” Is the absence of “differences in other unobservable firm factors” the requirement for sharp (rather than fuzzy) RD?
What implications, if any, does the discussion on pp. 94–95 of Bebchuk et al. (2017) have for the arguments of Boone and White (2015)?
What is the treatment variable implied by the specification in Equation (1) in Boone and White (2015)?

22.7.4 Manchiraju and Rajgopal (2017)

Identify some issues in applying RDD in Manchiraju and Rajgopal (2017)? What steps do the authors take to address these issues?
What is the treatment in Manchiraju and Rajgopal (2017)? Is this the same treatment variable as analysed in prior research?

22.7.5 Ertimur et al. (2015)

Consider Figure 3. How persuasive do you find this plot as evidence of a significant market reaction to majority support in shareholder proposals on majority voting? What aspects of the plot do you find persuasive or unpersuasive?
If shareholders react to successful shareholder proposals on majority voting so positively, why do so many shareholders vote against such proposals?
Ertimur et al. (2015, p. 38) say “our analyses suggest that high votes withheld do not increase the likelihood of a director losing their seat but often cause boards to respond to the governance problems underlying the vote, suggesting that perhaps director elections are viewed by shareholders as a means to obtain specific governance changes rather than a channel to remove a director.” How do you interpret this statement? Do you find it convincing?

22.7.6 Li et al. (2018)

Figure 1 of Li et al. (2018) presents RDD plots. How does the running variable in Figure 1 differ from that in other RDD analyses you have seen? What would you expect to be the relation between the running variable and the outcome variable? Would this vary from the left to the right of the cut-off? Do you agree with the decision of Li et al. (2018, p. 283) to “include high-order polynomials to allow for the possibility of nonlinearity around the cut off time”?
What is the range of values of “distance to IDD adoption” reported in Figure 1? What is the range of possible values given the sample period of Li et al. (2018) and data reported in Appendix B of Li et al. (2018, p. 304)?
Li et al. (2018, p. 283) say that “the figures in both panels show a clear discontinuity at the date of IDD adoption.” Do you agree with this claim?

Angrist, J.D., Pischke, J.-S., 2008. Mostly harmless econometrics: An empiricist’s companion. Princeton University Press.

Armstrong, C., Kepler, J.D., Samuels, D., Taylor, D., 2022. Causality redux: The evolution of empirical methods in accounting research and the growth of quasi-experiments. Journal of Accounting and Economics 74, 101521. https://doi.org/10.1016/j.jacceco.2022.101521

Armstrong, C.S., Gow, I.D., Larcker, D.F., 2013. The efficacy of shareholder voting: Evidence from equity compensation plans. Journal of Accounting Research 51, 909–950. https://doi.org/10.1111/1475-679X.12023

Bebchuk, L.A., Cohen, A., Hirst, S., 2017. The agency problems of institutional investors. Journal of Economic Perspectives 31, 89–102. https://doi.org/10.1257/jep.31.3.89

Bird, A., Karolyi, S.A., 2017. Governance and taxes: Evidence from regression discontinuity. The Accounting Review 92, 29–50. https://doi.org/10.2308/accr-51520

Bloomfield, M.J., 2021. The asymmetric effect of reporting flexibility on priced risk. Journal of Accounting Research 59, 867–910. https://doi.org/10.1111/1475-679X.12346

Boone, A.L., White, J.T., 2015. The effect of institutional ownership on firm transparency and information production. Journal of Financial Economics 117, 508–533. https://doi.org/10.1016/j.jfineco.2015.05.008

Chen, S., Huang, Y., Li, N., Shevlin, T., 2019. How does quasi-indexer ownership affect corporate tax planning? Journal of Accounting and Economics 67, 278–296. https://doi.org/10.1016/j.jacceco.2018.01.001

Cunningham, S., 2021. Causal inference: The mixtape. Yale University Press.

Ertimur, Y., Ferri, F., Oesch, D., 2015. Does the director election system matter? Evidence from majority voting. Review of Accounting Studies 20, 1–41. https://doi.org/10.1007/s11142-014-9284-9

Gelman, A., Imbens, G., 2019. Why high-order polynomials should not be used in regression discontinuity designs. Journal of Business & Economic Statistics 37, 447–456. https://doi.org/10.1080/07350015.2017.1366909

Gow, I.D., Larcker, D.F., Reiss, P.C., 2016. Causal inference in accounting research. Journal of Accounting Research 54, 477–523. https://doi.org/10.1111/1475-679X.12116

Hoekstra, M., 2009. The effect of attending the flagship state university on earnings: A discontinuity-based approach. The Review of Economics and Statistics 91, 717–724. https://doi.org/10.1162/rest.91.4.717

Huntington-Klein, N., 2021. The effect: An introduction to research design and causality. Chapman; Hall/CRC Press.

Iliev, P., 2010. The effect of SOX Section 404: Costs, earnings quality, and stock prices. The Journal of Finance 65, 1163–1196. https://doi.org/10.1111/j.1540-6261.2010.01564.x

Kajüter, P., Klassmann, F., Nienhaus, M., 2018. The effect of mandatory quarterly reporting on firm value. The Accounting Review 94, 251–277. https://doi.org/10.2308/accr-52212

Khan, M., Srinivasan, S., Tan, L., 2016. Institutional ownership and corporate tax avoidance: New evidence. The Accounting Review 92, 101–122. https://doi.org/10.2308/accr-51529

Lee, D.S., Lemieux, T., 2010. Regression discontinuity designs in economics. Journal of Economic Literature 48, 281–355. https://doi.org/10.1257/jel.48.2.281

Li, Y., Lin, Y., Zhang, L., 2018. Trade secrets law and corporate disclosure: Causal evidence on the proprietary cost hypothesis. Journal of Accounting Research 56, 265–308. https://doi.org/10.1111/1475-679X.12187

Listokin, Y., 2008. Management always wins the close ones. American Law and Economics Review 10, 159–184. https://doi.org/10.1093/aler/ahn010

Manchiraju, H., Rajgopal, S., 2017. Does corporate social responsibility (CSR) create shareholder value? Evidence from the Indian Companies Act 2013. Journal of Accounting Research 55, 1257–1300. https://doi.org/10.1111/1475-679X.12174

McCrary, J., 2008. Manipulation of the running variable in the regression discontinuity design: A density test. Journal of Econometrics 142, 698–714. https://doi.org/10.1016/j.jeconom.2007.05.005

Thistlethwaite, D.L., Campbell, D.T., 1960. Regression-discontinuity analysis: An alternative to the ex post facto experiment. Journal of Educational Psychology 51, 309–317. https://doi.org/10.1037/h0044319

While Thistlethwaite and Campbell (1960) seems to be very commonly cited as an example of sharp RDD, it appears that only a minority of students who scored higher than the cut-off were awarded National Merit Scholarships, making it strictly an example where fuzzy RDD should be applied if the treatment of interest is receipt of a National Merit Scholarship.↩︎
Hoekstra (2009, p. 719) takes care to explain why this is not a concern in his setting.↩︎
Iliev (2010, p. 1165) describes public float as “the part of equity not held by management or large shareholders, as reported on the first page of the company 10-K.” See also https://www.sec.gov/news/press/2004-158.htm.↩︎
Note that we see even weaker evidence of an effect if we replace float2004 with float2002 in the intent-to-treat analysis.↩︎