Data & Code



Data Sets

Utah Diffs-in-Diffs Talk

Here are the slides, data and example code for my survey of recent diff-in-diffs literature for the Utah Winter Strategy Conference.

Intellectual Property Disclosures Data

These data are based on public disclosures of standards-related intellectual property at thirteen Standard Setting Organizations. The data are used in several of my papers, and include information on SSO, company, disclosure date, patent and/or application numbers, and licensing terms (when available). If you use these data, please cite our NBER Working Paper.

LEED Differentiation

Code, data and documentation for Marc Rysman, Tim Simcoe and Yanfei Wang, "Differentiation Strategies in the Adoption of Environmental Standards: LEED from 2000-2014" (Working Paper)

UPC Adoption

Although we cannot share the raw UPC registration files, we have posted our concordance between 1987 4-digit manufacturing and wholesale SIC codes and the merchandise line codes from the 1977, 1982, 1987 and 1992 Census of Retail Trade. We have also posted data and code to reproduce Figure 6 in the paper along with data and code for creating the revenue-share weights described in Appendix A. If you use these data, please cite the NBER Working Paper.

eBay and Innovation

Data and code for working paper on Patent Policy and American Innovation after eBay.

Standard Setting Committees (IETF Publication Process Data)

Link to a zip file containing Stata code, IETF publication data and documentation for "Standard Setting Committees: Consensus Governance for Shared technology Platforms" (forthcoming AER). The files in this directory can be used to generate all Tables and Figures in the working paper. Please cite me if you use these data or programs.

Missing Names Data

Link to a zip file containing data (csv and Stata format) and code for the paper "Status, Quality and Attention: What's in a (Missing) Name?", Management Science, Feb 2011, 274--290, co-authored with David Waguespack.

Patent Citation Age Profiles (Journal of Applied Econometrics Data Archive)

The Stata code, data and documentation for Mehta Rysman and Simcoe (Identifying the Age Profile of Patent Citations: New Estimates of Knowledge Diffusion) are available via the data archive for the Journal of Applied Econometrics. The link takes you to the main page for the archive, To find our materials, you will need to search on any of the author last names.

Stata Code

IV Fixed Effects Poisson (Code from R&D Tax Credits Paper)

These links provide Stata code for the GMM estimators and simulation described in my paper about the Canadian R&D tax credit with Ajay Agrawal and Carlos Rosell.

Stata Code for Robust Standard Errors in the Fixed Effects Poisson (in Stata type “ssc install xtpqml”) N.B. this code is superseded be the "vce(robust)" option in updates to Stata 11.

Wooldridge (JOE 1999) shows that the fixed effects Poisson estimator produces consistent estimates of the parameters in an unobserved components multiplicative panel data model under very general conditions. In fact, all that is required is an assumption about the conditional mean of the dependent variable. This is quite useful for two reasons. First, it implies that fixed effects Poisson estimation is appropriate for any non-negative dependent variable—not just count data that follow a Poisson distribution. Second, the estimator is robust to arbitrary patterns of serial correlation. In spite of these obvious attractions, the fixed-effect Poisson estimator does not appear to be widely used in practice. This is partly because statistical software does not generally allow computation of the appropriate (robust) standard errors for inference. This ado file runs Stata’s pre-packaged fixed effects Poisson estimator and then computes the robust standard errors suggested by Wooldridge (1999).

Stata Code for Multinomial Agglomeration/Dispersion Test (in Stata type “ssc install mtad”)

A Stata r-class routine for calculating the index of agglomeration (or disperion) proposed in Rysman and Greenstein (2003). This statistic is closely related to the Ellison and Glaeser (1998) “dartboard index” can be used to test for dispersion.

NOTE: This Feb 2017 version supersedes the old MTAD code, which computes incorrect p-values!

Forms from 1992 Census of Retail Trade Download here.