Skip to content
This repository was archived by the owner on May 6, 2026. It is now read-only.

Latest commit

 

History

History
212 lines (173 loc) · 8.88 KB

File metadata and controls

212 lines (173 loc) · 8.88 KB

A comparison of some structural models of private information arrival

Jefferson Duarte, Edwin Hu, and Lance Young

Paper

Latest version available at SSRN: https://ssrn.com/abstract=2564369

Previous Versions

Data

https://www.dropbox.com/s/45b42e89gaafg0n/cpie_data.zip?dl=1

The data contains several files:

  • cpie_daily.csv
  • eo_yearly.csv
  • dy_yearly.csv
  • gpin_yearly.csv
  • owr_yearly.csv

Daily

cpie_daily.csv is sorted by permno and date:

permnoyeardatecpie_pincpie_dycpie_gpincpie_owrcpie_mechret_oret_dy_en_buysn_sellsturn
100571993199301040.00500.6080.0110.850-0.007-0.014-0.354711
100571993199301050.00000.0030.0710.8800.0110.0120.93202
100571993199301060.00070.2120.0140.910-0.016-0.014-0.38358
100571993199301070.02570.1310.4330.8200.0010.0150.268412
100571993199301080.09620.9460.0030.7210.001-0.007-0.5741014
100571993199301110.00110.0090.2810.6900.007-0.0030.41516
100571993199301120.94480.2260.5280.9710.0160.0220.11141226
100571993199301130.45940.9730.0010.8210.008-0.020-0.6241216
100571993199301140.00060.1010.0440.880-0.008-0.015-0.33448

cpie_pin corresponds to $CPIE_{PIN}$ in the paper (the PIN model).

cpie_dy corresponds to $CPIE_{DY}$ in the paper (the DY model).

cpie_gpin corresponds to $CPIE_{GPIN}$ in the paper (the GPIN model).

cpie_owr corresponds to $CPIE_{OWR}$ (the Odders-White and Ready (2008) model).

cpie_mech is the $CPIE_{Mech}$, which is a dummy variable defined as: \begin{equation} CPIE_{Mech,j,t}=% \begin{cases} 0, & \text{ if }turn_{j,t}<\overline{turn}_{j}
1, & \text{ if }turn_{j,t}\geq \overline{turn}_{j},% \end{cases} \end{equation}

ret_d, ret_o, and y_e correspond to $(r_d,r_o,y_e)$.

n_buys and n_sells corresponds to the $B$ and $S$, and turn is the sum, corresponding to $turn$.

PIN Model

pin_yearly.csv is sorted by permno and year:

permnoyearaebesud
1005719930.23015.46975.743310.50680.6052
1005719940.08106.94496.696934.40760.3984
1005719950.269214.237116.849333.27530.8156
1006419930.250245.704541.335371.47080.6673
1006419940.283525.892927.724040.06220.6076
1006419950.165632.757938.967594.37070.8213
1006419960.191027.730539.937294.27330.8373
1007119930.275515.270714.384822.80940.6077
1007119940.200012.831014.113524.66150.6733

a is $α$, eb is $ε_B$, es is $ε_S$, u is $μ$, and d is $δ$.

DY Model

dy_yearly.csv is sorted by permno and year:

permnoyearaebesubusdtnsbss
1005719930.384.673.5510.635.000.210.207.677.74
1005719940.304.524.8410.819.210.510.0726.4332.65
1005719950.4311.929.4626.5910.600.400.2421.5925.63
1006419930.3635.9837.1431.1069.810.760.1582.1930.60
1006419940.4916.7424.2521.8345.730.880.2427.1021.28
1006419950.3727.6831.8439.6443.770.590.09103.7839.22
1006419960.2126.4638.5478.89124.340.870.02226.61623.26
1007119930.3611.9111.4611.6517.200.600.2023.3514.43
1007119940.449.4611.7611.6515.280.740.1030.5121.51

a is $α$, eb is $ε_B$, es is $ε_S$, ub is $μ_B$, us is $μ_S$, d is $δ$, tn is $θ$, sb is $Δ_B$, and ss is $Δ_S$.

GPIN Model

gpin_yearly.csv is sorted by permno and year:

permnoyeararpetadth
1005719930.306.510.621.001.000.44
1005719940.172.270.851.000.400.51
1005719950.282.890.921.000.480.54
1006419930.197.250.930.750.570.53
1006419940.168.480.870.780.480.50
1006419950.185.880.930.770.760.44
1006419960.112.410.970.900.590.44
1007119930.216.460.830.790.600.52
1007119940.135.010.850.880.800.44

a is $α$, r is $r$, p is $p$, eta is $η$, d is $δ$, and th is $θ$.

OWR Model

owr_yearly.csv is sorted by permno and year:

permnoyearasuszsispdspo
1005719930.78080.18860.47900.02260.00590.0101
1005719940.41350.15480.51360.02980.00740.0108
1005719950.77460.13680.43190.02510.00770.0000
1006419930.29910.12270.34570.02480.00850.0072
1006419940.62790.15160.31940.02220.00700.0047
1006419950.75260.18650.34400.02410.00730.0009
1006419960.65430.13910.34810.02990.00640.0000
1007119930.78850.13900.41460.01480.00450.0054
1007119940.64550.14170.44270.01320.00610.0050

a is $α$, su is $σ_u$, sz is $σ_z$, si is $σ_i$, spd is $σ_{pd}$, and spo is $σ_{po}$.

**New data**

2013–2019 GPIN and OWR estimates:
https://www.dropbox.com/s/6xaa3x5zbmvyyq1/pin-est-1319.zip?dl=1

Based on requests from other researchers we have updated our estimates beyond the sample period in our paper. These estimates may be used as starting points for your own estimation, or used as-is. We have done some basic quality checks with the estimates, but not to the full extent of the 1993–2012 sample from the paper. In the paper we also only used NYSE-listed stocks. If you have any questions, comments, suggestions, or find any issues please feel free to contact me. If you use these estimates in your work, please cite to our paper and website so that future researchers can find our work.

  • Based on WRDS DTAQ Intraday Indicators.
  • CRSP shrcd 10, 11 and exchcd 1, 2, 3, 4.
  • Unlike the paper we do not remove distribution/event days for the OWR.
  • Estimates are based on up to five random starting points.

./xom-gpin-2018.png

Code

Python Environment: https://www.dropbox.com/scl/fi/m3u1i5aoejf7ltoo30tl6/environment.sh?rlkey=s44j5sbqn5m7ri5hlxhk67xlw&st=fbxwbvxu&dl=1

Code is now available on its own GitHub page. A detailed description of the code is also available here.