Wednesday, December 21, 2016

"Market Anomalies and Data Mining: Some Pretty Tough Love from Data "

From True Economics:
Investment anomalies (or in other words efficacy of exogenous factors in determining abnormal returns to investment) are a matter of puzzle for traditional investment analysis. In basic terms, we normally think about the investment as an undertaking that offers no ‘free lunch’ - if markets are liquid, deep and, once we control for risk factors, taxes and transaction costs, an average investor cannot expect to earn an above-market return. Put differently, there should be no ways to systematically (luck omitting) beat the market.

Anomalies represent the case where some factors do, in fact, generate such abnormal returns. There is a range of classic anomalies, most commonly known ones being Small Firms Outperform, January Effect, Low Book Value, Under-dogs or Discounted Assets or Dogs of the Dow, Reversals, Days of the Week, etc. In fact, there is an entire analytics industry built around markets that does one thing: mine for factors that can give investors a leg up on competition, or finding anomalies.

One recent paper have identified a list of some 314 factors that were found - in the literature - to generate abnormal returns. As noted by John Cochrane: “We thought 100% of the cross-sectional variation in expected returns came from the CAPM, now we think that’s about zero and a zoo of new factors describes the cross section.”

A recent paper published by NBER and authored by Juhani Linnainmaa and Michael Roberts (see link below) effectively tests this Cochrane’s proposition. To do this, the authors “examine cross-sectional anomalies in stock returns using hand-collected accounting data extending back to the start of the 20th century. Specifically, we investigate three potential explanations for these anomalies: unmodeled risk, mispricing, and data-snooping.” In other words, the authors look into three reasons as to why anomalies can exist:
  1. Unmodeled risk reflects the view that some of risk premium paid out in the form of investment returns is not captured by traditional models of risk-return relations;
  2. Mispricing reflects the view that markets’ participants routinely and over long run can misplace risk; and
  3. Data-snooping view implies that anomalies generate returns in the historical data that do not replicate in forward-looking implementation because these anomalies basically arise from ad hoc empirical data mining.
The authors argue that “each of these explanations generate different testable implications across three eras encompassed by our data: (1) pre-sample data existing before the discovery of the anomaly, (2) in-sample data used to identify the anomaly, and (3) post-sample data accumulating after identification of the anomaly.”...