RFS Advance Access originally published online on October 15, 2003
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Rev Fin 2004; 17:339-378
The Review of Financial Studies Vol. 17, No. 2, pp. 339378 © 2004 The Society for Financial Studies; all rights reserved.
Conditioning Information and Variance Bounds on Pricing Kernels
Columbia University and NBER
University of California, Los Angeles
Address correspondence to Geert Bekaert, Columbia Business School, Uris Hall, Room 802, 3022 Broadway, New York, NY 10027, or e-mail: gb241{at}columbia.edu
| Abstract |
|---|
|
|
|---|
Gallant, Hansen, and Tauchen (1990)
Hansen and Jagannathan (1991)
HJ bounds are computed by projecting the pricing kernel unconditionally on the space of available asset payoffs and computing the standard deviation of the projected pricing kernel. The larger this standard deviation, the stronger the restrictions on asset pricing models. The standard consumption-based asset pricing model with time-additive preferences dramatically fails to lie inside the feasible region defined by the HJ bounds computed using a variety of asset returns. However, the pricing kernels in more recent models, such as the nonseparable utility model in Heaton (1995)
or incomplete markets model of Constantinides and Duffie (1996)
, satisfy the bounds.
In this article we study the use of conditioning information to effectively increase the dimension of the available asset payoffs and hence to improve the bounds.1 Gallant, Hansen, and Tauchen (1990
; hereafter GHT) show how to use conditioning information efficiently. The procedure is in principle straightforward. They construct an infinite space of available payoffs combining conditioning information and a primitive set of asset payoffs. The variance of the unconditional projection of the pricing kernel onto that space is the efficient HJ bound, which we will term the GHT bound.2
The GHT bound depends on the first and second conditional moments of the asset payoffs. The GHT procedure has not been used very much in practice, and researchers have mostly resorted to a simpler technique of embedding conditioning information in the computation of HJ bounds. They simply scale returns with predictive variables in the information set, augment the space of available payoffs (and corresponding prices) with the relevant scaled payoffs or returns, and compute a standard HJ bound for the augmented space [see, e.g., Hansen and Jagannathan (1991)
, Cochrane and Hansen (1992)
, Bekaert and Hodrick (1992)
, and many others]. This procedure is much simpler to implement than GHT since it does not require knowledge of conditional moments at all.
In this article we provide a formal bridge between the optimal but relatively unknown GHT bound and the ad hoc scaling methods prevalent in the literature. We prove two main results. First, we answer the following question: When scaling a return with a function of the conditioning information, what is the function that maximizes the HJ bound? The solution is an application of variational calculus. The resultant optimal scaling factor is decreasing in the conditional variance but is not monotonic in the conditional mean. Second, we show that our bound, which we term the optimally scaled bound, is as tight as the GHT bound when the conditional moments are known.
The optimally scaled bound has three important properties. First, it is efficient. Rather than arbitrarily scaling returns with an instrument, our procedure optimally exploits conditioning information leading to sharper bounds. We also use this property to explore the relation between improvements in HJ bounds due to conditioning information and the presence of return predictability.
Second, it is robust to misspecification of the conditional mean and variance. Whereas the GHT bound is also efficient, it is only correct when the conditional moments are accurate. If they are misspecified, the resulting bound may be larger than the variance of the true pricing kernel. Since the optimal bound we derive is a standard HJ bound, it always provides a bound to the variance of the true pricing kernel even if incorrect proxies to the conditional moments are used.
Third, the optimally scaled bound is a useful diagnostic for the specification of the first and second moments of asset returns. Our bound only attains the maximum when the first and second conditional moments are correctly specified. If they are not, the Hansen-Jagannathan frontier is not even a parabola, so that misspecification is visually clear. We also suggest a diagnostic test that can be used to formally compare the fit of alternative specifications of the conditional mean and variance. Given the nonnegligible modeling and parameter uncertainty regarding the first and second conditional moments of asset returns, this property of our bound is likely to be important in many finance applications.
We organize the article into three parts. Section 1 starts by clarifying the relation between standard HJ bounds, the GHT bound, ad hoc scaled bounds, and our optimally scaled bound. We then prove our two main results, deriving an optimal scaling function and showing that the resulting bound reaches the GHT bound when the conditional moments are correctly specified. Section 2 discusses the three main properties of our optimally scaled bound. We end the section by comparing our work to that of Ferson and Siegel (2001)
. They derive and study the optimal scaling factor in the setting of mean-variance frontiers. Since there is a well-known duality between Hansen-Jagannathan frontiers and the mean-variance frontier, their results are similar to ours but there are also some important differences.
Section 3 contains an empirical illustration. We estimate both an asymmetric GARCH-in-mean model and a regime-switching model on U.S. consumption growth, and bond and stock returns, and test the restrictions of the standard consumption-based asset pricing model. This generalization of the Hansen and Singleton (1983)
model provides a natural null and alternative model for the first and second moments, whereas the GARCH and regime-switching models provide two nonnested specifications for the conditional mean and variance. We use these models to explore the role of misspecification and robustness in the behavior of the various bounds. We briefly discuss future potential applications of our results in a concluding section.
| 1. Incorporating Conditioning Information into Variance Bounds |
|---|
|
|
|---|
In this section we first review the standard HJ bound while setting up notation in Section 1.1. In Section 1.2 we briefly review the standard way of using conditioning information. Section 1.3 reviews the GHT bound. Section 1.4 introduces the optimally scaled bound.
1.1. Unconditional variance bounds
Let there be a set of assets with payoff vector rt+1 and price vector pt. When the payoff is a (gross) return, the price equals one. Let the vector yt denote the set of conditioning variables in the economy and let It be the
algebra of the measurable functions of yt, that is, It is the information set. The pricing kernel mt+1 prices the payoffs correctly if
![]() | (1) |
![]() | (2) |
![]() | (3) |
![]() | (4) |
![]() | (5) |
![]() | (6) |
. We denote the bound
as
2(v; rt+1) (or
2(v)), since it depends on the mean of the kernel and the first two moments of rt+1:
![]() | (7) |
![]() | (8) |
![]() | (9) |
2(v)) is the HJ frontier. Note that if q equals one and there exists a risk-free asset such that rf = (E[mt+1])-1, then
2(v; rt+1) is proportional to the square of the Sharpe ratio on the set of assets. Hence a sharper HJ bound corresponds to a better risk-return trade-off on the available assets.
To facilitate comparison with the derivations in GHT (1990)
, we provide an alternative formulation of
in terms of the uncentered moments of rt+1:
![]() | (10) |
![]() | (11) |
unconditionally prices the returns follows immediately by substituting Equation (10) into Equation (1). The relation between w and v is apparent from taking the expected value of
in Equation (10):
![]() | (12) |
The intuition behind Equation (10) is rather straightforward. Rewrite the equation as
![]() |
on this space augmented with a constant payoff. The coefficient multiplying w is the residual of the projection of a unit payoff onto the rt+1 space and hence orthogonal to that space. Consequently, w is the projection coefficient of
onto that residual. The two parts together constitute the projection of
onto the rt+1 space augmented with a constant payoff.
1.2 Scaled variance bounds
The presence of the conditioning variable yt allows construction of an infinite-dimensional payoff space [see Hansen and Richard (1987)
]. Let zt = f(yt), where f is a measurable function and zt is an n x 1 vector. Scaled returns are simply assets with payoffs equal to
and prices
(where J is an n x 1 vector of ones), and do not raise any difficulty in computing standard HJ bounds.
Such scaling has an intuitive interpretation when excess returns,
, are scaled as in Bekaert and Hodrick (1992)
and Cochrane (1996)
. The gross "scaled" return,
, can then be interpreted as a "managed" portfolio with z'tJ being the time-varying proportion of the investment allocated to the risky assets.
Scaling likely only improves the HJ bound if the weight zt has information on future returns. In the literature, one sets zt = Gyt, where G is a selector matrix of ones and zeros selecting the variables in yt believed to predict rt+1 or to capture the time variation in the expected returns.
Most studies stack actual returns with scaled returns [see, e.g., Bekaert and Hodrick (1992)
and Cochrane and Hansen (1992)
], considering the system
. This amounts to considering many different zt's where each zt is represented by a selection matrix with only one nonzero element, selecting a particular instrument out of the available instruments. It is fairly unlikely that this is the optimal way to select from the set of information variables. Therefore we sometimes refer to the bounds resulting from this ad hoc approach to scaling as "naive bounds."
1.3 The GHT variance bound
GHT (1990)
show how to use conditioning information efficiently. Recall that a scaled asset is a one-dimensional asset,
, where zt is an n-dimensional vector whose entries are measurable functions of yt (so they belong to It). The space of all such scaled payoffs is an infinite-dimensional conditional Hilbert space
. GHT directly project the pricing kernel onto this space augmented by a riskless payoff. They show that the projected pricing kernel is4
![]() | (13) |
t is the conditional variance-covariance matrix of the returns, and w is given by
![]() | (14) |
![]() | (15) |
![]() | (16) |
![]() | (17) |
,
![]() | (18) |
1.4 The optimally scaled variance bound
The approach in this article is different. Consider the family of infinitely many one-dimensional scaled payoff spaces
indexed by zt. There is an HJ bound
2(v; rt+1) associated with each scaling vector zt, which only depends on the unconditional moments of
,
![]() | (19) |
. The optimally scaled bound is the highest such HJ bound:
![]() | (20) |
Proposition 1. The solution to the maximization problem
![]() | (21) |
![]() | (22) |
![]() | (23) |
![]() | (24) |
![]() | (25) |
![]() | (26) |
Proof. The appendix contains a formal proof. The proof proceeds in two steps. First, the optimal functional form is solved for. Second, the remaining constant parameter characterizing the function is solved for in a separate maximization.
Not surprisingly, the optimal scaling factor depends on the conditional distribution function only through the first and second conditional moments. Whereas the optimal scaling factor is decreasing in the conditional variance
t, it is not monotonic in the conditional mean µt. The nonmonotonicity is easy to understand using the duality with the mean-variance frontier. Consider two independent risky assets with a different expected return but identical variance. In this case, the minimum variance portfolio is the equally weighted portfolio. Also, the inefficient part of the frontier goes through a point where the expected return is the return on the lowest yielding asset and all funds are invested in that asset. When, without loss of generality, the expected return on the best-yielding asset is raised, the minimum variance point is raised as well, but the inefficient part of the frontier still intersects the point where all is invested in the lowest-yielding asset. The part of the new frontier beyond that point is below the old frontier.
Both bounds
and
depend on the conditional mean and the conditional variance of the payoffs. When these moments are known to researchers, the relation between
and and
is described by the following proposition:
Proposition 2. For an n-dimensional payoff rt+1 with price vector pt, conditional mean µt, and conditional variance-covariance matrix
t,
![]() | (27) |
P (the GHT bound represents the most efficient way of using conditional information), it follows that
![]() | (28) |
has the form described in the proposition. Now consider the variance of
:
![]() | (29) |
, the law of iterated expectations, and simplifying algebra, it follows that
![]() | (30) |
This result is at first surprising. Our optimally scaled bound is a standard HJ bound for a scaled return. Since the scaling factor depends on v, the mean of the pricing kernel, the optimally scaled bound is the ratio of a quartic polynomial in v over a quadratic polynomial in v which is generally not a quadratic polynomial in v. Nevertheless, when evaluated at the true conditional moments, the quartic polynomial in the numerator becomes the square of the quadratic polynomial in the denominator, and the optimally scaled bound becomes quadratic in v. The optimal scaled frontier becomes a parabola, identical to the GHT frontier. Since this insight is useful later on, we prove it explicitly.
![]() | (31) |
![]() |
![]() |
Proof. First note that
, the GHT bound. Thus from Proposition 2, we know that
![]() |
![]() | (32) |
and collecting terms, the result follows. This corollary provides the basis for a diagnostic test in Section 2.3. | 2. The Optimally Scaled Bound: Discussion |
|---|
|
|
|---|
Three important properties make the optimally scaled bound very useful in applied work. First, if there is time variation in expected returns and volatility, the optimally scaled bound should be sharper than standard ad hoc bounds. In Section 2.1 we explore the relation between predictability and the optimally scaled bound. Second, Section 2.2 discusses how the optimally scaled bound is robust in that it always is a valid lower bound to the pricing kernel, which is not the case for the GHT bound. Third, Section 2.3 suggests how the optimally scaled bound could form the basis of a diagnostic test for the correct specification of the first and second moments. Finally, we discuss how our work relates to two recent articles by Ferson and Siegel (2001
2.1 Efficiency and predictability
Whereas the optimally scaled bound uses conditioning information efficiently, it would be useful to derive conditions under which scaling improves the bound. In particular, one would hope that predictable variation in returns would result in sharper HJ bounds. Unfortunately it is difficult to derive sufficient conditions, but it is straightforward to derive a necessary condition. Let us, without loss of generality, focus on a univariate return space. If the scaling factor zt is uncorrelated with the first and second conditional moment of rt+1 (i.e.,
), then scaling the return with zt will decrease the HJ bound. To see this, note that
![]() |
. Intuitively, scaling by an independent random variable just adds noise to the return. Conversely the scaling factor has to be correlated with the future return for the scaled HJ bound to improve relative to the standard bound. In other words, when the return is scaled with a conditioning variable (e.g., a stock return with its lagged dividend yield), the variable must predict the return in order for the HJ bound to improve. This is intuitively clear: when a variable predicts an asset return, it may be possible to create managed portfolios that improve the risk-return trade-off as measured by the Sharpe ratio, and it is well-known that HJ bounds and Sharpe ratios are closely related.
This intuition remains intact for the case where two-dimensional spaces of the form
![]() | (33) |
, we know for sure that
![]() | (34) |
Proposition 3. Suppose there is an asset vector with payoff rt+1, price pt. Let It denote the
algebra of the measurable functions of the conditioning variables yt. Then the solution
to the maximization problem
![]() | (35) |
![]() | (36) |
The proof is given in the appendix.
Now suppose µt and
t are constants (i.e., there is no predictable variation in conditional means or variances), then
is a constant and rt+1 and
are linearly dependent. It follows that
. But since our bound is optimal, this implies
. Conversely, for the bound to improve, zt must predict rt+1. In the empirical illustrations below, we will use standard scaling in the "stacked" space as indicated above. Apart from our optimally scaled bounds, we will also report "stacked" optimally scaled bounds,
, which ought to be identical to the optimally scaled bounds when the conditional moments are known.
Our work here is related to Kirby (1998)
, which is the only article we are aware of that provides an explicit link between linear predictability and HJ bounds. More specifically, he shows that the Wald test of the null of no predictability in a linear regression is proportional to the standard HJ measure. He then uses this insight to investigate whether several asset pricing models are consistent with the evidence on predictability. Our work suggests that if the predictability is correctly described by a linear predictive model, our optimally scaled return should lead to a sharper HJ bound, and hence sharper restrictions on these asset pricing models. Furthermore, our framework can also accommodate nonlinear predictive relations.
2.2 Robustness
The GHT bound is given by
, where mt+1 depends on the conditional mean µt and the conditional variance
t of the returns. In practice, these conditional moments are not known. We use a proxy for them and thus a proxy
for
. In that case, the proxy for the GHT bound,
, may either underestimate or overestimate
. When it overestimates,
fails to be a lower bound for the variance of valid pricing kernels. On the other hand, the optimally scaled bound is
, where zt depends on the first two conditional moments. When the conditional moments are unknown,
is unknown and so is
. However, for every zt,
, remains a lower bound to the variance of all pricing kernels, since
is an HJ bound. Hence, even when using a proxy for the conditional moments to get a proxy
for
, the resultant optimally scaled bound remains a valid lower bound to the variance of pricing kernels.
This robustness property is important since conditional moments are notoriously difficult to estimate from the data. GHT (1990)
propose to use the seminonparametric (SNP) method to estimate conditional moments. The SNP method approximates the conditional density using a Hermite expansion, where a standardized Gaussian density is multiplied with a squared polynomial. In their preferred model, the leading term is a linear vector-autoregressive (VAR) model with autoregressive conditionally heteroscedastic (ARCH) volatility. In GHT's application on stock and bond returns, the conditioning set is restricted to contain only past returns, and SNP estimation may be adequate. However, when the data-generating process for returns contains jumps or regime switches and involves other predictive variables, such as dividend yields or term spreads, it is not clear that the SNP approach provides a good approximation.5 The risk of overestimating the variance bound can be avoided by applying our method.
Given an empirical specification for the conditional moments, our "optimally" scaled bound is as easy to implement as the original HJ bounds, since we only need to compute unconditional moments. For example, if we deem the time variation in the conditional mean to be more important than the time variation in the conditional variance, we obtain valid bounds by just replacing the conditional variance with the unconditional variance. The resulting bound will not be optimal if there truly is time variation in the conditional variance. However, it may still be sharper than using arbitrary scaling.
2.3 Diagnostics
The fact that optimally scaled bounds computed from misspecified conditional moments remain valid bounds that are best when the true conditional moments are used suggests an interesting application of our procedure. We can use the optimally scaled bound to diagnose the accuracy of competing models for the first two conditional moments. There are several ways in which misspecification of the conditional moments may manifest itself. First, it need not be the case that
. Hence, misspecified conditional moments may reveal themselves through poorly performing optimally scaled bounds relative to the conditional, "naively" scaled or stacked optimally scaled bounds.
Second, and most strikingly, the HJ bound need not be a parabola, since its numerator is a quartic in v and its denominator a quadratic in v. That is, misspecification should be visibly clear from graphing the optimal bound and we will illustrate this behavior in the empirical section below.
This reasoning also makes it possible to develop a general diagnostic test for the first and second conditional moments of asset returns.6 To develop such a test, let's revisit Corollary 1 in Section 1.4. The optimally scaled bound can be written as
, where B is the GHT bound, and correct moment specification implies A = B. This suggests a simple diagnostic test. The GHT bound is a quadratic in v where the coefficients are nonlinear functions of the three unconditional moments a, b, and d, defined above. For the parabola A to coincide with B for all v's, it should be the case that its coefficients are equal to the coefficients in B. Rewrite
as E[f1t + f2tv + f3tv2] and denote the estimated constants a, b, and d by
,
, and
. It is straightforward to derive
![]() |
![]() | (37) |
that ensures that all orthogonality conditions are of a similar order of magnitude and the fifth condition is the rescaled version of
. The restriction
does not yield any conditional moments restrictions since returns do not enter this expression.
There are three parameters to be estimated, so that there are two overidentifying restrictions, which can be tested using the usual statistic
, where gT is the mean of gt, T is the number of observations, and W is a suitable weighting matrix; for example, obtained from a Newey and West estimate (1987) of the inverse of the spectral density matrix of gt at frequency zero. Note that whatever the dimensionality of returns, the test is always a
2(2) and can be used to compare the performance of nonnested models for the first and second conditional moments. Of course, in a formal application, the sampling error in the parameters generating µt and
t should be taking into account. In a generalized method of moments (GMM) context, this can be easily accomplished employing a sequential GMM procedure, as in Bekaert (1994)
, Burnside (1994)
, and Heaton (1995)
.
The economic intuition for the test is straightforward. In a standard unconditional HJ framework, the HJ bound, which is the variability of the projected pricing kernel, can be viewed as a quadratic form in the deviations from risk-neutral pricing [see Hansen and Jagannathan (1991)
]. Let's consider
. If a risk-free asset exists, then v is the inverse of the risk-free rate and A can be seen as the deviation from risk-neutral pricing for the portfolio with weights zt (the optimally scaled portfolio), since the first term is the expected actual price and the second term is the risk-neutral price. Note that the portfolio weights do not need to add up to one. B on the other hand is simply the variability of the optimally scaled portfolio and at the same time the variability of the GHT kernel. If the scaling is done with the correct moments, the variability of the scaled return exactly equals the deviation of risk-neutral pricing.
This suggests another useful diagnostic statistic that could be used to compare alternative models. One could simply select two economically relevant v's (v1 and v2, say) and create a quadratic form using the following orthogonality conditions:
![]() | (38) |
To fully explore the properties of diagnostic tests based on the optimally scaled bound is beyond the scope of the present article. In our empirical illustration we report the test developed in Equation (37) for a number of different cases, including cases with simulated data where the true first and second moments model is known.
2.4 Relation to Ferson and Siegel (2001
, 2003)![]()
Ferson and Siegel have two contemporaneous articles that are related to the present article. Ferson and Siegel (2001)
solve for unconditionally minimum variance portfolios while using conditioning information efficiently. They provide explicit solutions for the portfolio weights as a function of the conditional means and volatilities of the available asset returns, both when a risk-free asset exists and when it does not. Since there is a duality between HJ frontiers and mean standard deviation frontiers, the Ferson and Siegel portfolios have some similarities to our optimally scaled returns, as Ferson and Siegel note in a final section. However, there are also many differences between our analyses so that our respective articles should really be viewed as complements rather than substitutes.
First, the HJ bounds derived from the Ferson and Siegel procedure are not as sharp as our bounds, because of the restriction that the portfolio weights have to sum to one. To appreciate the potential effect of this restriction, consider the extreme case where a researcher examines HJ bounds using one asset return (the equity return, for example) and multiple instruments (dividend yields, default spreads, and short rates, for example). One would imagine that conditioning information should be very valuable in increasing the HJ bound, but the Ferson and Siegel bound would equal the unconditional bound, since the conditioning information is useless with only one return, which forces the weight to be one for all t. Second, the optimality proof in Ferson and Siegel basically guesses the right solution for the optimal portfolio weight and verifies that it is correct, so no variational analysis is used. Third, Ferson and Siegel do not attempt to link their results to the GHT optimal HJ bound, and finally, they assume the conditional moments to be correctly specified.
Whereas our focus is mostly on the relation between GHT bounds and our optimally scaled bound, Ferson and Siegel extensively analyze the form of the weight function, providing extensive intuition on the nonlinear relation between the optimal portfolio weight and the magnitude of the expected return. In particular, extreme values for the expected return for a risky asset decrease the optimal weight on that asset, providing an interesting form of conservativeness to optimal scaling. This result applies to our bounds too, since it derives from the influence of the expected return on the uncentered second moment.
Ferson and Siegel (2003)
is a purely empirical article that provides useful information about the small sample properties of alternative methods to embed conditioning information into HJ bounds. They compare the naively scaled (multiplicative) bounds, the GHT bound, and a bound based on their unconditionally efficient portfolios. Perhaps not surprisingly, all bounds suffer from significant biases that increase the bounds relative to their true values. They conclude that the parsimony of the Ferson and Siegel (2001)
bounds enhances their attractiveness in small samples and that they are often close to optimal (i.e., close to the GHT bound). The analysis in Ferson and Siegel (2003)
also assumes correct specification of the conditional moments. In a sense, our results strengthen their conclusions since we show that the use of optimally scaled bounds is robust to misspecification. We show that this remains true even in the presence of significant nonlinearities as generated by a regime-switching model.
| 3. Empirical Application |
|---|
|
|
|---|
3.1 The econometric model
3.1.1 An extension of Hansen and Singleton (1983)
Let
be the logarithm of the stock return (i = s) and the bond return (i = b) and let Xt be the logarithm of gross consumption growth. Define
![]() |
Our first specification has two important features. First, we impose a parsimonious factor structure on the conditional covariance matrix inspired by Engle, Ng, and Rothschild (1990)
. Second, we allow negative shocks to have a different effect on the conditional variance than positive shocks, that is, we accommodate asymmetric volatility as in Glosten, Jagannathan, and Runkle (1993)
and Bekaert and Wu (2000)
. The presence of asymmetry in stock return volatility is well known, but in a previous version of this article [Bekaert and Liu (1998)] we also document asymmetry in the conditional variance of quarterly consumption growth. While it is intuitively plausible that uncertainty about future consumption growth is higher in a recession than in a boom, we could not find articles in the business cycle literature that document this phenomenon. In the finance literature, the available empirical evidence is mixed. Ferson and Merrick (1987)
report U.S. consumption volatility to be higher in a nonrecession sample relative to a recession sample. Kandel and Stambaugh (1990)
find peaks in the standard deviation of U.S. consumption growth to occur at the end of recessions or immediately after them.
For the multivariate setup, we begin by parameterizing an unconstrained model:
![]() | (39) |
![]() | (40) |
![]() | (41) |
i > 0, volatility displays the well-known asymmetric property. The et vector contains the fundamental shocks to the system. The error terms of the system are linked to et through
t. A parsimonious factor structure arises by assuming that
t is time invariant and upper triangular:
![]() | (42) |
![]() | (43) |
![]() | (44) |
ijt with i, j = x, b, s. Since the consumption-based asset pricing model introduces elements of the conditional variance-covariance matrix in the conditional mean, the unconstrained model should allow the conditional covariance matrix to affect the conditional mean as well. Therefore we let
![]() | (45) |
![]() |
be the CRRA and let ß be the discount factor. The model implies
![]() |
as the price of risk, with the risk being the covariance with consumption. With our particular GARCH structure, the model further simplifies to
![]() |
![]() |
R contains 14 parameters,
![]() |
3.1.2 A nonlinear dynamic model
Although the above unconstrained model features nonlinearities in the volatility dynamics, the conditional mean is linear in the information variables. There are two reasons to explore nonlinear models more explicitly. First, empirical research has documented regime-switching behavior in both consumption growth and equity return data [see, Whitelaw (2000)
and Ang and Bekaert (2002)
]. Second, if nonlinear predictability is present, it can be easily accommodated in our optimally scaled bound, whereas naively scaled bounds are not likely to reflect it.
Consequently we formulate a regime-switching version of the unconstrained VAR model of Section 3.1.1. With St a discrete regime variable that can take on the values of one or two, we assume
![]() | (46) |
![]() | (47) |
![]() | (48) |
![]() | (49) |
![]() |
3.2 Data and estimation results
Our consumption measure is the sum of per capita real nondurables and services consumption in the United States. These data were downloaded from DATASTREAM. The stock return is the quarterly value-weighted dividend-inclusive index return on the New York Stock Exchange (NYSE), taken from Wharton's website (http://wrdsx.wharton.upenn.edu). The interest rate is the U.S. three-month Treasury-bill rate taken from the Federal Reserve website. We use a dataset on weekly secondary market rates (averages of daily) and use the rate closest to the end of the month. All data run from the second quarter of 1959 to the end of 1996.
Table 1 shows the results from the unconstrained estimation. Despite the presence of very large coefficients on the GARCH-in-mean term, consumption growth and bond returns show strong autocorrelation as they do univariately. Although the standard errors for the GARCH-in-mean coefficients seem very small, they should be interpreted with much caution. Standard errors computed from the cross-product of the first derivatives of the likelihood are quite large and more adequately represent the uncertainty regarding these parameter estimates. In fact, the likelihood function is very flat with respect to these parameters, and a number of locals exist where the GARCH-in-mean parameters are in fact positive. This is not that surprising. Much work on GARCH-in-mean models for stock returns [see Bekaert and Wu (2000)
for a survey] has stressed the weakness of a positive relation between stock return volatility and its conditional mean. In this model, stock and bond returns are linked to consumption volatility, which in turn drives asset return volatility. The much smaller magnitude of consumption volatility relative to stock return volatility explains the large coefficients we find relative to the GARCH-in-mean literature for stock returns. When we estimate a univariate GARCH-in-mean model for stock returns, we find a GARCH-in-mean parameter of 6.29 with a large standard error of 5.23. Note that there is virtually no GARCH in the volatility dynamics, but strong asymmetry, with the coefficient on positive shocks being slightly negative. This is somewhat problematic since the conditional variance may theoretically become negative, although it never does in-sample.
|
Not surprisingly, the constrained model (see Table 2) is rejected by a likelihood ratio test. The chi-square test statistic is 75.32 with a p-value of 0.000 (there are eight restrictions). The CRRA is estimated to be 14.675 and the discount factor ß is 1.071. Although the latter is greater than one, we know from Kocherlakota's (1996)
|
Table 3 contains the estimation results for the regime-switching model. The construction of the likelihood function for such models is by now standard [see, for instance, Hamilton (1994)
|
Whereas the identification of regimes its typically driven by differences in volatilities across regimes, we find the volatility of consumption shocks to be very similar in the two regimes. However, regime 1 is a regime with overall high consumption growth, whereas consumption growth in regime 2 is often negative, although it depends positively and significantly on the bond return. In this recession regime, expected asset returns are high, consistent with the conventional wisdom. In the recession regime, bond returns are negatively serially correlated. Bond returns have a small, insignificant consumption beta, whereas stock returns have a large, positive, and statistically significant consumption beta. We verified by simulation that this model matches the first and second moments of the data very well.
When we allow the transition probabilities to depend on the past bond return, we find d1 = 0.3785 with a standard error of 1.0048 and d2 = 13.08 with a standard error of 8.65. A likelihood ratio test rejects the restriction of constant transition probabilities at the 5% level.
For the sake of completeness, we should mention that we estimated two other regime-switching models that we decided not to use in the empirical illustration. First, we estimated the model in Table 3, subject to a set of restrictions imposed by the consumption-based asset pricing model with CRRA preferences. This model was strongly rejected by the data and the CRRA coefficient was negative. Hansen and Singleton (1983)
also found convex utility for some of their specifications. Furthermore, we estimated a model using only data on bond and stock returns. By freeing the regime variable from having to fit regimes in consumption growth, we conjectured that this model would provide a better fit with the return data. However, this was not the case: the joint model estimated in Table 3 provides sharper HJ bounds!
3.3 The HJ bounds
This section illustrates the performance of our optimally scaled bound along the three dimensions that we discussed in Section 2: efficiency, robustness, and diagnostics.
We have four candidates for the computation of the conditional moments we need in deriving the optimally scaled and GHT bounds: the VAR model for stock and bond returns and consumption growth in its unconstrained and constrained form, and the regime-switching model with constant and time-varying transition probabilities. We will also use these models as data-generating processes in simulations. Simulations both serve to illustrate the effect of misspecifications where the conditional moments are known and to help interpret data results that may be sensitive to sampling error in our short sample. Simulations use 10,000 observations.8 Table 4 provides a complete guide to the figures we produce. Of importance is that we always focus on both stock returns, and bond returns, and naive scaling uses the past bond and stock returns as instruments for both returns.
|
|
|
|
|
|
3.3.1 Efficiency
Figure 1 uses the unconstrained VAR model and the two regime-switching models to compute the conditional moments in the optimally scaled bounds. Also on the graph are the unconditional and naively scaled HJ bounds. Three results stand out. First of all, the difference between the unconditional and scaled bounds reveals considerable predictability. The main source of the predictability is the autoregressive component in bond returns.
|
Second, the difference between the various scaled bounds is small, but the arbitrarily scaled bound is even somewhat sharper for small v's than is the optimally scaled bound computed from the unconstrained VAR. This can be due to either misspecification of the conditional moments or chance (sampling error). To examine this issue more closely, we first produce the same graphs for a long simulated sample from the unconstrained model in Figure 2. We also show the GHT bound. As should be the case, the GHT and optimally scaled bound are virtually on top of one another and dominate ad hoc scaling, but only slightly. In other words, in a world where the unconstrained model generates the data, naive scaling will closely approximate the efficient use of the conditioning information. In fact, since the unconstrained model describes the data rather well, the dominance of the naively scaled bound in Figure 1 may be simply due to sampling error, which we confirmed by performing simulations using 151 data points only.
|
It is no mystery why the use of the true conditional moments adds little in this setting. The feature of the data that arbitrary scaling would most likely fail to capture is the GARCH-in-mean feature, which happens to be weak in quarterly data. The importance of optimal scaling in generating sharper HJ bounds is likely more dramatic when strong non-linearities are present. This brings us to the third important result captured by Figure 1: the bounds generated by both regime-switching models are indeed sharper than the naively scaled bound, with the sharpest bound delivered by the most nonlinear model, the model with time-varying transition probabilities. Hence there appear to be nonlinear conditional mean effects in the data, but it is not surprising that they are not terribly strong in this quarterly dataset. To investigate this further, Figure 3 simulates data from the regime-switching model with time-varying transition probabilities and produces the unconditional, naively scaled, optimally scaled, and GHT bounds, the latter two using the true model to compute conditional moments. As expected, the optimally scaled bound and the GHT bound are practically on top of one another, but there is now a bit more of a wedge between naive and optimal scaling. To further demonstrate that naive scaling produces inefficient bounds when nonlinear predictability is present, we simulate data from a regime-switching model with stronger nonlinear predictability. In Figure 4, we use the parameter estimates from Table 3, but multiply the parameters governing the state dependence of the transition probabilities (d1 and d2) by 10. The wedge between the naively scaled and the efficient optimally scaled and GHT bounds now gets much larger.
|
|
3.3.2 Diagnostics
In Figure 5, which uses real data and the constrained VAR model to generate the conditional moments, two results stand out. First, the stacked optimally scaled bound gets pretty close to the naively scaled bound, despite the misspecification of the conditional moments. Of course, the constrained model manages to reproduce the most important aspect of the predictability, namely the autoregressive component in bond returns, so this result is not so surprising. What may strike some readers as surprising is the second main fact: the optimally scaled bound is not a parabola. As we indicated above, if the moments are correctly specified it ought to be. Since we know the model is rejected, the optimally scaled bounds seem to provide a striking alternative specification test. Of course, it is again possible that some quirk in the constrained model, coupled with sampling error, generates this result. This is not the case. Figure 6 uses data simulated from the constrained model. Since the model for conditional moments is correctly specified in this case, we now obtain a smooth parabola. We also produced these bounds for a number of simulated samples of length 151 and never found the same "strange" behavior.
|
|
To illustrate the diagnostic power of the optimally scaled bound more starkly, we can use simulations and our estimated data-generating processes to generate misspecified bounds. Figure 7 uses observations simulated from the model that best fits the data: the regime-switching model with time-varying transition probabilities. We show the optimally scaled bounds using the true model and the three other models to compute the conditional moments. In the last three cases, the moments are of course misspecified. The three models not constrained by the consumption-based asset pricing model generate similar optimally scaled bounds with no obviously visible misspecification. Of course, the bound from the true model is the sharpest. However, the optimally scaled bound computed using constrained VAR moments shows striking nonparabolic behavior near the trough of the graph. The bound is also far from efficient, which may suggest that misspecification may lead to very inefficient bounds. In this rather stylized example, this is of course true by construction, but the graph also shows that misspecified models (the unconstrained VAR and the regime-switching model with constant transition probabilities) that get the dynamics "almost" right yield rather tight bounds. Moreover, in the case of misspecification, one can always do better by using the stacked optimal bound. Figure 8 illustrates this by repeating Figure 7 with optimally scaled stacked bounds. Of course, for the true model, the optimally scaled and the optimally scaled stacked bound are identical. The constrained VAR model still generates a nonparabolic optimally scaled bound, but the bound is now much closer to the true bounds.
As another illustration, Figure 9 generates data satisfying our worst model, the constrained VAR model, but computes the optimally scaled bound using moments according to the true and our three other, now misspecified models. Now, all misspecified models generate nonparabolic optimally scaled bounds.
Finally, Table 5 produces the diagnostic test of Section 2.3 [see Equation (37)], ignoring the sampling error in the original parameters, but taking the sampling error in estimating the a, b, and d constants into account. All test values are
2(2) and the p-values are in parentheses. We first produce the test for the data, using all four of our models to compute the conditional moments. The test rejects the constrained model, as did the likelihood ratio test, at the 1% level. However, the diagnostic test also provides a test of the first and second moment specification embedded in the other models, including the regime-switching models. Here the test fails to reject in each case with p-values of over 90%. Testing the constrained or unconstrained VAR models versus the regime-switching models would be a difficult task because of the presence of nuisance parameters under the null. Our specification test clearly shows that the models that do not impose the restrictions of the consumption-based asset pricing model provide a reasonable specification of the first and second moments, but that the constrained VAR model does not.
|
Our simulated samples offer a controlled environment to examine the performance of the test. We consider 16 cases simulating data from the four different models and computing the moments according to each model (which will be misspecifying the conditional moments in three of the four cases). Given the size of the simulated samples (10,000 observations), we expect to reject when the moments are clearly misspecified, as in the constrained VAR model. The three other models all generate very similar first and second moments, but for the test to be useful (i.e., consistent) it must be able to distinguish these small differences in conditional moments and reject when given enough observations. This indeed happens. Focusing on the simulation rows and columns in Table 5, the moments are correctly specified along the diagonal, and there the test yields small, insignificant test values. For all off-diagonal elements, the moments are misspecified and the test should reject. It does in each and every case at less than the 1% significance level. For entries involving the constrained model, the test generates very large test values. We conclude that the test is well-behaved.
3.3.3 Robustness
So far we have not focused on the GHT bounds very much. Generally, optimally scaled bounds do not perform much worse or better than the GHT bound. Moreover, our simulations reveal that the GHT bounds quite often overestimate the variance of the true pricing kernel. A first example is in Figure 10. In Figure 10, we generate data from the regime-switching model with time-varying transition probabilities. When we correctly specify the conditional moments, the GHT bound mimics the behavior of the optimally scaled bound, which was graphed in Figure 7: it starts out high on the left (a little over 1) and ends up at a level of about 0.6 on the right. When we use the unconstrained VAR and the constant transition probabilities regime-switching model, the GHT bounds are not too different. Nevertheless, on the right-hand side of the graph they slightly overestimate the true variability of the pricing kernel. When we use misspecified moments from the constrained VAR model, the GHT bound generates values that are way too high for the bounds on the right-hand side. When we use the constrained VAR model to generate truth in Figure 11, a similar phenomenon appears. This time, the GHT bound overestimates to varying degrees on the left-hand side of the graph for all three misspecified models.
This lack of robustness is a serious drawback to the GHT bound. The optimally scaled bound never exceeds the true GHT bound, but manages to be quite close to it. Figures 7
9 amply demonstrate this fact. The top HJ bound in these figures corresponds to the GHT bound in either Figure 10 or Figure 11. Importantly, when the moments are misspecified, the optimally scaled bound always remains below the true bounds and the misspecification shows up in nonparabolic behavior of the bound. The latter is particularly apparent in Figure 7. This surprising robustness should make the optimally scaled bound the preferred method of incorporating conditioning information efficiently.
| 4. Conclusion |
|---|
|
|
|---|
With the continued interest of the finance profession in the use of (unconditional) HJ bounds on the one hand, and the growing evidence of time variation in conditional means and variances of asset returns on the other hand, it becomes important to optimally incorporate conditioning information in these bounds. Our article provides a bridge between the insightful, but complex analysis of GHT (1990)
There are potentially many interesting applications of our framework. First, the bounds can be used to reexamine the predictability of asset returns and to examine which instruments yield the sharpest restrictions on asset return dynamics. In our application here, using the optimally scaled bound does not sharpen the bounds dramatically. However, Ferson and Siegel (2003)
show cases where the efficient use of conditioning information substantially increases the efficient volatility bound.
Second, the bounds can also yield information on expected return and conditional variance modeling and serve as a diagnostic tool to judge the performance of dynamic asset pricing models. The reason is that the optimal scaling function depends on the conditional mean and conditional variance of the returns and that the resulting HJ bound is best when they represent the true conditional moments. We use this property of the optimally scaled bound to develop a GMM-based specification test for the first and second moments, but much more needs to be done. We ignored the sampling error in the parameter estimates of the original models, and did not examine the small sample properties of the test.9
Third, using the duality with the mean-variance frontier, the optimally scaled bound can be used in dynamic models of optimal asset allocation that seek to maximize an unconditional mean-variance criterion. Fourth, the bounds could be used in developing performance measures for portfolio managers. In the standard mean-variance paradigm, there is no role for a portfolio manager since the optimal portfolio weights are fixed over time. In a dynamic setting, with changing conditional information, the role of the portfolio manager is to adjust the portfolio weights according to the arrival of information, preferably optimally.
| Appendix |
|---|
|
|
|---|
Proof of Proposition 1. The problem we would like to solve is
![]() |
is bounded from above by the GHT bound
and from below by zero. Note that
![]() |
t are the conditional mean vector and conditional covariance matrix of the set of returns, respectively. So the above problem is reduced to the problem (we omit the subscript t in the derivation),
![]() | (50) |
![]() | (51) |
![]() | (52) |
![]() | (53) |
(y) is the multivariate distribution function of y. This is a variation-like problem and we adapt the calculus of variation technique to solve it. Let g(y) = f (y) +
h(y), where
> 0, the first-order condition with respect to
gives
![]() |
![]() | (54) |
(y) of y does not appear explicitly. Solving for f from Equation (54), we obtain
![]() | (55) |
![]() |
and
are constants. Further, note that the scaling by a constant does not change the HJ bound, so we can solve f only up to a constant. We can thus let
= 1. With the functional form of the scaling factor known, we can determine the constant
[note that -
is w in Equation (10)] by solving a standard maximization problem (instead of a functional problem):
![]() | (56) |
![]() | (57) |
![]() | (58) |
. The first-order condition in
gives
![]() | (59) |
![]() | (60) |
b -
vd) (this is not a problem because
is a minimum since it leads to
), we have
![]() |
![]() |
![]() | (61) |
![]() | (62) |
![]() | (63) |
![]() |
![]() |
) f(y)] and (E[(p - vµ)'f(y)])2 are convex in f(y) and there is an interior point, this is a convex programming problem and there is a minimum. In fact, one can easily verify that the solution is the one we obtained above.
Proof of Proposition 3. Note that the pricing kernel written in terms of scaled assets formed using rt+1 and
can always be written as
for some
. So we have
![]() |
![]() |
![]() |
| Footnotes |
|---|
We would like to thank Andrew Ang, Luca Benzoni, Michael Brandt, Qiang Dai, Darrell Duffie, Wayne Ferson, Mark Garmaise, Ming Huang, Jun Pan, John Scruggs, seminar participants at Stanford University, the European Finance Association meeting in Barcelona, the American Finance Association meeting in Washington, D.C., and especially John Heaton (the editor) for valuable suggestions. Kenneth Singleton and an anonymous referee provided extensive guidance, comments, and insights. Geert Bekaert thanks the National Science Foundation for financial support.
1 Other methods have been proposed to improve HJ bounds. Snow (1991)
studies the restrictions on the higher moments of the pricing kernel. Balduzzi and Kallal (1997)
tighten the bounds by using the risk premiums that the pricing kernel assigns to arbitrary sources of risk. ![]()
2 While GHT study both conditional as well as unconditional projections, we will only study unconditional projections. ![]()
3 Equation (10) can be derived from Equation (4) directly using the identity (F + gg')-1 = F-1 - F-1 g(
+ g 'F-1g)-1g'F-1 with F = µµ' +
, g = µ, and
the identity matrix. ![]()
4 Note that the projection is an unconditional not a conditional projection. ![]()
5 More and more research reveals that some of the predictable patterns detected in returns, even in linear settings, may be spurious, [e.g., see Kirby (1997)
]. ![]()
6 We thank the referee for stimulating our thinking on this issue. The test in Kirby (1998)
diagnoses the performance of several asset pricing models with respect to linear predictability but does not accommodate heteroscedasticity. ![]()
7 See Bekaert (1996)
for an elaboration of this point. ![]()
8 We simulate 10,100 observations but discard the first 100 observations to reduce dependence on initial conditions. Such dependence is unavoidable in the graphs using short sample data. Our sample estimates of the HJ bounds may also be subject to the finite sample bias documented in Ferson and Siegel (2003)
, but the number of asset returns we use is much smaller than theirs. ![]()
9 See Hansen, Heaton, and Yaron (1996)
for a study of the small sample properties of GMM estimators. ![]()
10 We would like to thank Darrell Duffie for suggesting this proof. ![]()
| References |
|---|
|
|
|---|
Ang, A., and G. Bekaert, 2002, "International Asset allocation with Regime Shifts," Review of Financial Studies, 15, 11371187.[Abstract]
Balduzzi, P., and H. Kallal, 1997, "Risk Premia and Variance Bounds," Journal of Finance, 52, 19131949.
Bekaert, G., 1994, "Exchange-Rate Volatility and Deviations from Unbiasedness in a Cash-in-Advance Model," Journal of International Economics, 36, 2952.
Bekaert, G., 1996, "The Time Variation of Risk and Return in Foreign Exchange Markets: A General Equilibrium Perspective," Review of Financial Studies, 9, 427470.
Bekaert, G., and R. Hodrick, 1992, "Characterizing Predictable Components in Excess Returns on Equity and Foreign Exchange Markets," Journal of Finance, 47, 467509.[CrossRef]
Bekaert, G., and J. Liu, "Conditioning Information and Variance Bounds on Pricing Kernels," Working Paper 6880, NBER.
Bekaert, G., and M. Urias, 1996, "Diversification, Integration and Emerging Market Closed-End Funds," Journal of Finance, 51, 835869.
Bekaert, G., and G. Wu, 2000, "Asymmetric Volatility and Risk in Equity Returns," Review of Financial Studies, 13, 142.[Abstract]
Burnside, C., 1994, "Hansen-Jagannathan Bounds as Classical Tests of Asset-Pricing Models," Journal of Business and Economic Statistics, 12, 5779.
Cecchetti, S. G., P.-S. Lam, and N. C. Mark, 1994, "Testing Volatility Restrictions on Intertemporal Marginal Rates of Substitution Implied by Euler Equations and Asset Returns," Journal of Finance, 49, 123152.
Chen, Z., and P. J. Knez, 1995, "Measurement of Market Integration and Arbitrage," Review of Financial Studies, 5, 287325.
Chen, Z., and P. J. Knez, 1996, "Portfolio Performance Measurement: Theory and Applications," Review of Financial Studies, 9, 511555.
Cochrane, J. H., 1996, "A Cross-Sectional Test of an Investment-Based Asset Pricing Model," Journal of Political Economy, 104, 572621.[CrossRef]
Cochrane, J. H., and L. P. Hansen, 1992, "Asset Pricing Explorations for Macroeconomics," O. J. Blanchard and S. Fischer (eds.), 1992 NBER Macroeconomics Annual, MIT Press, Cambridge, MA.
Constantinides, G. M., and D. Duffie, 1996, "Asset Pricing with Heterogeneous Consumers," Journal of Political Economy, 104, 219240.[CrossRef][Web of Science]
Dahlquist, M., and P. Söderlind, 1999, "Evaluating Portfolio Performance with Stochastic Discount Factors," Journal of Business, 72, 347383.[CrossRef]
De Santis, G., 1995, "Volatility Bounds for Stochastic Discount Factors: Tests and Implications from International Stock Returns," working paper, University of Southern California.
Engle, R. F., D. M. Lilien, and R. P. Robins, 1987, "Estimating Time Varying Risk Premia in the Term Structure: The Arch-M Model," Econometrica, 55, 391407.[CrossRef]
Engle, R. F., V. K. Ng, and M. Rothschild, 1990, "Asset Pricing with a FACTOR-ARCH Covariance Structure: Empirical Estimates for Treasury Bills," Journal of Econometrics, 45, 213237.[CrossRef]
Ferson, W. E., and J. J. Merrick, Jr., 1987, "Non-stationarity and Stage-of-the-Business-Cycle Effects in Consumption-Based Asset Pricing Relations," Journal of Financial Economics, 18, 127146.[CrossRef]
Ferson, W. E., and R. W. Schadt, 1996, "Measuring Fund Strategy and Performance in Changing Economic Conditions," Journal of Finance, 51, 425461.
Ferson, W., and A. Siegel, 2001, "The Efficient Use of Conditioning Information in Portfolios," Journal of Finance, 56, 967982.[CrossRef]
Ferson, W., and A. Siegel, 2003, "Stochastic Discount Factor Bounds with Conditioning Information," Review of Financial Studies, 16, 567595.[Abstract]
French, K. R., G. W. Schwert, and R. F. Stambaugh, 1987, "Expected Stock Returns and Volatility," Journal of Financial Economics, 19, 329.
Gallant, R., and G. Tauchen, 1989, "Seminonparametric Estimation of Conditionally Constrained Heterogeneous Processes: Asset Pricing Applications," Econometrica, 57, 10911120.
Gallant, R., L. P. Hansen, and G. Tauchen, 1990, "Using Conditional Moments of Asset Payoffs to Infer the Volatility of Intertemporal Marginal Rates of Substitution," Journal of Econometrics, 45, 141179.[CrossRef]
Glosten, L., R. R. Jagannathan, and D. E. Runkle, 1993, "On the Relation between the Expected Value and the Volatility of the Nominal Excess Return on Stocks," Journal of Finance, 48, 17791801.[CrossRef]
Hamilton, J., 1994, Time Series Analysis, Princeton University Press, Princeton, NJ.
Hansen, L. P., J. Heaton, and E. G. J. Luttmer, 1995, "Econometric Evaluation of Asset Pricing Models," Review of Financial Studies, 8, 237274.
Hansen, L. P., J. Heaton, and A. Yaron, 1996, "Finite-Sample Properties of Some Alternative GMM Estimators," Journal of Business and Economic Statistics, 14, 262280.
Hansen, L. P., and R. Jagannathan, 1991, "Implications of Security Market Data for Models of Dynamic Economies," Journal of Political Economy, 99, 225262.[CrossRef]
Hansen, L. P., and S. F. Richard, 1987, "The Role of Conditioning Information in Deducing Testable Restrictions Implied by Dynamic Asset Pricing-Models," Econometrica, 55, 587613.
Hansen, L. P., and K. Singleton, 1983, "Stochastic Consumption, Risk Aversion, and the Temporal Behavior of Asset Returns," Journal of Political Economy, 91, 249265.[CrossRef]
Heaton, J. C., 1995, "An Empirical Investigation of Asset Pricing with Temporally Dependent Preference Specifications," Econometrica, 63, 681717.[CrossRef]
Kaminsky, G., and R. Peruga, 1990, "Can a Time-Varying Risk Premium Explain Excess Returns in the Forward Market for Foreign Exchange?," Journal of International Economics, 28, 4770.
Kandel, S., and R. F. Stambaugh, 1990, "Expectations and Volatility of Consumption and Asset Returns," Review of Financial Studies, 3, 207232.
Kirby, C., 1997, "Measuring the Predictable Variation in Stock and Bond Returns," Review of Finanical Studies, 10, 579630.
Kirby, C., 1998, "The Restrictions on Predictability Implied by Rational Asset Pricing Models," Review of Financial Studies, 11, 343382.[Abstract]
Kocherlakota, N. R., 1996, "The Equity Premium: It's Still a Puzzle," Journal of Economic Literature, 34, 4271.
Newey, W. K., and K. D. West, 1987, "A Simple, Positive Semidefinite, Heteroskedasticity and Autocorrelation Consistent Covariance Matrix," Econometrica, 55, 703708.
Snow, K. N., 1991, "Diagnosing Asset Pricing Models Using the Distribution of Asset Returns," Journal of Finance, 46, 955983.
White, H., 1982, "Maximum Likelihood Estimation of Misspecified Models," Econometrica, 50, 125.[CrossRef][Web of Science]
Whitelaw, R. F., 2000, "Stock Market Risk and Return: An Equilibrium Approach," Review of Financial Studies, 13, 521547.[Abstract]
This article has been cited by other articles:
![]() |
T. Bollerslev, G. Tauchen, and H. Zhou Expected Stock Returns and Variance Risk Premia Rev. Financ. Stud., November 1, 2009; 22(11): 4463 - 4492. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. E. Ferson and A. F. Siegel Testing Portfolio Efficiency with Conditioning Information Rev. Financ. Stud., July 1, 2009; 22(7): 2735 - 2758. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Chabi-Yo Conditioning Information and Variance Bounds on Pricing Kernels with Higher- Order Moments: Theory and Evidence Rev. Financ. Stud., January 1, 2008; 21(1): 181 - 231. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

































































































