christoph braun O P T I M A L TA X AT I O N O F H U M A N C A P I TA L Optimal Taxation of Human Capital Inauguraldissertation zur Erlangung des akademischen Grades Doctor rerum politicarum an der Wirtschafts- und Sozialwissenschaftlichen Fakultät der Technischen Universität Dortmund vorgelegt von Diplom-Volkswirt Christoph Braun aus Siegen Tag der Einreichung: 28.01.2011 Tag der Disputation: 04.05.2011 dissertation committee: Prof. Dr. Wolfram F. Richter (Chairman) Prof. Dr. Andreas Schabert Prof. Dr. Ludger Linnemann location: Dortmund faculty: Technische Univeristät Dortmund Wirtschafts- und Sozialwissenschaftliche Fakultät Dean: Prof. Dr. Wolfgang B. Schünemann Christoph Braun: Optimal Taxation of Human Capital, Ph.D. Dissertation, © January 2011 Für meine Tochter Eleonore. A C K N O W L E D G M E N T S This dissertation was written while I was a Ph.D student at the Ruhr Graduate School in Economics. The whole enterprise benefited from the help of Professor Wolfram F. Richter. His support, guidance, and patience cannot be overestimated. I owe the greatest intellectual debt to him. Over the course of time, I have discussed this work with many people at home, at work, at conferences, and at many other occasions. For some, it has hopefully been interest- ing and maybe even stimulating; for others, it may have sometimes been hard to follow my ideas. I would like to express my gratitude to all people who listened to me, who supported me, and who talked with me. Financial support by the Wissenschaftsgemeinschaft Gott- fried Wilhelm Leibniz is gratefully acknowledged. Last but not least, I thank my beloved wife, Judith, for her continuous support and encouragement. Finishing this enterprise would not have been possible without her. I dedicate this dissertation to my daughter, Eleonore. She is my fountain of joy who always reminds me that seeing her laughing and growing up is even more fascinating and mysterious than thinking about economics. v C O N T E N T S 1 introduction 1 1.1 Human Capital and Economic Well-Being 1 1.1.1 Notion of Human Capital 2 1.1.2 Macro Growth Literature 5 1.1.3 Micro Labor Literature 8 1.2 Human Capital and Fiscal Policies 8 1.3 Starting Point of this Dissertation 11 1.4 Related Literature and Shortcomings 16 1.5 Summary of the Dissertation 19 1.A Tables 24 1.B A Simple Example 34 2 optimal taxation of education with an in- itial endowment of human capital 37 2.1 Introduction 37 2.2 The Model 40 2.2.1 Individual’s Problem 40 2.2.2 The Government 42 2.2.3 Firm’s Problem 43 2.2.4 Competitive Equilibrium 43 2.2.5 First-Best Solution 44 2.2.6 Second-Best Solution 46 2.2.7 Results 50 2.3 Conclusion 58 2.A Second-order Conditions 60 3 taxing human capital : a good idea 63 3.1 Introduction 63 3.2 The Model 66 vi 3.2.1 Individual’s Problem 66 3.2.2 Firm’s problem 71 3.2.3 Government’s problem 71 3.2.4 Competitive Equilibrium 72 3.2.5 Social Planner’s Problem – First-Best Analysis 73 3.2.6 Ramsey Problem – Second-Best Anal- ysis 75 3.3 Conclusion and Discussion 85 3.A Proof of Proposition 3.1 87 3.B Derivation of (3.35) 88 4 efficient human capital policy with over- lapping generations and endogenous growth 89 4.1 Introduction 89 4.2 The model and the planner’s first-best problem 96 4.3 Balanced growth 101 4.4 Optimal taxation in the standard OLG model with selfish individuals 105 4.5 Efficient and effective subsidization of educa- tion 116 4.6 Optimal taxation in the OLG model with altruistic individuals 118 4.7 Summary 131 5 efficient subsidization of human capital accumulation with overlapping generations and endogenous growth : a numerical ex- ample 137 5.1 Introduction 137 5.2 Restrictions on the Utility and Production Functions 138 vii 5.2.1 Utility Function 138 5.2.2 Production Function 139 5.2.3 Human Capital Investment Function 140 5.3 Calibration 141 5.4 Endogenous Government Consumption 147 5.5 Ramsey Problem 149 5.6 Discussion and Conclusion 157 5.A First-Order Derivatives of W 160 6 conclusion 162 bibliography 167 viii L I S T O F TA B L E S Table 1.1 The contribution of education to eco- nomic growth (percentage), 1950-1962 24 Table 1.2 Cross-country growth regressions 25 Table 1.3 Effective rate of subsidization for an individual obtaining tertiary educa- tion as part of initial education, ISCED 5/6 (2006), in OECD countries 29 Table 1.4 Regressions of individual earnings on schooling s and experience x (1959 annual earnings of white, nonfarm men) 31 Table 1.5 Growth effects of tax reform 32 Table 1.6 Comparison of steady states under al- ternative tax regimes 33 Table 5.1 Initial equilibrium 146 Table 5.2 Initial choice of parameters 147 Table 5.3 Results 155 ix 1 I N T R O D U C T I O N This dissertation is on optimal taxation of human capital. Why it is intriguing to examine this topic shall be the subject of what follows. I will first introduce and briefly discuss the notion of human capital. Then I will give a summary of the empirical and theoretical findings on (i) the link between human capital and economic well-being and (ii) how tax policies affect the accumulation of human capital. By then, I hope, the reader will be convinced that taxation matters for human capital and hence it matters for economic well-being. Taking this as given, it is fruitful to further pursue and to tackle the leading question of this dissertation: How should the tax system be optimally de- signed to promote the accumulation of human capital to maximize economic well-being? The basic approach to this question will then be presented along with a discussion of related literature. This introduc- tion concludes with a summary of my contributions to the literature on optimal taxation of human capital. 1.1 human capital and economic well-being Education policy ranks high on the political agenda. The following statements are taken from German and US Amer- ican politics: 1 • “Education is a key to personal prosperity, social jus- tice, and wealth.” (Coalition agreement of the CDU, CSU and FDP, p. 6, 2009) • “Growth. Education. Cohesion. Leading Germany to new Strength.” (Title of Federal Chancellor Angela Merkel’s inaugural policy statement, 2009) • “The [] challenge we must address is the urgent need to expand the promise of education in America. In a global economy where the most valuable skill you can sell is your knowledge, a good education is no longer just a pathway to opportunity - it is a pre-requisite.” (President Barack Obama’s speech at a joint session of the United States Congress, 2009) Put a little bit less flowery, one constituent of economic well-being is human capital. A deep and sound understand- ing of what human capital is and how it affects economic well being is therefore indispensable. In this section I would like to first discuss the notion of human capital and then sur- vey literature that attempts at identifying the links between human capital and economic well being. 1.1.1 Notion of Human Capital The term human capital was introduced into economics by Theodore W. Schultz, who was awarded the Sveriges Riksbank Prize in Economic Sciences in Memory of Al- fred Nobel joint with Sir Arthur Lewis in 1979 “for their pioneering research into economic development research with particular consideration of the problems of developing countries”. Schultz’s seminal contribution does not give a 2 formal definition, the usage of the term human capital is motivated by the following example: Much of what we call consumption constitutes investment in human capital. Direct expendi- tures on education, health, and internal migra- tion to take advantage of better job opportunities are clear examples. Earnings foregone by ma- ture students attending school and by workers acquiring on-the-job training are equally clear examples. Yet, nowhere do these enter our na- tional accounts. The use of leisure time to im- prove skills and knowledge is widespread and it too is unrecorded. In these and similar ways the quality of human effort can be greatly improved and its productivity enhanced. I shall contend that such investments in human capital accounts for most of the impressive rise in real earnings per worker. (Schultz, 1961a, p. 1) Gary S. Becker, who was awarded the Sveriges Riksbank Prize in Economic Sciences in Memory of Alfred Nobel “for having extended the domain of microeconomic analysis to a wide range of human behaviour and interaction, including nonmarket behaviour” in 1992, also did not give a formal definition of what he meant by human capital. He writes that expenditures on education, training, medical care, etc., are investments in capital. However, these produce human, not physical or financial, capital because you cannot separate a person from his or her knowledge, skills, health, or 3 values the way it is possible to move financial and physical assets while the owner stays put. (Becker, 1993, p. 16) Both Schultz and Becker have an analogy to the stock of physical capital in mind when they speak of human capital. The way savings act as a flow that increases the stock of physical capital, which then can be used in production, it is only natural also to speak of human capital as a stock variable in which various types of investments are made, of which education is only one example next to many others. Although the other types of investments in human capital are also important, education is the type of investment that will be further studied below. Thus, in a narrower sense, this dissertation could be equally titled as “Optimal Taxation of Education”. From the great many ways how the stock of human capital can be increased, it becomes only evident that is rather difficult to precisely determine its size. Whereas it is relatively easy to determine the value of tangible assets, which is the stock of physical capital holdings, it is relatively complicated to determine the value of the stock of human capital, which is part of the intangible assets. Empirical studies therefore differ with respect to how the stock of human capital is measured. The rather abstract term “well-being” in the beginning is used to indicate that it is not clear how and to which kind of well-being human capital contributes. President Obama’s quote rather refers to the individual, microeconomic view, whereas the two quotes from German politics rather refer to the societal, macroeconomic view. At the macroeconomic level well-being refers to the rate of growth in national 4 income or the level of national income, which may also be expressed in per capita terms. The social rate of return to investments in human capital is another measure. At the microeconomic level one studies the private rate of return to investments in human capital. There is a vast literature that examines the links between human capital and economic well-being at the macro and micro level. Many excellent surveys of each strand of the literature are already available. The macro literature is re- viewed by Topel (1999) and Sianesi and Reenen (2003). Card (1999) and Harmon, Oosterbeek, and Walker (2003) review the micro literature. Finally, Krueger and Lindahl (2001) offer a discussion that tries to bridge the gap between the macro and micro approaches. In what follows, I will draw on these surveys, only sketch the most important contribu- tions and report major results. This is meant only to show that the quoted politicians are right about that human capi- tal is one of the most important constituents of economic well-being. 1.1.2 Macro Growth Literature Very early attempts have been made by Schultz (1961b), Denison (1962, 1967). They differ with respect to the method- ology applied to measure what constitutes human capital.1 The starting point of both Denison’s and Schultz’s work is the growth accounting exercise pioneered by Solow (1957). In Denison’s study, the production factor labor is adjusted to account for different schooling levels to have a mea- 1 Bowman (1964) and Psacharopoulos (1973, 111-118) provide surveys and discussions of their approaches. 5 sure of quality. He simply asks: “What was the division of growth among the sources?” (Denison, 1967, p. 296) For instance, the United States grew by 3.36% between 1950 and 1962. Education’s contribution to growth was 0.49%. As a result, education’s contribution to growth amounted to 15%. Further results can be found in table 1.1 in appendix 1.A. Other contributions in the tradition of growth accounting, but using other measures of the stock of human capital, include Jorgenson and Fraumeni (1992), Mankiw, Romer, and Weil (1992), Hall and Jones (1999) and Klenow and Rodríguez-Clare (1997). Work devoted to actually identifying the correlations be- tween different regressors of human capital and measures of economic well being has been initiated by Barro (1991). To honor his seminal contribution, these cross-country growth regressions are sometimes referred to as Barro-regressions. Sianesi and Reenen (2003) have nicely compiled a table that tries to ease the interpretation and comparability of studies done by Barro (1991), Levine and Renelt (1992), Murphy, Shleifer, and Vishny (1991), Barro (1998), Hanushek and Kim (1995), Gemmell (1996), Judson (1998), Englander and Gurney (1994), Barro and Lee (1994), Benhabib and Spiegel (1994), Mankiw, Romer, and Weil (1992), de la Fuente and Doménech (2006) and Bassanini and Scarpetta (2001). Ta- ble 1.2 shows the results. The overall message from these studies is that a 1 percentage increase in human capital is significantly correlated to an increase in GDP. For instance, Barro (1991) finds that 1 percentage point increase in pri- mary (secondary) school enrolment rates is significantly 6 correlated to a 2.5 (3.0) percentage points increase in per capita GDP growth rates. Another macro related approach is pursued by the OECD in its ‘Education at a Glance’ series, which is based on in- vestment theory. The discount rate is the basis that allows to compare cash flows over time. The internal rate of re- turn is the discount rate at which the discounted streams of benefits and costs are equal. The net present value is calculated by setting the discount rate at some required rate (OECD, 2010, p. 137). Both the internal rate of return and the net present values are thus measures of the profitability of an investment, which then can be easily compared to other investments. The data provided by the OECD can be used to compute what will be called the effective rate of subsidization of education in chapter 4. It measures the extent to which the private rate of return exceeds the social rate of return. Table 1.3 reports the results obtained from the most recent 2010 issue of ‘Education at a Glance’. The evidence whether the private rate of return exceeds the social rate of return is mixed. In a slight majority of coun- tries education is effectively subsidized. Interesting are the cases of Finland, Hungary, Italy, Korea, New Zealand, and the United Kingdom where males and females are treated differently. Belgium and Germany are clear cases in which the social rate of return is larger than the private rate of return, which potentially indicates that tertiary education may not be sufficiently subsidized. 7 1.1.3 Micro Labor Literature Another measure of economic well being is the wage rate earned by an individual. Beginning with the seminal study by Mincer (1974) a vast literature has emerged that provides estimates of the monetary return to education. He found that the returns to schooling and experience were around 10% and 8%, respectively. See table 1.4 in appendix 1.A for the details. Willis (1986) and Psacharopoulos (1994) provide more examples. Ammermüller and Weber (2005) report results for Germany between 1985 and 2002 and find that the returns to education are in the range from 8% to 10% in West Germany and between 7% and 8% in East Germany. To sum up, the positive impact of human capital on economic well being is a well-established fact although estimates of its magnitude differ. A deeper understanding of how the accumulation of human capital is affected is therefore necessary. 1.2 human capital and fiscal policies Easily one can think of many different things that matter for human capital: Families, nutrition, health, the school and university system, taxes and subsidies, and many other phe- nomena and institutions shape the environment in which the accumulation of human capital takes place. Exploring each single issue is undoubtedly an interesting task. What I would like to focus on in the following is the role of fis- cal policies and its impact on the accumulation of human capital. Schultz (1961a) has a rather pessimistic view: 8 Our tax laws everywhere discriminate against human capital. Although the stock of such cap- ital has become large and even though it is ob- vious that human capital, like other forms of reproducible capital, depreciates, becomes obso- lete, and entails maintenance, our tax laws are all but blind on these matters. Whether Schultz is right or wrong has been the subject of numerous theoretical and empirical studies. Myles (2007) provides a comprehensive review of both strands of the literature.2 A priori it is not clear how fiscal policies affect the accumulation of human capital. On the one hand, in- dividual decisions are affected by taxes. Returns to certain economic decisions such as saving, education and labor are reduced, which may be detrimental to individual prosperity and growth. But on the other hand, tax revenue may be spent on institutions, e.g. school quality, that partly offset the negative individual incentives effect and furthermore provide for a conducive and stimulating environment. The studies by Lucas (1990), King and Rebelo (1990), Jones, Manuelli, and Rossi (1993) and Pecorino (1993) do not take into account how tax revenue is spent and only analyze the effects a tax reform has on growth, consumption and the size of the stock of physical capital. These simulation studies demonstrate a clear negative relationship between rising taxes on labor and capital income and growth. See table 1.5 in appendix 1.A for the details. For instance, the model by Lucas (1990) is most closely related to the models studied in the present dissertation. Lucas predicts that the tax rates on capital and labor income change from 36% and 2 Follow-ups are Myles (2009a, 2009b,2009c). 9 40% to 0% and 46%, respectively. The growth rate slightly decreases from 1.50% to 1.47%. But even more important is that these changes bring about increases of the capital stock and consumption of about 33% and 6%, indicating the large positive welfare effects of such a tax reform. The model by Trostel (1993) is closely related to the present dissertation as it explicitly models the accumu- lation of human capital. Time spent on education together with goods are the investments into human capital. More precisely, the accumulation of human capital is described by a Ben-Porath (1967) type production function. For the base- line calibration the result is that a one percent increase in the labor income tax rate causes the stock of human capital to decrease by about 0.39%. Thus, theory clearly shows the negative impact of taxation on the accumulation of human capital. Heckman, Lochner, and Taber (1998) further pursue Trostel’s idea and study a model similar to that by Auerbach and Kotlikoff (1987). Two policy experiments are conducted. First, the experiment is a move from a tax system with a progressive tax on labor income and a 15-percent flat tax on capital income towards a system in which the tax rate on capital income remains at the initial level and a flat tax on labor income is set equal to 7.7% such that it balances the budget. In the second experiment only consumption is taxed at a flat rate of 10%. The tax reforms entail significant increases in the stock of human capital. For instance, the stock of college human capital increases at 2.82% and 1.85% in the flat tax and flat consumption tax experiment, respec- tively, compared to the benchmark case. One can see that the flat tax is more favorable to human capital accumulation than the flat consumption tax, which is more pro-capital. 10 Table 1.6 in 1.A provides further details. Other contributions that empirically study the effect of progressive taxation on schooling and on-the-job training include Dupor, Lochner, Taber, and Wittekind (1996), Heckman, Lochner, and Taber (1999) and Taber (2002). Work done by Blankenau and Simpson (2004) and Ciriani (2007) do take into account how the government spends the tax revenue. The evidence provided is mixed. One has to closely bear in mind on which activities resources are spent. The government may directly provide education or it may subsidize privately provided education. Distortionary taxation may even nullify the positive effect of financing education. To conclude this section, theoretical and empirical work indicates that the way how the returns to saving, educa- tion and labor are taxed and subsidized has effects on the accumulation of human capital in the short and long run. 1.3 starting point of this dissertation The discussion so far has shown that (i) human capital is an important ingredient of economic well-being and (ii) taxation may have adverse effects on the accumulation of human capital. The objective of this dissertation is to provide a normative analysis of these relationships and attempts to provide answers to the question of how the tax system should be optimally designed to promote the accumulation of human capital to maximize economic well- being. The following chapters present several models of optimal taxation that approach this question from different perspectives. The basic economic problem in each model 11 is the same: There is a single individual, endowed with perfect foresight, that lives for a given number of periods. In each period of time it has to make a consumption-leisure decision. It may choose to consume today or tomorrow and thus to save in form of physical capital, which is used as a means of production. Total time available may be spent on working, education and leisure. Working today increases income from labor today whereas time spent on education is the means to accumulate human capital which in turn increases productivity tomorrow and thus labor earnings. Besides foregone earnings, education involves some direct cost, which the government may choose to subsidize. Hence, saving and education are two possibilities to smooth con- sumption over time. Because the government levies linear taxes on the returns to capital and labor to finance its expen- ditures, the individual’s decisions how much to save and how much time to spend on working and education are distorted. How theses taxes and the subsidy should be set to meet the government’s revenue needs is the chief concern of this dissertation. The results depend on the time horizon of the individual, the way human capital is accumulated, and how the individual internalizes the effects of his own education decisions. Particular attention will be paid to how the optimal tax system affects the education decision. The benchmark case is that the individual devotes time to education until the point where the social marginal benefit to education equals the social marginal cost of education. When this point is achieved, education efficiency is said to prevail. All models are set up in the spirit of Ramsey (1927). The basic methodological approach is the same in all models. 12 In what follows, I will lay out the approach and draw on the expositions in Chari and Kehoe (1999), Ljungqvist and Sargent (2004, pp. 490) and Christiano (2010). First, the individual’s and firm’s problem is set up. How the individ- ual and the firms behave given prices and a tax system is described by a set of first-order conditions, which are the best-response functions. A competitive equilibrium consists of a feasible allocation, a price system, and a government policy, such that given a price system and the government policy, the allocation solves the individual’s and firm’s prob- lem, and the government policy satisfies the government’s budget constraint given the allocation and the price sys- tem. The set C collects all competitive equilibria resulting from different government policies. The government then aims at designing a linear tax system that maximizes the individual’s utility while taking the feasibility constraint and the individual’s and firm’s competitive equilibrium behavior into account. Put differently, as each government policy gives rise to a different competitive equilibrium, the Ramsey problem is to choose the one that yields the highest utility. There are two major approaches how to incorporate the individual’s competitive equilibrium behavior: The primal and the dual approach. I first would like to shortly describe the dual approach before I turn to the primal approach, which will be used later on. The dual approach takes all equations describing the competitive equilibrium into ac- count and solves the maximization problem by choosing the allocation and prices. This approach may be intuitively more appealing. But the major disadvantage is that the op- timization problem involves solving for a large number of 13 variables, which may be a cumbersome task. Instead of fol- lowing this direct way, the primal approach asks to choose the optimal allocation that is consistent with competitive equilibrium behavior and then to solve for the government policy and prices that support this optimal allocation as a competitive equilibrium. The key to solving this problem is to use the so-called implementability constraint that sum- marizes the individual’s competitive equilibrium behavior. The implementability constraint is the individual’s bud- get constraint after having substituted out all after-tax prices using the sufficient and necessary individual’s first-order conditions. It is also possible not to use all first-order con- ditions and then to add these conditions separately. This then mixes the dual and the primal approach. The other constraint is the economy’s resource constraint. Let the set R consist of all allocations that satisfy the aforementioned constraints. The Ramsey problem is to choose an allocation, which will be called Ramsey allocation, from the set R that yields the highest utility. The key result is that one can find a price system and a government policy that implement the Ramsey allocation as a competitive equilibrium, or put shorter, that the two sets C and R are equal. First, one has to show that any allo- cation that is in the set C is also in the set R. This is fairly intuitive because the implementability constraint is directly derived from the individual’s competitive equilibrium con- ditions. Because the individual’s and government’s budget constraints are satisfied the resource constraint is satisfied by Walras’s law. This proves the first inclusion. Then, second, one needs to show that any allocation satis- fying the implementability and resource constraint satisfies 14 competitive equilibrium behavior. This means one has to find prices and a government policy, namely the tax rates, such that the allocation that is in R is also in C. To do this, use the individual’s and firm’s first-order conditions, eval- uate them at the Ramsey allocation, and determine prices and tax rates such that the allocation is in line with the individual’s and firm’s competitive equilibrium behavior as described by the first-order conditions. By construction, the Ramsey allocation satisfies the individual’s budget con- straint and the economy’s resource constraint. By Walras’ law, the government’s budget constraint is satisfied as well. The preceding discussion also helps to clarify the notion of the implementability constraint. The aim is to look for prices and tax rates that implement the Ramsey allocation as a competitive equilibrium. Any Ramsey allocation, which by construction satisfies the implementability constraint, can be implemented as a competitive equilibrium via an appropriate choice of tax rates. To sum up, Ljungqvist and Sargent (2004, p. 491) provide the following recipe: 1. Derive the first-order conditions of the individual’s and firm’s problem. These are the best-response func- tions given prices and tax-rates. Solve for these prices and tax-rates as functions of the allocation. 2. Substitute out any prices and tax rates in the indi- vidual’s intertemporal budget constraint. This leaves one with a constraint only taking the allocation as its argument. 15 3. Solve the Ramsey problem by maximizing the indi- vidual’s utility subject to the implementability and resource constraint. 4. Use the Ramsey allocation to find the tax rates and prices. Given the optimal tax rates and prices, the Ramsey alloca- tion is feasible and simultaneously solves the individual’s and firm’s problem. As a result, one can state the following proposition, which fully characterizes the methodological foundation of this dissertation: Proposition 1.1 (taken from Chari and Kehoe (1999), Propo- sition 1). The allocations in a competitive equilibrium satisfy the resource constraints and the implementability constraint. Fur- thermore, given allocations which satisfy these constraints, we can construct policies and prices which, together with the given allocations, constitute a competitive equilibrium. To get a deeper understanding of this rather abstract description of the primal approach, appendix 1.B provides a simple and fully fleshed out example taken from Christiano (2010). 1.4 related literature and shortcomings The models presented in this dissertation are simple. Many important issues have not been considered, which I would like to comment on now. It will be assumed that the government commits to a policy chosen at the outset of time. Reoptimization during the course of time is ruled out. The well-known time in- consistency (Kydland and Prescott (1977), Fischer (1980)) 16 problems are assumed away.3 Boadway, Marceau, and Marc- hand (1996) and Andersson and Konrad (2003) address these issues. Two forms of market failure are not discussed: (i) Borrow- ing constraints and (ii) incomplete insurance markets. The presence of market failure is generally taken as an argu- ment favoring government intervention if the government has the means at its disposal to yield a better outcome than the market solution. If the individual when young is borrowing constrained, he is not able to finance consumption and the direct cost of education by borrowing against his future income when old. Credit markets for young individuals are thus imperfect. The assumption that individuals are borrowing constrained has been reassessed in several models by Kane (1994), Card (1999) and Ellwood and Kane (2000) who argue that low- income earners are indeed borrowing constrained which then explains their lower participation rates in tertiary edu- cation. But the more important question actually is in which way individuals are constrained. Certainly, in the short run individuals may be borrowing constrained which is why the government must intervene. Cameron and Heckman (2001) and Carneiro and Heckman (2002) argue that long- term constraints serve better to explain the relative absence of children of low-income earners in tertiary education. Incomplete insurance markets are an issue if the individ- ual is subject to idiosyncratic shocks that are uninsured. Instead it is assumed that the individual is endowed with perfect foresight. This means there is no uncertainty about the state of the world tomorrow. The individual knows 3 See also Golosov and Tsyvinski (2008) for a short discussion. 17 today what will be the output of the production function tomorrow given his savings and time devoted to working. He also knows what will be the size of the stock of human capital tomorrow given the time spent on education today. The chance of failing or not being able to fully use the stock of human capital is absent. Put shorter, human capital is not risky. Contributions tackling this issue include Wigger and von Weizsäcker (2001), Krebs (2003), da Costa and Maestri (2007), Anderberg (2009) and Jacobs, Schindler, and Yank (2010). There are two approaches to study questions of opti- mal taxation, which have been pioneered by Ramsey (1927) and Mirrlees (1971). Both approaches derive different re- sults regarding the optimal taxation of human capital. The Mirrlees approach4 rests on the assumption that the govern- ment is not able to observe the individuals’ skills. Hence, it has to take an informational asymmetry into account and the aim then is to set up a tax system that is incentive- compatible such that the high-skill individual does not mimic the low-skill individual. Contributions by Boven- berg and Jacobs (2005), Jacobs (2005), Jacobs and Bovenberg (2010a, 2010b) and Grochulski and Piskorski (2010) put emphasis on endogenous human capital accumulation. A positive tax wedge on saving is used to set correct incen- tives.5 Ramsey models have been studied by Lucas (1990), Yuen (1991), Bull (1993), Jones, Manuelli, and Rossi (1997), Chari and Kehoe (1999), Erosa and Gervais (2002), Barbie 4 Useful surveys include Mirrlees (1986) and Golosov, Tszvinski, and Werning (2007). The latter survey also contains a discussion of the pros and cons of the Ramsey and Mirrlees approach on which Diamond (2007) and Judd (2007) comment. 5 A wedge is the difference between an individual marginal rates of sub- stitution and marginal rates of transformation. These wedges determine optimal taxes. See Kocherlakota (2005) for a discussion. 18 and Hermeling (2006) and Richter (2009). Both strands of the literature provide results on education efficiency. But results differ with respect to the role of capital income tax- ation. Chamley (1986) and Judd (1985) show that within Ramsey models there should be no wedge between the intertemporal rate of substitution and the marginal rate of transformation, or, alternatively, that the capital income tax rate is zero in the long run.6 The converse is true in Mir- rlees models if skills evolve stochastically over time.7 The difference is that within Ramsey models the prime objective is achieving allocative efficiency, that is, not to distort the intertemporal allocation of consumption. This objective is pursued independent of whether the models feature hu- man capital or not (see Jones, Manuelli, and Rossi (1997)). Within Mirrlees models, capital income is taxed to discour- age savings and therefore to make invests in human capital more attractive. This result is driven by the informational asymmetry between the government and the individuals. 1.5 summary of the dissertation Chapter 2 discusses contributions by Bovenberg and Jacobs (2005) and Richter (2009), who set up two-period models of the Mirrlees and of the Ramsey type and derive the so-called education efficiency theorem: In a second-best op- timum, the education decision is undistorted. The before- and after-tax rates of return to education are equal. This 6 When one imposes certain restrictions on the tax rates, capital income may be taxed as Chari and Kehoe (1999) show. 7 Otherwise, if the distribution of skills over time is constant, the intertem- poral margin is not distorted as Golosov, Kocherlakota, and Tsyvinski (2003) show. 19 result crucially depends on the way the accumulation of human capital is modeled, which is as follows. In the first period, the individual spends time on education, which enters a human capital production function as the only input. The output increases the stock of human capital in the second period. Bovenberg and Jacobs (2005) and Richter (2009) assume that the stock of human capital in the sec- ond period equals only the output of the human capital production function, which is assumed to be isoelastic with respect to education. This means that the function express- ing the stock of human capital in the second period and the human capital production function are the same. By contrast, I assume that in the first period the individual is endowed with an initial stock of human capital. The chapter then studies the effect that the initial stock of human capital has on optimal taxation. The stock of human capital in the second period is assumed to be the sum of the output of the isoelastic human capital production function, which takes education as the only input, and the initial stock of human capital net of depreciation. This implies that the elasticity of the function expressing the stock of human capital in the second period is increasing. Then the education efficiency theorem no longer holds. In a second-best optimum, the discounted marginal social return to education is smaller than the marginal social cost. The individual overinvests in human capital relative to the first best. As a result, the government effectively subsidizes the return to education. Chapter 3 tackles the following question asked by Jones, Manuelli, and Rossi (1997): “Is physical capital special?” Using the Ramsey approach, they add human capital to an optimal taxation model with physical capital. By model- 20 ing human capital almost symmetrically to physical capital they show that in a stationary state all taxes are zero. What drives this zero-tax result is that the human capital pro- duction function features constant returns to scale with respect to the stock of human capital. But they acknowl- edge that if the human capital production function violates this assumption, the stationary-state labor tax will not be zero. This chapter takes up the issue of modeling human capital almost symmetrically to physical capital. I drop the constant-returns-to-scale assumption. The human capital production function does not include the current stock of human capital and only includes the individual’s time de- voted to education. The individual has to pay for verifiable direct costs, e.g., tuition fees, that depend on the amount of education. The government may choose to subsidize this cost. I derive two results: First, optimal taxation in the stationary state prescribes not taxing capital income. The zero-capital-tax result holds despite the presence of human capital. The education decision depends only on how the labor tax and the education subsidy interact with each other. This relates to the second result, stating that in the opti- mum the marginal social return to education is larger than the marginal social cost. The so-called Education Efficiency Theorem (Richter, 2009) does not hold. From the inequality between the marginal social return and the marginal social cost it follows that education is effectively taxed. Turning to the underlying tax rates, it results that the cost of edu- cation is not fully tax-deductible, the labor income tax rate is higher than the rate of subsidization. As a consequence, the individual underinvests in human capital relative to the first best. 21 Chapter 4, which is joint work with Wolfram F. Richter, studies a model with overlapping generations and endoge- nous growth. Individuals live for two periods. They decide on education, saving, and nonqualified labor in their youth. They supply qualified labor when old. The productivity of qualified labour increases in the stock of human capital inherited from preceding generations, and it also increases in own educational investments. Individuals either may be perfect altruists with respect to descendent generations or may behave selfishly. Assuming selfish individuals yields the result that it is second best not to distort education if the human capital investment function is isoelastic in edu- cation. If, however, the elasticity of the investment function is increasing, which happens when the human capital stock does not depreciate completely if just one generation fails to invest, it is second best at balanced growth to subsidize education even relative to the first best. Assuming altruistic individuals changes some conclusions, but not all. Altruists internalize the positive effect that education has on descen- dents’ productivity. For all generations except the first one the accumulation of human capital should not be distorted, and this result is obtained for arbitrary utility and human capital investment functions. The accumulation of physical capital should not be distorted if the taxpayer’s utility is weakly separable between consumption and non-leisure and homothetic in consumption. Furthermore, qualified and nonqualified labour should be taxed uniformly across the life cycle when utility exhibits balanced growth path properties. Chapter 5 provides a numerical analysis of an important result derived in chapter 4. Two effects have been identified 22 that serve to justify the subsidization of education if indi- viduals are selfish. First, the most well-known justification is to internalize intergenerational external effects of edu- cation. Second, distortionary labor taxation has a negative effect on education and thereby on growth. For this reason, education should be subsidized relative to the first best if the elasticity of the human capital investment function is increasing. The chapter numerically analyzes the impact an increasing elasticity has on optimal taxation and studies the importance of the intergenerational external effect and of the distortionary labor taxation effect as reasons to justify the subsidization of education. As it turns out, the case for subsidizing education to account for distortionary labor taxation is rather weak. The still dominant justification for subsidizing education is to internalize intergenerational externalities. 23 1.a tables Table 1.1: The contribution of education to economic growth (per- centage), 1950-1962 Country Growth rate Growth rate National Education explained income by education United States 3.36 0.49 15 Belgium 3.03 0.43 14 Denmark 3.36 0.14 4 France 4.70 0.29 6 Germany 7.26 0.11 2 Italy 5.95 0.40 7 Netherlands 4.52 0.24 5 Norway 3.47 0.24 7 United Kingdom 2.38 0.29 12 Source: Denison (1967, tables 21-1–21-20) 24 Table 1.2: Cross-country growth regressions StudyDependentVariableHumanCapitalProxyFlow/StockEstimatedCoefficientIllustrationofImpact Barro(1991)growthrateofreal percapitaGDP annual1960–85 schoolenrolmentrate: numberofstudents enrolledinthedesignated gradelevels(primaryand secondaryrespectively) relativetothetotal populationofthecorres- pondingagegroupin1960 initialflow mean: prim60:0.78 sec60:0.23 prim¼0.025 sec¼0.030 A1percentagepoint increaseinprimary (secondary)school enrolmentratesis associatedwitha2.5 (3.0)percentagepoints increaseinpercapita GDPgrowthrate. Levineand Renelt(1992) growthrateofreal percapitaGDP annual 1960–89 secondaryschoolenrol- mentratein1960 initialflowhigh¼3.71 base¼3.17 low¼2.5 A1percentagepoint increaseinsecondary schoolenrolmentrate isassociatedwitha between2.5and3.7 percentagepoints increaseinpercapita GDPgrowthrate. Murphy, Schleiferand Vishny(1991) growthrateofreal percapitaGDP between1970–85 primaryschoolenrol- mentratein1960 initialflowfullsample: 0.022 (OECD:not significant) A1percentagepoint increaseinprimary schoolenrolmentrate isassociatedwitha 2.2percentagepoints increaseinpercapita GDPgrowthrate. Barro(1997)growthrateofreal percapitaGDP over1965–75, 1975–85, 1985–90 averageyearsofattain- mentformalesaged25 andoverinsecondary andhigherschoolsat thestartofeachperiod initialstocksin 1965,75and85 meanin1990¼ 1.9years 0.012Anextrayearofmale upper-levelschooling isassociatedwitha 1.2percentagepoint increaseinpercapita GDPgrowthrate. (continued) 25 StudyDependentVariableHumanCapitalProxyFlow/StockEstimatedCoefficientIllustrationofImpact Hanushek andKim(1995) growthrateofreal percapitaGDP between60–90 averageyearsofsecondary schoolingofadultmale populationatbeginning ofperiod initialstock0.36Anextrayearofmale secondaryschoolingis associatedwitha0.36 percentagepointincrease inpercapitaGDP growthrate. Gemmel(1996)growthrateof realpercapitaGDP annual60–85 constructedhumancapital stockin1960andhuman capitalannualaverage growthratesatprimary, secondaryandtertiary levels. Thesemeasuresareboth enteredintheequation simultaneously. initialstock mean: prim¼72.8 sec¼19.5 tert¼4.0 annualflows mean: prim¼2.5 sec¼3.7 tert¼2.7 Fullsample: primstock¼0.81 primflow¼2.68 PoorestLDCs: primstock¼0.91 primflow¼4.19 IntermediateLDCs: secstock¼1.09 OECD: tertstock¼1.10 tertflow¼5.89 ForOECD: A1percentincrease intertiaryhuman capitalstockis associatedwitha1.1 percentagepoint increaseinpercapita GDPgrowthrate. A1percentagepoint increaseintertiary humancapitalgrowth isassociatedwitha 5.9percentagepoints increaseinpercapita GDPgrowthrate. Judson(1998)growthrateofreal GDP 5-yearsaverages, 1960–90 growthofherconstructed measureofhumancapital stock periodflows10.8 low-efficiency countries ¼3.0 high-efficiency ¼12.9 A1percentagepoint increaseinhuman capitalgrowthis associatedwithan 11percentagepoints increaseinGDP growthrate. (continued) 26 Englanderand Gurney(1994) growthoflabour productivity(and totalfactor productivity)over fourtimeperiods schoolenrolmentrates: numberofstudents enrolledinsecondary schoolrelativetothe totalpopulationofthe correspondingagegroup inbeginningofperiod initialflowOECD: 1.45–1.78 A1percentagepoint increaseinsecondary schoolenrolmentrate isassociatedwith around1.5percentage pointincreasein productivitygrowth. BarroandLee (1994) lnGDP perworker averageyearsof secondaryschoolingof adultmalepopulation atbeginningofperiod initialstock0.014Anextrayearofmale secondaryschoolingis associatedwitha 1.4percentincreasein perworkerGDPgrowth. Benhabiband Spiegel(1994) lnGDPpercapitahumancapitalstock estimatesfromKyriacou: averagelevelofloghuman capitalovertheperiod (logofaveragelevelof humancapital; logofaveragelevels) averagestock0.12–0.17A1percentincreasein thestockofhuman capitalisassociated witha12to17percent increaseinpercapita GDPgrowth. Mankiw, Romer andWeil (1992) lnGDPper working-age person averagepercentageof working-agepopulationin secondaryschool,1960–85 periodflow0.66 impliedoutput elasticitywith respecttohuman capitalstock¼0.28 A1percentincreasein theaveragepercentage ofworking-agepopu- lationinsecondary schoolisassociated witha0.66percent increaseinGDPper working-ageperson. A1percentincrease inhumancapitalstock isassociatedwitha0.28 percentincreaseinGDP (continued) 27 StudyDependentVariableHumanCapitalProxyFlow/StockEstimatedCoefficientIllustrationofImpact delaFuente andDome´nech (2000) GDPperworker Annual, 1960–90 OECD averagenumberof yearsofschooling oftheadult population stock mean’90¼10.49 (UK’90¼10.94) impliedoutput elasticitywith respecttohuman capitalstock¼0.27 A1percentincreasein humancapitalstockis associatedwitha0.27 percentincreaseinGDP. Atthesamplemean,an increaseinaverage educationbyoneyear wouldraiseoutputper capitabyca.3percent. Bassaniniand Scarpetta (2001) GDPper working-age person Annual,1971–98 OECD averagenumberof yearsofschooling oftheadult population stock mean¼10.15 impliedoutput elasticitywith respecttohuman capitalstock¼0.57 A1percentincreasein humancapitalstockis associatedwitha0.57 percentincreaseinGDP. Atthesamplemean,an increaseinaverage educationbyoneyear wouldraiseoutputper capitabyca.6percent. Source: Sianesi and Reenen (2003, p. 177-180, table 2) 28 Table 1.3: Effective rate of subsidization for an individual ob- taining tertiary education as part of initial education, ISCED 5/6 (2006), in OECD countries Country Male Female Australia −0, 32 −0, 05 Austria 0, 07 0, 04 Belgium −0, 32 −0, 23 Canada 0, 14 0, 17 Czech Republic 0, 23 0, 19 Denmark −1, 46 −0, 22 Finland −0, 03 0, 07 France na na Germany −0, 34 −0, 32 Hungary 0, 01 −0, 31 Italy 0, 17 −0, 26 Korea −0, 03 0, 07 Netherlands −0, 16 −0, 15 New Zealand −0, 14 0, 10 Norway 0, 11 0, 34 Poland 0, 20 0, 22 Portugal 0, 18 0, 01 Spain 0, 26 0, 31 Sweden 0, 07 0, 62 Turkey 0, 41 0, 44 United Kingdom 0, 05 −0, 11 United States na na OECD average 0, 06 0, 07 Source: OECD (2010, chapter A8), own calculations The OECD provides the following pieces of information that are used to compute the effective rate of subsidization: • Private net present value, PrivNPV • Public net present value, PubNPV • Private direct costs, PrivDC 29 • Private foregone earnings, PrivFE • Public direct costs, PubDC • Public foregone earnings, PubFE The private rate of return is defined as follows: PRR = PrivNPV PrivDC + PrivFE The social rate of return is defined as follows: SRR = PrivNVP + PubNPV PrivDC + PrivFE + PubDC + PubFE The effective rate of subsidization seff then is: seff = PRR − SRR PRR 30 Table 1.4: Regressions of individual earnings on schooling s and experience x (1959 annual earnings of white, nonfarm men) Equation forms R2 1. lny = 7.58+ 0.070s 0.067 (43.8) 2. lny = 6.20+ 0.107s+ 0.081x− 0.0012x2 0.285 (72.3) (75.5) (−55.8) Source: Mincer (1974, p. 92, table 5.1) 31 Table 1.5: Growth effects of tax reform le ve l ef fe ct . In c on tr as t, K in g an d R eb el o (1 99 0) a nd J on es e t al . (1 99 3) f in d ve ry st ro ng g ro w th a nd le ve l e ff ec ts . K in g an d R eb el o co ns id er th e ef fe ct o f an in cr ea se in th e ca pi ta l ta x by 1 0% w he re as J on es e t al . m ir ro r L uc as b y el im in at in g th e ca pi ta l ta x. W ha t d is tin gu is he s th e K in g an d R eb el o an al ys is is th at th ey h av e ph ys ic al c ap ita l en te ri ng i nt o th e pr od uc tio n of h um an c ap ita l. Jo ne s et a l. em pl oy a h ig he r va lu e fo r th e el as tic ity o f la bo ur s up pl y th an o th er s tu di es . T he m od el o f Pe co ri no ( 19 93 ) ha s th e fe at ur e th at c ap ita l i s a se pa ra te c om m od ity to th e co ns um pt io n go od . T hi s pe rm its di ff er en t f ac to r in te ns iti es in th e pr od uc tio n of h um an c ap ita l, ph ys ic al c ap ita l a nd th e co ns um pt io n go od . C om pl et e el im in at io n of t he c ap ita l ta x ra is es t he g ro w th r at e, i n co nt ra st to th e fi nd in g of L uc as . T he im po rt an ce o f ea ch o f th e el em en ts in e xp la in in g th e di ve rg en ce b et w ee n th e re su lts i s st ud ie d in S to ke y an d R eb el o (1 99 5) . U si ng a m od el t ha t en co m pa ss es th e pr ev io us t hr ee , t he y sh ow t ha t th e el as tic ity o f su bs tit ut io n in p ro du ct io n m at te rs lit tle f or th e gr ow th e ff ec t b ut d oe s ha ve im pl ic at io ns f or th e le ve l e ff ec t - w ith a h ig h el as tic ity o f su bs tit ut io n, a t ax s ys te m t ha t tr ea ts i np ut s as ym m et ri ca lly w ill b e m or e di st or tio na ry . T he e lim in at io n of t he d is to rt io n th en l ea ds t o a si gn if ic an t w el fa re in cr ea se . T he i m po rt an t fe at ur es a re t he f ac to r sh ar es i n pr od uc tio n of h um an c ap ita l an d ph ys ic al c ap ita l, th e in te rt em po ra l el as tic ity o f su bs tit ut io n in u til ity a nd t he el as tic ity o f la bo ur s up pl y. S to ke y an d R eb el o co nc lu de t ha t th e em pi ri ca l ev id en ce pr ov id es s up po rt f or v al ue s of t he se p ar am et er s w hi ch j us tif y L uc as ' c la im t ha t th e gr ow th e ff ec t i s sm al l. C ap ita l 0 % L ab or 0% G ro w th 2 .7 4% C ap ita l 0 % L ab or 0% G ro w th 4 .0 0% C ap ita l 3 0% L ab or 20 % G ro w th 0 .5 0% C ap ita l 0 % L ab or 4 6% G ro w th 1 .4 7% Fi na l P os iti on A dd iti on al O bs er va tio ns In iti al T ax R at es a nd G ro w th R at e U til ity Pa ra m et er s Fe at ur es A ut ho r C ap ita l a nd co ns um pt io n di ff er en t g oo ds , co ns um pt io n ta x re pl ac es in co m e ta xe s C ap ita l 4 2% L ab or 20 % G ro w th 1 .5 1% σ = 2 α = 0. 5 Pr od uc tio n of h um an ca pi ta l r eq ui re s ph ys ic al c ap ita l Pe co ri no (1 99 3) 10 % in cr ea se in ca pi ta l s to ck 29 % in cr ea se in co ns um pt io n C ap ita l 2 1% L ab or 31 % G ro w th 2 .0 0% σ = 2 α= 4 .9 9 α ca lib ra te d gi ve n σ T im e an d ph ys ic al ca pi ta l p ro du ce hu m an c ap ita l Jo ne s, M an ue lli an d R os si ( 19 93 ) L ab or s up pl y is in el as tic C ap ita l 2 0% L ab or 2 0% G ro w th 1 .0 2% σ = 2 α = 0 Pr od uc tio n of h um an ca pi ta l r eq ui re s ph ys ic al c ap ita l (p ro po rt io n = 1/ 3) K in g an d R eb el o (1 99 0) 33 % in cr ea se in ca pi ta l s to ck 6% in cr ea se in co ns um pt io n C ap ita l 3 6% L ab or 4 0% G ro w th 1 .5 0% σ = 2 α = 0. 5 Pr od uc tio n of h um an ca pi ta l d id n ot re qu ir e ph ys ic al ca pi ta l L uc as ( 19 90 ) C ap ita l 0 % L ab or 0% G ro w th 2 .7 4% C ap ita l 0 % L ab or 0% G ro w th 4 .0 0% C ap ita l 3 0% L ab or 20 % G ro w th 0 .5 0% C ap ita l 0 % L ab or 4 6% G ro w th 1 .4 7% Fi na l P os iti on A dd iti on al O bs er va tio ns In iti al T ax R at es a nd G ro w th R at e U til ity Pa ra m et er s Fe at ur es A ut ho r C ap ita l a nd co ns um pt io n di ff er en t g oo ds , co ns um pt io n ta x re pl ac es in co m e ta xe s C ap ita l 4 2% L ab or 20 % G ro w th 1 .5 1% σ = 2 α = 0. 5 Pr od uc tio n of h um an ca pi ta l r eq ui re s ph ys ic al c ap ita l Pe co ri no (1 99 3) 10 % in cr ea se in ca pi ta l s to ck 29 % in cr ea se in co ns um pt io n C ap ita l 2 1% L ab or 31 % G ro w th 2 .0 0% σ = 2 α= 4 .9 9 α ca lib ra te d gi ve n σ T im e an d ph ys ic al ca pi ta l p ro du ce hu m an c ap ita l Jo ne s, M an ue lli an d R os si ( 19 93 ) L ab or s up pl y is in el as tic C ap ita l 2 0% L ab or 2 0% G ro w th 1 .0 2% σ = 2 α = 0 Pr od uc tio n of h um an ca pi ta l r eq ui re s ph ys ic al c ap ita l (p ro po rt io n = 1/ 3) K in g an d R eb el o (1 99 0) 33 % in cr ea se in ca pi ta l s to ck 6% in cr ea se in co ns um pt io n C ap ita l 3 6% L ab or 4 0% G ro w th 1 .5 0% σ = 2 α = 0. 5 Pr od uc tio n of h um an ca pi ta l d id n ot re qu ir e ph ys ic al ca pi ta l L uc as ( 19 90 ) F ig ur e 4. 2: G ro w th E ff ec ts o f T ax R ef or m T he se s im ul at io ns m od el s pr od uc e a va ri et y of r es ul ts b ut d o no t cl ar if y th e pr ec is e fa ct or s th at a re r es po ns ib le o r pr ov id e m uc h in si gh t i nt o th e ge ne ra l o ut co m e. A n um be r of a na ly tic al r es ul ts a re p ro vi de d in M ile si -F er re tti a nd R ou bi ni ( 19 98 ). T he m od el th ey c on si de r en co m pa ss es m os t o f th os e de sc ri be d ab ov e. I t h as s ep ar at e pr od uc tio n te ch no lo gi es f or p hy si ca l an d hu m an c ap ita l. In te re st in gl y, i t co ns id er s Source: Myles (2009a, p. 28, table 2) 32 Table 1.6: Comparison of steady states under alternative tax regimes Percentage difference from benchmark progressive case Flat Flat tax consumption tax Stock of physical capital −0.79 19.55 Stock of college human capital 2.82 1.85 Stock of highschool human capital 0.90 0.08 Aggregate output 1.15 4.98 Source: Heckman, Lochner, and Taber (1998, table 1) 33 1.b a simple example The individual’s problem is to maximize utility u(c, l) over the choice of consumption, c, and labor supply, l, subject to the budget constraint c 6 w(1− τ)l where w is the wage rate and τ is the labor tax rate. The first-order condition ucw(1− τ) + ul = 0 along with the budget constraint define the optimal choices of c and l as functions of the tax rate τ, that is, c˜ = c(τ) and l˜ = l(τ). The Ramsey problem is to find a government policy τ that maximizes the individual’s utility u(c(τ), l(τ)) subject to the budget constraint g 6 wl(τ)τ. The Ramsey equilib- rium consists of an optimal government policy τ∗ and an optimal allocation {c∗, l∗} such that the allocation is an op- timal choice of the individual given τ∗, that is, c∗ = c(τ∗) and l∗ = l(τ∗), and τ∗ solves the Ramsey problem. Given a government policy τ, let C denote the set of allocations that constitute a competitive equilibrium. Put formally: C = { (c, l) : ∃τ s.t. ucw(1− τ) + ul = 0, c = w(1− τ)l,g 6 wl(τ)τ } To employ the Ramsey approach, one has to derive the implementability constraint. Replace (1− τ)w in the indi- 34 vidual’s budget constraint using the first-order condition (1− τ)w = − ul uc . (1.1) Then the implementability constraint reads ucc+ ull = 0. (1.2) The set R consists of all allocations that satisfy the econ- omy’s resource constraint, c + g 6 wl, and the imple- mentability constraint: R := {(c, l) : c+ g 6 wl,ucc+ ull = 0} Proposition 1.2. C = R Proof. First, prove that C j R. (c, l) ∈ C satisfies the imple- mentability constraint (1.2) by construction. If one combines the individual’s and government’s budget constraint, one derives the resource constraint. Second, show that R j C. Choose τ such that (1.1) is satisfied. Then multiply (1.1) by l and use (1.2): (1− τ)wl = − ull uc = c The individual’s budget constraint is satisfied. If one combines the individual’s budget constraint and the re- source constraint, one ends up with the government’s bud- get constraint which is also a direct consequence of Walras’s law. 35 The proposition shows that it is possible to find a tax rate τ that implements the optimal allocation (c, l) as a competitive equilibrium. 36 2 O P T I M A L TA X AT I O N O F E D U C AT I O N W I T H A N I N I T I A L E N D O W M E N T O F H U M A N C A P I TA L 2.1 introduction Bovenberg and Jacobs (2005) and Richter (2009) set up two- period models of the Mirrlees and of the Ramsey type and derive the so-called education efficiency theorem: In a second- best optimum, the education decision is undistorted. The before- and after-tax rates of return to education are equal. This result crucially depends on the way the accumulation of human capital is modeled, which is as follows. In the first period, the individual spends time on education, which enters a human capital production function as the only in- put. The output increases the stock of human capital in the second period. Bovenberg and Jacobs (2005) and Richter (2009) assume that the stock of human capital in the second period equals only the output of the human capital produc- tion function, which is assumed to be isoelastic with respect to education.1 This means that the function expressing the stock of human capital in the second period and the human capital production function are the same. A debatable im- 1 Jacobs and Bovenberg (2010b) further analyze the human capital pro- duction function’s properties and find that a constant elasticity is crucial for their results in Bovenberg and Jacobs (2005). Richter (2009) refers to the so-called power law of learning, a result from cognitive psychology that provides evidence in favor of a constant elasticity. See Ritter and Schooler (2001) for more details. 37 plication of modeling the law of motion for human capital this way is that the stock of human capital in the second period is zero if the individual does not spend any time on education in the first period. Then effective labor supply is zero, because it is modeled as the product of raw labor supply and the then existing stock of human capital. Con- sequently, the individual does not earn any labor income. Put more briefly, if the individual wants to reap benefits of human capital, it first has to spend time on education. Or, as an alternative interpretation of this implication, consider a two-period overlapping-generations (OLG) model. When young, the individual accumulates human capital, which it uses when old. When old, it passes on the then existing stock of human capital to the newly born young individual so that it can further increase the stock by spending time on education. But when a young individual stops devoting time to education, the stock of human capital in the second period is zero. Consequently, the old individual could not pass on human capital to the newly born young individual. By contrast, I assume that in the first period the individ- ual is endowed with an initial stock of human capital. The present paper then studies the effect that the initial stock of human capital has on optimal taxation. The stock of human capital in the second period is assumed to be the sum of the output of the isoelastic human capital production func- tion, which takes education as the only input, and the initial stock of human capital net of depreciation. This implies that the elasticity of the function expressing the stock of human capital in the second period is increasing. With this specifi- cation, the stock of human capital in the second period does not drop to zero even when the individual stops spending 38 time on education in the first period, or, referring back to the OLG interpretation, the old individual can then pass on human capital even when it may not have spent time on education when young. Then the education efficiency theorem no longer holds. In a second-best optimum, the discounted marginal social return to education is smaller than the marginal social cost. The individual overinvests in human capital relative to the first best. As a result, the government effectively subsidizes the return to education. The general-equilibrium model used here comprises a single individual, a firm, and a government. The individual lives for two periods, in which it faces a consumption-labor- leisure choice. In the first period, it chooses how much time to devote to work and education. In the second period, it only decides how much to work. Time spent on education is transformed into human capital by means of a human capital production function. The individual combines its raw labor supply with the then existing stock of human capital, giving the effective labor supply. It chooses to lend capital to a firm, which takes it as an input, jointly with the effective labor supply, and pays a return. The firm produces a single consumption good. Time spent on education brings about disutility and comes at the cost of forgone earnings and some direct costs such as tuition fees. All actions of the individual are assumed to be fully observable. The government levies linear taxes on the individual’s income from work and saving to finance an exogenously given stream of expenditures. Furthermore, it may choose to sub- sidize the direct cost of education. The question then is how to optimally choose linear taxes and the subsidy to maxi- mize the individual’s utility given exogenous government 39 expenditures and subject to the individual’s competitive equilibrium behavior. 2.2 the model 2.2.1 Individual’s Problem The individual solves the following maximization problem: L = U(C0,L0 + E) +βU(C1,L1) + λ0 ( ω0L0H0 + R τ 0K0 −C0 −K1 −ϕE ) +βλ1 ( ω1L1H1 + R τ 1K1 −C1 ) + µ ( G(E) + (1− δH)H0 −H1 ) . (2.1) The individual’s utility function is strictly increasing in consumption, Ct, and strictly decreasing in the nonleisure times L0 + E and L1. It is strictly concave in both arguments and time-separable. Savings serve as a means to smooth consumption over time. They pay the net rate of return Rτt ≡ (1−τ K t )rt+1−δK, where τKt is a linear tax on the gross rate of return rt, and δK is the rate at which the stock of capital Kt depreciates. Raw labor supply Lt is combined with the stock of human capital Ht accumulated so far. Effective labor supply LtHt earns the net wage rate ωt ≡ (1 − τLt )wt, where τ L t is a linear tax on the gross wage rate wt. Let ϕ ≡ (1− τH)f be the direct cost of education net of the subsidy τH, where f is an exogenous (fee) parameter. The endowments of the initial stocks of human capital, H0, and capital, K0, are given. β is the private discount factor. 40 The law of motion H1(E) = G(E) + (1− δH)H0 (2.2) governs the evolution of the stock of human capital, which depreciates at the rate δH 6 1. Here G is the human capital production function, which takes time E as its only input factor. It is isoelastic: G(E) = aEγ (2.3) with 0 < γ < 1. The coefficient a > 0 is a shift parameter. (2.2) and (2.3) imply that the elasticity η of the function of the stock of human capital H1(E) is strictly increasing as long as the initial stock of human capital does not fully depreciate.2 By setting H0 = 0 or δH = 1, one obtains the model underlying the analysis in Bovenberg and Jacobs (2005) or Richter (2009). Then the elasticity of H1(E) equals γ. To have a well-behaved problem, it does not suffice to apply the Inada conditions. An analysis of the second-order conditions, which is done in Appendix 2.A, reveals that moreover one has to assume that the utility function is sufficiently concave to compensate for the lack of concavity of the law of motion (2.2) for human capital. Put formally, the requirement says that γ < υ1/(1+ υ1), where γ is the elasticity of the human capital production function (2.3), and υ1 = L1UL1L1/UL1 , which captures the concavity of the utility function with respect to second-period labor supply. 2 Proof. ddEη ≡ d dE EH ′1(E) H1(E) = γG ′(E)H1(E)−G(E) H1(E)2 > 0 for δH < 1. 41 This condition will show up again when labor taxation is analyzed in section 2.2.7.3. Let UCt and ULt denote the partial derivatives with re- spect to consumption and nonleisure time, taking the cor- responding period t variables as arguments. Maximization over consumption, time spent on working, and investments in human and physical capital yields the following first- order conditions: Ct : UCt = λt, t = 0, 1, (2.4) Lt : −ULt = ωtHtλt, t = 0, 1, (2.5) E : −UL0 + λ0ϕ = µG ′(E), (2.6) H1 : λ1βω1L1 = µ, (2.7) K1 : λ0 = λ1βR τ 1. (2.8) By eliminating µ and using all first-order conditions, the following optimality condition results: ω1L1G ′(E) Rτ1 = ϕ+ω0H0. (2.9) The individual chooses education up to the point where the discounted marginal (private) return ω1L1G ′(E)/Rτ1 equals the marginal (private) cost ϕ+ω0H0, which is sum of the direct cost and the forgone earnings. 2.2.2 The Government The government uses linear taxes to finance an exogenously given stream of government expenditures {gt}1t=0. Its budget constraints are 42 g0 + τ HfE = τK0 r0K0 + τ L 0w0L0H0, (2.10) g1 = τ K 1 r1K1 + τ L 1w1L1H1. (2.11) 2.2.3 Firm’s Problem The stock of physical capital Kt and the individual’s ef- fective labor supply, Zt ≡ LtHt, enter the firm’s constant- returns-to-scale production function F(Kt,Zt). Factors are paid their marginal products: FKt ≡ ∂ ∂Kt F(Kt,Zt) = rt, t = 0, 1, (2.12) FZt ≡ ∂ ∂Zt F(Kt,Zt) = wt, t = 0, 1. (2.13) 2.2.4 Competitive Equilibrium A competitive equilibrium consists of a feasible allocation { {Ct,Lt,Kt,Ht}1t=0,E } , a price system {wt, rt}1t=0, a government policy { {gt, τKt , τ L t } 1 t=0, τ H}, an exogenously given direct cost of education f, and initial stocks of human and physical capital, H0 and K0, respec- tively. The feasible allocation and the price system solve the 43 individual’s and firm’s problems. The government policy satisfies the budget constraints (2.10) and (2.11). 2.2.5 First-Best Solution Studying the first-best problem serves to establish a bench- mark case. The planner chooses consumption, investments in physical and human capital, and the allocation of time to solve the following maximization problem: L = U(C0,L0 + E) +βU(C1,L1) + θ0 ( F(K0,L0H0) + (1− δK)K0 −C0 −K1 − fE− g0 ) (2.14) + θ1β ( F(K1,L1H1) + (1− δK)K1 −C1 − g1 ) (2.15) + µ ( G(E) + (1− δH)H0 −H1 ) . He maximizes the individual’s discounted sum of utilities subject to the per-period resource constraints (2.14) and (2.15) and the law of motion for human capital. The first-order conditions are Ct : UCt = θt, t = 0, 1, (2.16) Lt : −ULt = θtFZtHt, t = 0, 1, (2.17) E : µG ′(E) = −UL0 + θ0f, (2.18) K1 : θ0 = θ1β ( FK1 + 1− δK ) ≡ θ1βR s 1, (2.19) H1 : θ1βFZ1L1 = µ. (2.20) 44 Proposition 2.1. The discounted marginal social return to education equals the marginal social cost: FZ1L1G ′(E) Rs1 = f+ FZ0H0. (2.21) Proof. Eliminate θ0, µ, and UL0 in the condition (2.18) using (2.17), (2.19), and (2.20). The social planner chooses education up to the point where the discounted marginal social return FZ1L1G ′(E)/Rs1 equals the marginal social cost f+ FZ0H0, which is sum of the direct cost and the loss in marginal productivity of first period’s labor supply. Proposition 2.1 therefore suggests the following definition to gauge education efficiency. Definition 2.1. Education efficiency is achieved if the discounted marginal social return to education equals the marginal social cost, which is the direct cost of education plus the loss in marginal productivity of the first period’s labor supply. In the first best, there is no wedge between the discounted marginal social return and the marginal social cost of education. The efficiency condition (2.21) can be further used to as- sess under which circumstances a competitive equilibrium implies education efficiency in the sense of Definition 2.1. The wedge between the discounted marginal social return 45 and the marginal social cost of education can be manipu- lated as follows: ∆ ≡ FZ1L1G ′(E) Rs1 − ( f+ FZ0H0 ) (2.22) = Rτ1 ( ϕ+ω0H0 ) Rs1 [ FZ1 ω1 − Rs1 ( f+ FZ0H0 ) Rτ1(ϕ+ω0H0) ] . The last equality follows from the individual’s optimality condition (2.9). Education efficiency holds if and only if the bracketed factor vanishes. Therefore, FZ1 Rs1 ( f+ FZ0H0 ) = ω1 Rτ1 ( ϕ+ω0H0 ) . (2.23) Put verbally, if and only if before- and after-tax rates of re- turn are equal, education efficiency prevails in a competitive equilibrium. The wedge ∆ is positive (negative) if and only if education is effectively taxed (subsidized). Richter (2009) uses the condition (2.23) to assess education efficiency. 2.2.6 Second-Best Solution The Ramsey problem is to choose a government policy that maximizes the individual’s utility subject to the individ- ual’s and the firm’s competitive equilibrium behavior, given initial stocks of human and physical capital, H0 and K0, and direct cost of education f. The primal approach is adopted to study the Ramsey problem of optimal taxation (Atkinson and Stiglitz (1980), Chari and Kehoe (1999)). The difference to the dual approach is how it incorporates the individ- ual’s competitive equilibrium behavior. The individual’s first-order conditions serve to eliminate all prices and taxes 46 in the intertemporal budget constraint. As a result, this constraint then fully captures how the individual behaves in a competitive equilibrium. Given the allocation, the first- order conditions yield the prices and taxes that implement the second-best outcome as a competitive equilibrium. By contrast, the dual approach includes all constraints sepa- rately, which requires optimizing over the allocations and prices. To derive the so-called implementability constraint, first the intertemporal budget results after combining the per- period budget constraints from the individual’s problem (2.1) by eliminating K1 and using (2.9) to eliminate direct cost ϕE: Rτ0K0 +ω0H0(L0 + E) + 1 Rτ1 ω1L1H1 (1− η) = C0 + 1 Rτ1 C1 (2.24) with η ≡ η(E,H1) ≡ H ′1(E)E H1(E) = G ′(E)E H1(E) , which is the nondecreasing3 elasticity of the function ex- pressing the stock of human capital in the second period. The LHS of (2.24) is the individual’s income side. Rτ0K0 is the value of the initial stock of physical capital. The RHS is the expenditure side. 3 See footnote 2. 47 Using the individual’s first-order conditions (2.4), (2.5), and (2.8), the intertemporal budget constraint (2.24) can be written as A = UC0C0 +βUC1C1 +UL0(L0 + E) + βUL1L1 (1− η) (2.25) with A ≡ UC0R τ 0K0, (2.26) which is a function of the endogenous variables C0, L0, E, and τK0 , and of the exogenous variables K0 and H0. The allocations that the individual’s problem imply for a given government policy satisfy the implementability constraint (2.25) and the per-period resource constraints (2.14) and (2.15) (see Proposition 1 in Chari and Kehoe (1999)). The government commits to a specific policy chosen at the outset of period 0, meaning that it does not reoptimize during the course of time. The Ramsey problem reads L = U(C0,L0 + E) +βU(C1,L1) + θ0 ( F(K0,L0H0) + (1− δK)K0 −C0 −K1 − fE− g0 ) +βθ1 ( F(K1,L1H1) + (1− δK)K1 −C1 − g1 ) + µ ( G(E) + (1− δH)H0 −H1 ) +φ ( UC0C0 +βUC1C1 +UL0(L0 + E) + βUL1L1 (1− η) −A ) . 48 The following assumption simplifies the derivation of the first-order conditions. Assumption 2.1. The utility function U is additively separable in consumption and nonleisure, that is, UCtLt = 0, t = 0, 1. The first-order conditions for the Ramsey problem are C0 : UC0 − θ0 +φ ( UC0C0C0 +UC0 −AC0 ) = 0, (2.27) C1 : UC1 − θ1 +φ ( UC1C1C1 +UC1 ) = 0, (2.28) L0 : UL0 + θ0FZ0H0 +φ ( UL0L0(L0 + E) +UL0 −AL0 ) = 0, (2.29) L1 : UL1 + θ1FZ1H1 +φ ( UL1L1L1 +UL1 ) (1− η) = 0, (2.30) E : UL0 − θ0ϕ+ µG ′(E) +φ ( UL0L0(L0 + E) +UL0 −βUL1L1 dη dE ) = 0, (2.31) K1 : −θ0 +βθ1 ( FK1 + 1− δK ) = 0, (2.32) H1 : βθ1FZ1L1 − µ−φβUL1L1 dη dH1 = 0. (2.33) Maximizing over τK0 would be the same as taxing away the return to the initial stock of capital, which essentially is a lump-sum tax.4 Assuming τK0 = 0 rules out this form of taxation. 4 To see this point, maximize the Lagrangian over τK0 : ∂L ∂τK0 = φUC0FK0K0 Introducing a lump-sum tax, namely τK0 , enhances welfare, as less distortionary taxation is necessary. φ measures the cost of using distor- tionary taxation. The other three factors are positive. Therefore, φ > 0. Optimally, τK0 should be chosen such that all government expenditures could be financed. Then we would have φ = 0 and the present problem 49 2.2.7 Results 2.2.7.1 Taxation of Physical Capital To study the case of taxation of physical capital, the fol- lowing assumption limits the analysis to a specific type of utility functions. Assumption 2.2. The instantaneous utility function shall have the following form: U(Ct, ·) =    C1−σt −1 1−σ − V(·), t = 0, 1, 0 6 σ 6= 1. lnCt − V(·), t = 0, 1, σ = 1. V is strictly increasing and strictly convex. It is a function of L0 + E and L1, respectively. 1/σ is the intertemporal elasticity of substitution in consumption. Proposition 2.2. Given Assumption 2.2 with σ > 0, if Rτ0K0 > 0, then τ K 1 > 0. Proof. Combine the conditions (2.27), (2.28), and (2.32): UC0 βUC1 1+φ (1− σ) −φ UC0C0 UC0 Rτ0K0 1+φ(1− σ) = Rs1. To determine τK1 , use the individual’s conditions (2.4) and (2.8): 1 < 1+φ (1− σ) −φ UC0C0 UC0 Rτ0K0 1+φ(1− σ) = Rs1 Rτ1 = 1+ τK1 r1 Rτ1 . coincides with the first-best problem. This renders the whole analysis uninteresting. See also Jones, Manuelli, and Rossi (1997, p. 111). 50 (2.34) The proposed result follows immediately. Assumption 2.2 is necessary because it allows one to compare the denominator and numerator in (2.34), which would not be possible if the coefficient σ were not constant. Proposition 2.2 is a well-known result in macroeconom- ics.5 Taxation of the return to physical capital in the first period was ruled out by assumption, and therefore the gov- ernment was not able to extract the profit coming from the initial stock of physical capital. In period 1, the positive tax on capital income is due to this initial stock. One may view this capital tax as an attempt to take away part of the return to capital, which was ruled out in the first period. The proof of Proposition 2.2 highlights also that taxa- tion of physical capital income depends on the individual’s preferences for consumption. To emphasize this point, sup- pose that the individual’s utility from consumption is linear, σ = 0. Then UC0C0 = 0, and τ K 1 = 0 results. The preceding analysis furthermore shows that the results are derived despite and not because of the presence of human capital in the model. This points out that taxes on the return to physical capital are not a vehicle to provide education incentives. As the next section will show, the wedge between the discounted marginal social return and the marginal social cost of education does not vanish with the optimal capital tax rate. 5 See, for instance, Proposition 7 in Chari and Kehoe (1999). 51 2.2.7.2 Taxation of Human Capital Proposition 2.3. The discounted marginal social return to education is smaller than the marginal social cost: FZ1L1G ′(E) Rs1 < f+ FZ0H0. (2.35) Proof. The first-order conditions (2.29) and (2.31) imply β θ1 θ0 FZ1L1G ′(E) − ( f+ FZ0H0 ) = − φ θ0 { βL1UL1 ( −G ′(E) dη dH1 − dη dE ) +AL0 } . (2.36) By (2.32), βθ1/θ0 equals the social discount factor 1/Rs1. As a result, the LHS of (2.36) is the wedge ∆ as defined by (2.22). To prove the inequality, one has to determine the sign of the factor in curly brackets. The first term in it can be rearranged using the law of motion (2.2) for human capital, the specific functional form (2.3) of G, and the individual’s optimality condition (2.9). Therefore, ∆ = − φ θ0 { γ H1 ( 1− δH ) H0 ︸ ︷︷ ︸ #1 ( ϕ+ω0 ) UC0 + AL0︸︷︷︸ #2 } < 0. From the definition (2.26), AL0 = UC0FK0Z0H0K0. 6 As long as H0 > 0, the factor in the curly brackets is positive. It further increases as the human capital depreciation rate δH decreases. φ is positive for the reason explained above; see footnote 4, p. 49. 6 Recall that τK0 = 0 was assumed. 52 Corrolary 2.1. In the second-best optimum, the individual overinvests in human capital relative to the first best. Proof. One can show that the discounted marginal return to ed- ucation is a decreasing function of E, ceteris paribus. The marginal cost is constant. As a result, the individual overin- vests in human capital relative to the first best. The results qualify the education efficiency theorem and show under which circumstances it does not hold. To begin, Proposition 2.3 holds in any case if there is an initial stock of human capital, H0 > 0. Then at least #2 does not vanish. The source of distortion is the term AL0 , which is due to the endogeneity of the first-period interest rate FK0 . It is an initial endowment effect similar to the one discussed in the context of physical capital taxation. In Richter (2009) and Bovenberg and Jacobs (2005) this effect is not present, because they use a partial equilibrium analysis in which the interest rate is fixed, which means FK0Z0 = 0 and AL0 = 0 results. Regarding #1, if H1 ≡ G(E) = aEγ, which follows from setting δH = 1 or H0 = 0 in the law of motion (2.2) for hu- man capital, the elasticity of the function H1(E) is constant, that is, dηdE = dη dH1 = 0. This means the positive product (1− δH)H0 is one source of distortion, because as long as the initial stock of human capital does not fully depreciate, the elasticity η is increasing. This is the essence of Remark 2 in Richter (2009). Both effects #1 and #2 would vanish with H0 = 0, and the education efficiency theorem would result. This case 53 corresponds to ruling out labor supply in the first period, as is done, for instance, in Jacobs and Bovenberg (2010a). The preceding results allow one to show that education is effectively subsidized. Use (2.35) and (2.9) to derive FZ1 Rs1 ( f+w0H0 ) < ω1 Rτ1 ( ϕ+ω0H0 ) ⇔ ∆ < 0. (2.37) The before-tax rate of return to education is smaller than the after-tax rate of return, which means that the wedge ∆ between the discounted marginal social return and the the marginal social cost is negative. Therefore, the following proposition results: Proposition 2.4. Education is effectively subsidized relative to the first best. The private rate of return to education is larger than the social rate of return. Physical and human capital are two assets, which the in- dividual can hold to smooth consumption over time. Above, on page 51, it is explained that it is optimal to tax the re- turn to physical capital in the second period. Turning to human capital, the individual disposes of an initial stock of human capital H0. The return to it can only be taxed in a distorting way, because the tax rate τL0 does not have the characteristic of a lump-sum tax. Consequently, this tax is an imperfect instrument, in the sense that it distorts the labor decision, to extract the return to the initial stock of hu- man capital. For this reason, in the second period, when the individual reaps the fruits of education, I therefore would have expected the government to at least partly skim off the additional return that could be attributed to the initial stock of human capital. The striking result, however, is that 54 the contrary is true. The government should subsidize the accumulation of human capital. To sum up this point, an initial stock of physical capital implies that it is optimal to tax physical capital, whereas an initial stock of human capi- tal calls for a subsidization of human capital. The intuition for why it is optimal to subsidize human capital may be the following. Labor taxation exerts a depressing effect on the accumulation of human capital. To counter this, a subsidy is helpful. One may view the above result in a different light and in- terpret the model as the steady state of an OLG model as in Nielsen and Sørensen (1997). Then the OLG interpretation of the present model is in line with Propositions 4.2 and 4.3 in chapter 4. The first result states that if the function H1(E) is isoelastic, education will remain undistorted. This case corresponds to setting δH = 1 and thereby implicitly assuming that the young individual does not inherit any stock of human capital from the old individual. Term #1 vanishes. Proposition 3 states that if the elasticity of H1(E) is increasing, which is the case when δH < 1, education will be subsidized relative to the first best. Also, the strength of the positive distortion depends on the cost resulting from the unavailability of lump-sum taxes, as captured by the Lagrange multiplier φ. Term #2 is not present in a steady state, because the initial endowment effect AL0 only occurs in the first period and not later on. 2.2.7.3 Taxation of Labor Proposition 2.5. Labor tax rates are given by 55 τL0 1− τL0 = − φ 1+φ ( υ0 + AL0 +AC0FZ0H0 + FZ0H0UC0C0C0 −UL0 ) (2.38) and τL1 1− τL1 = − φ 1+φ ( (1− η)υ1 − η+ UC1C1 −UL1 C1FZ1H1 ) (2.39) with υ0 = (L0 + E)UL0L0 UL0 and υ1 = L1UL1L1 UL1 denoting the reciprocals of the elasticities of nonleisure in periods 0 and 1 in Frisch’s sense.7 Proof. Combine the first-order conditions (2.27) and (2.29), and (2.28) and (2.30). τL0 depends on initial endowment effects and the indi- vidual’s preferences for consumption. τL1 is affected by the effect of human capital, which is captured by the elasticity η. The following assumption helps to gain further insight into (2.38) and (2.39). Assumption 2.3. 7 Lt is implicitly defined by (2.5). Differentiating this condition with respect to, say, ω0, holding λ0 constant, yields 1/υ0 = UL0/[(L0 + E)UL0L0 ]. See Cahuc and Zylbergerg (2004, p. 20) for further details. 56 1. Interpret the above model as the steady state of an OLG model. 2. The utility function U shall be linear in consumption. 3. Human capital fully depreciates (δH = 1), or the initial stock of human capital is zero (H0 = 0). The first assumption implies that the initial endowment effects AL0 and AC0 are not present. UCtCt = 0 follows from the second assumption, which implies that savings are not taxed in the second period: τK1 = 0. The third assumption implies that the elasticity of the function expressing hu- man capital in the second period equals the human capital production function’s elasticity: η = γ. Division of (2.39) by (2.38) then yields τL1 1−τL1 τL0 1−τL0 = (1− γ)υ1 − γ υ0 . (2.40) (2.40) is the analogue to equation 13 in Richter (2009), who terms it an extension of the inverse elasticity rule to cope with endogenous education.8 He, however, assumed a util- ity function of the form U = Z(C0,C1) − V(L0 + E) − V(L1), with the function Z being linear homogeneous. Because the utility function used here is time-separable in consumption, linear homogeneity means that utility is linear. To have positive tax rates on labor income when young and old, the numerator in (2.40) has to be positive: γ < 8 Another insignificant difference is that Richter (2009) uses exclusive tax rates, whereas inclusive tax rates are used here. 1− τ = 1 1+ τ ′ is the formula for converting from a tax-inclusive basis to a tax-exclusive basis (Atkinson and Stiglitz, 1980, p. 70). 57 υ1/(1+ υ1). This inequality emerged from the analysis of the second-order conditions; see appendix 2.A. It is a suf- ficient condition that must hold to have a well-behaved problem and moreover ensures that the tax rates are posi- tive. 2.3 conclusion This chapter has reassessed the models by Bovenberg and Jacobs (2005) and Richter (2009). Their papers and this chapter tackle the same set of questions, use different ap- proaches, but in the end come to similar conclusions. First, I demonstrated that the question of how to tax the return to physical capital is not affected by the accumulation of human capital. Taxing the return to physical capital in- vestments is not a means to yield efficient investments in human capital. Then I showed that an increasing elasticity of the function expressing the stock of human capital in the second period implies subsidizing the return to education. An increasing elasticity arises if the individual is endowed with an initial stock of human capital that does not fully depreciate. The existing literature on Ramsey models of optimal tax- ation (see Atkeson, Chari, and Kehoe (1999) among others) and this chapter both come to the conclusion that the ques- tion of whether to tax the return to capital or not depends on individual consumption preferences. In the second pe- riod this result is special, because one has to allow for an initial endowment effect, which is due to the initial stock of capital. The issue of capital taxation is independent of whether the model features human capital accumulation or 58 not. To sum up this point, taxing capital income in the sec- ond period is optimal, but for other reasons than achieving efficient investments in human capital. Further research could discuss the present model in an infinite-horizon setup, as it is done by Jones, Manuelli, and Rossi (1997). The major difference is that their human cap- ital production function exhibits constant returns to scale with respect to stock variables that enter as means of pro- duction. By this specification they model human capital very symmetrically to physical capital and show that the return to education should remain untaxed in a steady state. The natural question then arises what exactly is the difference between human and physical capital. This chap- ter works with a human capital production function that does not include the stock of human capital as a production factor. Then a model of optimal taxation could answer the question of how to tax the return to education if the human capital production function does not exhibit the restrictive properties as in Jones, Manuelli, and Rossi (1997). See also Ljungqvist and Sargent (2004, p. 534,) for a short discussion of this point. 59 2.a second-order conditions To study the conditions that must hold to have a well- behaved problem with an interior solution, the original problem (2.1) is written as one with only a single constraint. This is done by deriving the intertemporal budget constraint by substituting out the capital K1, and then replacing the stock of human capital H1 with the law of motion (2.2) for human capital. The Lagrangian therefore reads L = U(C0,L0 + E) +βU(C1,L1) + λ ( Rτ0K0 +ω0L0H0 + 1 Rτ1 ω1L1 ( G(E) + 1− δH ) −C0 − 1 Rτ1 C1 −ϕE ) . A sufficient condition for the solution to solve the con- strained maximization problem is that the bordered Hessian of the Lagrangian satisfies the condition that the last four leading principal minors alternate in sign, the sign of the first one being positive. Let ∇g denote the gradient of the constraint: ∇g = [ −1,ω0H0, 1 Rτ1 ω1L1G ′ −ϕ,− 1 Rτ1 , 1 Rτ1 ω1G ] . (2.41) The Hessian of the Lagrangian reads H =          UC0C0 0 0 0 0 0 UL0L0 UL0L0 0 0 0 UL0L0 UL0L0 + λ 1 Rτ1 ω1L1G ′′ 0 λ 1Rτ1 ω1G ′ 0 0 0 βUC1C1 0 0 0 λ 1Rτ1 ω1G ′ 0 βUL1L1          . 60 Then the bordered Hessian is bH =    0 ∇g ∇g ′ H    . (2.42) When deriving the Hessian, it was assumed that the cross derivatives of U are zero (UCL = 0) and that human capi- tal fully depreciates (δH = 1). Dropping these simplifying assumptions does not change the following results. Let Dk denote the kth leading principal minor of bH. Then straightforward but tedious calculations yield D3 = − (ω0H0) 2UC0C0 − (−1) 2UL0L0 > 0, D4 = λ 1 Rτ1 ω1L1G ′′(−UL0L0 −UC0C0(ω0H0) 2) < 0, D5 = − λ 1 Rτ1 ω1L1G ′′ ( UC0C0UL0L0 ( − 1 Rτ1 )2 +UL0L0βUC0C0(−1) 2 +UC0C0βUC1C1(ω0H0) 2 ) > 0, D6 = UC0C0βUC1C1UL0L0 ( 1 Rτ1 ω1G )2( −λ 1 Rτ1 ω1L1G ′′ ) − ( UL0L0βUC1C1(−1) 2 +UC0C0UL0L0 ( − 1 Rτ1 )2 +UC0C0βUC1C1(ω0H0) 2 ) × λ 1 Rτ1 ω1G ′′UL1 1 1− γ ( (1− γ)υ1 − γ ) ! < 0 with υ1 = L1UL1L1/UL1 and γ = G ′ G E. 61 The sign ofD6 must be negative. The minuend is negative. The first factor of the subtrahend is positive. Hence, the second factor must be positive: (1− γ)υ1 > γ⇔ γ < υ1 1+ υ1 . The requirement is that the concavity of the utility function, captured by υ1, has to be sufficiently large to compensate for the lack of concavity of the law of motion (2.2) for human capital, measured by γ. 62 3 TA X I N G H U M A N C A P I TA L : A G O O D I D E A 3.1 introduction “Is physical capital special?” Jones, Manuelli, and Rossi (1997) ask. Using the Ramsey approach (Ramsey, 1927), they add human capital to an optimal taxation model with physical capital similar to that of Chamley (1986) and Judd (1985). By modeling human capital almost symmetrically to physical capital they show that in a stationary state all taxes are zero. Chamely and Judd’s result is thus shown to extend to human capital. What drives this zero-tax result is that the human capital production function features constant returns to scale with respect to the stock of human capi- tal. Jones, Manuelli, and Rossi (1997) call this specification a zero-profit condition. As a consequence, human capital disappears as an object of taxation in a competitive equi- librium. But they acknowledge that if the human capital production function violates the assumption of constant re- turns to scale, the stationary-state labor tax will not be zero. Jones, Manuelli, and Rossi (1997) raise an intriguing ques- tion and provide useful insights into the nature of optimal taxation, but in the end, unfortunately, no answer is evident. The difference between physical and human capital still is not clear, because they have made it disappear by means of zero-profit conditions.1 1 See also Ljungqvist and Sargent (2004, pp. 534) for this line of argument. 63 This chapter takes up the issue of modeling human cap- ital almost symmetrically to physical capital. I drop the constant-returns-to-scale assumption. The human capital production function does not include the current stock of human capital, which therefore is not self-productive. It does not raise the productivity of human capital invest- ments, or interchangeably, education. The increasing and concave production function only includes the individual’s time devoted to education. Time spent on education cannot be substituted by physical goods.2 Instead, the individual has to pay for verifiable3 direct costs, e.g., tuition fees, that depend on the amount of education. The government may choose to subsidize this cost. It therefore has two instru- ments at its disposal to guide education. Labor taxes and the subsidy both affect the opportunity cost of education. The next periods’ labor tax rates affect the discounted stream of marginal earnings from education. I derive two results: The first one is not surprising but nonetheless important, as it helps to clarify the role of zero capital taxation when the model features human capital. The other is new and shows how to deal with profits com- ing from education. First, optimal taxation in the stationary state prescribes not taxing capital income, as Chamley (1986) and Judd (1985) show. The zero-capital-tax result holds de- spite the presence of human capital. Lucas (1990), Jones, Manuelli, and Rossi (1997) and Chari and Kehoe (1999) also derive this result. The education decision depends only on 2 Allowing for physical goods as an additional production factor does not affect the results obtained by Jones, Manuelli, and Rossi (1997), as Chari and Kehoe (1999) show. 3 Reis (2007, chapter 4) assumes that the government cannot distinguish between consumption and expenditures on education and finds that it is optimal to tax human capital. 64 how the labor tax and the education subsidy interact with each other. This relates to the second result, stating that in the optimum the marginal social return to education is larger than the marginal social cost. The so-called Education Efficiency Theorem (Richter, 2009), which states that the ed- ucation decision is undistorted given certain assumptions, does not hold. From the inequality between the marginal social return and the marginal social cost it follows that ed- ucation is effectively taxed, i.e., the private rate of return to education is smaller than the social rate of return. Turning to the underlying tax rates, it results that the cost of edu- cation is not fully tax-deductible, the labor income tax rate is higher than the rate of subsidization. As a consequence, the individual underinvests in human capital relative to the first best. The second result is striking. Since the individual is en- dowed with perfect foresight and therefore must be able to internalize the effects of its actions, one would have ex- pected to derive an equality between the private and social rates of return to education, and the Education Efficiency The- orem to hold - a result that Jones, Manuelli, and Rossi (1997), among others,4 also obtain. Their zero-tax results imply that all private and social rates of return from investments in physical and human capital are equal in the stationary state. The difference in results is due to how I model the ac- cumulation of human capital. The specification used gives rise to profits in equilibrium. Profits from education are not pure in the strict sense, because they still depend on raw 4 For further reference, see Lucas (1990), Bull (1993), Milesi-Ferretti and Roubini (1998), Chari and Kehoe (1999), Barbie and Hermeling (2006), and Richter (2009). 65 labor supply. The government taxes away part of the return to education, thereby accepting the distortion of education. To derive clear-cut results, the analysis is confined to an examination of the stationary state. In the stationary state, the individual’s decision variables remain constant. As usual, it is assumed that a unique stationary state exists and that the economy converges to it. It would be straight- forward to introduce exogenous growth. To allow for a set- ting in which the economy grows endogenously is however not possible. The reason for this limitation is the specifica- tion of the human capital production function. Lucas (1988) and Caballe and Santos (1993) provide a discussion of the existence and properties of a balanced growth path. They show that the human capital production function must fea- ture constant returns to scale with respect to the stock of human capital. 3.2 the model 3.2.1 Individual’s Problem The individual solves the following maximization problem: L = ∞∑ t=0 βt { u(ct, 1−nt − et) − λt ( ct + kt+1 + bt+1 + (1− τ e t)fet − (1− τnt )wtntht − R k tkt − R b tbt ) (3.1) − µt ( ht+1 − (1− δh)ht −G(et) )} (3.2) The individual’s utility function u is strictly increasing and concave in both arguments and continuously differen- 66 tiable everywhere. The Inada conditions apply to ensure interior solutions. In each period t the individual faces a consumption-labor-leisure choice. It consumes ct, which is not taxed,5 and devotes nt time units to work in the labor market and et time units to investment in human capital. The total time endowment is normalized to one, i.e., nt + et + `t = 1, where `t is the amount of leisure. The individual combines its raw labor supply nt with the current stock of human capital ht. The product zt ≡ ntht is called the effective labor supply,6 it earns the after-tax wage rate (1− τnt )wt where wt is the real wage rate. The individual must spend resources (1 − τet)f per time unit invested in human capital. One may think of f as tuition, books, and other related expenses. The government subsi- dizes this cost at rate τet . The individual lends capital kt+1 to the firm. The rate of return net of taxes and depreciation is Rkt+1 ≡ (1− τ k t )rt+1 + 1− δk, where rt is the real interest rate and δk is the rate at which capital depreciates. The individual may lend bt+1 to the government which offers a rate of return of Rbt+1 in the next period. In period 0, the individual earns income from capital Rk0k0 and government debt Rb0b0. (3.1) is the individual’s budget constraint in pe- riod t, which is associated with the Lagrange multipliers λt. Associated with the Lagrange multiplier µt, the law of motion (3.2) describes the accumulation of human cap- 5 Taxing consumption only complicates the analysis without yielding further insights in the present context. 6 This specification is a special case of Jones, Manuelli, and Rossi (1997), who use the more general function z =M(x,h,n) and assume that it exhibits constant returns to scale with respect to h and market goods x. Judd (1999) works out that this specification is not innocuous, as any deviation gives rise to positive taxation of human capital. This point Jones, Manuelli, and Rossi (1997, p. 103) acknowledge. 67 ital. Investments et enter the human capital production function G, which is strictly increasing and concave, i.e., G ′′ < 0 < G ′. The crucial assumption is that the current stock of human capital does not enter the production func- tion; it does not increase productivity. The output of G is added to the depreciated stock of human capital, the rate of depreciation being 0 < δh 6 1.7 Furthermore, the human capital production function G is assumed to be isoe- lastic, that is, γ ≡ G ′e/G < 1.8 Finally, β is the individual’s discount factor, which, for simplicity, stays constant over time. The first-order conditions are ∂u ∂ct ≡ uct = λt, (3.3) ∂u ∂`t ≡ u`t = (1− τ n t )wthtλt, (3.4) u`t + λt(1− τ e t)f = µtG ′(et), (3.5) λt+1β(1− τ n t+1)wt+1nt+1 + µt+1β(1− δh) = µt, (3.6) λt = βλt+1R k t+1, (3.7) λt = βλt+1R b t+1. (3.8) Combine (3.7) and (3.8) to derive Rbt+1 = R k t+1. (3.9) 7 δh = 1 means the individual cannot use the stock of human capital accumulated so far in the next period. 8 This is an assumption that features prominently in the literature. See Jacobs and Bovenberg (2010b) for a discussion and their footnote 3 for more references. 68 (3.9) is a familiar condition that states that there is arbi- trage-freeness between investments in physical capital and government bonds. Both investments promise the same rate of return in equilibrium. Human capital can be regarded as an asset, similar to physical capital, that yields a rate of return, which in equi- librium must be equal to the other assets’ rates of return. To see this, recursively eliminate µt+1 in (3.6), and use (3.5) and (3.7)9: Rkt+1 = ∞∑ i=0 qt+1+it+1 (1− τ n t+1+i)wt+1+int+1+iG ′(et)(1− δh)i (1− τet)f+ (1− τ n t )wtht . (3.10) with qt+1+it+1 = i∏ j=1 ( Rkt+1+j )−1 , which denotes the private period t+ 1 price of a unit of the consumption good in period t+ i+ 1.10The numerator in (3.10) summarizes the discounted sum of returns due to a marginal investment et, henceforth referred to as the marginal (private) return to education. The investment in 9 Then the transversality condition lim i→∞   i∏ j=0 ( Rkt+j )−1   (1− δh) i G ′(et) G ′(et+i) × [( 1− τnt+i ) wt+iht+i + ( 1− τht+i ) f ] = 0 also emerges, which holds as long as 0 < δh 6 1. 10 This implies qt+1t+1 = 1. 69 period t not only increases tomorrow’s stock of human capital and thereby the wage earned, but also the stock afterwards at the decreasing rate 1− δh.11 The denominator summarizes the marginal (private) cost of education in period t, comprising direct cost and foregone earnings. The optimality condition (3.10) reveals arbitrage-freeness between investments in human and physical capital. (3.10) also shows that the depreciation rate δh and the after-tax rate of return to physical capital investments Rk affect the discounted present value of a time unit et invested in human capital similarly. An increasing capital tax rate, which reduces Rk, and an increasing rate of depreciation both raise the marginal return to education (Davies and Whalley, 1991). For further reference, the stationary state version of (3.10) is12 β 1−β(1− δh) (1− τn)wnG ′ = (1− τn)wh+ (1− τe)f. (3.11) (3.11) can be interpreted in the same way as (3.10). The individual devotes time to education up to the point where the marginal cost equals the marginal return to education. One can also see that if the direct cost of education were 100% tax-deductible, i.e., τn = τe, the choice of education would be undistorted. Boskin (1975) was the first to state this insight. (3.11) also reveals that capital taxation does not affect the marginal return to education, because only the individual’s 11 To allow for δh = 1, 00 = 1 must hold. 12 Use 1 = βRk, which is the stationary state version of (3.7). 70 discount factor β matters. This means that only the labor tax rate τn and the rate of subsidization τe affect the wedge between the marginal return to and the marginal cost of education. 3.2.2 Firm’s problem The representative firm produces the single consumption good using a neoclassical constant-returns-to-scale produc- tion function. It maximizes profits F(kt,ntht) − rtkt −wtntht in capital kt and effective labor zt ≡ ntht, taking the capital rental rate rt and the wage rate wt as given . As a result, Fkt ≡ ∂F(kt, zt) ∂kt = rt, (3.12) Fzt ≡ ∂F(kt, zt) ∂zt = wt. (3.13) The constant-returns-to-scale production technology im- plies that the firm makes zero profit in equilibrium. 3.2.3 Government’s problem The government finances an exogenously given stream of government expenditures {gt}∞t=0. Its per-period budget con- straint is gt + Rbtbt = τ k t rtkt + τ n twthtnt − τ e tfet + bt+1. (3.14) 71 3.2.4 Competitive Equilibrium A competitive equilibrium consists of a feasible allocation {ct,nt, et,kt,ht,gt}∞t=0, a price system {wt, rt,Rbt } ∞ t=0, and a government policy {τnt , τ k t , τ e t ,bt,gt} ∞ t=0, such that, given the price system and the government policy, the allocation solves the individual’s and firm’s prob- lems, and the government balances its budget. C is the set of the competitive equilibria that result from different gov- ernment policies. Put formally: C = { {ct,nt, et,kt,ht,gt}∞t=0 : ∃{τnt , τ k t , τ e t ,bt,gt} ∞ t=0, {wt, rt,R b t } ∞ t=0 s.t. (3.3) − (3.8), (3.12) − (3.13), (3.1), (3.2) and (3.14) hold for all t = 0, 1, . . . , k0,b0 and h0 are given. } . 72 3.2.5 Social Planner’s Problem – First-Best Analysis The social planner maximizes the individual’s utility subject to the resource constraint and the law of motion for human capital. The Lagrangian reads L = ∞∑ t=0 βt { u(ct, 1−nt − et) + θt ( F(kt, zt) + (1− δk)kt − ct − kt+1 − fet − gt ) − µt ( ht+1 − (1− δh)ht −G(et) )} . The first-order conditions for ct, et, nt, ht+1, and kt+1 are uct = θt, (3.15) µtG ′(et) = u`t + θtf, (3.16) θtFztht = u`t , (3.17) θt+1βFzt+1nt+1 − µt +βµt+1(1− δh) = 0, (3.18) θt = θt+1β(Fkt + 1− δk). (3.19) Analogously to the individual’s problem, the following condition shows how the social planner optimally chooses education13: Fkt+1 + 1− δk = ∞∑ i=0 q˜t+1+it+1 Fzt+1+int+1+iG ′(et)(1− δh)i f+ Fztht . (3.20) 13 Recursively eliminate µt+1 in (3.18), and use (3.16) and (3.19). 73 with q˜t+1+it+1 = i∏ j=1 ( Fkt+1+j )−1 , which denotes the social period t+ 1 price of a unit of the consumption good in period t+ i+ 1. The numerator in (3.20) is the discounted sum of marginal returns to in- vestment et, henceforth called marginal (social) return to education. The investment in period t increases not only tomorrow’s stock of human capital and thereby the produc- tivity but also the stock afterwards at the decreasing rate 1− δh. The denominator captures the marginal (social) cost in period t, comprising the direct cost of education and the loss of labor income. The optimality condition (3.20) re- veals that the rates of return to physical and human capital accumulation are equal. For further reference, the stationary-state version of (3.20) reads β 1−β(1− δh) FznG ′ = Fzh+ f. (3.21) The efficiency condition (3.21) will serve as a benchmark when analyzing below how the education decision is af- fected by the use of distortionary taxation. The preceding discussion therefore suggests the following Definition 3.1. Education efficiency is achieved if the marginal social return to education equals the marginal social cost of edu- cation. In the first best, there is no wedge between the marginal social return to and the marginal social cost of education. 74 3.2.6 Ramsey Problem – Second-Best Analysis Linear taxes are chosen to finance a given stream of govern- ment expenditures. The choice of taxes should maximize social welfare subject to resource and budget constraints and taking the individual’s and firm’s competitive equi- librium behavior into account. Each government policy gives rise to a different competitive equilibrium. The Ram- sey problem is to choose the competitive equilibrium that yields the highest utility. To solve the problem, the primal approach (Lucas and Stokey (1983), Atkinson and Stiglitz (1980), Chari and Kehoe (1999)) is adopted. This approach is one way to take into account the compet- itive equilibrium behavior. Instead of choosing the optimal policy directly, which yields the optimal allocation and prices, one chooses the optimal allocation that is consistent with competitive equilibrium behavior and then solves for the government policy and prices that support this outcome. The key to solving this problem is to use the so-called im- plementability constraint that summarizes the individual’s competitive equilibrium behavior. In the present model, three conditions on the Ramsey problem must hold. The first one, the implementability constraint, is the individual’s budget constraint after having substituted for after-tax prices by means of the individual’s first-order conditions. Combining the per-period budget constraints (3.1) leads to the intertemporal budget constraint (using (3.8)): Rk0k0 + R b 0b0 + ∞∑ t=0   t∏ j=1 ( Rkj )−1   (1− τnt )wtntht 75 = ∞∑ t=0   t∏ j=1 ( Rkj )−1   ( ct + (1− τet)fet ) . (3.22) The transversality conditions lim t→∞   t∏ j=0 ( Rkj )−1  kt+1 = 0 (3.23) and lim t→∞   t∏ j=0 ( Rbj )−1  bt+1 = 0 (3.24) must hold. If (3.23) and (3.24) were positive, then the individual could find an alternative allocation yielding a higher utility by simply consuming more in finite time. The reverse cannot hold either, because some other individual has to be on the lending side and could increase utility for the reason just explained. Then, using the individual’s first-order conditions (3.3) and (3.4) and thereby substituting out (1 − τnt )wtht, the intertemporal budget constraint (3.22) can be written as W0 + ∞∑ t=0 βtu`tnt = ∞∑ t=0 βtuct ( ct + (1− τet)fet ) (3.25) with W0 ≡ uc0 ( Rk0k0+R b 0b0 ) , which is the value of the in- itial endowment of physical capital and government bonds. (3.25) is the first constraint in the planner’s problem. The first-order conditions for et and ht+1, (3.5) and (3.6), which yield (3.10) and determine the dynamic choice of ht+1, have not been used. Therefore, they give rise to a second constraint, which Jones, Manuelli, and Rossi (1997) 76 and Atkeson, Chari, and Kehoe (1999) call an Euler equation for the accumulation of human capital: βnt+1u`t+1h −1 t+1 +β(1− δh) u`t+1 + uct+1(1− τ e t+1)f G ′(et+1) = u`t + uct(1− τ e t)f G ′(et) . (3.26) (1− τet)f could also be eliminated in (3.25) using (3.26). But dealing with the resulting double sum is cumbersome, which is why it is more convenient to work with two imple- mentability constraints. In any case, either approach must yield the same solution. Pursuing the present way mixes the primal and the dual approach, as the planner has to optimize over the allocation and over the tax rate τet . Third, the economy’s resource constraint is F(kt, zt) + (1− δk)kt − ct − kt+1 − fet − gt = 0. (3.27) The set R consists of all allocations that satisfy the three constraints above and the law of motion (3.2) for human capital. Put formally, R = { {ct,nt, et,kt,ht,gt}∞t=0 : (3.2), (3.25), (3.26) and (3.27) hold for all t = 0, 1, . . . } . The Ramsey problem is to choose a member belonging to the set R that yields the highest utility. The key result to solving the Ramsey problem is the following Proposition 3.1 (see Chari and Kehoe (1999), Proposition 1). The competitive equilibrium allocations satisfy the resource con- straints and the implementability constraint. Furthermore, given 77 allocations that satisfy these constraints, one can construct poli- cies and prices that, together with the given allocations, constitute a competitive equilibrium. Put formally, C = R. Proof. See appendix 3.A. The Ramsey problem therefore reads L = ∞∑ t=0 βt { W(ct,nt, et, τet ,φ) + θt ( F(kt, zt) + (1− δk)kt − ct − kt+1 − fet − gt ) − µt ( ht+1 − (1− δh)ht −G(et) ) − ηt ( βnt+1u`t+1h −1 t+1 − u`t + uct(1− τ e t)f G ′(et) +β(1− δh) u`t+1 + uct+1(1− τ e t+1)f G ′(et+1) )} −φW0, with W(ct,nt, et, τet ,φ) = u(ct, 1−nt − et) +φ ( uct ( ct + (1− τet)fet ) − u`tnt ) defining the so-called pseudo-welfare function, which includes the implementability constraint and also depends on the endogenous Lagrange multiplier φ. Jones, Manuelli, and Rossi (1997) follow the same approach. But their and the present setup differ substantially. First, due to the spe- cial assumptions made regarding the human capital pro- duction function, they show that human capital does not appear in the implementability constraint. Second, when solving the Ramsey problem they neglect the Euler equa- tion for the accumulation of human capital (3.26). After having found the solution to this relaxed problem, they 78 show that this equation is satisfied anyway. Similarly, they derive a stationary-state arbitrage-freeness condition for human capital and a corresponding Ramsey problem’s con- dition. Because in their setup time devoted to education only gives rise to some cost in the form of forgone earnings and because the labor income tax is proportional, the tax cannot have an effect on education in a stationary state. Both the return and the cost are taxed at the same rate, and both are reduced in the same proportion.14 It is the imple- mentability constraint (3.26) that captures the transitional dynamics of the accumulation of human capital. Setting up the problem in a way that allows one to put this constraint aside and then to show that it is satisfied anyway does not, however, help to explore the special nature of human capital. As k0 is exogenous, τk0 works like a lump-sum tax. 15 To rule out this trivial form of taxation, it is common to assume τk0 = 0. 14 Even more obviously, this is the case in Chari and Kehoe (1999), too. 15 To see this point, maximize the Lagrangian over τk0 : ∂L ∂τk0 = φuc0Fk0k0 φ measures the costs of using distortionary taxation. Optimally, τk0 should be chosen such that all government expenditures could be financed by taxing away the return to the initial stock of physical capital and thereby abstaining from levying distorting taxes on capital and labor. The other three factors are positive. Therefore, φ > 0. It is then possible to increase τk0 until φ = 0 and the present problem coincides with the first-best problem. This renders the whole analysis uninteresting. See also Jones, Manuelli, and Rossi (1997, p. 111). 79 Under the assumption that a unique stationary state ex- ists,16 the first-order conditions for ct, et, nt, ht+1, kt+1 and τet , evaluated at the stationary state, are Wc − θ− η ( nuc`h −1 − δh u`c + ucc(1− τe)f G ′ ) = 0, (3.28) We − θf+ µG ′ − η ( −nu``h −1 − δh ( −u`` − uc`(1− τe)f ) G ′ − ( u` + uc(1− τe)f ) G ′′ G ′2 ) = 0, (3.29) Wn + θFzh − η ( h−1(−u``n+ u`) − δh −u`` − uc`(1− τe)f G ′ ) = 0, (3.30) µ = θ β 1−β(1− δh) Fzn + η β 1−β(1− δh) nu`h −2, (3.31) 1 = β(Fk + 1− δk), (3.32) η δh G ′ = −φe. (3.33) The first-order conditions (3.28)-(3.33), the resource con- straint (3.27), the implementability constraint (3.25), and the Euler equation (3.26) for the accumulation of human capital determine the Ramsey allocation {c, e,n,h,k, τe} along with the Lagrange multipliers θ, η, and φ.17 16 This is a common assumption frequently found in the literature. Judd (1999) uses a compactness assumption on the marginal social value of government wealth instead of the convergence assumption adopted here and shows that the average capital tax rate is zero for any long interval. 17 Chari, Christiano, and Kehoe (1994) explain how to compute φ. First, fix φ and solve for the entire allocation, using all first-order conditions 80 The following analysis is devoted to studying the tax rates τk, τn, and τe that implement the Ramsey allocation as a competitive equilibrium, R ⊆ C.18 Proposition 3.2. Capital income is not taxed in the stationary state, i.e., τk = 0. Proof. Combine (3.32) with (3.7) evaluated at the stationary state. This is the seminal result by Chamley (1986) and Judd (1985). Evidently, the private and social rates of return to capital investments are equal. The zero-capital-tax result is independent of whether the model features human capital or not. From this follows that there is no trade-off between efficiency in physical and human capital formation. Proposition 3.3. 1. Labor income is taxed in the stationary state if the human capital production function’s elasticity is sufficiently small. 2. The labor income tax rate is not higher than 100% if prefer- ences satisfy the following condition: − uccc uc + uc` uc ( (1− γ)n+ e ) < 1+ 1/φ (3.34) Proof. Combining (3.3), (3.4) and (3.28), (3.30) and rearranging yields 1− τn = 1+φ ( 1+ uccuc c− uc` uc ( (1− γ)n+ e )) 1+φ ( 1− γ− u``u` ( (1− γ)n+ e ) + uc`uc c ) . and resource constraints. Then, check whether this allocation satisfies the implementability constraint. If not, iterate on φ until the constraint holds. 18 See Proposition 3.1 for the central argument. 81 1. τn > 0 amounts to requiring γ < ( −u``u` + uc` uc ) (n+ e) + ( −uccuc + uc` u` ) c 1+ ( −u``u` + uc` uc ) n . 2. τn < 1 amounts to requiring the condition (3.34) to hold. Restrictions are imposed on the individual’s preferences and the properties of the human capital production function. Suppose the utility function reads u(c, `) = ln c+ κ ln `. In this special case, conditions 1 and 2 then reduce to γ < 1/(1 + n/`) and 1 < 1 + 1/φ. Condition 1 says that the elasticity parameter has to be below unity, as has been assumed above on page 68. Condition 2 is always satisfied as long as distortionary taxes are used. Proposition 3.4. If the human capital production function is isoelastic, the educa- tion decision is distorted. In the second best, there is underinvest- ment in human capital relative to the first best. Proof. Combine (3.26), (3.29), (3.30), (3.31), and (3.33) to obtain19 β 1−β(1− δh) FznG ′ = φ θ γu` + Fzh+ f. (3.35) Equation (3.35) states that the discounted flow of mar- ginal returns to education equals the marginal cost plus some distortion term, φ θ γu`. (3.36) 19 See appendix 3.B for the details. 82 The distortion term is positive, as each Lagrange mul- tiplier is positive. Therefore, the marginal social return, which is decreasing in e, is larger than the marginal social cost, which is constant in e. The individual is required to underinvest in human capital relative to the first best. Given that the education decision is distorted, the next question is what this means for the tax rates. Corrolary 3.1. In the stationary state, the direct cost of education is not fully tax-deductible, that is, τe < τn. Proof. Multiply (3.35) by 1− τn, and combine the result with (3.11) using (3.13): (1− τn) φ θ γu` + (1− τ n)(Fzh+ f) = (1− τn)Fzh+ (1− τe)f ⇔ (1− τn) φ θ γu` = (τ n − τe)f. All the Lagrange multipliers are positive. Given that τn < 1, the desired result follows. The preceding results allow one to study how the social and the private return to education are related to each other. Corrolary 3.2. 1. The private rate of return to capital investments is equal to the social rate of return. 2. The private rate of return to capital investments is equal to the private rate of return to education. 83 3. The private rate of return to education is smaller than the social rate of return. Education is effectively taxed. Proof. One has to show that (1− τk)r+ 1− δk = Fk + 1− δk = Fz(1−τn)nG ′ 1−β(1−δh) Fz(1− τn)h+ f(1− τe) < FznG ′ 1−β(1−δh) Fzh+ f . The first and second equalities follow from Proposition 3.2 and (3.11). (3.32) and Proposition 3.4 imply the inequal- ity. The marginal social return is taxed at a higher rate than the marginal social cost. The result is that this tax scheme negatively distorts education incentives, as Proposition 3.4 clarifies. To shed more light on the above results, consider the gov- ernment’s stationary-state budget constraint (3.14), which can be written as follows: g+ (RB − 1)b = τn(whn− fe) + (τn − τe)fe. The direct cost of education is taxed at the rate (τn − τe) as long as τn > τe. Suppose that the converse were true, and consider a marginal decrease of τe. Then τn has to decline as well if the government’s budget constraint is to continue to hold. τn 6 τe implies the private rate of return to education to be larger than the social rate of return. The considered tax reform has the effect that the marginal cost of education, consisting of the direct cost and forgone earnings, increases less than the marginal return. As a result, the 84 private rate of return to education increases. Ceteris paribus, the individual earns more income and hence consumption rises which increases utility. An efficiency gain would result, which is not possible, given that the planner maximizes efficiency. 3.3 conclusion and discussion This chapter explores the special nature of human capital compared to physical capital in an optimal taxation model. Capital income remains untaxed in the stationary state. The presence of human capital does not interfere with this result. This means taxing capital and human capital are two distinct issues and capital taxation is not a means to guide efficient education policy. This leaves labor taxation and the subsidization of the direct cost of education as the only instruments to set efficient education incentives. As the human capital production function includes time as the only production factor and not the current stock of human capital, the analysis calls for effective taxation of education, thereby partly extracting the ability rent. To achieve this end, the cost of education is not fully tax- deductible. As a consequence, the subsidy is insufficient in encouraging education and to offset the distortions caused by the tax on labor. Critical is the assumption that the cost of education is fully observable. This allows the government to use this piece of information to set an effective tax on education. Otherwise it has to resort to the labor tax alone to achieve this end, which would imply higher welfare costs. In reality it is not that easy to get exact data on the time spent on 85 education. Likewise it is not possible to precisely estimate the stock of human capital. 86 3.a proof of proposition 3.1 1. C ⊆ R: The statement is true because the implementability constraint is the intertemporal budget constraint after having substituted out prices using the individual’s first-order conditions. Derive (3.26) by combining (3.5) and (3.6) and substituting out prices again. Because the individual’s and government’s budget constraints are satisfied, the resource constraint is satisfied by Walras’s law. This proves the first inclusion. 2. R ⊆ C: The converse, that any allocation satisfying the imple- mentability and resource constraints satisfies competi- tive equilibrium behavior, is also true. This amounts to finding prices and a government policy, namely tax rates, such that the allocation that is in R is also in C. To derive Rbt+1 use (3.3) and (3.8). Obtain rt and wt from (3.12) and (3.13). (3.3) and (3.4) yield τnt . (3.3) and (3.7) determine τkt . τ e t is defined recursively by (3.5) and (3.6). By construction, the Ramsey allocation satisfies the individual’s budget constraint and the economy’s re- source constraint. By Walras’ law, the government’s budget constraint is satisfied as well. 87 3.b derivation of (3.35) (3.33) serves to eliminate η: η = −φ G ′e δh (3.37) Equalize (3.29) and (3.30), and plug in (3.37): − θf+ µG ′ −φ G ′e δh δh −u``G ′ − ( u` + uc(1− τe)f ) G ′′ G ′2 −φ G ′e δhh nu`` +φ ( uc(1− τe) + u` ) = θFzh+φ G ′e δhh (−u`n+ u`) +φ G ′e δh δh u`` G ′ +φu``n−φu` (3.38) (3.2) yields h = G+ (1− δh)h ⇔ 1/G = 1/(δhh). Using the constant elasticity γ of G and substituting for µ by means of (3.31), one can manipulate (3.38) as follows: θ β 1−β(1− δh) FznG ′ −φγ ( β 1−β(1− δh) G ′nu`h −1 − uc(1− τe)f ) = θ(Fzh+ f) (3.39) The stationary-state version of (3.26) reads β 1−β(1− δh) G ′nu`h −1 = u` + uc(1− τ e)f. (3.40) Plug (3.40) into (3.39) to finally derive (3.35): β 1−β(1− δh) FznG ′ = φ θ γu` + Fzh+ f (3.35) 88 4 E F F I C I E N T H U M A N C A P I TA L P O L I C Y W I T H O V E R L A P P I N G G E N E R AT I O N S A N D E N D O G E N O U S G R O W T H 4.1 introduction Education is a field in which policies of OECD countries exhibit remarkable differences. This is borne out by the data published in 2009 (OECD, Tables A8.2 and A8.4). While var- ious countries effectively subsidize education at the tertiary level, other countries effectively tax this activity. Such a finding does not only raise the question of which policy is superior, it also raises the question of whether and how an effective subsidization of education can be justified in terms of efficiency. This chapter studies this question in a frame- work of overlapping generations and endogenous growth. Two reasons of why it may be efficient to subsidize educa- tion are highlighted. The first one is well known from the literature. It is the potential need to internalize the positive effect that human capital investments of selfish individuals have on the productivity of descendent generations. Effi- cient internalization requires subsidizing investments up to the first best. This chapter stresses the second reason. This is the negative effect that distortionary taxation of labour has on education and growth. If the elasticity of the hu- man capital investment function is strictly increasing, it is 89 shown to be a second best policy to subsidize education even relative to the first best. The traditional approach to optimal taxation follows Ram- sey (1927) and takes the model of a representative taxpayer as a starting point. A critical feature of this literature is that the results characterizing optimal policy heavily depend on whether the representative taxpayer plans for finite or infi- nite periods. If the taxpayer’s planning horizon is infinite, the rationale for employing distortionary linear taxes and subsidies turns out to be weak. This point was originally made by Chamley (1986) and Judd (1985) with respect to capital taxes. It extends, however, to the model with endoge- nous education, as has been demonstrated by Bull (1993), Jones, Manuelli, and Rossi (1993, 1997), and Atkeson, Chari, and Kehoe (1999). The question of whether human or non- human capital is accumulated is largely irrelevant. In the long run neither accumulation should be distorted. The policy recommendations are less clear-cut if the tax- payer’s planning horizon is finite. In the finite case it is primarily a matter of marginal rates of intertemporal sub- stitution in consumption whether taxing saving is efficient or not. In particular, saving should be untaxed only if the taxpayer’s utility is weakly separable between consumption and labour and homothetic in consumption (Atkinson and Stiglitz (1972), Sandmo (1974)). By contrast, the design of efficient education policy is more a reflection of the specific properties of the earnings function. This function has to be weakly separable in qualified labour supply and education and the elasticity with respect to the latter has to be con- stant if it shall be second best not to distort the choice of education (Jacobs and Bovenberg (2010b); Bovenberg and 90 Jacobs (2005)). If weak separability holds and if the elasticity is strictly increasing, it is second best to subsidize educa- tion (Richter (2009)). If the planner trades off efficiency and equity and if education and qualified labour are comple- mentary, it is equally second best to subsidize education (Jacobs and Bovenberg, 2010b). It somewhat discredits the Ramsey approach that the suggested policy recommendations so critically depend on the taxpayer’s planning horizon. That is why the present chapter studies optimal taxation in a model with overlap- ping generations. Such a model stands between the static and dynamic Ramsey frameworks and it therefore promises less debatable policy recommendations. The broader objec- tive of the present study is to characterize optimal policies for education, labour, and saving in a dynamic framework with overlapping generations. The narrower objective is to rationalize the effective subsidization of endogenous edu- cation. Such objectives may justify putting aside various shortcomings often turned against similar studies. In partic- ular, we exclusively focus on efficiency and we stick to the representative taxpayer framework because one would not really be surprised to learn that subsidizing education can well be optimal when equity is traded off against efficiency. Furthermore, we rule out potential reasons of market fail- ure because they may help to justify market intervention but certainly not the subsidization of education relative to the first best. The model chosen is one with overlapping generations and endogenous growth. Individuals live for two periods. They decide on education, saving, and nonqualified labour in their youth. They supply qualified labour when old. The 91 productivity of qualified labour increases in the stock of human capital inherited from preceding generations, and it also increases in own educational investments. Individuals either may be perfect altruists with respect to descendent generations or may behave selfishly. The implications of self- ishness have been studied before by Wigger (2002, Sec. 3.4) and Docquier, Paddison, and Pestieau (2007) for a frame- work in which the government is not constrained in the use of policy instruments. It is shown that decentralizing the first best requires subsidizing education up to the first best. The present chapter goes beyond these earlier studies by endogenizing labour supply and by assuming that the gov- ernment can only employ linear policy instruments. Most remarkably, major results characterizing efficient static pol- icy extend to the dynamic framework. In particular, it is second best not to distort education if the human capital investment function is isoelastic in education. It is argued, however, that such constant elasticity has debatable impli- cations in a dynamic framework. It implies that the human capital stock accumulated by preceding generations melts down to zero if just one generation stops investing. More appealing is the assumption that the elasticity of the in- vestment function is increasing and that the human capital stock does not depreciate completely if just one genera- tion fails to invest. If this is the case, it is second best at balanced growth to subsidize education even relative to the first best. This means that the marginal social cost of human capital should exceed the marginal social return in the long-run second-best optimum. This is a striking result. Not surprising is the need to subsidize education relative to laissez faire. This is so because the intergenera- 92 tional externalities of human capital investments have to be internalized.1 A priori it is not obvious, however, why investments should even exceed the first-best. Subsidiz- ing education requires government revenue, which in the model has to be raised by distortionary taxes on labour and savings. With the intuition of Lipsey and Lancaster (1956 - 1957) in mind, one might hypothesize that it is second best to provide insufficient incentives for education if labour has to be taxed and if the level of comparison is the first best. The contrary, however, is true. The key assumption is the strictly increasing elasticity of the human capital investment function with respect to education. The effect is that it is second best to subsidize education in static analysis, and this effect is shown to extend to the dynamic framework. At balanced growth the need to subsidize increases in the derivative of the investment function’s elasticity and in two further factors. One factor is the Lagrange multiplier on the planner’s implementability constraint, and the other is the gap between the marginal return to capital and the rate of balanced growth. In other words, the more binding the non- availability of lump-sum taxes is and the more deficient the growth is, the more should human capital accumulation overshoot the first best. Assuming altruistic individuals changes some conclu- sions, but not all. Altruists internalize the positive effect that education has on descendents’ productivity. Hence the need for government intervention is reduced. However, the second source of inefficiency modelled in this chapter does not vanish. That second source is the need to employ 1 The need is highlighted by various earlier studies. An example is Rey and del Mar Racionero (2002). 93 distortionary taxes for financing government expenditures. The implications for second-best policy are shown to differ markedly between the first generation and all descendent generations. With respect to descendent generations the following results are obtained. The accumulation of human capital should not be distorted, and this result is obtained for arbitrary utility and human capital investment func- tions. The accumulation of physical capital should not be distorted if the taxpayer’s utility is weakly separable be- tween consumption and non-leisure and homothetic in con- sumption. Furthermore, qualified and nonqualified labour should be taxed uniformly across the life cycle when utility is compatible with balanced growth. Such results will not only give reason in Section 6 to qualify major results derived by Jones, Manuelli, and Rossi (1997), Atkeson, Chari, and Kehoe (1999), and Erosa and Gervais (2002). They strongly contrast with the results derived for the case of selfish indi- viduals. The results obtained for the first generation are less con- trasting. In particular, it is second best not to distort the first generation’s educational choice if the human capital investment function is isoelastic in education. If, however, this function fails to be isoelastic, the optimal education policy for the first generation depends on initial values. On neutralizing the effect of initialization by assuming bal- anced growth and assuming a strictly increasing elasticity of the human capital investment function, it turns out to be second best to subsidize education. The reason is the same encountered when individuals are selfish. Strictly increas- ing elasticity is the reason why it is second best to subsidize education in static analysis. This effect extends to the dy- 94 namic framework. The need to subsidize is the stronger the larger the derivative of the investment function’s elasticity is, the more binding the non-availability of lump-sum taxes is, and the more deficient growth is. The unifying bottom line for selfish and altruistic in- dividuals is as follows. Altruism well reduces the need to subsidize education relative to laissez faire, and altru- ism also implies that descendent generations should have non-distorted incentives to invest in human capital. The short-run policy recommendations for altruism, however, agree with the long-run recommendations for selfishness. Labour has to be taxed, and – given that the elasticity of the human capital investment function is strictly increasing – education should be subsidized relative to the first best. The chapter is structured as follows. Section 2 sets up the two-period overlapping-generations model with en- dogenous growth. The first-order conditions characterizing solutions of the planner’s first-best maximization are de- rived. In Section 3 the utility functions are determined that are compatible with balanced growth in consumption and with constant use of labour and leisure. Section 4 studies the planner’s problem when individuals behave selfishly and when no policy instruments but linear ones are avail- able. Section 5 clarifies the relation between effective and efficient subsidization. Section 6 studies the planner’s prob- lem for individuals who are altruistic towards descendent generations. Section 7 summarizes. 95 4.2 the model and the planner’s first-best problem Consider a sequence of overlapping generations with in- dividuals living for two periods. The index t refers to the generation and to the period in which the representative individual of generation t is young and in her life period 0. Lifetime utility is given by Ut ≡ U(C0t, ,C1t,L0t,L1t) with the arguments C0t, C1t, L0t, and L1t denoting consumption and non-leisure in the life periods i = 0, 1. Utility is strictly increasing in consumption, is strictly decreasing in non- leisure, and is strictly concave. Additional restrictions on preferences required if the economy is to exhibit steady state growth are discussed in Section 3. Non-leisure in the second life period, L1t, equals qualified labour supplied to the market in period t+ 1. By contrast, non-leisure in the first life period has to be divided between nonqualified labour supply L0t − Et and education Et. The effect of education is to increase human capital and labour productivity. Ht−1 is the stock of human capital determining the productivity in period t. It is built up by generation t− 1 and inherited by generation t. By spending time Et on education, generation t determines the stock of human capital Ht effective in the second life period. The human capital accumulation equation is G(Et)Ht−1 = Ht. (µtβ t) (4.1) µtβt is a Lagrange multiplier associated with the plan- ner’s problem we are about to set up. The investment func- tion Gt ≡ G(Et) is assumed to be non-negative and strictly 96 monotone increasing with elasticity η(E) ≡ EG ′/G smaller than one. The case of constant elasticity η plays a promi- nent role in static models of endogenous education (Jacobs and Bovenberg (2010b); Richter (2009)) and equally in what follows. A critical implication is G(0)=0 so that the stock of human capital built up by generation t-1 melts down to zero, Ht=0, if generation t does not spend positive time on education. If one assumes instead G(Et) ≡ G˜(Et) + 1− δH with δH < 1 and some function G˜(E) of constant elasticity η˜, then Ht = (1− δH)Ht−1 follows from Et = 0 so that some human capital is passed on to the next generation even if there are no new investments. In this case, the elasticity of the investment function, η(E) = [ 1− 1−δHG(E) ] η˜, is strictly increasing in E. To allow for both scenarios with constant and increasing elasticity of G(E) we assume η ′(E) > 0 in what follows. The functional specification (4.1) is standard in the en- dogenous growth literature. It can be traced back to Uzawa (1965), and it has been used since by Lucas (1988), Atkeson, Chari, and Kehoe (1999), and others. A key feature is that Ht is linear homogenous in Ht−1. A notable implication of (4.1) is that time spent on education (learning) is the only variable input in the production of human capital. In par- ticular, learning cannot be substituted by physical inputs or services supplied by instructors. There is however some cost of instruction which accrues in fixed proportion with education. For simplicity’s sake, it is modelled as a linear function of inherited human capital and time spent on edu- cation, fEtHt−1. It is suggestive to interpret the exogenous parameter f as tuition fee. 97 There is a second stock variable, Kt, to be interpreted as (nonhuman) capital built up by generation t in their first life period. It is not productive before the second life period, and it depreciates at the rate δK. Production F is linear homogenous in capital and effective labour. The resource constraint is Ft + (1− δK)Kt−1 = C0t +C1t−1 + fEtHt−1 +Kt +At (αtβ t) (4.2) with Ft ≡ F(Kt−1, (L0t − Et)Ht−1,L1t−1Ht−1). The variable At denotes exogenous government spending. Such spending may be of consumptive and/or productive use. As At is exogenous, we refrain from making it an explicit argument of the utility and/or production functions. When taking partial derivatives use is made of the following short forms: FKt ≡ ∂F ∂Kt−1 , FL0t ≡ ∂F ∂ ( (L0t − Et)Ht−1 ) , FL1t ≡ ∂F ∂(L1t−1Ht−1) . Qualified and nonqualified labour may be perfect substi- tutes in production, but they need not be. Human capital is obviously labour augmenting. Note that education in- curs two kinds of cost. There is the cost of forgone earnings, FL0tEtHt−1, and the cost of tuition, fEtHt−1. It is not only realism suggesting an explicit modelling of both costs. We shall argue below that the explicit differentiation is the key to understanding why various results derived in the fol- lowing sections deviate from related results derived in the literature. 98 The planner maximizes ∞∑ t=0 βtU(C0t,C1t,L0t,L1t) (4.3) in C0t, C1t, L0t, L1t, Et, Ht, and Kt (t = 0, 1, . . .) sub- ject to the human capital accumulation equation (4.1) and the resource constraint (4.2). The parameters K−1, H−1, L1−1 = L1,t=−1 are exogenously given. 0 < β < 1 is a dis- count factor. Assume that this maximization – like all others still to follow – is well behaved and that it has an interior solution for which all choice variables are strictly positive. We abstain from stating all the assumptions needed to guar- antee a well-behaved maximization with interior solutions. Identifying those assumptions must remain the object of independent research efforts. In the present chapter we just state those assumptions explicitly needed to derive meaningful first-order conditions of second-best policies. We study neither second-order conditions nor questions of existence. As argued in Richter (2009) and as will become clearer below, a well-behaved maximization requires a spec- ification of Ut = U(C0t,C1t,L0t,L1t) which is sufficiently concave to compensate for the lack of concavity of the hu- man capital accumulation equation (4.1). The first-order conditions of the planner’s maximization are as follows: UC0t = αt, UC1t = αt+1β, (4.4) FL0tHt−1UC0t = −UL0t, FL1t+1HtUC1t = −UL1t, (4.5) FKt+1 + 1− δK = UC0t/UC1t = UC0t/βUC0t+1, (4.6) µtG ′t = αt(f+ FL0t), (4.7) αt+1β [ FL1t+1L1t + FL0t+1 · (L0t+1 − Et+1) − fEt+1 ] = 99 µt −βGt+1µt+1. (4.8) The conditions (4.4) and (4.5) characterize efficient con- sumption and labour choices. The condition (4.6) character- izes efficient saving and efficient capital. The condition (4.7) characterizes the efficient choice of Et, and (4.8) is the con- dition characterizing the efficient choice of Ht. Solving (4.7) for µt and inserting into (4.8) yields, after some straight- forward manipulations, the condition characterizing the efficient accumulation of human capital, FL1t+1L1t + FL0t+1L0t+1 − ( FL0t+1 + f ) Et+1 = [FKt+1 + 1− δK] f+ FL0t G ′t −Gt+1 f+ FL0t+1 G ′t+1 . (4.9) For the sake of brevity we also speak of efficient education if (4.9) holds. The first term on the left-hand side, FL1t+1L1t, is the return to human capital accruing to generation t in the second life period, and the difference FL0t+1L0t+1 − (FL0t+1 + f)Et+1 is the return accruing to individuals of the next generation in their first life period. The factor f+ FL0t G ′t = ( f+ FL0t ) Ht−1 dEt dHt (4.10) is the marginal cost of human capital in period t, and f+FL0t+1 G ′t+1 is the marginal cost of human capital one period later. Hence the right-hand side of (4.9) captures the cost resulting from investing in period t instead of postponing the investment to the next period. By separating terms referring to generation t from terms referring to generation t+ 1, (4.9) can be written as 100 [FKt+1 + 1− δK] f+ FL0t G ′t − FL1t+1L1t = FL0t+1L0t+1 + (FL0t+1 + f)Et+1 [ 1 ηt+1 − 1 ] ≡ MEBt,t+1. (4.11) Because ηt+1 < 1 by assumption, MEBt,t+1 is positive. It is the marginal external benefit enjoyed by generation t+1 and generated by the human capital investment of generation t. This excess benefit has to be internalized by first-best policy when individuals are selfish. As a result of internal- ization, generation t’s cost, [FKt+1 + 1− δK] f+FL0t G ′t , exceeds generation t’s return to human capital, FL1t+1L1t. 4.3 balanced growth We speak of balanced growth if the non-leisure choices L0t = L0,L1t = L1, and Et = E are constant across time while consumption, output, and both types of capital all grow at the common gross rate G = G(E), so that we have Ht−1 = GtH−1, Kt−1 = GtK−1, Cit = GtCi0 ≡ GtCi. At balanced growth, FKt+1 = FK is constant in t. If an efficient allocation is to be compatible with balanced growth, then conditions (4.4) and (4.6) require the rates of substitution UC0(G tC0,GtC1,L0,L1) UC0(G t+1C0,Gt+1C1,L0,L1) , UC0(G tC0,GtC1,L0,L1) UC1(G tC0,GtC1,L0,L1) to be both constant in t. Taking total derivatives with respect to t and setting the total derivatives equal to zero implies constancy of [ GtC0 ·UC0C0(G tC0, ..) 101 +GtC1 ·UC1C0(G tC0, ..) ] /UC0(G tC0, ..) = [ GtC0 ·UC0C1(G tC0, ..) +GtC1 ·UC1C1(G tC0, ..) ] /UC1(G tC0, ..) ≡ d− 1 (4.12) in t. Upon substituting C˜i for GtCi and integrating in C˜i one obtains C0UC0 +C1UC1 = dU+ cX (4.13) where d, c are constants and where X is a function of L0, L1. The following two types of utility specifications satisfy this condition: (i) U(C0,C1,L0,L1) = V(C0,C1) ·Λ(L0,L1) −D(L0,L1) (4.14) where V(C0,C1) is homogeneous of degree d 6= 0; (ii) U(C0,C1,L0,L1) = [ a0 lnC0 + a1 lnC1 ] Λ(L0,L1) −D(L0,L1) (4.15) Utility functions of type (4.14) satisfy condition (4.13) when setting c ≡ d 6= 0, X ≡ D and utility functions of type (4.15) satisfy condition (4.13) when setting c ≡ a0+a1, d ≡ 0, X ≡ Λ. Note that both specifications imply that UCi is homogenous of degree d− 1 in consumption. Efficiency at balanced growth additionally requires 102 −ULi(G tC0, . . .) = FLiG t+iH−1UCi(G tC0, . . .) = FLiG td+1H−1(C0, . . .). In other words, −ULi must be homogenous of degree d in consumption. This is clearly fulfilled only if D ≡ constant in (4.14) and Λ ≡ constant in (4.15). W.l.o.g. this means (i) U(C0,C1,L0,L1) = V(C0,C1)Λ(L0,L1), (4.16) where V(C0,C1) is homogenous of degree d 6= 0; (ii) U(C0,C1,L0,L1) = a0 lnC0 + a1 lnC1 −D(L0,L1). (4.17) This may also be seen as follows. Start with equation (4.5) and take logs on both sides: logUC0t + log FL0tHt−1 = log(−UL0t) The LHS grows at the common growth rate G. Hence, the RHS must grow at this rate as well. Differentiate both sides w.r.t. time t: 1 UC0t ( GtC0UC0C0t +G tC1UC0C1t ) + FL0G tH−1 FL0G tH−1 = 1 −UL0t ( −UL0C0tG tC0 −UL0C1tG tC1 ) (4.18) The factor logG, which results from the differentiation w.r.t t, has already been canceled out. Furthermore FL0t = 103 FL0 for all t along the balanced growth path. The RHS is the elasticity of marginal utility of non-leisure time w.r.t. t, which will be denoted σ. From (4.12) follows that the LHS equals d− 1+ 1 = d. Thus, σ has to be constant along the balanced growth path with σ = d. When we calculate σ for the two candidate functions (4.14) and (4.15), then σ =    VC0tC0tΛL0 VΛL0−DL0 + VC1tC1tΛL0 VΛL0−DL0 != d, a0 C0t C0tΛL0 VΛL0−DL0 + a1 C1t C1tΛL0 VΛL0−DL0 != 0. From these two conditions follows DL0 = 0 in the non-log case and ΛL0 = 0 in the log-case. Without loss of generality we set D and Λ equal to one in the non-log and log case, respectively. An earlier characterization of utility functions compatible with growth in consumption and constancy in leisure is due to King, Plosser, and Rebelo (1988, 2002). These au- thors however restrict their study to dynamic equilibria and government policies in a Ramsey-type framework with exogenous growth. Furthermore, they work with utility functions U(C,L) which have only two arguments. (4.16) and (4.17) extend their findings. Assuming balanced growth and utility to be homoge- neous of degree d in consumption, we obtain UC0t = G(d−1)tUC00. Hence FK + 1 − δK = G 1−d/β by (4.6). Fur- thermore, the condition of transversality, βtUC0tKt → 0 for t → ∞, implies (βGd−1)tUC00G tK−1 → 0 for t → ∞, i.e., βGd < 1. As a result, the return to capital exceeds the growth rate: FK + 1− δK = G 1−d/β > G. (4.19) 104 The following analysis studies second-best policy with regard to education, to saving, and also to labour. The focal question, however, is whether it is second best to provide or not to provide efficient incentives for education. As we shall see, much depends on the elasticity of the elasticity of the investment function G(E) and on whether individuals are perfect altruists towards their children or not. In the altruistic model – also called the dynasty model – individuals are assumed to maximize (4.3). In the other case the representative individual is assumed to maximize own lifetime utility U(C0t,C1t,L0t,L1t) (4.20) subject to the own lifetime budget constraint. We study both scenarios, and we start by analyzing efficient taxation in the standard OLG framework with selfish individuals. The approach taken is called the primal approach in optimal taxation. 4.4 optimal taxation in the standard olg model with selfish individuals The selfish individual representing generation t is assumed to maximize (4.20) in the five variables C0t,C1t,L0t,L1t,Et, and savings St subject to the life-period budget constraints ω0t(L0t − Et)Ht−1 = C0t +ϕtEtHt−1 + St (λ0t) (4.21a) 105 and ω1tL1tG(Et)Ht−1 + Rt+1St = C1t. (λ1t) (4.21b) In this optimization Ht−1 is treated as an exogenous parameter. By assumption, any excess supply of savings, St −Kt, is invested in government bonds. ω0t is the wage rate of nonqualified labour, ω1t is the wage rate of qualified labour, ϕt is the tuition fee, and Rt+1 is the return earned on savings. All these prices and costs are after tax and subsidy. For each t there are six first-order conditions UC0t = λ0t, (4.22a) UC1t = λ1t, (4.22b) ω0tHt−1UC0t = −UL0t, (4.23a) ω1tG(Et)Ht−1UC1t = −UL1t, (4.23b) ω1tL1tG ′ tUC1t = ( ϕt +ω0t ) UC0t, (4.24a) Rt+1 = λ0t/λ1t. (4.24b) They are constraints in the planner’s optimal taxation problem we are about to set up. In the primal approach to optimal taxation these conditions are used to substitute for the four relative prices ω0t, ω1t, ϕt, Rt+1, and the two Lagrange multipliers λ0t, λ1t. After substituting, the lifetime budget constraint derived from (4.21a,b) can be written as 1∑ i=0 [ CitUCit + LitULit ] = ηtL1tUL1t. (λ˜tβ t) (4.25) 106 The condition (4.25) assumes the role of an implementabil- ity constraint in the planner’s second-best problem. Because −ηt L1tUL1t UC0t = (4.23),(4.24) ( ϕt +ω0t ) EtHt−1, (4.26) the right-hand side of (4.25) can be interpreted as the private cost of education. As it turns out, the marginal in- crease inHt is of particular significance when characterizing second-best policies. Let us call the marginal increase the private marginal cost of human capital. The formal definition is PMCHCt ≡ − d dHt [ ηt L1tUL1t UC0t ] = − L1tUL1t UC0t dEt dHt d dEt η(Et) = − L1tUL1t UC0t η ′ t G ′tHt−1 = − L1tUL1t UC0t 1 Ht Et ηt dηt dEt . (4.27) The private marginal cost is obviously increasing in the elasticity of the elasticity of G(Et). If the elasticity ηt = η(Et) is constant, PMCHCt = 0 results. If the elasticity is however strictly increasing, PMCHCt is positive. The planner maximizes the sum of discounted lifetime utilities (4.3) in C0t, C1t, L0t, L1t, Et, Ht, and Kt (t = 0, 1, . . .) subject to the implementability constraint (4.25), the human capital accumulation equation (4.1), and the resource con- straint (4.2). In a fully-fledged description of the planner’s maximization one would have to include the first-order con- ditions of profit maximization. However, these conditions can be used to substitute for the endogenous factor prices 107 before taxes and subsidies. Hence, they are not constrain- ing the planner. The solutions are second best in the sense that they have to fulfil the implementability constraint in addition to the first-best constraints (4.1) and (4.2). If lump- sum taxes were available, the planner could ignore (4.25). Inclusion of (4.25) in the set of constraints implies that the planner is restricted in the choice of policy instruments. The restriction is however not an arbitrary one. Quite to the contrary, implicit in the derivation of (4.25) is the assump- tion that the planner is not constrained in setting consumer prices ω0t, ω1t, ϕt, and Rt+1. This means in particular that labour income can be taxed at different rates over an in- dividual’s life cycle. If such differentiation is ruled out by assumption, the planner has to respect an additional con- straint, which may have strong implications for the design of optimal taxation. See Erosa and Gervais (2002) for a dis- cussion of this point in an OLG model without endogenous education. To solve the planner’s problem set Wt ≡ Ut + λ˜t { 1∑ i=0 [ CitUCit + LitULit ] − ηtL1tUL1t } . (4.28) The first-order conditions are as follows: ∂ ∂C0t , ∂ ∂L0t :WC0t = αt = − WL0t FL0tHt−1 , (4.29) ∂ ∂C1t , ∂ ∂L1t :WC1t = αt+1β = − WL1t FL1t+1Ht , (4.30) ∂ ∂Kt : αt+1β[FKt+1 + 1− δK] = αt, (4.31) ∂ ∂Et : µtG ′ tHt−1 = λ˜tη ′ tL1tUL1t +αt(f+ FL0t)Ht−1 ⇒ 108 µt αt = (4.27) f+ FL0t G ′t − λ˜t αt UC0tPMC HC t , (4.32) ∂ ∂Ht : αt+1β [ FL1t+1L1t + FL0t+1 · (L0t+1 − Et+1) − fEt+1 ] + µt+1βGt+1 = µt. (4.33) We wish to derive characterizations of second-best pol- icy with regard to saving, education, and labour. We start with saving. As has been shown by Atkinson and Stiglitz (1972), Sandmo (1974), Atkeson, Chari, and Kehoe (1999), and others, it is efficient not to distort saving if utility is weakly separable between consumption and non-leisure and is homothetic in consumption, U = U(V(C0,C1),L0,L1) with a linear homogeneous function V . The utility func- tions defined in (4.14) and (4.15) are examples of weakly separable and homothetic functions. Weak separability and homotheticity implies WCi UCi = 1+ λ˜ { 1+ 1∑ j=0 [ Cj UCjCi UCi + Lj ULjCi UCi ] − ηL1 UL1Ci UCi } = 1+ λ˜ { 1+ V UVV UV + 1∑ j=0 Lj UVLj UV − ηL1 UVL1 UV } = constant in i = 0, 1. (4.34) Relying on (4.29) – (4.31) and (4.34) this implies FKt+1 + 1− δK = αt αt+1β = WC0t WC1t = UC0t UC1t . (4.35) This has to be interpreted as saying that it is optimal from the planner’s perspective to equate the marginal rate of re- 109 turn to capital with the private marginal rate of substitution in consumption. Proposition 4.1. If behaviour is selfish and if utility is weakly separable between consumption and non-leisure and homothetic in consumption, it is second best not to distort saving. We turn next to education. We first prove that it is efficient not to distort human capital accumulation if the investment function G is isoelastic. We do so by relying on (4.31) – (4.33), which are the first-order conditions with respect to Kt, Et, and Ht. By making use of (4.31) and (4.32), (4.33) can be written as [ FL1t+1L1t + FL0t+1 · (L0t+1 − Et+1) − fEt+1 ] + [ f+ FL0t+1 G ′t+1 − λ˜t+1 αt+1 UC0t+1PMC HC t+1 ] Gt+1 = [ f+ FL0t G ′t − λ˜t αt UC0tPMC HC t ] [FKt+1 + 1− δK] . (4.36) Obviously, (4.36) equals (4.9) whenever PMCHCt+1 = PMC HC t = 0, (4.37) which is the case if η(Et) is constant. Proposition 4.2. Assume selfish behaviour. It is second best not to distort education if the human capital investment function G(E) is isoelastic. Proposition 4.2 is a dynamic version of the education efficiency proposition, well known from static tax analysis (Ja- cobs and Bovenberg (2010b); Bovenberg and Jacobs (2005)). An intuitive explanation is the following. The planner cares 110 about two objectives. One objective is to minimize the effi- ciency loss resulting from distorted choices of consumption and leisure. The other objective is to minimize losses in the rent income generated by education. In general, these two minimizations are not separable, so that the planner has to trade off. Separability is only ensured if the human capital investment function is isoelastic. If this is the case and if the set of policy instruments is sufficiently rich, it is efficient not to distort education and to minimize the effi- ciency loss resulting from distorted choices of consumption and leisure. According to Proposition 4.2 this result extends to the dynamic framework and it does not explicitly rely on the utility specifications (4.14) and (4.15). Things are different if the private marginal cost of human capital is positive. To study this case set ∆t ≡ λ˜t αt UC0tPMC HC t · (FKt+1 + 1− δK) − λ˜t+1 αt+1 UC0t+1PMC HC t+1 ·Gt+1. (4.38) With this definition (4.36) can be written as ∆t = f+ FL0t G ′t (FKt+1 + 1− δK) − f+ FL0t+1 G ′t+1 Gt+1 − FL1t+1L1t − [ FL0t+1 · (L0t+1 − Et+1) − fEt+1 ] . (4.39) Comparison of (4.39) and (4.9) reveals that ∆t is the effi- cient wedge between the social cost and the social benefit of investing in human capital in period t instead of postponing the investment by one period. A positive wedge stands for subsidizing relative to the first best. A priori the sign of ∆t 111 is indeterminate. This is different if (4.38) is evaluated at a balanced growth path. By definition, balanced growth means that the non-leisure choices L0t = L0,L1t = L1, and Et = E are constant in t while consumption, output, and both types of capital all grow at the common gross rate G = G(E), so that we have Ht−1 = GtH−1, Kt−1 = GtK−1, Cit = GtCi0 ≡ GtCi. At balanced growth FKt+1 = FK, Gt+1 = G in t. Because the utility functions are as spec- ified in (4.16) and (4.17), the other variables entering (4.38) take on the following values: (i) UC0t = G (d−1)tUC00 ≡ G (d−1)tUC0 . (ii) PMCHCt = (4.27) − L1tUL1t UC0t 1 G ′(Et)Ht−1 η ′(Et) = − L1UL10G dt UC00G (d−1)t 1 G ′(E)H−1Gt η ′(E) = − L1UL1 UC0 1 G ′H−1 η ′ = PMCHC0 ≡ PMC HC. Because UC0 , UL0 are homogeneous of degree d− 1 in consumption, WC0 , WL0 are likewise homogeneous of de- gree d− 1 in consumption. As a result, the growth factor Gt cancels out in equation (4.29): WC0t = − WL0t FL0tHt−1 . After cancelling out, the only variable carrying an index t in this equation is the Lagrange multiplier λ˜t. Hence (iii) λ˜t = λ˜, and a fortiori (iv) αt =WC0t = G (d−1)tWC00 ≡ G (d−1)tWC0 and UC0t αt = G(d−1)tUC0 G(d−1)tWC0 = UC0 WC0 . 112 Eventually, setting R ≡ FK + 1− δK, (4.38) can be written as ∆ = λ˜ UC0 WC0 · PMCHC · (R−G). (4.40) Interpret λ˜UC0/WC0 as the social cost associated with the implementability constraint. This factor is positive if the implementability constraint is binding, λ˜ > 0, which is the case if the non-availability of lump-sum taxes is a binding constraint.2 In this sense the factor measures the cost result- ing from the non-availability of lump-sum taxes. PMCHC is the private marginal cost of human capital, which is pos- itive by assumption and increasing in η ′. Finally, R−G is the growth gap, which by (4.19) must be positive as well. Hence ∆ is the product of three positive factors. Proposition 4.3. Assume selfish behaviour, and U to satisfy (4.16) or (4.17). At balanced growth it is second best to subsidize education relative to the first best if the private marginal cost of human capital, PMCHC, is positive. The strength of positive distortion increases in (i) the private marginal cost of human capital, (ii) the growth gap, and (iii) the cost resulting from the non-availability of lump- sum taxes. 2 We abstain from proving in detail that the Lagrange multiplier is posi- tive. Jones, Manuelli, and Rossi (1997, p. 109) do this for a maximization which comes close to the present one. The intuition is the following. Paying generation t some positive lump-sum income would show up on the right-hand side of (4.25). The Lagrange multiplier must be pos- itive if increasing such a lump-sum income can be shown to have a negative effect on the planner’s objective function. The effect is indeed negative, because such a lump-sum transfer must be paid at the ex- pense of government funds, which are generated by distortive taxes. Although the government budget constraint is not modelled explicitly, it has to be respected. This follows from Walras’s law. In summary, the non-availability of lump-sum taxes is the reason why λ˜ is positive. 113 This is a remarkable result, for reasons explained be- fore. It is rather evident, and has been noted before, that the laissez-faire level of education is inefficient from the first-best perspective. Without government intervention, selfish indi- viduals externalize the positive effect of own education on descendent generations’ welfare. Not so evident is the result that human capital accumulation should be distorted along balanced growth while capital accumulation should not be distorted, subject to appropriately chosen utility functions. The sign of the efficient distortion is even less obvious. Note that any revenue needed to subsidize the cost of tuition has to be raised by distortionary labour taxes. With the intuition of Lipsey and Lancaster (1956 - 1957) in mind, one could have hypothesized that it is second best to give negative incentives for human capital accumulation relative to the first best if labour has to be taxed. The contrary, however, is true. The key assumption is the strictly increasing elasticity of the human capital investment function with respect to education. If the elasticity is strictly increasing, the private marginal cost of human capital is positive. With a positive private marginal cost of human capital it is second best to subsidize education. This has been shown before by Richter (2009) to hold in static analysis, and it is shown here to extend to the dynamic framework. The need to subsidize in- creases in the factors listed in Proposition 4.3. In particular, it increases in the elasticity of the human capital investment function’s elasticity. We finally turn to the study of labour taxation. Of partic- ular interest is the efficient taxation of nonqualified labour relative to qualified labour. As the definition of Wt in (4.28) is structurally asymmetric in L0t and L1t, one may easily 114 conjecture that qualified and nonqualified labour should be taxed differently. To make a clear case for differentiated tax- ation and to obtain clear-cut results, we focus on balanced growth and specific utility functions. Thus we assume U ≡ 1∑ i=0 [ ai lnCi −Di(Li) ] . (4.41) In this particular case the first-order condition (4.29) im- plies: WL0 + FL0H−1WC0 = 0 ⇔ UL0 + FL0H−1UC0 = λ˜ [ L0D ′′ 0 (L0) +D ′ 0(L0) ] (4.42) Similarly, (4.30) implies WL1 + FL1GH−1WC1 = 0 ⇔ UL1 + FL1GH−1UC1 = λ˜(1− η) [ L1D ′′ 1 (L1) +D ′ 1(L1) ] . (4.43) Denote by νi ≡ LiULiLi/ULi > 0 the elasticity of marginal utility of leisure in life-period i, and define tax rates τi by setting (1− τ0)FL0H−1 = −UL0/UC0 , (1− τ1)FL1GH−1 ≡ −UL1/UC1 ⇔ (1− τi)FLi = ωi. Dividing (4.43) through by (4.42) gives us τ1/(1− τ1) τ0/(1− τ0) = (1− η) ν1 + 1 ν0 + 1 . (4.44) For η = 0, (4.44) is the familiar (inverse) elasticity rule. According to this rule, wage taxes τi should increase in νi. If utility were quasi-linear, the νi would be the inverse of 115 the wage elasticity of labour supply in life-period i. Hence taxes would have to vary inversely with the wage elasticities rendering the rule its name. The rule is extended by (4.44) to allow for endogenous education. The effect of education is to reduce the tax on qualified labour relative to the tax on nonqualified labour. The deviation from the elasticity rule increases in the elasticity of the human capital investment function, η. See Richter (2009), who derives a similar rule for the static framework. Proposition 4.4. Assume selfish behaviour, and U to satisfy (4.41). On a balanced growth path it is then second best to tax labour according to the elasticity rule (4.44). The effect of endogenous education is to reduce the tax on qualified labour relative to the tax on nonqualified labour. 4.5 efficient and effective subsidization of ed- ucation As mentioned in the introduction, OECD data suggest that various countries effectively subsidize tertiary education while others effectively tax tertiary education. Before sub- stantiating such a statement one has to clarify the under- lying notion of effective subsidization and its relation to efficient subsidization. In the recent publication of 2009 the OECD reports es- timates of the private and public net present values for individuals obtaining tertiary education as part of initial ed- ucation in 2005. In present notation the private net present value is 116 NPVpriv ≡ ω1L1GH−1UC1/UC0 − (ϕ+ω0)EH−1 = (20) (ϕ+ω0) [ G EG ′ − 1 ] EH−1 = 1− η η (ϕ+ω0)EH−1. For the sake of brevity, the time index t is dropped. The public net present value is the difference between the social and the private net present values where the social value NPVsoc ≡ FL1L1GH−1/[FK + 1− δK] − (f+ FL0)EH−1 captures only the return to education accruing to the investing generation. Denote by PRR ≡ NPVpriv (ϕ+ω0)EH−1 = 1− η η , SRR ≡ NPVsoc (f+ FL0)EH−1 = FL1L1G [FK + 1− δK](f+ FL0)E − 1 the private rate of return and the social rate of return, respec- tively. Our suggestion is to speak of effective subsidization only to the extent that the private rate exceeds the social rate. Hence denote by s ≡ PRR− SRR PRR (4.45) the effective rate of subsidization. The efficient value seff of this rate is determined by (1− η)seff = def 1− ηFL1L1G [FK + 1− δK](f+ FL0)E = [FK + 1− δK]f+ FL0) −G ′FL1L1 [FK + 1− δK](f+ FL0) = (10),(34) ∆+MEB (FK + 1− δK)(f+ FL0)/G ′ (4.46) 117 where (f+ FL0)/G ′ is the social marginal cost of human capital andMEB = FL0L0+(FL0 + f)E(1−η)/η the marginal external benefit as specified by (4.10) and (4.11). With ∆ and MEB, seff is positive as well. Equation (4.46) confirms the view that there are two reasons for effective subsidization of education. One is the need to internalize the intergenera- tional externality and the other is the need to compensate for distortionary labour taxation. Just for the sake of illus- tration we report the empirical values of s for men as they can be computed by means of the data published by OECD (2009, tables A8.2 and A8.4). Positive values for s are ob- tained in case of TUR (.47), POL (.34), ESP (.22), POR (.20), AUT (.19), CAN (.18), NOR (.10), ITA (.09), and HUN (.04). Negative values are obtained for SWE (-.03), KOR (-.05), DEN (-.05), FIN (-.06), CZE (-.14), USA (-.16), NZL (-.20), GER (-.20), IRL (-.20), FRA (-.32), BEL (-.32), and AUS (-.40). Such extreme differences in effective rates and even more the opposing signs clearly raise the question of which policy is more efficient. A convincing answer however requires a thorough empirical analysis which has to remain the object of future research. The numbers are only reported to illus- trate the empirical relevance of the theoretical investigation undertaken in this chapter. 4.6 optimal taxation in the olg model with altruistic individuals The perfectly altruistic individual is assumed to maximize U˜t ≡ U(C0t,C1t,L0t,L1t) + βU˜t+1, which by recursive sub- stitution amounts to maximizing the sum of discounted lifetime utilities (4.3) in C0t, C1t, L0t, L1t, Et, Ht, and Kt 118 (t = 0, 1, . . .). This objective is maximized subject to the hu- man capital accumulation constraint (4.1) and the dynasty’s budget constraint, ∞∑ t=0 [ pit+1ω1tL1tHt + pitω0t ( L0t − Et ) Ht−1 ] = ∞∑ t=0 [ pitC0t + pit+1C1t + pitϕtEtHt−1 + ( pit − Rt+1pit+1 ) Kt ] (λ). (4.47) The price and cost variables have the same meaning as before. The first-order conditions are (t = 0, 1, . . .) βtUC0t = λpit,β tUC1t = λpit+1, (4.48) ω0tHt−1UC0t = −UL0t,ω1tHtUC1t = −UL1t, (4.49) µtG ′ t = ( ϕt +ω0t ) UC0t,Rt+1 = pit/pit+1, (4.50) λpit+1 [ ω1tL1t +ω0t+1 ( L0t+1 − Et+1 ) −ϕt+1Et+1 ] = βtµt −βt+1Gt+1µt+1. (4.51) The last condition implies λ ∞∑ t=0 pit+1 [ ω1tL1t +ω0t+1 ( L0t+1 − Et+1 ) −ϕt+1Et+1 ] Ht = (4.50) ∞∑ t=0 [ βtµtHt −βt+1µt+1Ht+1 ] = µ0H0 = (4.50) ϕ0 +ω00 G ′0 UC00H0. (4.52) Multiplying the budget constraint (4.47) through by λ and using (4.48), (4.49), (4.50), and (4.52) to substitute for 119 λpit, λpit+1, ω0t, ω1t, and Rt+1 in (4.47) yields the imple- mentability constraint ∞∑ t=0 βt 1∑ i=0 CitUCit = B (λ˜) (4.53) with B ≡ { [ ω00 ( L00 − E0 ) −ϕ0E0 ] H−1 + ϕ0 +ω00 G ′0 H0 } UC00. Similarly, (4.48), (4.49) and (4.50) can be used to substitute for λpit+1, ω0t+1, ω1t, and µt in (4.51), which leaves us with (t = 0, 1, ...) − L1tUL1t −β [( L0t+1 − Et+1 ) UL0t+1 +ϕt+1Et+1UC0t+1Ht ] = (4.51) { µt −βGt+1µt+1 } Ht = µtHt −βµt+1Ht+1 = (4.50) [ ϕtUC0t −UL0t 1 Ht−1 ] Ht G ′t −β [ ϕt+1UC0t+1 −UL0t+1 1 Ht ] Ht+1 G ′t+1 . (γtβt) (4.54) Interpret (4.54) as the Euler equation for human capital ac- cumulation. The planner maximizes the sum of discounted lifetime utilities (4.3) in C0t, C1t, L0t, L1t, Et, Ht, Kt, and ϕt (t = 0, 1, ...) subject to the resource constraint (4.2), the ac- cumulation constraint (4.1), the implementability constraint (4.53), and the Euler equation (4.54). It is important to note that the cost of tuition ϕt+1 (t = 0, 1, . . .) only appears explicitly in equation (4.54). By contrast, the planner’s ob- jective function and the constraints (4.1), (4.2), and (4.53) are independent of ϕt+1. The equation (4.54) can therefore be treated as a relationship by which the “free” policy vari- 120 able ϕt+1 is determined. This solution procedure is feasible because the coefficient of ϕt+1 in (4.54) does not vanish. The coefficient equals βUC0t+1 [ Et+1Ht − Ht+1 G ′t+1 ] = βUC0t+1Et+1Ht ( 1− 1 ηt+1 ) < 0. Hence the planner’s problem is equivalent to the simpli- fied version in which (4.3) is maximized in C0t, C1t, L0t, L1t, Et, Ht, Kt (t = 0, 1, . . .), and ϕ0 subject to (4.1), (4.2), and (4.53). We first study those first-order conditions of the simpli- fied planner’s problem which are associated with variables which do not enter the implementability constraint (4.53) or which drop out when making particular assumptions. The optimization with respect to those variables is not affected by (4.53) and should therefore remain undistorted. Proposition 4.5. Assume altruistic behaviour. Then it is second best not to distort education for all generations except the first. Proposition 4.6. Assume altruistic behaviour and the utility function to be weakly separable between consumption and non-leisure and homothetic in consumption. Then it is second best not to distort the accumu- lation of capital for all generations except the first. Proposition 4.7. Assume altruistic behaviour and U to satisfy (4.16) or (4.17). Then it is second best to tax qualified and nonqualified labour uniformly. This holds for all generations except the first. The proof of Proposition 4.5 is rather straightforward. Just note that the variables Et, Ht, Kt (t > 0) do not enter 121 the implementability constraint. Taking partial derivatives of the Lagrange function with respect to these variables and substituting for the Lagrange multipliers µt, αt yields the efficiency condition (4.9) for t > 0. The proof of Proposition 4.6 parallels the one of Proposition 4.1 and is therefore skipped. The proof of Proposition 4.7 is as follows. Set Wt ≡ Ut + λ˜ 1∑ i=0 CitUCit. If V is homogeneous of degree d 6= 0, then WLit = (1+ λ˜d)ULit (i = 0, 1). Hence the social and the private marginal rates of intertemporal substitution in non-leisure are equal, WL1t WL0t = UL1t UL0t = (43) ω1tHtUC1t ω0tHt−1UC0t . (4.55) The equation (4.55) is equally obtained if V is homoge- neous of degree zero in the sense of (4.15) with D ≡ 0. Taking partial derivatives of the Lagrange function with respect to Kt, L0t, L1t, yields (4.31) andWL0t = −αtFL0tHt−1, WL1t = −αt+1βFL1t+1Ht (t > 0). Therefore, (4.55)⇔ αt+1βFL1t+1Ht αtFL0tHt−1 = ω1tHtUC1t ω0tHt−1UC0t ⇔ FL1t+1 FL0t = (4.31) [ FKt+1 + 1− δK ]ω1t ω0t UC1t UC0t . Define tax rates τit by setting 1− τ1t ≡ ω1t/FL1t+1, 1− τ0t ≡ ω0t/FL0t. Hence, (4.55)⇔ 1− τ0t 1− τ1t = [ FKt+1 + 1− δK ]UC1t UC0t . (4.56) The utility functions assumed to hold for Proposition 4.7 are weakly separable between consumption and non- 122 leisure and homothetic in consumption. Hence Proposition 4.6 applies and it is second best not to distort saving. As a result, the right-hand of (4.56) equals one and labour tax rates are independent of age. Proposition 4.6 is just what one would expect in view of the literature. Proposition 4.7 is less obvious, and it even allows us to qualify the main result of Erosa and Gervais (2002) stating that it is generally optimal to differentiate labour taxes across the individual life cycle. The intuitive ex- planation for this result is that labour supplied in the second life period differs from labour supplied in the first period. While Proposition 4.4 confirms the result of Erosa and Ger- vais (2002) on assuming selfish individuals, Proposition 4.7 does not. Obviously, in the present framework altruism removes the need to employ age-dependent labour taxes for descendent generations. Age-dependent labour taxes would then be used only as a correcting device if it were second best to distort saving. This becomes clearer when consider- ing utility functions which are additive separable between consumption and non-leisure, U = V(C0,C1) −D(L0,L1). In this case (4.56) would equally hold but the right-hand side of (4.56) would only equal one in the optimum if V were homothetic. This is a noteworthy qualification of Erosa and Gervais (2002). Above, it is derived from the equality of the social and private marginal rates of intertemporal substitution in non-leisure, (4.55). For this equality to hold we have to assume not only altruism, but also a sufficiently rich set of policy instruments. In particular, the planner must be able to choose ωit independently of ϕt. In other words, the planner must be able to optimize the taxation of labour separately from the subsidization of education. 123 The generality of Proposition 4.5 is striking. The propo- sition holds for arbitrary utility and human capital invest- ment functions, and it does not assume balanced growth. This generality is not only remarkable as such. It allows one to qualify related results by Jones, Manuelli, and Rossi (1997) and Atkeson, Chari, and Kehoe (1999). Those results suggest that human capital does not differ that strongly from physical capital to justify different tax policies. In fact, both kinds of accumulation processes should remain undis- torted along balanced growth. By contrast, Propositions 4.5 and 4.6 highlight strong differences. The case for leaving education undistorted is much stronger than the case of undistorted saving. The explanation for such deviating results is as follows. Jones, Manuelli, and Rossi (1997) and Atkeson, Chari, and Kehoe (1999) derive their results in the standard Ramsey model with an infinite planning horizon. At first sight, the proofs show strong parallels to the one given above. In each case, the planner is assumed to maximize the sum of discounted utilities subject to a resource constraint, an accu- mulation constraint of human capital, an implementability constraint, and an Euler equation for human capital accumu- lation. Equally, the proof runs in each case by first solving the problem obtained when discarding the Euler equation and by then showing that the Euler equation is fulfilled. Differences come in when solving the relaxed problem and when arguing why the Euler equation is fulfilled. In Jones, Manuelli, and Rossi (1997) and in Atkeson, Chari, and Ke- hoe (1999, Prop. 5) the Euler equation is one which is not distorted by prices. Hence only undistorted allocations solv- ing the relaxed problem are able to solve the non-relaxed 124 problem. The only solutions fulfilling such requirements are allocations that converge to a balanced growth path and that are obtained when all taxes are zero along such path. The present analysis is much less constraining. The solutions of the relaxed problem need not be undistorted allocations. The Euler equation is not really constraining the planner because it contains the free policy variable. Once more, this demonstrates the importance of modelling two different costs of education. Modelling the cost of foregone earnings only but not the cost of tuition - as Jones, Manuelli, and Rossi (1997) and Atkeson, Chari, and Kehoe (1999) do - has the effect that an instrument is missing allowing the planner to control education independently of labour. The generality of Proposition 4.5 strongly reminds one of the Production Efficiency Theorem of Diamond and Mir- rlees (1971). According to this theorem the allocation of intermediate goods should not be distorted in second best if no lump-sum income accrues to the private sector. This is just what holds in the present model. Investment in hu- man capital is modelled as an intermediate good in the sense that it does not affect the implementability constraint (4.53) for t > 0. Furthermore, the only lump-sum income modelled is income earned by the parent generation liv- ing in period 0. On setting pi0 = 1, this income equals FK0K−1+ FL10L1−1H−1+(1− δK)K−1. It does not show up in the dynasty’s budget constraint (4.47). It must therefore be income accruing to the government budget. The Production Efficiency Theorem is applicable, and Proposition 4.5 can be considered to be a corollary. The recommendation not to distort education is not easily translated into explicit tax and subsidy rates. The reason is 125 that private incentives are affected by a whole set of tax and subsidy rates, which all must be optimally set. Just inspect the altruist’s first-order condition (4.51) determining the optimal amount of human capital. After substituting for the Lagrange multipliers one obtains ω1tL1t +ω0t+1L0t+1 − ( ω0t+1 +ϕt+1 ) Et+1 = Rt+1 ϕt +ω0t G ′t −Gt+1 ϕt+1 +ω0t+1 G ′t+1 . (4.57) This condition reveals that the altruist’s incentive to invest in human capital is not only affected by taxes on own labour income and the subsidy paid to the own cost of tuition. It is additionally affected by the tax on savings, by the next generation’s tax on nonqualified labour, and finally by the subsidy paid to the next generation’s cost of tuition. More can be said only after making specific assumptions. Just for the sake of illustration, assume U to satisfy (4.16) or (4.17). Hence Propositions 4.6 and 4.7 apply, and it is optimal not to tax saving, Rt+1 = FKt+1 + 1− δK, and to tax labour independently of age, 1− τt ≡ ω1t/FL1t+1 = ω0t/FL0t (t > 0). Only if optimal wage taxes do neither differentiate across generations, τt = τ, can one infer that it is compatible with efficiency for the cost of tuition to be subsidized at the same rate as labour income is taxed, ϕ = (1 − τ)f. This follows immediately from comparing (4.57) with (4.9). If the mentioned assumptions do not hold, it is difficult to make definite statements about the efficient structural relationship between labour tax rates and education subsidy rates. The government has to finance the exogenous cash flow of government expenditures At (t > 0). If the amount of pure profit earned by the government is insufficient, 126 distortionary taxes have to be employed to balance the budget. In this case, the implementability constraint (4.53) is binding, and it cannot be ruled out that it is efficient to distort the choice of education of generation 0. This raises the question of how to design optimal human capital policy for generation 0. As we are going to learn, the answer comes close to what has been shown to be efficient in the world of selfish individuals. More precisely, generation 0’s education should not be distorted if the human capital investment function is isoelastic. If however the private marginal cost of human capital is positive, education should be positively distorted relative to the first best. To show this we maximize (4.3) subject to (4.1), (4.2), (4.53), and (4.54). Taking partial derivatives of the Lagrange function yields the following results after some tedious but straightforward manipulations have been made: ∂ ∂ϕ0 : γ0 = −λ˜(1− η0), (4.58) ∂ ∂ϕ1 : γ1 = γ0(1− η1), (4.59) ∂ ∂E0 : µ0 α0 = (4.58) f+ FL00 G ′0 − λ˜ α0 UC00 ϕ0 +ω00 G ′0 [ 1+ E0G ′′0 G ′0 ] , (4.60) ∂ ∂E1 : µ1 α1 = (4.58),(4.59) f+ FL01 G ′1 − λ˜ α1 UC01 ϕ1 +ω01 G ′1 [ 1− E0G ′0 G0 ] [ 1+ E1G ′′1 G ′1 ] . (4.61) The first-order condition with respect to K0 is the same as (4.31) for t = 0. By making use of (4.58)–(4.61) and (4.31) for t = 0 we end up with 127 ∂ ∂H0 : ∆0 = f+ FL00 G ′0 ( FK1 + 1− δK ) − f+ FL01 G ′1 G1 − FL11L10 − [ FL01 · ( L01 − E1 ) − fE1 ] , (4.62) where ∆0 ≡ λ˜ α0 UC00 · PMC HC 0 · ( FK1 + 1− δK ) − λ˜(1− η0) α1 UC01 · PMC HC 1 ·G1 (4.63) and (t = 0, 1) PMCHCt ≡ − L1tUL1t UC0t η ′ t G ′tHt−1 = (4.26) ϕt +ω0t G ′t Etη ′t ηt = ϕt +ω0t G ′t [ 1− EtG ′t Gt + EtG ′′t G ′t ] . (4.64) The variables ∆0 and PMCHCt are defined so that the par- allels with (4.38) and (4.27) show up. As PMCHCt vanishes for isoelastic G(Et), we obtain Proposition 4.8. Assume altruistic behaviour and the human capital investment function G to be isoelastic. Then it is second best not to distort the first generation’s educational choice. Proposition 4.8 is just the altruistic analogue to Propo- sition 4.2. It is a result that one could easily conjecture. Altruism goes beyond selfishness in internalizing efficiency effects. If it is second best not to distort education given that G is isoelastic and behaviour selfish, then it should all the more be second best not to distort education given that G is isoelastic and behaviour altruistic. Things are less straightforward if the private marginal cost of human capital is positive. Without making further 128 assumptions, it is difficult to sign ∆0. However, we are able to derive a direct analogue to Proposition 4.3. More pre- cisely, ∆0 can be shown to be positive if the growth path is balanced and if utility satisfies (4.16) or (4.17). The assump- tion of balanced growth has the effect of neutralizing the impact of initialization. The proof is only sketched. First note that ω0t = ω0 follows from (4.48). In a second step Gdt is shown to be a factor that cancels out of the constraint (4.54), so that ϕt and ϕt+1 are the only remaining variables in (4.54) carrying an index t. The equation can then be used to solve forϕt = ϕt+1 ≡ ϕ. This is a feasible procedure, as the coefficient of ϕ does not vanish. Just note that after dividing through by Gdt the coefficient equals βGdUC00 [ EH− GH G ′ ] +UC00 GH G ′ = UC00EH [ βGd + 1 η ( 1−βGd )] . The condition of transversality, βGd < 1, implies that the coefficient is positive. Plugging ϕ into (4.64) yields PMCHCt = PMC HC. Assume PMCHC > 0 and prove ∆0 = ∆ > 0 by inspecting (4.63) and by noting λ˜ α0 UC00 · ( FK1 + 1− δK ) > λ˜(1− η) α1 UC01 ·G ⇔ (4.31) UC00 > β(1− η)G d−1UC00 ·G ⇔ 1 > (1− η) ·βGd. The last inequality follows from η < 1 and, once more, from the condition of transversality. 129 Proposition 4.9. Assume altruistic behaviour, and U to satisfy (4.16) or (4.17). At balanced growth it is second best to subsidize the first generation’s educational choice relative to the first best if the private marginal cost of human capital, PMCHC, is positive. It would be nice if one could similarly characterize sec- ond-best policy with regard to the first generation’s choice of labour and saving. However, analogues to Propositions 1 and 4 seem not to hold. In particular, it seems that the first generation’s saving decision is systematically distorted. The reason is the factor UC00 entering the right-hand side of (4.53). This factor implies a lack of symmetry when taking partial derivatives of B with respect to Ci0 (i = 0, 1). As a result it is second best to distort saving. The parallelism between Propositions 4.9 and 4.3 allows us to tell a unifying story for selfish and altruistic individu- als. Altruism well reduces the need to subsidize education relative to laissez-faire. Altruism also implies that the second- best tax policy for descendent generations is more like the first-best policy. The accumulation of human capital should remain undistorted, and – if utility functions are well se- lected – labour taxes need not be differentiated across the individual life cycle. The short-run policy recommendations for altruism, however, parallel the long-run recommenda- tions for selfishness. Labour has to be taxed, and – given that the elasticity of the human capital investment function is strictly increasing – education should be subsidized rel- ative to the first best. Whether saving should be taxed is not a matter of selfishness or altruism. With regard to de- scendent generations it primarily depends on assumptions made with regard to the marginal rate of intertemporal 130 substitution in consumption. In any case, the recommenda- tion not to distort education is better founded in dynamic analysis than the recommendation not to distort saving. 4.7 summary The accumulation of human capital may suffer from all sorts of potential inefficiencies. Most of them have simply been assumed away in the present study. Such a procedure is, no doubt, debatable. Critical is the ignoring of possible causes of capital market or policy failure. Even more critical is the ignoring of individual heterogeneity and informa- tional asymmetry. Still, the procedure is defended with the objective of studying efficient taxation in Ramsey’s tradition. More precisely, this chapter aims at bridging the gap that separates the two strands of Ramsey tax analyses which exist for the finite and the infinite planning horizon. Our knowledge of efficient human capital policy in Ramsey’s tradition is largely shaped by incompatible results derived for the different horizons. The results derived for the infi- nite horizon suggest that education should not be distorted in the long run, just as saving should not be distorted in the long run. Hence it seems as if efficient policy does not differentiate between human and nonhuman capital. By way of contrast, the results in finite horizon strongly suggest differentiated policies. Whether education should be distorted or not appears to depend primarily on how education affects the individual’s earning potential. More precisely, only if the earnings function is weakly separable in qualified labour supply and education and if the elastic- ity with respect to the latter is constant, should the choice 131 of education be not distorted by second-best policy (Jacobs and Bovenberg, 2010b). By way of contrast, the question of whether saving should be distorted or not primarily has to be answered with regard to the taxpayer’s prefer- ences. More precisely, saving should not be taxed if the taxpayer’s utility is weakly separable between consumption and labour/non-leisure and homothetic in consumption (Atkinson and Stiglitz, 1972). The model filling the gap between finite and infinite Ramsey tax analyses is one with overlapping generations. The present chapter studies second-best policy for educa- tion, saving, and labour in such an overlapping-generations model with endogenous growth. There have been earlier attempts to do the same. In view of the present study, two attempts deserve to be cited more than others. These are by Atkeson, Chari, and Kehoe (1999) on one side and by Wig- ger (2002, Sec. 3.4) and Docquier, Paddison, and Pestieau (2007) on the other side. The most conspicuous differences to the present study are the following ones. The focus of the present study is on human capital accumulation, while the focus of Atkeson, Chari, and Kehoe (1999) is on nonhuman capital. Their paper contains extensions to both endoge- nous education and overlapping generations, but it fails to integrate the two. The work of Wigger (2002) and Doc- quier, Paddison, and Pestieau (2007) does integrate them. However, it does not allow for endogenous labour supply and second-best taxation. The authors assume the availabil- ity of non-distorting tax instruments, which the present study does not. In a sense, the present chapter starts where Atkeson, Chari, and Kehoe (1999)and where Wigger (2002) and Docquier, Paddison, and Pestieau (2007) stop. It goes 132 beyond Atkeson, Chari, and Kehoe (1999) by integrating endogenous education and overlapping generations, and it goes beyond Wigger (2002) and Docquier, Paddison, and Pestieau (2007) by endogenizing labour supply and by do- ing second-best tax analysis. The present chapter studies two possible reasons for al- locational inefficiency. One is the non-availability of non- distorting tax instruments. The other is individual selfish- ness. Taxpayers are assumed to externalize the positive effect that their human capital investments have on the pro- ductivity of descendent generations. As stressed by Wigger (2002) and by Docquier, Paddison, and Pestieau (2007), self- ishness is the source of an intergenerational externality. It gives reason to subsidize education relative to laissez-faire. Such subsidization, however, requires government revenues. In the framework studied by Wigger (2002) and by Docquier, Paddison, and Pestieau (2007) it is efficient to subsidize education up to the first-best level where marginal social costs equal marginal social returns. The result assumes the availability of non-distortionary tax instruments. The key assumption of the present study, however, is that no tax instruments are available that would allow the government to raise the revenue needed to subsidize education without creating distortions. As it turns out, it is still second best not to distort education if only the human capital investment function is isoelastic. This result can be considered to be the dynamic version of the education efficiency proposition known from static Ramsey analysis. It is, however, argued that an isoelastic investment func- tion has the unappealing implication that all human capital accumulated by past generations melts down to zero if only 133 one generation stops investing. If, by way of contrast, hu- man capital depreciates just by some fraction and if the investment function’s elasticity is strictly increasing, then investment incentives should overshoot the first best at bal- anced growth. In other words, it is efficient in the long run to combine positive tax wedges in the labour market with an effective subsidy wedge for education. The need to subsidize is shown to increase in (i) the private marginal cost of human capital, (ii) the cost resulting from the non- availability of lump-sum taxes, and (iii) the growth gap. Furthermore, it turns out to be efficient to tax labour such that qualified labour is less distorted than nonqualified labour. If taxpayers are altruists with respect to descendent gen- erations, one clear reason for government intervention does not apply. The effect that education has on descendent gen- erations’ productivity is internalized by altruists. The only remaining inefficiency modelled in this chapter is caused by the need to employ distortionary taxes for financing government expenditures. As it turns out, all generations except the first one should still be given non-distorted in- centives for accumulating human. Furthermore, if utility functions are well selected, saving should not be distorted and labour should be taxed uniformly across the individ- ual life cycle. These results contrast with results derived by Jones, Manuelli, and Rossi (1997), Atkeson, Chari, and Kehoe (1999), and Erosa and Gervais (2002). While Jones, Manuelli, and Rossi (1997) and Atkeson, Chari, and Kehoe (1999, Prop. 5) suggest that human capital does not differ that strongly from physical capital to justify different dy- namic tax policies, the present analysis highlights strong 134 differences. If altruism is assumed, the case for leaving education undistorted is much stronger than the case of undistorted saving. Our results equally allow us to qualify the main result of Erosa and Gervais (2002), who stress the need to employ age-dependent labour taxes in second best. In the present framework, however, altruism has the effect of implying equality of the social and private marginal rates of intertemporal substitution in non-leisure. The opti- mality of uniform labour taxation is an immediate though intriguing corollary to this equality. The results on non-distortionary taxation do not require removing all distortions. On the contrary, the labour supply of descendent generations will be distorted if the govern- ment has to finance exogenous government expenditures by relying on distortionary instruments. Nor do the results on non-distortionary taxation extend to the dynasty’s first generation, indexed by zero in the present chapter. A more precise characterization of optimal policy for generation 0 is difficult, as the specific features not only depend on the shape of the human capital investment function but also on initial values of key variables. As in the case with selfish individuals, it is efficient not to distort education if the investment function is isoelastic in education. If, how- ever, the elasticity is strictly increasing and if the impact of initialization is suppressed by assuming balanced growth, it is second best to subsidize education relative to the first best. The reason is the same as the one given before in the scenario with selfish individuals. A strictly increasing elasticity of the investment function has the effect that it is second best to subsidize education in static analysis, and this effect extends to the dynamic framework. At balanced 135 growth the need to subsidize increases in the derivative of the investment function’s elasticity, and it is the stronger, the more binding the non-availability of lump-sum taxes is and the more deficient growth is. The unifying bottom line for selfish and altruistic in- dividuals is as follows. Altruism well reduces the need to subsidize education relative to laissez faire, and altruism also implies that descendent generations should be given non- distorted incentives for accumulating human capital. The short-run policy recommendations for altruism, however, agree with the long-run recommendations for selfishness. Labour has to be taxed, and – given that the elasticity of the human capital investment function is strictly increasing – education should be subsidized relative to the first best. Whether saving should be taxed is not a matter of self- ishness or altruism. It primarily depends on assumptions made with regard to the marginal rate of intertemporal substitution in consumption. 136 5 E F F I C I E N T S U B S I D I Z AT I O N O F H U M A N C A P I TA L A C C U M U L AT I O N W I T H O V E R L A P P I N G G E N E R AT I O N S A N D E N D O G E N O U S G R O W T H : A N U M E R I C A L E X A M P L E 5.1 introduction Chapter 4 studies second best policies for education, saving, and labor in a two-period OLG model in which endoge- nous growth results from human capital accumulation. It identifies two effects that serve to justify the subsidization of education if individuals are selfish: (i) Internalize the external intergenerational effect and (ii) account for dis- tortionary labor taxation. Propositions 4.2 and 4.3 are the pivotal results: Proposition 4.2. Assume selfish behaviour. It is second best not to distort education if the human capital investment function G(E) is isoelastic. Proposition 4.3. Assume selfish behaviour, and U to satisfy balanced growth properties. At balanced growth it is second best to subsidize education relative to the first best if the private marginal cost of human capital, PMCHC, is positive. The strength of positive distortion increases in (i) the private marginal cost of human capital, (ii) the growth gap, and (iii) the cost resulting from the non-availability of lump-sum taxes. 137 This chapter numerically analyzes these propositions and studies the importance of the intergenerational external ef- fect and of the distortionary labor taxation effect as reasons to justify the subsidization of education. As it turns out, the case for subsidizing education to account for distor- tionary labor taxation is rather weak. The still dominant justification for subsidizing education is to internalize inter- generational externalities. This result is robust and holds for a wide range of parameter values. 5.2 restrictions on the utility and production functions To have a model that exhibits balanced growth, certain restrictions on the utility function and on the production functions must be imposed. 5.2.1 Utility Function Section 4.3 derives the balanced growth properties that the utility function must satisfy. The individual’s utility function used in the present analysis is as follows: Ut = U(C0t,C1t,L0t,Et,L1t) =    (C0t(1−L0t−Et)γ0) 1−ψ 1−ψ + ρ (C1t(1−L1t)γ1) 1−ψ 1−ψ , 0 < ψ 6= 1. lnC0t + γ ln(1− L0t − Et) +ρ(lnC1t + γ ln(1− L1t)), ψ = 1. (5.1) 138 ρ is the individual’s discount factor, which may equal the planner’s discount factor β. γ0 and γ1 are taste parameters for leisure. The individual’s time endowment is normalized to one, that is, the time constraints read 1 = L0+E+ leisure and 1 = L1 + leisure. 1/ψ is the individual’s intertemporal elasticity of substitution in consumption. (5.1) is a special case of conditions (4.16) and (4.17). It is straightforward to verify the balanced growth path proper- ties as explained in section 4.3. The utility function used here is time-separable. Without loss of generality it explicitly includes the time E spent on education when young. The whole analysis in chapter 4 can be done using the present specification of the utility function without affecting any results. It is used here to facilitate the calibration of how the individual spends his discretionary time on leisure, working and education. 5.2.2 Production Function The firm’s technology is given by the following Cobb- Douglas production function: Ft = F ( Kt−1,L0tHt−1,L1t−1Ht−1 ) = AKαt−1 ( L0tHt−1 + L1t−1Ht−1 )1−α. A is the total factor productivity, which is a scaling con- stant that can be arbitrarily set. Following Auerbach, Kot- likoff, and Skinner (1983, p. 87-88) and Erosa and Gervais (2002), the labor input is the sum of effective units of labor supply of the young and old individual. This specification 139 implies that a young and old individual’s marginal product of labor is equal, FL0t = FL1t = FLt. As one can see, along the balanced growth path at which the stock of physical capital increases at the same rate as the stock of human capital, the marginal products of capital and labor stay constant. 5.2.3 Human Capital Investment Function The accumulation of human capital is described by the following law of motion: G(Et)Ht−1 = Ht. The function expressing the stock Ht of human capital in the next period features constant returns to scale with respect to the current stock Ht−1 of human capital. This specification is another requirement for the economy to grow along a balanced-growth path (Lucas (1988), Caballe and Santos (1993)). The human capital investment function G(E) reads G(E) = DEη˜ + 1− δH, δH 6 1, where D > 0 is a shift parameter and δH is the rate at which human capital depreciates. Let η(E) denote the elasticity of the function G(E). Then: η(E) = [ 1− 1− δH G(E) ] η˜. As long as the stock of human capital does not fully depreciate, the elasticity η is increasing in E. 140 5.3 calibration To numerically analyze the model and to solve for the balanced-growth path allocation, it is necessary to trans- form the model with growth into one without growth. To do so, divide all growing variables by the current stock Ht−1 of human capital: Kˆt−1 = Kt−1 Ht−1 , Cˆ0t = C0t Ht−1 , Cˆ1t = C1t Ht−1 , Aˆt = At Ht−1 . (5.2) This procedure ensures that all hat variables are constant along the balanced growth path. The remaining variables L0t, L1t and Et remain constant. The normalized conditions of the selfish individual’s problem, evaluated along the balanced growth path, are as follows: (1− τ0)(1−α)AKˆ α(L0 + L1) −αL0 = Cˆ0 + (1− τ H)fE + (Kˆ + Bˆ)G(E), (5.3) (1− τ1)(1−α)AKˆ α(L0 + L1) −αL1 + [ (1− τK) ( αAKˆα−1(L0 + L1) 1−α ) + 1− δK ] × (Kˆ + Bˆ) = Cˆ1/G(E), (5.4) (1− τK)αAKˆα−1(L0 + L1) 1−α + 1− δK = ρ−1 ( Cˆ1 Cˆ0 )ψ( (1− L0 − E)γ0 (1− L1)γ1 )1−ψ , (5.5) (1− τ0)(1−α)AKˆ α(L0 + L1) −α = γ0 Cˆ0 1− L0 − E , (5.6) (1− τ1)(1−α)AKˆ α(L0 + L1) −α = γ1 Cˆ1/G(E) 1− L1 , (5.7) (1− τ1)(1−α)AKˆα(L0 + L1)−αL1Dη˜Eη˜−1 (1− τK)αAKˆα−1(L0 + L1)1−α + 1− δK 141 = (1− τH)f+ (1− τ0)(1−α)AKˆ α(L0 + L1) −α, (5.8) AKˆα ( (L0 + L1) )1−α + (1− δK)Kˆ = Cˆ0 + Cˆ1/G(E) + fE +G(E)Kˆ + Aˆ. (5.9) The equations are specializations of the equations of the general model in section 4.4. (5.3) and (5.4) are the indi- vidual’s budget constraints when young and old. (5.5) - (5.8) are the individual’s marginal conditions. (5.9) is the resource constraint. The government’s budget constraint is satisfied because of Walras’s law. The parameters of the model are chosen as follows. The utility function is logarithmic, i.e., the intertemporal elastic- ity of substitution is equal to one, ψ = 1, which is a value frequently used in the literature, for instance in a related analysis by Chari, Christiano, and Kehoe (1994). The annual time preference rate is ρ is 0.96, which corresponds to a quarterly value of 0.99 (= 0.961/4). Total factor productivity is equal to 8. Varying the pa- rameter A only has level effects and does not affect the qualitative results. The capital’s share α is set equal to 0.29, which is a value derived by Gomme and Rupert (2007). It is assumed that the stock of physical capital is completely depreciated after 30 years (De la Croix and Michel, 2002, p. 338). Tax rates on labor and capital income are uniformly set equal to 30%. Several methodologies have been put forward to estimate average tax rates on capital and labor income. See Gomme and Rupert (2007) for a discussion. Tax rates ranging from a low 17% to a high 30% on labor income and tax rates on capital income between 27% and 50% are reported by Mendoza, Razin, and Tesar (1994). Gomme and 142 Rupert (2007) use 22% and 29% as tax rates on capital and labor income. And finally, Bouzahzah, De la Croix, and Docquier (2002) set the tax rate on labor income equal to 29%. The present choice of tax rates therefore is not too far away from what the literature suggests. The level of government consumption Aˆ is set to equal 40% of output: Aˆ = 0.4×AKˆα ( (L0 + L1) )1−α. (5.10) This is a high number compared to a low 20%, which is often used in the literature (Lucas (1990), Chari, Christiano, and Kehoe (1994)). The perhaps unusual choice may be justified as follows. As the theory suggests, it is optimal to subsidize education to account for distortionary labor taxation. If the need for tax revenue is high, the negative effects of distortionary taxation can be expected to be large. The results characterizing the need to subsidize education, that will be presented and discussed below, therefore do not underestimate the negative effect of distortionary taxation. If at a high level of government consumption the case for subsidizing education is weak, then this is also true for lower levels. (5.10) implies that the level of government consumption grows at the common growth rate. This means that the level of government consumption is not fixed but only the share. If one fixes the level, government consumption drops to a negligible fraction of output as the economy grows along the balanced growth path. To avoid this unrealistic prop- erty, it is necessary to extend the setup of the model such that in equilibrium the share of government expenditures 143 remains constant along the balanced growth path. Section 5.4 provides the details. The choice of the two leisure parameters γ0 and γ1 en- sures that total non-leisure time is not larger than 0.5. More specifically, when young the individual spends a fraction of about 46% on working and the remainder of about 4% on education. In a related analysis, Bouzahzah, De la Croix, and Docquier (2002) numerically solve a six-periods-OLG model with endogenous labor, saving and education. They use data from time use surveys to calibrate their model. If I use their data and compute it to a two-period model, I end up with the numbers 13/30 and 1/15 for the fractions of time spent on working when young and old, and on education. One could also use data from the American Time Use Survey . First, one builds the age groups 15-44 years and 45- 74 years. Second, one computes time allocated to working and education. The last step involves relating these numbers to discretionary time, which is 24h minus time spent on sleeping and personal use. I used data from the waves 2003 to 2009 and ended up with the following numbers: L0 = 0.2620, E = 0.0530 and L1 = 0.2323. The present choice comes somewhat close to these num- bers. Unfortunately, it was not possible to precisely match these numbers. As it turned out, other parameters, in par- ticular the parameters of the human capital investment function, responded very sensitive to the choice of the time inputs, which then gave rise to meaningless results. Cali- brating the two leisure parameters in a way such that total non-leisure time is not larger than 50% of discretionary time allows for the greatest flexibility. The proper calibration of 144 time is one of the most difficult issues. Even the informa- tion provided Lucas (1990) how he has done the job is a bit blurry. The human capital investment function is specified as fol- lows. First, the elasticity parameter η˜ is fixed to 0.5. To my knowledge, nobody so far has tried to seriously estimate a human capital investment function that represents a period of 30 years. The choice of the parameter therefore cannot claim to match an observable characteristic of the economy. Bouzahzah, De la Croix, and Docquier (2002) use values in the range of 0.1 and 0.3 also without further justification. Lucas (1990) uses 0.8 by referring to Rosen (1976), who esti- mates a value of 0.76. Theory only requires the parameter to lie between zero and one. Other than that, I have no priors and choose a value that lies somewhat in-between. Given this choice and the amount of education the shift parameter D is chosen to match an annual growth rate of 1.8%, which is a value used by Bouzahzah, De la Croix, and Docquier (2002): DEη˜ + 1− δH = G(E) = (1+ 0.018) 30 (5.11) Finally, it is assumed that education expenses are fully subsidized. Initially, the government bears the burden of education related expenses. The fee parameter is one, f = 1. This choice implies that in equilibrium education expenses amount to about 1% of output. Because education expenses are multiplied by the current stock of human capital, the share of education expenses stays constant in the steady state. 145 Table 5.1: Initial equilibrium δH = 1 δH = 0.5 δH = 0 Annual growth rate 0.018 Debt/output −0.004 Gov. spending/output 0.400 Edu. expenses/output 0.013 0.009 0.005 Consumption/output 0.545 0.548 0.552 Investment/output 0.042 0.042 0.043 Young labor tax rate τL0 0.300 Old labor tax rate τL1 0.300 Capital tax rate τK 0.300 Subsidy rate τH 1 Annual interest rate 0.089 0.088 0.088 Young labor Supply L0 0.448 0.463 0.478 Old labor Supply L1 0.500 Education E 0.052 0.037 0.022 Because the theory suggests that for human capital depre- ciation rates lower than one education should be subsidized, the initial equilibrium is calibrated to three different choices of the human capital depreciation rate. This gives a set of parameters for three initial economies that share the follow- ing characteristics: Consumption, investment and education as a fraction of output amount to about 55%, 4% and 1%, respectively. The remaining 40% is spent on government consumption. The economies grow at the annual rate of 1.8%. Initial government’s assets are worth about 0.4% of output. The interest rate is about 9%. Tables 5.1 and 5.2 summarize the results of the calibration exercise. 146 Table 5.2: Initial choice of parameters δH = 1 δH = 0.5 δH = 0 Time preference rate ρ 0.960 Inter. elast. of subst. ψ 1.000 TFP A 8.000 Capital’s share α 0.290 Capital dep. rate δK 1.000 Education elast. η˜ 0.500 Fee parameter f 1.000 Shift parameter D 7.513 6.283 4.784 Young leisure par. γ0 1.178 1.145 1.113 Old leisure par. γ1 0.813 0.799 0.786 5.4 endogenous government consumption As Jones, Manuelli, and Rossi (1993) and Chari and Kehoe (1999, p. 1714) note, government consumption as a frac- tion of output converges to zero as the economy grows if the level of government consumption is held constant. For this reason the present analysis assumes that the share of government consumption is constant by setting At = aFt, where a denotes the share. This assumption ensures that the level of government consumption grows at the common growth rate, which is determined endogenously. As a re- sult, the level of government consumption is endogenous. This section serves to show how this assumption can be rationalized. There are several routes available to endogenize gov- ernment consumption. One way, which is taken by Jones, Manuelli, and Rossi (1993), is to include government con- sumption as a factor in the production function. Then the limiting tax on capital income may be positive. This ap- 147 proach will not be pursued here because it makes the comparability of the implied results to the ones derived in chapter 4 difficult. Another way is to include govern- ment consumption in the individual’s utility function as proposed by Chari and Kehoe (1999, p. 1714). In an infinite horizon model with a single individual it is then easy to see that the level of government consumption is optimally chosen in a way such that it grows at the common endoge- nously determined growth rate. This extension does not affect the results concerning the optimal choice of tax rates. This idea may be directly applied to the present overlap- ping generations model. Suppose the function V(At,At+1) = b A1−ψt 1−ψ + ρb A1−ψt+1 1−ψ is added to the utility function (5.1). b > 0 is a scal- ing constant. Then the Ramsey problem consists of two parts. First, as before, the planner chooses the optimal al- location given a stream of government consumption. Then he chooses the optimal sequence of government consump- tion, i.e., maximizes over the choice of At. The following first-order conditions emerge: bA−ψt = C −ψ 0t (1− L0t − Et) γ0(1−ψ)× ( 1+φ(1−ψ) ( 1− γ0 L0t + Et 1− L0t − Et )) ρbA−ψt = ρC −ψ 1t−1(1− L1t−1) γ1(1−ψ)× ( 1+φ(1−ψ) ( 1− (1− η)γ1 L1t−1 1− L1t−1 )) When one divides these equations by F−ψt , and evaluates them along the balanced growth path at which labor supply 148 is constant, one ends up with the rules that describe the optimal choice of government consumption: b ( At Ft )−ψ = ( C0t Ft )−ψ × ξ1 b ( At Ft )−ψ = ( C1t−1 Ft )−ψ × ξ2 ξ1 and ξ2 capture all remaining terms. Consumption and output grow at the same rate, and the RHS are constant. Hence, the LHS must be constant, which means that govern- ment consumption as a share of output must be constant along the balanced growth path. 5.5 ramsey problem The solution to the Ramsey problem is described by equa- tions (4.29) - (4.33) in section 4.4. The normalized specializa- tions of the equations, evaluated along the balanced growth path, are as follows: (1−α)AKˆα(L0 + L1) −α = γ0 Cˆ0 1− L0 − E ΩL0 ΩC0 , (5.12) (1−α)AKˆα(L0 + L1) −α = γ1 Cˆ1/G 1− L1 ΩL1 ΩC1 , (5.13) R = ρ−1 ( Cˆ1 Cˆ0 )ψ( (1− L0 − E)γ0 (1− L1)γ1 )1−ψ ΩC0 ΩC1 (5.14) = β−1G(E)ψ, (5.15) (1−α)AKˆα(L0 + L1) 1−α − fE = (R−G) f+ (1−α)AKˆα(L0 + L1)−α Dη˜Eη˜−1 − (R−G)φΩ−1C0PMC HC, (5.16) 149 PMCHC = ργ1 L1 1− L1 ( Cˆ1 Cˆ0 )−ψ Cˆ1/G× ( (1− L1)γ1 (1− L0 − E)γ0 )1−ψ E η η ′, (5.17) Cˆ1−ψ0 (1− L0) γ0(1−ψ) ( 1− γ0 L0 1− L0 − E ) + ρCˆ1−ψ1 (1− L1) γ1(1−ψ)× ( 1− (1− η)γ1 L1 1− L1 ) G1−ψ = 0, (5.18) AKˆα ( (L0 + L1) )1−α + (1− δK)Kˆ = Cˆ0 + Cˆ1/G+ fE+GKˆ + Aˆ, (5.19) R = αAKˆα−1(L0 + L1) 1−α + 1− δK, (5.20) G = DEη˜ + 1− δH. (5.21) TheΩ-terms result from the differentiation of the pseudo- welfare function W, see appendix 5.A. Equations (5.12) and (5.13) are the marginal conditions for the individual’s labor- leisure choices when young and old. (5.14) and (5.15) are the intra- and inter-generational Euler equations. (5.16) is the dynamic Euler equation for human capital. This equation and the term PMC defined by (5.17), which is the private marginal cost, will be further discussed below. (5.18) and (5.19) are the implementability and resource constraint, re- spectively. R and G are the rate of return to capital net of deprecation and the growth rate, respectively. This system of equations will be solved for the Ramsey allocation given the parameters as derived in section 5.3. To understand the results presented in Table 5.3, it is helpful to recapitulate the theoretical analysis in chapter 4. Recall (4.36) in section 4.4, which is the dynamic Euler equation for 150 human capital. This equation describes the optimal choice of education in the Ramsey equilibrium: [ FL1t+1L1t + FL0t+1L0t+1) − fEt+1 ] + [ f+ FL0t+1 G ′t+1 − φt+1 αt+1 UC0t+1PMC HC t+1 ] Gt+1 = [ f+ FL0t G ′t − φt αt UC0tPMC HC t ] [FKt+1 + 1− δK]. (5.22) The dynamic Euler equation is the corner stone of the analysis to explore how and why education is optimally subsidized in the Ramsey equilibrium. Because of (4.26) in section 4.4, −ηt L1tUL1t UC0t = ( ϕt +ω0t ) EtHt−1, (5.23) which is derived from the selfish individual’s optimality condition (5.8) for education, PMC denotes the private mar- ginal cost of human capital. It measures how the private cost of education changes as the stock Ht increases: PMCt = − d dHt ( ηt L1tUL1t UC0t ) = − L1tUL1t UC0t 1 Ht Et ηt dηt dEt (5.24) Essentially, PMC is a quantity that captures how the accumulation of human capital is modeled. If the elasticity of the elasticity η is zero, which is the case if human capital fully depreciates, PMC = 0. Otherwise, if dηt/dEt > 0, PMC > 0. The variable ∆t collects the terms related to PMC: ∆t ≡ φt αt UC0tPMC HC t ( FKt+1 + 1− δK ) − 151 φt+1 αt+1 UC0t+1PMC HC t+1Gt+1. (5.25) Then the dynamic Euler equation (5.22) can be further simplified by help of this notation: ∆t = Rt+1 f+ FL0t G ′t − FL1t+1L1t − FL0t+1(L0t+1 + Et+1) − (f+ FL0t+1)Et+1 ( 1 ηt+1 − 1 ) . (5.26) The interpretation of (5.26) is the same as on page 111. ∆t is a wedge between the social benefit and social cost of edu- cation. A positive wedge stands for subsidizing education relative to the first best. Note that the terms on the RHS of (5.26) relate to dif- ferent generations. Whereas the first line is the difference between the social marginal cost and the social marginal benefit of the individual born in period t, the second line is related to the individual born in period t+ 1. Therefore, the second line of (5.26) is the marginal external effect, which is denoted MEB: MEBt+1 = FL0t+1(L0t+1 + Et+1)+ (f+ FL0t+1)Et+1 ( 1 ηt+1 − 1 ) . (5.27) Then (5.26) can be written as: ∆t + MEBt+1 = Rt+1 f+ FL0t G ′t − FL1t+1L1t. (5.28) One can see that two quantities drive a wedge between the social marginal cost and social marginal benefit of edu- cation of the individual born in period t. ∆ is the tax wedge 152 and MEB is the intergenerational externality. Both quanti- ties serve as a justification for the effective subsidization of education. The question now is what is the size of both quantities. Section 4.5 explains the notion of effective subsidization of education. Effective subsidization refers to the extent to which the private rate of return exceeds the social rate of return to education from the perspective of the individual. The private rate of return is the private net present value to education that accrues to the individual over the private cost that has to be borne by him. Similarly, the social rate of return is the social net present value over the social cost. Hence, the effective rate of subsidization is s = PRR− SRR PRR . (5.29) One can show that in the Ramsey equilibrium, evaluated along the balanced growth path, the efficient rate of sub- sidization includes both the tax wedge and the marginal external effect: seff = ∆+ MEB R(f+w)(1− η)/G ′ , (5.30) where ∆ is the tax wedge, evaluated along the balanced growth path: ∆ = φ UC0 WC0 × PMCHC × (R−G). (5.31) φ is the Lagrange multiplier associated with the self- ish individual’s implementability constraint. It as a mea- sure of the non-availability of lump-sum taxes. The frac- tion UC0/WC0 equals one if the individual’s intertemporal 153 elasticity of substitution equals one. PMC is the private marginal cost of human capital as discussed before. The last term R−G is the growth gap, which necessarily must be positive. It again becomes clear that the question of how to optimally subsidize education depends on the tax wedge ∆ and the marginal external effect MEB. This closes the recapitulation of the theoretical analysis that was necessary to appreciate the following results. Table 5.3 presents the results for the three calibrated economies. The first part of the table is structured in the same way as Table 5.1 and summarizes the characteristics of the economy in the Ramsey equilibrium. The second part presents the results related to the distortion of the education decision. Clearly, the growth rate is significantly higher in the Ram- sey equilibrium than in the initial equilibrium. It increases to about 3.4%, which can be attributed to the fact that the individual spends more time on education. Government’s assets are worth 4.4% of output. Labor tax rates decrease slightly and are different for a young and old individual, see Proposition 4.4. As expected, capital income is not taxed, see Proposition 4.1. The subsidy rate increases from 100% to about 188%. To understand this optimal government policy, consider the government’s budget constraint: Aˆ+ τHfE+ RB = τL0wL0 + τ L 1wL1 +GB. (5.32) Note that B is negative. Then the government’s income consists of labor income taxes and the net yield to the asset B which is (R−G)B. Thus, the gap between the rate of re- turn to physical capital net of depreciation and the growth 154 Table 5.3: Results δH = 1 δH = 0.5 δH = 0 Annual growth rate 0.036 0.034 0.031 Debt/output −0.044 −0.044 −0.045 Gov. spend./output 0.400 0.400 0.400 Edu. exp./output 0.035 0.028 0.020 Consumption/output 0.479 0.485 0.493 Investment/output 0.087 0.087 0.087 Young labor tax rate τL0 0.258 0.258 0.253 Old labor tax rate τL1 0.211 0.233 0.252 Capital tax rate τK 0.000 0.000 0.000 Subsidy rate τH 1.722 1.871 2.060 Annual interest rate 0.079 0.077 0.073 Young labor supply L0 0.323 0.348 0.379 Old labor supply L1 0.659 0.646 0.632 Education E 0.152 0.128 0.098 Lagrange multiplier φ 0.184 0.182 0.177 Private marginal cost of human capital PMC 0 0.016 0.038 Return to capital R 9.770 9.186 8.347 Growth rate G 2.925 2.750 2.499 Growth gap R−G 6.845 6.436 5.848 Human capital dist. ∆ 0 0.019 0.039 Human capital dist. MEB 2.129 2.327 2.608 Efficient subsidy rate seff 1.012 0.895 0.792 Efficient subsidy rate without ∆ 1.012 0.887 0.780 rate, called the growth gap, is the interest rate that govern- ment bonds earn in equilibrium. The annualized growth gap amounts to about 10%. The total yield to government bonds is about 28% of output (= (R−G)×B) and therefore does not suffice to finance government consumption, which 155 is 40% of output. The government taxes labor income at rates of about 26% and 23% to finance this level of gov- ernment consumption. As a result, total taxes amount to more than 40% of output. Then the government uses the education subsidy to transfer government revenue back to the individual. The education policy can be described by the efficient rate of subsidization. See section 5 for the details. The ef- ficient rate of subsidization seff accounts for two effects. One is the distortionary taxation effect ∆ as defined by (5.31). The other is the marginal external effect MEB as de- fined by (5.27). If human capital fully depreciates, δH = 1, ∆ = 0. Thus, only the marginal external effect matters and the efficient subsidy rate is positive. The distortionary tax- ation effect ∆ is positive when the private marginal cost of education is positive which is the case when human capital does not fully depreciate. With δH = 0 the distor- tionary taxation effect ∆ slightly increases. The result is that the efficient rate of subsidization seff decreases from 1.012 to 0.792. But as one can see the share of seff that can be attributed to ∆ is small. If one did not take it into consid- eration, the efficient rate of subsidization is only slightly smaller. As a result, most importantly education is effec- tively subsidized to internalize intergenerational effects. To additionally account for the distortionary taxation effect only slightly increases the efficient rate of subsidization. Thus, the theoretical analysis identifies a quantity that also serves to justify the subsidization of education. But as the numerical analysis shows this quantity does not play a big role. 156 A sensitivity analysis shows that this result holds for lower shares of government consumption, for varying elas- ticity parameters η˜ between zero and one, for higher values of the intertemporal elasticity of substitution and for higher fee parameters. Still the marginal external is more dominant than the distortionary taxation effect. The distortionary taxation effect to rationalize the subsi- dization of education has been put forth by Trostel (1996) for the first time. He finds that subsidizing education by making the related cost tax-deductible substantially reduces the adverse effects of distortionary taxation on investing in education. More than 78% of the the cost should be tax- deductible. Beyond this rate, distortionary taxation effects cease to exist. But the analysis differs with regard to various aspects. He sets up a representative-agent model, in which an intergenerational externality does not exist. The human capital production function does not feature constant re- turns to scale with respect to the current stock of human capital and market goods necessary for production. This refers to the perhaps most important modelling aspect that subsidization refers to market goods, not time devoted to education as in the present context. 5.6 discussion and conclusion Chapter 4 studies two effects that serve to justify the subsi- dization of education if individuals are selfish. As is well- known, a subsidy is called for to internalize intergenera- tional external effects of education. The analysis highlighted the negative effects distortionary labor taxation has on ed- ucation. For this reason, education should be subsidized 157 relative to the first best if the elasticity of the human capi- tal investment function is increasing. The present chapter disentangles these two effects and evaluates their magni- tudes. The result is that the case for subsidizing education to account for distortionary labor taxation is rather weak. The still dominant justification for subsidizing education is to internalize intergenerational externalities. This result is robust and holds for a wide range of parameter values. How serious can this result considered to be? Caveats are in order because the timing of the model may be inappro- priate or the human capital investment function may be ill- specified. The model assumes that life may be divided into only two periods, each of which lasts for 30 years. Thus one might object that questions of lifelong learning are not mod- eled. To describe this process of lifelong learning one would have to extend the model to a, say, 60-periods overlapping generation model in the fashion of Auerbach, Kotlikoff, and Skinner (1983) or Erosa and Gervais (2002). Undoubtedly, this further complicates the analysis and more questions arise. As mentioned in the calibration section, it is difficult to imagine something like a human capital investment func- tion that describes the accumulation process of the stock of human capital over a period of 30 years. A first step would be to empirically derive the human capital invest- ment function. Most likely this will be done using annual data. Then the question arises how this information could be aggregated to derive a 30-years human capital invest- ment function. 158 Obviously, there is still a lot to do for econometricians that could try to identify the distortionary taxation effect and assess its significance. 159 5.a first-order derivatives of w The partial derivatives of the pseudo-welfare function W = U+φ ( C0UC0+C1UC1+(L0+E)UL0+(1−η)L1UL1 ) are as follows: WC0 = UC0 +φ ( UC0 +C0UC0C0 + L0UL0C0 + E0UL0C0 ) = UC0ΩC0 with ΩC0 = ( 1+φ(1−ψ) ( 1− γ0 L0 + E 1− L0 − E )) ; WC1 = UC1 +φ ( C1UC1C1 +UC1 + (1− η)UC1L1L1 ) = UC1ΩC1 with ΩC1 = ( 1+φ(1−ψ) ( 1− (1− η)γ1 L1 1− L1 )) ; WL0 = UL0 +φ ( C0UC0L0 + L0UL0L0 +UL0 + EUL0L0 ) = UL0ΩL0 with ΩL0 = ( 1+φ [ 1− ( γ0(1−ψ) − 1 ) L0 + E 1− L0 − E + 1−ψ ]) ; 160 WE = UL0+φ ( C0UC0L0+L0UL0L0+UL0+EUL0L0−η ′L1UL1 ) = WL0 −φη ′L1UL1; WL1 = UL1 +φ ( C1UC1L1 + (1− η)(L1UL1L1 +UL1) ) = UL1L1ΩL1 with ΩL1 = ( 1+φ [ (1−η) ( 1− ( γ1(1−ψ) − 1 ) L1 1− L1 ) +1−ψ ]) . 161 6 C O N C L U S I O N This dissertation provided an analysis of three models of optimal taxation of human capital in Ramsey’s tradition. Chapters 2 and 3 presented models with only a single in- dividual that lived for two periods or until infinity, respec- tively. Chapter 4 presented two overlapping generations model with a selfish or altruistic individual that lived for two periods. The models also differed with respect to how the accumulation of human capital was modelled. The first two models had a human capital production function in which the stock of human capital did not enter as a pro- duction factor. By contrast, the human capital production function in the overlapping generations models featured constant returns to scale with respect to the stock of human capital. The most important results of this dissertation are: • If the single individual only lives for two periods and is endowed with a given stock of human capital, it is second best to tax capital and to subsidize human capital. • If the single individual lives until infinity and the human capital production function does not include the current stock of human capital as a production factor, it is second best in the long run to not tax capital and to tax human capital. 162 • If the single individual is selfish and does not inter- nalize his education decisions, it is second best to effectively subsidize human capital for two reasons: (i) Internalize the external intergenerational effect and (ii) account for distortionary taxation. • A numerical analysis however shows that the justi- fication of subsidizing human capital to account for distortionary taxation is rather weak. The perhaps most interesting issue that further research could focus on is to extend the overlapping generations model of chapter 4 in two directions. First, Erosa and Ger- vais (2002) build a fairly general overlapping generations model of optimal taxation where the individual lives for more than two periods. The authors derive the optimal tax policy and relate their work to the influential paper by Auerbach, Kotlikoff, and Skinner (1983). These authors study the effects of switching from a proportional income tax to either a proportional tax on consumption or a pro- portional tax on labor income. Hence, the analysis focusses on the effect of the switch to a given tax system, which must not be necessarily the optimal one. This exactly is done by Erosa and Gervais (2002), who analyze the switch to the optimal tax system. But it is important to bear in mind that the productivity profile of the individual over the life cycle is given. Both Auerbach, Kotlikoff, and Skinner (1983) and Erosa and Gervais (2002) take the productivity profile from Welch (1979). The contribution then could be to further generalize the model of chapter 4 to a time horizon with, say, 55 years, or, put differently, to add endogenous education to the model by Erosa and Gervais (2002) and 163 thus to endogenize the productivity profile. Then a cali- bration and simulation exercise similar to that of chapter 5 can be executed to assess the relevance of subsidizing education to account for distortionary taxation in relation to internalizing the intergenerational externality. Second, the model studies an individual that is either selfish or altruistic à la Barro (1974) and Becker (1974). These are certainly two polar cases. One can think of other preferences that are somewhat in between where an indi- vidual does care about his descendants to a certain degree. Andreoni (1989) was the first to set up an overlapping generations model in which altruism is not “pure” but “im- pure”. The individual gets a “warm glow” from giving to his immediate descendant. The idea of deriving utility from leaving a bequest to descendants may be applied to the stock of human capital. Glomm and Ravikumar (1992) and Cremer and Pestieau (2006) take up this idea and provide a normative analysis of education policies such as public vs. private provision of education and taxation/subsidization of education, respectively. Which kind of policy advice can be drawn from the present research? The analysis clearly shows that thinking about education efficiency involves calculating the social and private benefits and costs of education over the entire life cycle. One then has to bear in mind that earnings when educated are usually higher than foregone earnings when being educated. Then, if the marginal tax rate is increasing in earnings, the tax treatment of future earnings and fore- gone earnings differ, which may translate into an inequality between the social and private rate of return to education. Moreover, the individual has to pay for direct cost that af- 164 fect the rate of return of education. The government may choose to tax or subsidize these costs to adjust the private rate of return such that it equals the social rate of return. To achieve education efficiency, a first step could be to make the direct cost of education fully tax-deductible from future earnings, which then means that direct costs and future earnings are taxed at the same rate. This issue has been and still is under scrutiny by the Federal Fiscal Court in Germany. As a general rule, under the provisions of section 12 (5) of the German Income Tax Act education related costs are considered privately induced and therefore may not be deductible as earnings-related or special expenses.1 The Federal Fiscal Court however ruled in 2009 that education related costs may be deductible as earnings-related expenses if the taxpayer has already completed vocational education before tertiary education was started. Section 12 (5) of the German Income Tax Code then does not apply (Federal Fiscal Court, Judgment, Ref. VI R 14/07, June 18, 2009). An appeal is currently pending with the Federal Fiscal Court concerning the question whether the aforementioned judgement also applies to taxpayers who have started studying right after having finished school (Federal Fiscal Court, Ref. VI R 7/10).2 1 The crucial difference between earnings-related and special expenses is the following: Earnings-related expenses (section 9 (1) of the German In- come Tax Act) are deducted from earnings and a possibly resulting loss may be carried forward to the following fiscal year under the provisions of section 10d (2) of the German Income Tax Act. Special expenses, e.g. education expenses (section 10 (1) number 7 of the German Income Tax Code) are deducted from positive overall earnings, which is the sum of all net earnings, up to 4,000 Euro. This difference, which is taxable income, cannot become negative. A possibly resulting loss may not be carried forward. Moreover, if overall earnings are negative, special expenses are not tax-deductible at all within the fiscal year in question. 2 The Fiscal Court of Hamburg, that has granted this appeal, however denied the applicability of this judgment (Fiscal Court of Hamburg, Judgment, Ref. 5 K 193/08, November 25, 2009). 165 The Advisory Board to the Federal Ministry of Finance (2010) even goes further than the aforementioned judgment of the Federal Fiscal Court and makes two far-reaching pro- posals to achieve education efficiency, which holds when the social and private rates of return to education are equal. This equality holds when the return and the direct and indi- rect costs of education are taxed at the same rate. First, the Advisory Board suggests to consider education expenses as anticipated earnings-related expenses and to allow for an interest-bearing carry forward. The effect is that direct costs and benefits of education are taxed at the same rate. Second, as has been said before, the government taxes the return to education at a higher rate than foregone earnings if the mar- ginal tax rate increases in taxable income. To compensate the taxpayer it is therefore necessary that the government furthermore bears an additional fraction of the foregone earnings amounting to the difference between the higher tax rate when educated and the lower tax rate when being educated. Based on the assumption that foregone earnings amount to 20,000 Euros per year, which implies a marginal tax rate of 29%, and that initial earnings when educated amount to 45,000 Euro, which implies a marginal tax rate of 41%, the government shall grant a subsidy worth 2,400 Euro (= (0.41− 0.29)× 20, 000) to students. This dissertation contributes to the discussion insofar as it sheds light on the circumstances under which education efficiency shall prevail in a second-best world. In particular, the analysis has demonstrated that there are not too im- plausible circumstances under which an inequality between the private and social rate of return to education is optimal. 166 B I B L I O G R A P H Y Advisory Board to the Federal Ministry of Finance (2010): “Beitragspflichtiger Hochschulzugang,” Federal Ministry of Finance. Ammermüller, A., and A. M. Weber (2005): “Educational Attainment and Returns to Education in Germany: An Analysis by Subject of Degree, Gender and Region,” ZEW Discussion Papers No. 05-17, ZEW - Zentrum für Eu- ropäische Wirtschaftsforschung / Center for European Economic Research. Anderberg, D. (2009): “Optimal policy and the risk prop- erties of human capital reconsidered,” Journal of Public Economics, 93(9-10), 1017–1026. Andersson, F., and K. A. Konrad (2003): “Human capi- tal investment and globalization in extortionary states,” Journal of Public Economics, 87(7-8), 1539–1555. Andreoni, J. (1989): “Giving with Impure Altruism: Appli- cations to Charity and Ricardian Equivalence,” Journal of Political Economy, 97(6), 1447–58. Atkeson, A., V. V. Chari, and P. J. Kehoe (1999): “Tax- ing capital income: a bad idea,” Federal Reserve Bank of Minneapolis Quarterly Review, 23(3), 3–17. Atkinson, A. B., and J. E. Stiglitz (1972): “The structure of indirect taxation and economic efficiency,” Journal of Public Economics, 1(1), 97–119. (1980): Lectures on Public Economics. McGraw-Hill Inc., London, United Kingdom and New York (NY), United States of America. ATUS (2003-2009): “Annual results,” http://www.bls.gov/ tus/. Auerbach, A. J., and L. J. Kotlikoff (1987): Dynamic Fis- cal Policy. Cambridge University Press, New York (NY), United States of America. Auerbach, A. J., L. J. Kotlikoff, and J. Skinner (1983): “The Efficiency Gains from Dynamic Tax Reform,” Inter- national Economic Review, 24(1), 81–100. 167 Barbie, M., and C. Hermeling (2006): “Optimal Taxation in a Simple Model of Human Capital Accumulation,” Economics Bulletin, 5(5), 1–8. Barro, R. J. (1974): “Are Government Bonds Net Wealth?,” Journal of Political Economy, 82(6), 1095–1117. (1991): “Economic Growth in a Cross Section of Countries,” The Quarterly Journal of Economics, 106(2), 407– 43. (1998): Determinants of Economic Growth: A Cross- Country Empirical Study, vol. 1 of MIT Press Books. The MIT Press, Cambridge (MA), United States of America. Barro, R. J., and J.-W. Lee (1994): “Sources of economic growth,” Carnegie-Rochester Conference Series on Public Pol- icy, 40(1), 1–46. Bassanini, A., and S. Scarpetta (2001): “The Driving Forces of Economic Growth: Panel Data Evidence for the OECD Countries,” OECD Economic Studies, 2001(2). Becker, G. S. (1974): “A Theory of Social Interactions,” Journal of Political Economy, 82(6), 1063–93. (1993): Human Capital: A Theoretical and Empirical Analysis with Special Reference to Education. The University of Chicago Press, Chicago (IL), United States of America, 3rd edn. Ben-Porath, Y. (1967): “The Production of Human Capital and the Life Cycle of Earnings,” The Journal of Political Economy, 75(4), 352–365. Benhabib, J., and M. M. Spiegel (1994): “The role of human capital in economic development evidence from aggre- gate cross-country data,” Journal of Monetary Economics, 34(2), 143–173. Blankenau, W. F., and N. B. Simpson (2004): “Public edu- cation expenditures and growth,” Journal of Development Economics, 73(2), 583–605. Boadway, R., N. Marceau, and M. Marchand (1996): “In- vestment in Education and the Time Inconsistency of Redistributive Tax Policy,” Economica, 63(250), 171–89. Boskin, M. J. (1975): “Notes on the Tax Treatment of Human Capital,” NBER Working Papers 0116, National Bureau of Economic Research. 168 Bouzahzah, M., D. De la Croix, and F. Docquier (2002): “Policy reforms and growth in computable OLG economies,” Journal of Economic Dynamics and Control, 26(12), 2093–2113. Bovenberg, A. L., and B. Jacobs (2005): “Redistribution and education subsidies are Siamese twins,” Journal of Public Economics, 89(11-12), 2005–2035. Bowman, M. J. (1964): “Schultz, Denison, and the Contri- bution of “Eds” to National Income Growth,” Journal of Political Economy, 72(5), 450–464. Bringhurst, R. (2002): The Elements of Typographic Style, Version 2.5. Hartley & Marks, Publishers, Point Roberts (WA), United States of America. Bull, N. (1993): “When all the optimal dynamic taxes are zero,” Working Paper Series / Economic Activity Section 137, Board of Governors of the Federal Reserve System (U.S.). Caballe, J., and M. S. Santos (1993): “On Endogenous Growth with Physical and Human Capital,” Journal of Political Economy, 101(6), 1042–67. Cahuc, P., and A. Zylbergerg (2004): Labor Economics. MIT, Cambridge (MA), United States of America. Cameron, S. V., and J. J. Heckman (2001): “The Dynamics of Educational Attainment for Black, Hispanic, and White Males,” Journal of Political Economy, 109(3), 455–499. Card, D. (1999): “The causal effect of education on earn- ings,” in Handbook of Labor Economics, ed. by O. Ashen- felter, and D. Card, vol. 3 of Handbook of Labor Economics, chap. 30, pp. 1801–1863. Elsevier. Carneiro, P., and J. J. Heckman (2002): “The Evidence on Credit Constraints in Post–secondary Schooling,” Eco- nomic Journal, 112(482), 705–734. CDU, C. u. F. (2009): “Wachstum. Bildung. Zusam- menhalt. Der Koalitionsvertrag zwischen CDU, CSU und FDP,” http://www.cdu.de/doc/pdfc/ 091026-koalitionsvertrag-cducsu-fdp.pdf, accessed December 15, 2010. Chamley, C. (1986): “Optimal Taxation of Capital Income in General Equilibrium with Infinite Lives,” Econometrica, 54(3), 607–22. 169 Chari, V. V., L. J. Christiano, and P. J. Kehoe (1994): “Optimal Fiscal Policy in a Business Cycle Model,” Journal of Political Economy, 102(4), 617–52. Chari, V. V., and P. J. Kehoe (1999): “Optimal fiscal and monetary policy,” in Handbook of Macroeconomics, ed. by J. B. Taylor, and M. Woodford, vol. 1 of Handbook of Macro- economics, chap. 26, pp. 1671–1745. Elsevier. Christiano, L. J. (2010): “Lecture Notes on Optimal Fiscal and Monetary Policy,” http://faculty.wcas. northwestern.edu/~lchrist/papers/optimal.pdf, accessed December 9, 2010. Ciriani, S. (2007): “Education policy in a general equi- librium model with heterogeneous agents,” Economics Bulletin, 9(1), 1–7. Cremer, H., and P. Pestieau (2006): “Intergenerational Transfer of Human Capital and Optimal Education Pol- icy,” Journal of Public Economic Theory, 8(4), 529–545. da Costa, C. E., and L. J. Maestri (2007): “The risk prop- erties of human capital and the design of government policies,” European Economic Review, 51(3), 695–713. Davies, J., and J. Whalley (1991): “Taxes and Capital For- mation: How Important is Human Capital?,” in National Saving and Economic Performance, NBER Chapters, pp. 163– 200. National Bureau of Economic Research, Inc. De la Croix, D., and P. Michel (2002): A Theory of Economic Growth, Cambridge Books. Cambridge University Press, New York (NY), United States of America. de la Fuente, A., and R. Doménech (2006): “Human Capi- tal in Growth Regressions: How Much Difference Does Data Quality Make?,” Journal of the European Economic Association, 4(1), 1–36. Denison, E. F. (1962): The Sources of Economic Growth in the United States and the Alternatives before us. Committee for Economic Development. (1967): Why Growth Rates Differ - Postwar Experi- ence in Nine Western Countries. The Brookings Institution, Washington D.C., United States of America. 170 Diamond, P. A. (2007): “Comment on Mikhail Golosov, Aleh Tsyvinski, Iván Werning, New Dynamic Public Finance: A Users Guide,” in NBER Macroeconomics Annual 2006, Vol- ume 21, ed. by K. R. Daron Acemoglu, and M. Woodford, NBER Chapters, chap. 5, pp. 365–379. National Bureau of Economic Research. Diamond, P. A., and J. A. Mirrlees (1971): “Optimal Tax- ation and Public Production I: Production Efficiency,” American Economic Review, 61(1), 8–27. Docquier, F., O. Paddison, and P. Pestieau (2007): “Opti- mal accumulation in an endogenous growth setting with human capital,” Journal of Economic Theory, 134(1), 361– 378. Dupor, B., L. Lochner, C. Taber, and M. B. Wittekind (1996): “Some Effects of Taxes on Schooling and Training,” American Economic Review, 86(2), 340–46. Ellwood, D., and T. Kane (2000): “Who is Getting a College Education?: Family Background and the Growing Gaps in Enrolment,” in Securing the Future: Investing in Children from Birth to College, ed. by S. Danzinger, and J. Waldfogel, chap. 10, pp. 283–324. Russel Sage Foundation, New York (NY), United States of America. Englander, S., and A. Gurney (1994): “Medium-term de- terminants of OECD productivity,” OECD Economic Stud- ies, 22, 49–109. Erosa, A., and M. Gervais (2002): “Optimal Taxation in Life-Cycle Economies,” Journal of Economic Theory, 105(2), 338–369. Fischer, S. (1980): “Dynamic inconsistency, cooperation and the benevolent dissembling government,” Journal of Economic Dynamics and Control, 2(1), 93–107. Gemmell, N. (1996): “Evaluating the Impacts of Human Capital Stocks and Accumulation on Economic Growth: Some New Evidence,” Oxford Bulletin of Economics and Statistics, 58(1), 9–28. Glomm, G., and B. Ravikumar (1992): “Public versus Pri- vate Investment in Human Capital Endogenous Growth and Income Inequality,” Journal of Political Economy, 100(4), 813–34. 171 Golosov, M., N. Kocherlakota, and A. Tsyvinski (2003): “Optimal Indirect and Capital Taxation,” Review of Eco- nomic Studies, 70(3), 569–587. Golosov, M., and A. Tsyvinski (2008): “optimal fiscal and monetary policy (without commitment),” in The New Palgrave Dictionary of Economics, ed. by S. N. Durlauf, and L. E. Blume. Palgrave Macmillan, Basingstoke, United Kingdom. Golosov, M., A. Tszvinski, and I. Werning (2007): “New Dynamic Public Finance: A User’s Guide,” in NBER Macroeconomics Annual 2006, Volume 21, ed. by K. R. Daron Acemoglu, and M. Woodford, NBER Chapters, chap. 5, pp. 317–388. National Bureau of Economic Re- search. Gomme, P., and P. Rupert (2007): “Theory, measurement and calibration of macroeconomic models,” Journal of Monetary Economics, 54(2), 460–497. Grochulski, B., and T. Piskorski (2010): “Risky human capital and deferred capital income taxation,” Journal of Economic Theory, 145(3), 908–943. Hall, R. E., and C. I. Jones (1999): “Why Do Some Coun- tries Produce So Much More Output Per Worker Than Others?,” The Quarterly Journal of Economics, 114(1), 83– 116. Hanushek, E. A., and D. Kim (1995): “Schooling, Labor Force Quality, and Economic Growth,” NBER Working Papers 5399, National Bureau of Economic Research, Inc. Harmon, C., H. Oosterbeek, and I. Walker (2003): “The Returns to Education: Microeconomics,” Journal of Eco- nomic Surveys, 17(2), 115–156. Heckman, J. J., L. Lochner, and C. Taber (1998): “Tax Pol- icy and Human-Capital Formation,” American Economic Review, 88(2), 293–97. (1999): “Human capital formation and general equi- librium treatment effects: a study of tax and tuition pol- icy,” Fiscal Studies, 20(1), 25–40. Jacobs, B. (2005): “Optimal Income Taxation with Endoge- nous Human Capital,” Journal of Public Economic Theory, 7(2), 295–315. 172 Jacobs, B., and A. L. Bovenberg (2010a): “Human Cap- ital and Optimal Positive Taxation of Capital Income,” International Tax and Public Finance, 17(5), 451–478. (2010b): “Optimal Taxation of Human Capital and the Earnings Function,” Journal of Public Economic Theory, forthcoming. Jacobs, B., D. Schindler, and H. Yank (2010): “Optimal Taxation of Risky Human Capital,” Scandinavian Journal of Economics, forthcoming. Jones, L. E., R. E. Manuelli, and P. E. Rossi (1993): “Opti- mal Taxation in Models of Endogenous Growth,” Journal of Political Economy, 101(3), 485–517. (1997): “On the Optimal Taxation of Capital Income,” Journal of Economic Theory, 73(1), 93–117. Jorgenson, D. W., and B. M. Fraumeni (1992): “Investment in Education and U.S. Economic Growth,” Scandinavian Journal of Economics, 94(0), S51–70. Judd, K. L. (1985): “Redistributive taxation in a simple per- fect foresight model,” Journal of Public Economics, 28(1), 59–83. (1999): “Optimal taxation and spending in general competitive growth models,” Journal of Public Economics, 71(1), 1–26. (2007): “Comment on Mikhail Golosov, Aleh Tsyvin- ski, Iván Werning, New Dynamic Public Finance: A Users Guide,” in NBER Macroeconomics Annual 2006, Volume 21, ed. by K. R. Daron Acemoglu, and M. Woodford, NBER Chapters, chap. 5, pp. 381–384. National Bureau of Eco- nomic Research. Judson, R. (1998): “Economic Growth and Investment in Education: How Allocation Matters,” Journal of Economic Growth, 3(4), 337–59. Kane, T. J. (1994): “College Entry by Blacks since 1970: The Role of College Costs, Family Background, and the Returns to Education,” Journal of Political Economy, 102(5), 878–911. King, R. G., C. I. Plosser, and S. T. Rebelo (1988): “Produc- tion, growth and business cycles : I. The basic neoclassical model,” Journal of Monetary Economics, 21(2-3), 195–232. 173 (2002): “Production, Growth and Business Cycles: Technical Appendix,” Computational Economics, 20(1-2), 87–116. King, R. G., and S. Rebelo (1990): “Public Policy and Eco- nomic Growth: Developing Neoclassical Implications,” Journal of Political Economy, 98(5), S126–50. Klenow, P., and A. Rodríguez-Clare (1997): “The Neoclas- sical Revival in Growth Economics: Has It Gone Too Far?,” in NBER Macroeconomics Annual 1997, Volume 12, NBER Chapters, pp. 73–114. National Bureau of Economic Re- search, Inc. Kocherlakota, N. R. (2005): “Zero Expected Wealth Taxes: A Mirrlees Approach to Dynamic Optimal Taxation,” Econometrica, 73(5), 1587–1621. Krebs, T. (2003): “Human Capital Risk And Economic Growth,” The Quarterly Journal of Economics, 118(2), 709– 744. Krueger, A. B., and M. Lindahl (2001): “Education for Growth: Why and for Whom?,” Journal of Economic Litera- ture, 39(4), 1101–1136. Kydland, F. E., and E. C. Prescott (1977): “Rules Rather Than Discretion: The Inconsistency of Optimal Plans,” Journal of Political Economy, 85(3), 473–91. Levine, R., and D. Renelt (1992): “A Sensitivity Analysis of Cross-Country Growth Regressions,” American Economic Review, 82(4), 942–63. Lipsey, R. G., and K. Lancaster (1956 - 1957): “The General Theory of Second Best,” The Review of Economic Studies, 24(1), 11–32. Ljungqvist, L., and T. J. Sargent (2004): Recursive Marcoeco- nomic Theory. MIT Press, Cambridge (MA), United States of America, 2nd edn. Lucas, R. E. J. (1988): “On the mechanics of economic de- velopment,” Journal of Monetary Economics, 22(1), 3–42. (1990): “Supply-Side Economics: An Analytical Re- view,” Oxford Economic Papers, 42(2), 293–316. Lucas, R. E. J., and N. L. Stokey (1983): “Optimal fiscal and monetary policy in an economy without capital,” Journal of Monetary Economics, 12(1), 55–93. 174 Mankiw, N. G., D. Romer, and D. N. Weil (1992): “A Contribution to the Empirics of Economic Growth,” The Quarterly Journal of Economics, 107(2), 407–437. Mendoza, E. G., A. Razin, and L. L. Tesar (1994): “Effective tax rates in macroeconomics: Cross-country estimates of tax rates on factor incomes and consumption,” Journal of Monetary Economics, 34(3), 297–323. Merkel, A. (2009): “Wachstum. Bildung. Zusam- menhalt. Deutschland zu neuer Stärke führen - Regierungserklärung von Bundeskanzlerin An- gela Merkel vor dem Deutschen Bundestag am 10.11.2009,” http://www.bundesregierung.de/ Content/DE/Publikation/Bestellservice/__Anlagen/ 2009-11-26-regierungserklaerung-bkin.html, ac- cessed December 15, 2010. Milesi-Ferretti, G. M., and N. Roubini (1998): “On the taxation of human and physical capital in models of endogenous growth,” Journal of Public Economics, 70(2), 237–254. Mincer, J. A. (1974): Schooling, Experience, and Earnings. Columbia University Press, New York (NY), United States of America. Mirrlees, J. A. (1971): “An Exploration in the Theory of Optimum Income Taxation,” Review of Economic Studies, 38(114), 175–208. (1986): “The theory of optimal taxation,” in Hand- book of Mathematical Economics, ed. by K. J. Arrow, and M. Intriligator, vol. 3 of Handbook of Mathematical Eco- nomics, chap. 24, pp. 1197–1249. Elsevier. Murphy, K. M., A. Shleifer, and R. W. Vishny (1991): “The Allocation of Talent: Implications for Growth,” The Quarterly Journal of Economics, 106(2), 503–30. Myles, G. D. (2007): “Economic Growth and the Role of Taxation,” Report prepared for the OECD, http://www. people.ex.ac.uk/gdmyles/papers/pdfs/OECDfin.pdf, accessed October 25, 2010. (2009a): “Economic Growth and the Role of Taxation-Theory,” OECD Economics Department Work- ing Papers 713, OECD, Economics Department. 175 (2009b): “Economic Growth and the Role of Taxa- tion - Aggregate Data,” OECD Economics Department Working Papers 714, OECD, Economics Department. (2009c): “Economic Growth and the Role of Taxa- tion - Disaggregate Data,” OECD Economics Department Working Papers 715, OECD, Economics Department. Nielsen, S. B., and P. B. Sørensen (1997): “On the optimal- ity of the Nordic system of dual income taxation,” Journal of Public Economics, 63(3), 311–329. Obama, B. (2009): “The President addresses a Joint Session of Congress, February 24, 2009,” http://www.whitehouse.gov/the_press_office/ Remarks-of-President-Barack-Obama-ADDRESS-to- Joint-Session-of-Congress, accessed December 15, 2010. OECD (2009): Education at a Glance. OECD Publications, Paris, France. (2010): Education at a Glance. OECD Publications, Paris, France. Pecorino, P. (1993): “Tax structure and growth in a model with human capital,” Journal of Public Economics, 52(2), 251–271. Psacharopoulos, G. (1973): Returns to Education: An Inter- national Comparison. Elsevier, Jossey Bass, San Francisco (CA), United States of America. (1994): “Returns to investment in education: A global update,” World Development, 22(9), 1325–1343. Ramsey, F. P. (1927): “A Contribution to the Theory of Taxa- tion,” The Economic Journal, 37(145), 47–61. Reis, C. (2007): “Essays on Optimal Taxation,” Ph.D. thesis, Massachusetts Institute of Technology, Department of Economics. Rey, E. D., and M. del Mar Racionero (2002): “Optimal educational choice and redistribution when parental edu- cation matters,” Oxford Economic Papers, 54(3), 435–448. Richter, W. F. (2009): “Taxing Education in Ramsey’s Tra- dition,” Journal of Public Economics, 93(11-12), 1254–1260. 176 Ritter, F. E., and L. J. Schooler (2001): “The learning curve,” in International Encyclopedia of the Social and Be- havioral Sciences, ed. by P. B. W. Kintch, N. Smelser, pp. 8602–8605. Pergamon. Rosen, S. (1976): “A Theory of Life Earnings,” Journal of Political Economy, 84(4), S45–S67. Sandmo, A. (1974): “A Note on the Structure of Optimal Taxation,” American Economic Review, 64(4), 701–06. Schultz, T. W. (1961a): “Investment in Human Capital,” American Economic Review, 51(1), 1–17. (1961b): “Education and Economic Growth,” in Social Forces Influencing American Education, ed. by N. B. Henry, and R. W. Tyler, chap. 3, pp. 46–88. The University of Chicago Press, Chicago (IL), United States of America. Sianesi, B., and J. V. Reenen (2003): “The Returns to Educa- tion: Macroeconomics,” Journal of Economic Surveys, 17(2), 157–200. Solow, R. M. (1957): “Technical Change and the Aggre- gate Production Function,” The Review of Economics and Statistics, 39(3), 312–320. Taber, C. (2002): “Tax Reform and Human Capital Accumu- lation: Evidence from an Empirical General Equilibrium Model of Skill Formation,” Advances in Economic Analysis & Policy, 2(1). Topel, R. (1999): “Labor markets and economic growth,” in Handbook of Labor Economics, ed. by O. Ashenfelter, and D. Card, vol. 3 of Handbook of Labor Economics, chap. 44, pp. 2943–2984. Elsevier. Trostel, P. A. (1993): “The Effect of Taxation on Human Capital,” Journal of Political Economy, 101(2), 327–50. (1996): “Should Education Be Subsidized?,” Public Finance Review, 24(1), 3–24. Uzawa, H. (1965): “Optimum Technical Change in An Ag- gregative Model of Economic Growth,” International Eco- nomic Review, 6(1), 18–31. Welch, F. (1979): “Effects of Cohort Size on Earnings: The Baby Boom Babies’ Financial Bust,” Journal of Political Economy, 87(5), S65–S97. 177 Wigger, B. U. (2002): Public pensions and economic growth. Springer-Verlag, Berlin, Germany. Wigger, B. U., and R. K. von Weizsäcker (2001): “Risk, Re- sources, and Education: Public Versus Private Financing of Higher Education,” IMF Staff Papers, 48(3), 6. Willis, R. J. (1986): “Wage determinants: A survey and reinterpretation of human capital earnings functions,” in Handbook of Labor Economics, ed. by O. Ashenfelter, and R. Layard, vol. 1 of Handbook of Labor Economics, chap. 10, pp. 525–602. Elsevier. Yuen, C.-W. (1991): “Taxation, Human Capital Accumula- tion, and Economic Growth,” Ph.D. thesis, University of Chicago, Department of Economics. 178 colophon This thesis was typeset with LATEX 2ε using Hermann Zapf’s Palatino and Euler type faces (Type 1 PostScript fonts URW Palladio L and FPL were used). The listings are typeset in Bera Mono, originally developed by Bitstream, Inc. as “Bit- stream Vera”. (Type 1 PostScript fonts were made available by Malte Rosenau and Ulrich Dirr.) The typographic style was inspired by Bringhurst’s genius as presented in The Ele- ments of Typographic Style (Bringhurst, 2002). It is available for LATEX via CTAN as “classicthesis”. note: The custom size of the textblock was calculated using the directions given by Mr. Bringhurst (pages 26– 29 and 175/176). 10 pt Palatino needs 133.21 pt for the string “abcdefghijklmnopqrstuvwxyz”. This yields a good line length between 24–26 pc (288–312 pt). Using a “double square textblock” with a 1:2 ratio this results in a textblock of 312:624 pt (which includes the headline in this design). A good alternative would be the “golden section textblock” with a ratio of 1:1.62, here 312:505.44 pt. For comparison, DIV9 of the typearea package results in a line length of 389 pt (32.4 pc), which is by far too long. However, this information will only be of interest for hardcore pseudo-typographers like me. To make your own calculations, use the following com- mands and look up the corresponding lengths in the book: \settowidth{\abcd}{abcdefghijklmnopqrstuvwxyz} \the\abcd\ % prints the value of the length Please see the file classicthesis.sty for some precalcu- lated values for Palatino and Minion. 159.85193pt Final Version as of May 8, 2011 at 10:44. D E C L A R AT I O N Ich versichere, dass ich diese Dissertation selbstständig ver- fasst habe. Bei der Erstellung der Arbeit habe ich mich ausschließlich der angegebenen Hilfsmittel bedient. Die Dissertation ist nicht bereits Gegenstand eines erfolgreich abgeschlossenen Promotions- oder sonstigen Prüfungsver- fahrens gewesen. Dortmund, 28.01.2011 Christoph Braun