上册答案(伍德里奇计量经济导论第三版课后答案)

上册答案(伍德里奇计量经济导论第三版课后答案)CHAPTER 1 SOLUTIONS TO PROBLEMS 1.1 (i) Ideally, we could randomly assign students to classes of different sizes. That is, each student is assigned a different class size without regard to any student characteristics such as ability and family background. ...

CHAPTER 1 SOLUTIONS TO PROBLEMS 1.1 (i) Ideally, we could randomly assign students to classes of different sizes. That is, each student is assigned a different class size without regard to any student characteristics such as ability and family background. For reasons we will see in Chapter 2, we would like substantial variation in class sizes (subject, of course, to ethical considerations and resource constraints). (ii) A negative correlation means that larger class size is associated with lower performance. We might find a negative correlation because larger class size actually hurts performance. However, with observational data, there are other reasons we might find a negative relationship. For example, children from more affluent families might be more likely to attend schools with smaller class sizes, and affluent children generally score better on standardized tests. Another possibility is that, within a school, a principal might assign the better students to smaller classes. Or, some parents might insist their children are in the smaller classes, and these same parents tend to be more involved in their children’s education. (iii) Given the potential for confounding factors – some of which are listed in (ii) – finding a negative correlation would not be strong evidence that smaller class sizes actually lead to better performance. Some way of controlling for the confounding factors is needed, and this is the subject of multiple regression analysis. 1.2 (i) Here is one way to pose the question: If two firms, say A and B, are identical in all respects except that firm A supplies job training one hour per worker more than firm B, by how much would firm A’s output differ from firm B’s? (ii) Firms are likely to choose job training depending on the characteristics of workers. Some observed characteristics are years of schooling, years in the workforce, and experience in a particular job. Firms might even discriminate based on age, gender, or race. Perhaps firms choose to offer training to more or less able workers, where “ability” might be difficult to quantify but where a manager has some idea about the relative abilities of different employees. Moreover, different kinds of workers might be attracted to firms that offer more job training on average, and this might not be evident to employers. (iii) The amount of capital and technology available to workers would also affect output. So, two firms with exactly the same kinds of employees would generally have different outputs if they use different amounts of capital or technology. The quality of managers would also have an effect. (iv) No, unless the amount of training is randomly assigned. The many factors listed in parts (ii) and (iii) can contribute to finding a positive correlation between output and training even if job training does not improve worker productivity. 1.3 It does not make sense to pose the question in terms of causality. Economists would assume that students choose a mix of studying and working (and other activities, such as attending class, leisure, and sleeping) based on rational behavior, such as maximizing utility subject to the constraint that there are only 168 hours in a week. We can then use statistical methods to measure the association between studying and working, including regression analysis that we cover starting in Chapter 2. But we would not be claiming that one variable “causes” the other. They are both choice variables of the student. CHAPTER 2 SOLUTIONS TO PROBLEMS 2.1 (i) Income, age, and family background (such as number of siblings) are just a few possibilities. It seems that each of these could be correlated with years of education. (Income and education are probably positively correlated; age and education may be negatively correlated because women in more recent cohorts have, on average, more education; and number of siblings and education are probably negatively correlated.) (ii) Not if the factors we listed in part (i) are correlated with educ. Because we would like to hold these factors fixed, they are part of the error term. But if u is correlated with educ then E(u|educ) ( 0, and so SLR.4 fails. 2.2 In the equation y = (0 + (1x + u, add and subtract (0 from the right hand side to get y = ((0 + (0) + (1x + (u ( (0). Call the new error e = u ( (0, so that E(e) = 0. The new intercept is (0 + (0, but the slope is still (1. 2.3 (i) Let yi = GPAi, xi = ACTi, and n = 8. Then = 25.875, = 3.2125, (xi – )(yi – ) = 5.8125, and (xi – )2 = 56.875. From equation (2.9), we obtain the slope as = 5.8125/56.875 .1022, rounded to four places after the decimal. From (2.17), = – EMBED Equation.DSMT4 3.2125 – (.1022)25.875 .5681. So we can write = .5681 + .1022 ACT n = 8. The intercept does not have a useful interpretation because ACT is not close to zero for the population of interest. If ACT is 5 points higher, increases by .1022(5) = .511. (ii) The fitted values and residuals — rounded to four decimal places — are given along with the observation number i and GPA in the following table: i GPA 1 2.8 2.7143 .0857 2 3.4 3.0209 .3791 3 3.0 3.2253 –.2253 4 3.5 3.3275 .1725 5 3.6 3.5319 .0681 6 3.0 3.1231 –.1231 7 2.7 3.1231 –.4231 8 3.7 3.6341 .0659 You can verify that the residuals, as reported in the table, sum to (.0002, which is pretty close to zero given the inherent rounding error. (iii) When ACT = 20, = .5681 + .1022(20) 2.61. (iv) The sum of squared residuals, , is about .4347 (rounded to four decimal places), and the total sum of squares, (yi – )2, is about 1.0288. So the R-squared from the regression is R2 = 1 – SSR/SST 1 – (.4347/1.0288) .577. Therefore, about 57.7% of the variation in GPA is explained by ACT in this small sample of students. 2.4 (i) When cigs = 0, predicted birth weight is 119.77 ounces. When cigs = 20, = 109.49. This is about an 8.6% drop. (ii) Not necessarily. There are many other factors that can affect birth weight, particularly overall health of the mother and quality of prenatal care. These could be correlated with cigarette smoking during birth. Also, something such as caffeine consumption can affect birth weight, and might also be correlated with cigarette smoking. (iii) If we want a predicted bwght of 125, then cigs = (125 – 119.77)/( –.524) –10.18, or about –10 cigarettes! This is nonsense, of course, and it shows what happens when we are trying to predict something as complicated as birth weight with only a single explanatory variable. The largest predicted birth weight is necessarily 119.77. Yet almost 700 of the births in the sample had a birth weight higher than 119.77. (iv) 1,176 out of 1,388 women did not smoke while pregnant, or about 84.7%. Because we are using only cigs to explain birth weight, we have only one predicted birth weight at cigs = 0. The predicted birth weight is necessarily roughly in the middle of the observed birth weights at cigs = 0, and so we will under predict high birth rates. 2.5 (i) The intercept implies that when inc = 0, cons is predicted to be negative $124.84. This, of course, cannot be true, and reflects that fact that this consumption function might be a poor predictor of consumption at very low-income levels. On the other hand, on an annual basis, $124.84 is not so far from zero. (ii) Just plug 30,000 into the equation: = –124.84 + .853(30,000) = 25,465.16 dollars. (iii) The MPC and the APC are shown in the following graph. Even though the intercept is negative, the smallest APC in the sample is positive. The graph starts at an annual income level of $1,000 (in 1970 dollars). 2.6 (i) Yes. If living closer to an incinerator depresses housing prices, then being farther away increases housing prices. (ii) If the city chose to locate the incinerator in an area away from more expensive neighborhoods, then log(dist) is positively correlated with housing quality. This would violate SLR.4, and OLS estimation is biased. (iii) Size of the house, number of bathrooms, size of the lot, age of the home, and quality of the neighborhood (including school quality), are just a handful of factors. As mentioned in part (ii), these could certainly be correlated with dist [and log(dist)]. 2.7 (i) When we condition on inc in computing an expectation, becomes a constant. So E(u|inc) = E( EMBED Equation.DSMT4 e|inc) = EMBED Equation.DSMT4 E(e|inc) = EMBED Equation.DSMT4 0 because E(e|inc) = E(e) = 0. (ii) Again, when we condition on inc in computing a variance, becomes a constant. So Var(u|inc) = Var( EMBED Equation.DSMT4 e|inc) = ( )2Var(e|inc) = inc because Var(e|inc) = . (iii) Families with low incomes do not have much discretion about spending; typically, a low-income family must spend on food, clothing, housing, and other necessities. Higher income people have more discretion, and some might choose more consumption while others more saving. This discretion suggests wider variability in saving among higher income families. 2.8 (i) From equation (2.66), = / . Plugging in yi = (0 + (1xi + ui gives = / . After standard algebra, the numerator can be written as . Putting this over the denominator shows we can write as = (0 / + (1 + / . Conditional on the xi, we have E( ) = (0 / + (1 because E(ui) = 0 for all i. Therefore, the bias in is given by the first term in this equation. This bias is obviously zero when (0 = 0. It is also zero when = 0, which is the same as = 0. In the latter case, regression through the origin is identical to regression with an intercept. (ii) From the last expression for in part (i) we have, conditional on the xi, Var( ) = Var = EMBED Equation.DSMT4 = EMBED Equation.DSMT4 = / . (iii) From (2.57), Var( ) = 2/ . From the hint, ( , and so Var( ) ( Var( ). A more direct way to see this is to write = , which is less than unless = 0. (iv) For a given sample size, the bias in increases as increases (holding the sum of the fixed). But as increases, the variance of increases relative to Var( ). The bias in is also small when is small. Therefore, whether we prefer or on a mean squared error basis depends on the sizes of , , and n (in addition to the size of ). 2.9 (i) We follow the hint, noting that = (the sample average of is c1 times the sample average of yi) and = . When we regress c1yi on c2xi (including an intercept) we use equation (2.19) to obtain the slope: From (2.17), we obtain the intercept as = (c1 ) – (c2 ) = (c1 ) – [(c1/c2) ](c2 ) = c1( – EMBED Equation.DSMT4 ) = c1 ) because the intercept from regressing yi on xi is ( – EMBED Equation.DSMT4 ). (ii) We use the same approach from part (i) along with the fact that = c1 + and = c2 + . Therefore, = (c1 + yi) – (c1 + ) = yi – and (c2 + xi) – = xi – . So c1 and c2 entirely drop out of the slope formula for the regression of (c1 + yi) on (c2 + xi), and = . The intercept is = – EMBED Equation.DSMT4 = (c1 + ) – (c2 + ) = ( ) + c1 – c2 = + c1 – c2 , which is what we wanted to show. (iii) We can simply apply part (ii) because . In other words, replace c1 with log(c1), yi with log(yi), and set c2 = 0. (iv) Again, we can apply part (ii) with c1 = 0 and replacing c2 with log(c2) and xi with log(xi). If are the original intercept and slope, then and . 2.10 (i) This derivation is essentially done in equation (2.52), once is brought inside the summation (which is valid because does not depend on i). Then, just define . (ii) Because we show that the latter is zero. But, from part (i), EMBED Equation.DSMT4 Because the are pairwise uncorrelated (they are independent), (because ). Therefore, (iii) The formula for the OLS intercept is and, plugging in gives (iv) Because are uncorrelated, , which is what we wanted to show. (v) Using the hint and substitution gives 2.11 (i) We would want to randomly assign the number of hours in the preparation course so that hours is independent of other factors that affect performance on the SAT. Then, we would collect information on SAT score for each student in the experiment, yielding a data set , where n is the number of students we can afford to have in the study. From equation (2.7), we should try to get as much variation in as is feasible. (ii) Here are three factors: innate ability, family income, and general health on the day of the exam. If we think students with higher native intelligence think they do not need to prepare for the SAT, then ability and hours will be negatively correlated. Family income would probably be positively correlated with hours, because higher income families can more easily afford preparation courses. Ruling out chronic health problems, health on the day of the exam should be roughly uncorrelated with hours spent in a preparation course. (iii) If preparation courses are effective, should be positive: other factors equal, an increase in hours should increase sat. (iv) The intercept, , has a useful interpretation in this example: because E(u) = 0, is the average SAT score for students in the population with hours = 0. CHAPTER 3 SOLUTIONS TO PROBLEMS 3.1 (i) hsperc is defined so that the smaller it is, the lower the student’s standing in high school. Everything else equal, the worse the student’s standing in high school, the lower is his/her expected college GPA. (ii) Just plug these values into the equation: = 1.392 ( .0135(20) + .00148(1050) = 2.676. (iii) The difference between A and B is simply 140 times the coefficient on sat, because hsperc is the same for both students. So A is predicted to have a score .00148(140) .207 higher. (iv) With hsperc fixed, = .00148(sat. Now, we want to find (sat such that = .5, so .5 = .00148((sat) or (sat = .5/(.00148) 338. Perhaps not surprisingly, a large ceteris paribus difference in SAT score – almost two and one-half standard deviations – is needed to obtain a predicted difference in college GPA or a half a point. 3.2 (i) Yes. Because of budget constraints, it makes sense that, the more siblings there are in a family, the less education any one child in the family has. To find the increase in the number of siblings that reduces predicted education by one year, we solve 1 = .094((sibs), so (sibs = 1/.094 10.6. (ii) Holding sibs and feduc fixed, one more year of mother’s education implies .131 years more of predicted education. So if a mother has four more years of education, her son is predicted to have about a half a year (.524) more years of education. (iii) Since the number of siblings is the same, but meduc and feduc are both different, the coefficients on meduc and feduc both need to be accounted for. The predicted difference in education between B and A is .131(4) + .210(4) = 1.364. 3.3 (i) If adults trade off sleep for work, more work implies less sleep (other things equal), so < 0. (ii) The signs of and are not obvious, at least to me. One could argue that more educated people like to get more out of life, and so, other things equal, they sleep less ( < 0). The relationship between sleeping and age is more complicated than this model suggests, and economists are not in the best position to judge such things. (iii) Since totwrk is in minutes, we must convert five hours into minutes: (totwrk = 5(60) = 300. Then sleep is predicted to fall by .148(300) = 44.4 minutes. For a week, 45 minutes less sleep is not an overwhelming change. (iv) More education implies less predicted time sleeping, but the effect is quite small. If we assume the difference between college and high school is four years, the college graduate sleeps about 45 minutes less per week, other things equal. (v) Not surprisingly, the three explanatory variables explain only about 11.3% of the variation in sleep. One important factor in the error term is general health. Another is marital status, and whether the person has children. Health (however we measure that), marital status, and number and ages of children would generally be correlated with totwrk. (For example, less healthy people would tend to work less.) 3.4 (i) A larger rank for a law school means that the school has less prestige; this lowers starting salaries. For example, a rank of 100 means there are 99 schools thought to be better. (ii) > 0, > 0. Both LSAT and GPA are measures of the quality of the entering class. No matter where better students attend law school, we expect them to earn more, on average. , > 0. The number of volumes in the law library and the tuition cost are both measures of the school quality. (Cost is less obvious than library volumes, but should reflect quality of the faculty, physical plant, and so on.) (iii) This is just the coefficient on GPA, multiplied by 100: 24.8%. (iv) This is an elasticity: a one percent increase in library volumes implies a .095% increase in predicted median starting salary, other things equal. (v) It is definitely better to attend a law school with a lower rank. If law school A has a ranking 20 less than law school B, the predicted difference in starting salary is 100(.0033)(20) = 6.6% higher for law school A. 3.5 (i) No. By definition, study + sleep + work + leisure = 168. Therefore, if we change study, we must change at least one of the other categories so that the sum is still 168. (ii) From part (i), we can write, say, study as a perfect linear function of the other independent variables: study = 168 ( sleep ( work ( leisure. This holds for every observation, so MLR.3 violated. (iii) Simply drop one of the independent variables, say leisure: GPA = + study + sleep + work + u. Now, for example, is interpreted as the change in GPA when study increases by one hour, where sleep, work, and u are all held fixed. If we are holding sleep and work fixed but increasing study by one hour, then we must be reducing leisure by one hour. The other slope parameters have a similar interpretation. 3.6 Conditioning on the outcomes of the explanatory variables, we have = E( + ) = E( ) + E( ) = (1 + (2 = . 3.7 Only (ii), omitting an important variable, can cause bias, and this is true only when the omitted variable is correlated with the included explanatory variables. The homoskedasticity assumption, MLR.5, played no role in showing that the OLS estimators are unbiased. (Homoskedasticity was used to obtain the usual variance formulas for the .) Further, the degree of collinearity between the explanatory variables in the sample, even if it is reflected in a correlation as high as .95, does not affect the Gauss-Markov assumptions. Only if there is a perfect linear relationship among two or more explanatory variables is MLR.3 violated. 3.8 We can use Table 3.2. By definition, > 0, and by assumption, Corr(x1,x2) < 0. Therefore, there is a negative bias in : E( ) < . This means that, on average across different random samples, the simple regression estimator underestimates the effect of the training program. It is even possible that E( ) is negative even though > 0. 3.9 (i) < 0 because more pollution can be expected to lower housing values; note that is the elasticity of price with respect to nox. is probably positive because rooms roughly measures the size of a house. (However, it does not allow us to distinguish homes where each room is large from homes where each room is small.) (ii) If we assume that rooms increases with quality of the home, then log(nox) and rooms are negatively correlated when poorer neighborhoods have more pollution, something that is often true. We can use Table 3.2 to determine the direction of the bias. If > 0 and Corr(x1,x2) <

                    本文档为【上册答案(伍德里奇计量经济导论第三版课后答案)】，请使用软件OFFICE或WPS软件打开。作品中的文字与图均可以修改和编辑，
                    图片更改请在作品中右键图片并更换，文字修改请直接点击文字进行修改，也可以新增和删除文档中的内容。 
 该文档来自用户分享，如有侵权行为请发邮件ishare@vip.sina.com联系网站客服，我们会及时删除。

                    [版权声明] 本站所有资料为用户分享产生，若发现您的权利被侵害，请联系客服邮件isharekefu@iask.cn，我们尽快处理。

                    本作品所展示的图片、画像、字体、音乐的版权可能需版权方额外授权，请谨慎使用。

                    网站提供的党政主题相关内容(国旗、国徽、党徽..)目的在于配合国家政策宣传，仅限个人学习分享使用，禁止用于任何广告和商用目的。
                

下载需要：免费已有0 人下载

立即下载

上册答案(伍德里奇计量经济导论第三版课后答案)

你可能还喜欢