## Get All Weeks Linear Regression for Business Statistics Coursera Quiz Answers

## Table of Contents

### Week 01: Linear Regression for Business Statistics Coursera Quiz Answers

#### Quiz 1: Practice Quiz

Q1. In a regression, the variable of interest is also known as which of the following? Mark all that apply.

[expand title=View Answer]

independent variable

dependent variable

response variable

[/expand]

Q2. Which one of the following linear equations best represents an explanatory relationship in which hours worked in a year and the number of employees can be used to explain

changes in yearly production volume?

[expand title=View Answer] Production Volume = β0 + β1Hours Worked + β2Employees[/expand]

#### Quiz 2: Practice Quiz

Q1. Which of the following statements regarding regression are true? Mark all that apply.

[expand title=View Answer]

Multiple regression uses more than one explanatory variable

Simple regression uses only one explanatory variable

[/expand]

Q2. Now that we have developed our model, we will estimate the model using software. Let’s continue the example from the previous lesson, in which our regression equation is

Which is the value for the coefficient β2, rounded to two decimal places?

[expand title=View Answer]142.26[/expand]

#### Quiz 3: Practice Quiz

Q1. Continue with same example from the previous lesson. The regression equation is Production Volume = β0 + β1Hours

Worked + β2Employees with the following estimates:

What is the value of the Y variable (rounded to two decimal points) when all X variables are zero?

[expand title=View Answer] 501.21 [/expand]

Q2. Notice that the value calculated in the previous question is β0 in the regression equation. Does the interpretation of β0 have managerial significance?

[expand title=View Answer] Yes [/expand]

#### Quiz 4: Practice Quiz

Q1. Continue with the same example from the previous lesson. The regression equation is Production Volume = β0 + β1Hours Worked + β2Employees with the following estimates:

A manager wants to estimate the production volume for various numbers of employees and hours worked. Using the regression output, what is the best estimate for production volume if there are 4000 hours worked and 300 employees during a year?

[expand title=View Answer] 52170 [/expand]

Q2. In our regression model, assume a base case of 4000 hours worked and 300 employees for the year.

The manager has the opportunity to change the number of employees and hours worked for the year. Which of the following changes leads to the greatest predicted production volume?

[expand title=View Answer] Hire 20 additional employees and keep total hours the same[/expand]

#### Quiz 5: Practice Quiz

Q1. Which of the following statements is true?

[expand title=View Answer] The true relationship between two variables can usually be determined from regression [/expand]

Q2. An R-square value of 1 indicates which of the following? Mark all that apply.

[expand title=View Answer]

The residuals are zero

The predicted y-values equal the actual values

[/expand]

#### Quiz 6: Practice Quiz

Q1. The residuals from a regression follow a **_ distribution centered around _**.

[expand title=View Answer]normal; 0 [/expand]

Q2. The expression (b0 – β0)/Sb0 follows a t distribution with n-k-1 degrees of freedom. What is Sb0?

[expand title=View Answer]the standard error of b0 [/expand]

#### Quiz 7: Regression Analysis: An Introduction

Q1. Download Grocery Store Sales, which provides data in the following categories: Sales per Square Foot, Size of Store (in Square Feet), Advertising Dollars (in thousands), and Number of Products Offered in Store, from a sample size of 70 grocery stores.

We want to see how changes in our independent variables affect Sales per Square Foot.

Please run one multiple regression including all independent variables to estimate the coefficients for each of our independent variables.

What is the coefficient for the Size of the Store? Please round to three decimal places.

[expand title=View Answer] The coefficient for the Size of Store is not provided. Please provide the coefficient values for the independent variables.[/expand]

Q2. What is the coefficient for Advertising Dollars, rounded to three decimal places?

[expand title=View Answer] The coefficient for Advertising Dollars is not provided. Please provide the coefficient values for the independent variables. [/expand]

Q3. Based on the sign of the coefficient for the Number of Products in Store, how will changes in the Number of Products likely increase or decrease the Sales per Square Foot?

[expand title=View Answer]As the Number of Products increases, the Sales per Square Foot will increase. [/expand]

Q4. What is the Sales per Square Foot if all of our X variables are zero (in $) ? Please round to one decimal place.

[expand title=View Answer] If all independent variables are zero, the Sales per Square Foot would be equal to the intercept (β0).[/expand]

Q5. What would be the expected Sales per Square Foot if the Size of Store was 60,000 square feet, they spent $70,000 in Advertising Dollars, and offered 30,000 products (in $) ? Please round to two decimal places.

[expand title=View Answer]To calculate the expected Sales per Square Foot with specific values for the independent variables, you need to use the regression equation and plug in those values. The expected value is the predicted value from the regression equation. [/expand]

Q6. R square helps explain the goodness of fit of the model. What is the R square for this regression model? Round to two decimal places

[expand title=View Answer] The R-squared (R²) value indicates the proportion of the variance in the dependent variable that is explained by the independent variables in the model. It helps assess the goodness of fit. You haven’t provided the R-squared value. [/expand]

.

Q7. How might one improve the goodness of fit for this model? Select all that apply.

[expand title=View Answer]

Consider that the relationship between the independent and dependent variables may not be linear.

Remove some of the sample data at random.

Include additional variables.

[/expand]

Q8. What are some assumptions made about errors in a regression equation?

[expand title=View Answer]Errors are normally distributed with a mean of zero. [/expand]

Q9. What is the residual degrees of freedom for the regression model?

[expand title=View Answer]The residual degrees of freedom for the regression model are usually calculated as (n – k – 1), where “n” is the sample size and “k” is the number of coefficients estimated.[/expand]

Q10. In utilizing notations, what are the primary differences in a regression model between b and β?

[expand title=View Answer]

1.The value of b is normally distributed around the actual value of β.

2.The true value of β is never known.

[/expand]

### Week 2 Quiz Answers

#### Quiz 1: Practice Quiz

Q1. From the video, the estimated coefficient produced from the regression for promotional expenditures is 1802.61 with a standard error of 392.85. However, the manager believes that the true value is 2000. To test this claim, we decide to run a hypothesis test. Which of the following is the correct calculation for the t-statistic?

[expand title=View Answer] (1802.61 – 2000) / 392.85[/expand]

Q2. Now that we have the t-statistic, we then calculate the value for t-cutoff. From the video, the t-cutoff is +/- 2.086. Do we reject the null hypothesis?

[expand title=View Answer] We do not reject the null hypothesis because the t-statistic lies outside the rejection region [/expand]

#### Quiz 2: Practice Quiz

Q1. In a one-tail test, the rejection region contains the probability of . In a two-tail test, each rejection region contains a **probability of **.

[expand title=View Answer] α; α/2[/expand]

Q2. We will continue with the hypothesis test on the coefficient for promotional expenditures. The estimated coefficient is 1802.61 with a standard error of 392.85, and the claim is that the true value is 2000. The residual degrees of freedom obtained in the regression output is 20.

What is the p-value for this hypothesis test?

[expand title=View Answer]0.62 [/expand]

#### Quiz 3: Practice Quiz

Q1. Review the video again to find the 95% confidence interval for the coefficient for promotional expenditures. What can we conclude about the claim that the true value of the coefficient is 2000?

[expand title=View Answer]Because 2000 lies outside the 95% confidence interval, we can reject the null hypothesis. [/expand]

Q2. The p-value provided by Excel for each coefficient corresponds to the hypothesis test as to whether each coefficient is zero. Suppose the p-value for a coefficient is greater than our α value. What can we conclude about the estimated coefficient?

[expand title=View Answer] It is not significant because we cannot reject the claim that the true value is zero. [/expand]

#### Quiz 4: Practice Quiz

Q1. Refer to the regression output from the video lesson. The coefficient for the annual income is 0.4891. What is an appropriate interpretation for this value? Mark all that apply.

[expand title=View Answer]

1.For every dollar increase in income, the home price increases by 0.4891 dollars, all other variables remaining the same.

2.For every 1000 dollar increase in income, the home price increases by 489.1 dollars, all other variables remaining the same.

[/expand]

Q2. What does the estimated value of 0.4891 tell us about the true value of the coefficient?

[expand title=View Answer] The true value could be greater than or less than 0.4891.[/expand]

#### Quiz 5: Practice Quiz

Q1. True or false: the R-square value indicates the proportion of total sum of squares explained by the regression.

[expand title=View Answer] True[/expand]

Q2. When explanatory variables are added to a regression, the R-square value ** _ increases whereas the adjusted R-square value _** increases.

[expand title=View Answer] always; sometimes[/expand]

#### Quiz 6: Practice Quiz

Q1. Which of the following could be appropriate categorical variables?

[expand title=View Answer]

profession

eye color

gender

[/expand]

Q2. A categorical variable that has five different categories requires *__* dummy variables.

[expand title=View Answer] A categorical variable with five different categories requires four dummy variables. [/expand]

### Week 3 Quiz Answers

#### Quiz 1: Practice Quiz

Q1. Refer to the example shown in the video; the region is represented by two separate dummy variables, REGA and REGB, such that region C is the reference category. Which combination of values for REGA and REGB are valid? Select all that apply.

[expand title=View Answer]

REGA = 0; REGB = 0

REGA = 1; REGB = 0

REGA = 0; REGB = 1

[/expand]

Q2. Continue with the same example. To denote that delivery is made to region C, what should the values of REGA and REGB be?

[expand title=View Answer] REGA = 0; REGB = 0[/expand]

#### Quiz 2: Practice Quiz

Q1. Refer to the regression from the video. Which of the following regions can be used as the reference category? Select all that apply.

[expand title=View Answer]

Region A

Region B

Region C

[/expand]

Q2. Suppose we choose region A as the reference category. We run the regression and obtain the following equation:

Minutes = β0 + β1REGB + β2REGC+ β3Parcels + β4TruckAge.

What does β2 represent?

[expand title=View Answer]The difference between the fixed time to deliver to Region C versus the fixed time to deliver to Region A [/expand]

#### Quiz 3: Practice Quiz

Q1. Refer to the regression from the video with the following estimated equation:

Minutes = -34.76 + 107.71*REGA + 1.21*REGB+ 9.92*Parcels + 3.68*TruckAge.

Approximately how long does it take to deliver 50 parcels to Region A using a truck that is 5 years old? Round your answer to the lowest integer.

[expand title=View Answer]587 [/expand]

Q2. Suppose the truck driver is on a tight schedule and wants to reduce the time of delivery by at least 100 minutes. Which of the following changes made to the delivery in question 1 would accomplish this goal? Select all that apply.

[expand title=View Answer]

Deliver 30 parcels instead of 50 parcels to region A

Deliver the same number of parcels to Region C instead of Region A

Use a brand new truck instead of a 5-year-old truck

[/expand]

#### Quiz 4: Practice Quiz

Q1. Refer to the video lesson. When the first regression using CoolSize is changed to the second regression using RefSize, why must the column for CoolSize be moved to the far right?

[expand title=View Answer] CoolSize is moved to the end strictly for aesthetic purposes.[/expand]

Q2. Refer to the regression using FreezeSize and RefSize. When interpreting a unit increase in the coefficient for FreezeSize, we assume that all other variables remain the same. What does this imply about the change in CoolSize (hint: CoolSize is not a variable in this regression, but can be derived from the values of FreezeSize and RefSize)?

[expand title=View Answer]The change in CoolSize cannot be inferred [/expand]

#### Quiz 5: Practice Quiz

Q1. Refer to the regressions on refrigerator price. How many dollars does the price of the refrigerator increase by when the freezer size increases by 1 cubic foot and the cooler size remains the same?

[expand title=View Answer] 76.50[/expand]

Q2. How many dollars does the price of the refrigerator increase by when the freezer size increases by 1 cubic foot and the cooler size decreases by 1 cubic foot?

[expand title=View Answer]137.38 [/expand]

#### Quiz 6: Practice Quiz

Q1. Which of the following is true regarding a regression with a high level of multicollinearity? Select all that apply.

[expand title=View Answer]

1.The regression cannot be used to interpret the impact of coefficients accurately

2.The regression might still be able to predict the dependent variable accurately

[/expand]

Q2. Which of the following pairs of explanatory variables likely has the highest amount of correlation?

[expand title=View Answer] length of right foot and length of left foot of a person [/expand]

### Week 4 Quiz Answers

#### Quiz 1: Practice Quiz

Q1. Please select all that ‘Mean-centering of variables’ does for a regression model.

[expand title=View Answer]

1.It improves prediction using the regression model.

2.It makes the intercept to be interpreted more meaningfully.

[/expand]

Q2. One could center variables at a value other than the mean. True or False?

[expand title=View Answer] True [/expand]

Q3. Mean-centering the Y variable helps in the interpretation of the intercept in the regression model. True or False?

[expand title=View Answer]False [/expand]

#### Quiz 2: Practice Quiz

Q1. Please choose all that apply.

[expand title=View Answer]

1.The formula for the confidence interval for a predicted value uses the ‘standard error’ of regression produced below the adjusted R-squared.

2.The confidence interval for the predicted value is a way of incorporating uncertainty in our prediction.

[/expand]

Q2. What is the correct formula for the margin of error for constructing a 95% confidence interval for the predicted value?

[expand title=View Answer] |T.INV(0.025,residual df)|*std error of regression [/expand]

#### Quiz 3: Practice Quiz

Q1. Which of the following are true in regard to interaction variables? Please select all that apply.

[expand title=View Answer]

1.Interaction variables are created by multiplying the variables.

2.Interaction variables allow you to study the impact of one variable at different levels of

[/expand]

Q2. Following is a regression equation developed using salary data for employees at a company:

Salary = β0 + β1Male

β2Age + β3Male*Age

Salary is measured in dollars. Age is measured in years and Male is a dummy variable representing the categorical variable Gender.

What is the interpretation of β2 ? Please mark the most appropriate answer.

[expand title=View Answer] It is the change in salary with each additional year of age for a male employee. [/expand]

#### Quiz 4: Practice Quiz

Q1. When creating an interaction variable, one of the variables has to be a dummy variable. Is this statement True or False?

[expand title=View Answer] False [/expand]

Q2. Following is a regression equation equating salary to

gender and years of experience..

Salary = β0 + β1Male + β2Years_of_Experience

β3Male*Years_of_Experience

Salary is measured in dollars. Years_of_Experience is

measured in years and Male is a dummy variable representing the categorical

variable Gender.

What is the interpretation of β3 ? Please mark

the most appropriate answer.

[expand title=View Answer] It is the ‘extra’ change in salary with each additional year of experience for a male employee as compared to a female employee. [/expand]

#### Quiz 5: Practice Quiz

Q1. Please select all statements that apply.

[expand title=View Answer]

Transforming variables in a regression may improve the R-square of the model.

Natural log transformation is a common transformation used in regression.

There are transformations other than the natural log that can be used in the regression.

Transforming variables in a regression may improve the linearity of the model.

[/expand]

Q2. In the following regression model, what is the correct interpretation of β1?

LN(Y) = β0 + β1X1 + β2X2

Please select all that apply.

[expand title=View Answer]

For every % increase in X1, the natural log of the Y variable increases by β1 %, all other variables are kept at the same level.

For every unit increase in X1, the natural log of the Y variable increases by β1 units, all other variables are kept at the same level.

[/expand]

Q3. In the following regression model, what is the correct interpretation of β2?

LN(Y) = β0 + β1ln(X1) + β2ln(X2)

Please select all that apply.

[expand title=View Answer]

1.For every % increase in X2, the Y variable increases by β2 %, and all other variables are kept at the same level.

2.For every unit increase in X2, the natural log of the Y variable increases by β2 units, all other variables are kept at the same level.

3.For every % increase in X2, the natural log of the Y variable increases by β2 %, all other variables are kept at the same level.

[/expand]

#### Quiz 6: Practice Quiz

Q1. Which of the following is the right function to calculate the natural log in Excel?

[expand title=View Answer] =LN( ) [/expand]

Q2. The coefficients in a log-log model can directly be interpreted as:

[expand title=View Answer] Elasticities [/expand]

Q3. Which of the following are reasons to take a natural log transformation of variables in a regression model?

Select all that apply.

[expand title=View Answer] To interpret the beta coefficients directly as elasticities or growth rates. [/expand]

#### Quiz 7: Regression Analysis: Various Extensions

Q1. Data for Questions 1 through 5 are contained in the file realestate.xlsx. Please download this file.

The data contains information about apartment prices and characteristics for a sought-after area in a large metropolitan city in the USA. The data include sale price (PRICE) in $, floor area (SQFT) in square feet, number of bedrooms (BED), number of bathrooms (BATH), number of floors in the building (FLOORS), and distance from a centrally located city park (DIST) in meters.

You need to establish a relationship between PRICE and these other characteristics. Specifically, estimate the following regression model,

LN(PRICE) = β0 + β1LN(SQFT) + β2BED + β3BATH + β4FLOORS + β5DIST

Notice that in the regression you need to take a log transformation of the PRICE and SQFT variables. Report the estimated value of β4, and round the answer to four decimal digits.

[expand title=View Answer] 0.0001[/expand]

Q2. How do you interpret the coefficient estimate of β1 ?

[expand title=View Answer] When the size of the apartment increases by 1%, then the Price increases by 1.013%, all other variables remaining at the same level. [/expand]

Q3. What is the impact of an additional Bathroom on apartment price?

[expand title=View Answer] All other variables being held constant, an additional Bathroom raises the apartment price by 0.0293%.[/expand]

Q4. Using the estimated regression model, predict the price in dollars of an apartment that is 1000 sqft in size, has 2 Bedrooms, 2 Bathrooms, is in a building with 8 Floors, and is 1.2 Km from the City Park. Round your answer to a whole number, and input the answer without any “$” or “,” sign.

[expand title=View Answer] 440032 [/expand]

Q5. Calculate a 95% confidence interval for your predicted price from Question 4.

Report the lower limit of the confidence interval (in dollars), and round your answer to a whole number. Input the answer without any “$” or “,” sign.

[expand title=View Answer] 313916 [/expand]

Q6. Data for Questions 6 through 11 is contained in the file Majors.xlsx. Please download this file.

The data contains information about the starting salary of a sample of 50 undergraduate students at a Business school. The data consists of the starting salary (SALARY) in dollars, the field of study of the student (MAJOR), and the field of study is either ‘Finance’ or ‘International Business’. Finally, the variable UGPA is the undergraduate Grade Point Average of the student.

Estimate a regression model linking starting salary to the field of study and UGPA as follows,

SALARY = β0 + β1IB + β2UGPA

In the above regression, IB is a dummy variable that takes a value =1 when the MAJOR is IB, otherwise, it takes a value 0.

Report the estimated value of β1, and round the answer to a whole number.

[expand title=View Answer] 11495[/expand]

Q7. Now, the mean center is the UGPA variable. That is, subtract the mean value of UGPA from all the data points. Denote this mean-centered variable as [UGPA].

Run a regression as follows,

SALARY = β0 + β1IB + β2[UGPA]

Round the estimated value of β0 to a whole number and interpret it. Please mark all that apply.

[expand title=View Answer]

1.60,630 is the salary of a FINANCE Major with a UGPA equal to the average UGPA observed in the data.

2.60,630$ is the value of the Y variable when all X variables are zero.

[/expand]

Q8. Based on the regression carried out in Question 7, how much less salary (in dollars) does an IB Major get as compared to a FINANCE Major, when they have the same UGPAs? Round your answer to a whole number. Input the answer without any “$” or “,” sign.

[expand title=View Answer] 10412 [/expand]

Q9. There is a belief among students that a higher UGPA is more important in terms of impacting the starting salary for IB undergraduates as compared to FINANCE undergraduates.

You can empirically check for this belief by introducing an interaction variable in your regression model constructed in Question 7 and then checking the estimated coefficient for that variable.

[expand title=View Answer] IB and [UGPA][/expand]

Q10. Introduce an interaction effect in your data and estimate the model. Report the estimate of the coefficient on the interaction variable. Please round your answer to a whole number.

[expand title=View Answer]1215 [/expand]

Q11. How do you interpret the coefficient on the interaction effect?

[expand title=View Answer]The coefficient is the differential impact of UGPA on the starting salary of FINANCE majors as compared to IB majors.[/expand]

###### Get All Course Quiz Answers of Business Statistics and Analysis Specialization

Introduction to Data Analysis Using Excel Coursera Quiz Answers

Business Applications of Hypothesis Testing and Confidence Interval Estimation Quiz Answers

Linear Regression for Business Statistics Quiz Answers