Python and Statistics for Financial Analysis Coursera Quiz Answers

All Weeks Python and Statistics for Financial Analysis Coursera Quiz Answers

Python and Statistics for Financial Analysis Coursera Quiz Answers

Week 1: Python and Statistics for Financial Analysis

Q1. Which of the following library has DataFrame object?

  • Pandas
  • Numpy
  • Matplotlib
  • Statsmodels

Q2. Which of the following is the correct way to import a library, eg Pandas?

1.pandas import
1.include
1.pandas
1.import pandas as pd

Q3. What is the method of DataFrame object to import a csv file?

  • import_csv()
  • from_csv()
  • read_csv()
  • csv()

Q4. Which of the following attributes of a DataFrame return a list of column names of this
DataFrame?

  • columns
  • shape
  • dtype
  • column

Q5. Which of the following can slice ‘Close’ from ‘2015-01-01’ to ‘2016-12-31’ from data,
which is a DataFrame object?

1.data.loc[‘2015-01-01’:’2016-12-31’, ‘Close’]
1.data.iloc[‘2015-01-01’:’2016-12-31’, ‘Close’]

Q6. What is the method of DataFrame to plot a line chart?

  • scatter()
  • plot()
  • plot_graph()
  • axhline()

Q7. Suppose you have a DataFrame – data, which
contains columns ‘Open’, ‘High’, ‘Low’, ‘Close’, ‘Adj Close’ and ‘Volume’.

What
does data[[‘Open’, ‘Low’]] return?

  • All
    columns of data except ‘Open’ and ‘High’
  • No results are shown
  • Columns ‘Open’ and ‘Low’
  • The first
    row of data which contains only columns ‘Open’ and ‘High’

Q8. Suppose you have a DataFrame ms , which contains the daily data of ‘Open’, ‘High’, ‘Low’, ‘Close’, ‘Adj
Close’ and ‘Volume’ of Microsoft’s stock.

Which of
the following syntax calculates the Price difference, (ie ‘Close’ of tomorrow –
‘Close’ of today)?

1. ms[‘Close’].shift(1) – ms[‘Close’].shift(1)
1.ms[‘Close’].shift(-1) – ms[‘Close’].shift(-1)
1.ms[‘Close’].shift(1) – ms[‘Close’]
1.ms[‘Close’].shift(-1) – ms[‘Close’]

Q9. Suppose you have a DataFrame – ms , which contains the daily data of ‘Open’, ‘High’, ‘Low’, ‘Close’, ‘Adj
Close’ and ‘Volumn’ of Microsoft’s stock.

What is
the method of DataFrame to calculate the 60 days moving average?

  • rolling().mean(60)
  • moving_average(60)
  • rolling(60).mean()
  • rolling(60).median()

Q10. Which of the following idea(s) is/are correct to the simple trading strategy that we introduced in the lecture video?

  • Use longer
    moving average as slow signal and shorter moving average as fast signal
  • We short
    one share of stocks if fast signal is larger than slow signal
  • If fast
    signal is larger than slow signal, this indicates an upward trend at the
    current moment

Week 2: Python and Statistics for Financial Analysis

Q1. Roll two dice and X is the sum of faces values. If we roll them 5 times and get 2,3,4,5,6

Which of the following is/are true about X?

  • The mean of X is 4.
  • X can only take values 2,3,4,5,6
  • X is a
    random variable

Q2. Roll two dice and X is the sum of faces values. If we roll them 5 times and get 2,3,4,5,6

What do we know about X?

  • The dice is fair.
  • Range of X is 6-2=4
  • The most likely value of X is 6
  • We have 5 observations of X

Q3. Roll two dice and X is the sum of faces values. If we roll them 5 times and get 2,3,4,5,6

X is a __ random variable.

  • discrete
  • continuous
  • None of
    the above

Q4. Why do we
use relative frequency instead of frequency?

  • Relative
    frequency is easier to compute
  • Frequency
    cannot show the number of appearance of outcomes
  • Relative frequency can be used to compare the
    ratio of values between difference collections with difference number of values
  • Relative
    frequency is easier to compute when the number of observations increases

Q5. What can
we say about relative frequency when we have large number of trials?

  • Relative frequency becomes approximately the
    distribution of the corresponding random variable
  • The
    relative frequency of each possible outcome will be the same
  • The relative
    frequency stays constant after a very large number of trials, eg. n=10000
  • None of the above

Q6. What is the notion of “95% Value at Risk” ?

  • 95% Value
    at Risk is 95% quantile
  • 95% VaR
    measures how much you can lose at most
  • 95% VaR
    measures how much you can win at most
  • 95% VaR measures the amount of investment you can
    lose, at the worst 5% scenario

Q7. In the lecture video, we mentioned the calculation of continuous random variable is based on the probability density function.

Given a
probability density function, f(x) = 1/100, what is the probability
P(10<X<20), where X~Uniform[0, 100]?

  • f(20) –
    f(10)
  • f(10)
  • f(20)
  • (20- 10) * 1/100

Q8. What
methods should we use to get the
cdf and pdf of normal distribution?

  • norm.cdf() and norm.pdf() from scipy.stats
  • cdf() and
    pdf() form numpy
  • cdf() and
    pdf() from pandas
  • norm.cdf()
    and norm.pdf() from statsmodels

Q9. Which additional library should we import when we want to calculate log daily return specifically?

  • Pandas
  • Numpy
  • Statsmodels
  • Matplotlib

Q10. What
is the distribution of stock returns suggested by Fama and French in general?

  • A perfect normal distribution
  • Close to normal distribution but with fat tail
  • Arbitrary distribution
  • Left-skewed distribution

Week 3: Python and Statistics for Financial Analysis

Q1. What is true
about sample and population?

  • Population
    can always be directly observed
  • Parameters
    from population is always the same as statistics from sample
  • Sample is a subset of population which is
    randomly draw from population
  • The size
    of population is always finite

Q2. You have a
DataFrame called ‘data’ which has only one column ‘population’.

data = pd.DataFrame()

data[‘population’] = [47, 48, 85, 20, 19, 13, 72, 16, 50, 60]
How to draw sample with sample size =5, from a
‘population’ with replacement?

(Hint: You can modify the code illustrated in the Jupyter Notebook “Population and Sample” after Lecture 3.1)

1.data[‘population’].sample(5, replace=False)
1.data[‘population’].sample(10)
1.data[‘population’].sample(5)
1.data[‘population’].sample(5, replace=True)

Q3. Why is the
degrees of freedom n-1 in sample variance?

  • The degrees of freedom in sample variance is
    constrained by the sample mean
  • None of
    the above
  • The extreme value in the sample is removed for fair analysis
  • Only n-1
    values in the sample is useful

Q4. What does Central Limit Theorem tell you about the distribution of sample mean?

  • The
    distribution of sample mean follows normal distribution only if the population
    distribution is normal
  • The
    distribution of sample mean follows normal distribution with any sample size
    only if the population distribution is normal
  • The distribution of sample mean follows normal
    distribution with very large sample size follows normal distribution regardless
    of the population distribution
  • The
    distribution of sample mean with large sample size follows chi-square
    distribution regardless of the population distribution

Q5. Suppose we have 3 independent normal random variables X1, X2 and X3:

What is the distribution of X1 + X2 + X3?

  • Remains the same even X1, X2 and X3 are added up
  • Mean and variance of X1, X2 and X3 are added up
  • Mean remains unchanged; variances are added up.
  • Mean remains unchanged; variance takes 3 square root.

Q6. Why do we
need to standardize sample mean when making inference?

  • Sample
    mean becomes normally distributed after standardization
  • Sample
    mean becomes population mean after standardization
  • The standardized distribution of sample mean follows N(0,1) which is easier to make inference
  • None of the above

Q7. What can a 95%
confidence interval of daily return of an investment tell you?

  • With 95% chance
    your daily return falls into this interval
  • With 95% chance this interval will cover the mean of
    daily return
  • With 5% chance your
    daily return falls into this interval
  • None of the above

Q8. Check the
Juypter notebook of 3.3 Sample and Inference. What is the confidence interval of this exercise?

  • [0.000015603,
    0.001656]
  • [-0.000015603, 0.001656]
  • [-0.0001690,
    0.001471]
  • [0.0001690,
    0.001471]

Q9. When do you reject a null hypothesis with alternative hypothesis μ>0 with significance level α?

  • p value is larger than α
  • p value is smaller than α
  • z < z_(1-α)
  • z > z_(1-α)

Q10. When doing analysis of stock return, you notice that with 95% confidence interval, the upper bound and lower bound are negative.

Base on this data, what can you tell about this stock?

  • There is 95% chance of which the mean return of this stock is negative
  • We must lose money by investing in this stock
  • There is only 5% chance of which the mean return of this stock is negative

Week 4: Python and Statistics for Financial Analysis

Q1. Why do you use
coefficient of correlation, instead of covariance, when calculating the association between two random variables?

  • None of the above
  • Covariance is not
    suitable to use when the underlying distribution is not normal
  • Covariance cannot
    address nonlinear relationship but coefficient of correlation can address
    nonlinear relationship
  • Covariance can be affected by the variance of
    individual variables, but coefficient of correlation is rescaled by variance of
    both variables

Q2. What is the range
and interpretation of coefficient of correlation?

  • From 0 to 1, 0
    means perfect negative linear relationship and 1 means perfect positive linear
    relationship
  • From 0 to 100, 0
    means perfect positive linear relationship and 100 means perfect negative
    linear relationship
  • From 0 to 100, 0
    means perfect negative linear relationship and 100 means perfect positive
    linear relationship
  • From -1 to 1, -1 means perfect negative linear
    relationship and 1 means perfect positive linear relationship

Q3. Refer to the https://www.coursera.org/learn/python-statistics-financial-analysis/notebook/F0Luf/simple-linear-regression-model

Is LSTAT a significant predictor of MEDV at significance level 0.05?

  • Yes, because the coefficient b_1 is not zero
  • Yes, because the p value of b_1 is larger than 0.05
  • No, because the coefficient b_1 is negative
  • Yes, because the p value of b_1 is smaller than 0.05

Q4. To evaluate the performance of linear regression model, we refer to the summary of “model” as seen in https://www.coursera.org/learn/python-statistics-financial-analysis/notebook/F0Luf/simple-linear-regression-model

What is the percentage of variation explained by the model?

  • 0.95
  • 0.54
  • 0.46
  • 0.829

Q5. How to check if a
linear regression model violates the independence assumption?

  • Draw residual
    versus predictor plot
  • Draw scatter plot
    of predictor versus target
  • Durbin Watson test
  • QQ plot

Q6. If any of the
assumptions of linear regression model are violated, we cannot use this model
to make prediction.

  • True
  • False

Q7. Check the Jupyter Notebook 4.4- Build the trading model by yourself!

We have a variable ‘formula’ which store the names of predictors
and target. How should you modify this ‘formula’ if you want to drop the
predictor ‘daxi’?

1.formula = ‘spy~aord+cac40+nikkei+dji+daxi’
1.formula = ‘spy~aord+cac40+nikkei+dji-daxi’
1.formula = ‘spy~aord+cac40+nikkei+dji’
1.formula = ‘spy~ -daxi’

Q8. Check the Jupyter Notebook 4.4- Build the trading model by yourself!

What is the most
significance predictor for ‘SPY’?

  • nikkei
  • dji
  • arod
  • cac40

Q9. What does it mean
if you have a strategy with maximum drawdown of 3%?

  • During the
    trading period, the minimum you lose is 3%
  • During the
    trading period, the maximum gain from the previous peak of your portfolio value
    is 3%
  • During the trading period, the maximum lose of
    your portfolio is 3%
  • During the trading period, the maximum drop from the
    previous peak of your portfolio value is 3%

Q10. How can you check
the consistency of your trading strategy?

  • Check if the
    return of your strategy is positive using all historical data you have
  • Define some
    metric for evaluating your strategy, eg Sharpe Ratio, maximum drawdown, and
    check if your strategy can generate positive return using all historical data
    you have
  • Define some metric for evaluating your strategy, eg
    Sharpe Ratio, maximum drawdown. Then split your data into train set and test
    set and check if your strategy can generate positive return using both train
    set and test set
  • There is no way to to check the consistency

Get All Course Quiz Answers of Entrepreneurship Specialization

Entrepreneurship 1: Developing the Opportunity Quiz Answers

Entrepreneurship 2: Launching your Start-Up Quiz Answers

Entrepreneurship 3: Growth Strategies Coursera Quiz Answers

Entrepreneurship 4: Financing and Profitability Quiz Answers

Share your love

Newsletter Updates

Enter your email address below and subscribe to our newsletter

Leave a Reply

Your email address will not be published. Required fields are marked *