MFIN7034 Machine Learning and Artificial Intelligence in Finance

MFIN7034

Problem Set 1 – Factor Model

Version: 2024/02/20

Due Date: 2024/03/05

Feel free to ask about algorithm-related problems. Please learn to debug your code using Google, GitHub, Stackoverflow, and the package manual.

In this  assignment, you will run regressions to understand how  the  factor model explains asset return and construct an investment strategy based on the factor model.

1. Datasets

Monthly Stock Returns:Monthly Stock File (upenn.edu)

Fama-French 5 Factors:Kenneth RFrench - Data Library (dartmouth.edu)

Hou, Xue, Zhang q-Factor:Factors (global-q.org)

Pastor Stambaugh Liquidity Factor:Robert Stambaugh's Home Page (upenn.edu)

Please first understand how these factors are constructed so that you will know what they represent.

2. (Navie) Factor Regression

For each stock, each month, use the data from 60 months ago to the previous month (it is okay if the sample has less than 60 months as long as it can be estimated in OLS), and run regression (“rolling-window regression”):

Ri,t  = a + ∑ βi,j Fj,t  + εi,t

. Ri,t : Stock  i’s return at month  t.

. Fj,t : Factor j’s return at month  t. Please use all factors provided (FF5,HXZ5, PS). However, both FF5 and HXZ5 contain market returns. Therefore, please only use market return in FF5 (“Mkt-RF”) and drop market return in HXZ5 (“R_MKT”).

. a: Constant term of the regression.

. βi,j : Coefficients of factor j  on stock  i.

For each month, you can estimate the  a  and  βi,j. Therefore, iterating the process, you will have a time series of a  and  βi,j   for stock  i. Plot the time series of “Mkt-RF and a ” and explain whatever you find for stock PERMNO=10324.

Rolling window means that, for example, for  a   and β in 201212, you use data from 200801 to 201212 to estimate.

200801

200802

200803

200804

201212

201301

201302

201303

a1, β1

a2, β2

a3, β3

a4, β4

Then plot the time series of  a  and  β .

3. Fama-MacBeth Regression

If you do not know this regression, refer to this wiki: FamaMacBeth regression - Wikipedia. Furthermore, you may search for materials on the Internet, which is relatively abundant.

By running Fama-MacBeth Regression, you can estimate the risk premium of each factor overtime. You will obtain time series of yt,j    for factor  j  (notation in the wiki) by iterating rolling-window regressions. Plot the time series of “Mkt-RF”’s risk premium and explain whatever you find.

4. LASSO Regression

In running naïve factor regression, you can replace the estimation method (OLS) with LASSO (set  a  = 0.001). Also, replace the factors with a 0- to 6 month lagged version. We are still using the previous 60 months (including the current month's) data to do the regression. For example, suppose we simply do CAPM by LASSO with lagged factors. It will be like this:

Ri,t  = a + β0 Mktt−0 + β1 Mktt−1 + β2 Mktt−2 + ⋯ + β6 Mktt−6 + εi,t

The candidate factors are those provided (FF5, HXZ5, and PS). Then for each stock each month, you will have a group of factors that LASSO chooses. Suppose there is 60% of stocks choose  Mktt−2    factor (2-monthly lagged market return,  β2  ≠ 0). That means this factor has good explanatory power to 60% of stocks in this month. Please show the top 5 prevalent factors for each month. Given the limitation of computing power, please simply calculate the results for YYYYMM={200501, 200601, …, 202001}.

You will produce a table like (“_l3” represents factor with a 3-month lag.):

YYYYMM

1st

2nd

3rd

4th

5th

200501

Mkt-RF_l0

AggLiq_l0

AggLiq_l3

LIV_Q_l0

LIV_Q_l1

200601

202001

You can also try various  a  to see how the results will be affected.

5. Mean-Variance Portfolio

Recall lecture note “Lec 4 Multi-Factor Model” page 12. For every year starting 2005, estimate the mean and variance of each stock. Mean and variance is calculated by factor model using data in the previous five years, which will be covered in Lecture 5. Please be reminded that choosing appropriate factors is important in calculating variance.

After compiling the mean and variance, calculate the weight to construct the most efficient mean-variance portfolio (assuming  Y  = 1  and no transaction costs). Please note that you can have long or short stocks in this practice. You will hold this portfolio for the following twelve months and then adjust your portfolio based on the calculation of the next iteration (i.e., rebalancing your portfolio annually). Finally, plot the cumulative returns othis investment strategy and calculate the annualized Sharpe ratio and maximum drawdown.

6. Deliverable

The final report should contain results generated by your program. It would help if you properly visualize them and provide interpretations of the results, for example, explaining why the coefficient of a factor change overtime.

You should submit a ZIP file containing ONE code file (.py or  .ipynb) and ONE analysis report (.pdf).

发表评论

电子邮件地址不会被公开。 必填项已用*标注