作者 | 徐罡 |
姓名汉语拼音 | Xu Gang |
学号 | 2020000005053 |
培养单位 | 兰州财经大学 |
电话 | 15911076680 |
电子邮件 | xg6680@outlook.com |
入学年份 | 2020-9 |
学位类别 | 专业硕士 |
培养级别 | 硕士研究生 |
一级学科名称 | 金融 |
学科代码 | 0251 |
授予学位 | 金融硕士 |
第一导师姓名 | 杨世峰 |
第一导师姓名汉语拼音 | Yang Shifeng |
第一导师单位 | 兰州财经大学 |
第一导师职称 | 教授 |
题名 | 基于GA-XGBoost的量化投资策略研究 |
英文题名 | Research on Quantitative Investment Strategy Based on Genetic Algorithm and Extreme Gradient Boosting |
关键词 | 量化投资 机器学习 XGBoost 遗传算法 |
外文关键词 | Genetic Algorithm ; Machine learning ; Quantitative investment |
摘要 | 随着我国A股市场的不断完善与发展,传统的投资策略如基本面研究、技术面研究等已经很难使投资者在市场上获得超额收益。近年来,量化投资的概念进入人们视野,量化投资基金的整体规模逐渐增大,市场中量化交易的占比也越来越高。在未来这种新型投资方式将成为市场上主流的交易方式,同时也会成为金融机构争夺客户资源的主要工具。因此,研究量化投资策略对于我国量化投资行业的发展格外重要。 本文将遗传算法引入XGBoost模型的超参数调优过程,提高了模型的调参效率和预测准确率,在对策略进行历史数据回测后,获得了超过沪深300指数的收益。在选股模型的构建过程中,本文采用设置标签的方式对股票进行分类,设置得分评价方案对预测结果进行排序,同时使用滚动训练的方式来保证模型训练时使用到的是最新数据。在对遗传算法的设计上,使用F1-score作为算法的适应度函数。 通过因子重要性分析,本文发现在所使用的31个技术指标之中,波动率类指标如归一化的波动幅度均值(NATR)的分类表现最强,其次是重叠类指标如双指数移动平均线(DEMA)、考夫曼的自适应移动平均线(KAMA),周期类指标、成交量类指标等其他指标的分类效果不明显。 |
英文摘要 | With the continuous improvement and development of China's A-share market, traditional investment strategies such as fundamental research and technical research have made it difficult for investors to obtain excess returns in the market. In recent years, the overall scale of quantitative investment funds has gradually increased, and the proportion of quantitative trading in the market has also increased. In the future, quantitative investment this new investment method will become the mainstream trading method in the market, and become the main tool for financial institutions to compete for customer resources. Therefore, the study of new quantitative investment strategies is particularly important for the development of quantitative investment in China. In this paper, for the first time in the field of financial quantification, the genetic algorithm is introduced into the hyperparameter tuning process of the XGBoost model, the stock selection model based on the XGBoost algorithm is optimized, and the return exceeding the CSI 300 index is finally obtained through backtesting of historical data. Compared with the traditional quantitative stock selection model, our model can significantly improve the computational efficiency. At the same time, due to the selection of suitable parameters, the prediction accuracy of the model and the return rate of backtesting of historical data have been improved to varying degrees. This paper also describes the construction process of the stock selection model in detail, and explains the setting of various aspects of the model. We use the method of setting labels to classify stocks, set score evaluation scheme to sort the prediction results, and select the rolling training method to ensure that the data used in model training is relatively new. In the design of genetic algorithm, F1-score is used as the fitness function of the algorithm. Through factor importance analysis, we found that among the 31 technical indicators used, volatility indicators such as the normalized mean volatility (NATR) performed the strongest in the classification task, followed by overlapping indicators such as double-exponential moving average (DEMA), Kaufman's adaptive moving average (KAMA), and other indicators such as cycle indicators, volume indicators have less obvious classification effect. |
学位类型 | 硕士 |
答辩日期 | 2023-06-03 |
学位授予地点 | 甘肃省兰州市 |
研究方向 | 金融投资与理财实务 |
语种 | 中文 |
论文总页数 | 59 |
插图总数 | 8 |
插表总数 | 10 |
参考文献总数 | 42 |
馆藏号 | 0005116 |
保密级别 | 公开 |
中图分类号 | F83/541 |
文献类型 | 学位论文 |
条目标识符 | http://ir.lzufe.edu.cn/handle/39EH0E1M/33882 |
专题 | 金融学院 |
推荐引用方式 GB/T 7714 | 徐罡. 基于GA-XGBoost的量化投资策略研究[D]. 甘肃省兰州市. 兰州财经大学,2023. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
2020000005053.pdf(1485KB) | 学位论文 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
个性服务 |
查看访问统计 |
谷歌学术 |
谷歌学术中相似的文章 |
[徐罡]的文章 |
百度学术 |
百度学术中相似的文章 |
[徐罡]的文章 |
必应学术 |
必应学术中相似的文章 |
[徐罡]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论