疾病费用预测的建模分析

引用本文

张菁芳, 李佳承, 任家顺. 疾病费用预测的建模分析[J]. 重庆大学学报, 2016, 39(2): 99-106. DOI: 10.11835/j.issn.1000-582X.2016.02.013. 复制到剪切板

ZHANG Jingfang, LI Jiacheng, REN Jiashun. Modeling analysis on the prediction of the cost of diseases[J]. Journal of Chongqing University, 2016, 39(2): 99-106. DOI: 10.11835/j.issn.1000-582X.2016.02.013. 复制到剪切板

基金项目

重庆市科技计划项目资助(cstc2013jccxA10012)。

作者简介

张菁芳(1979-),女,博士,主要从事社会医学等研究,(E-mail)556222@qq.com。

文章历史

收稿日期：2015-11-23

Contents Abstract Full text Figures/Tables PDF

疾病费用预测的建模分析

张菁芳^a, 李佳承^b, 任家顺^a

a.中国人民解放军第三军医大学第二附属医院, 重庆 400037;
b.中国人民解放军第三军医大学学员旅十一营, 重庆 400038

收稿日期：2015-11-23

基金项目：重庆市科技计划项目资助(cstc2013jccxA10012)。

作者简介：张菁芳(1979-),女,博士,主要从事社会医学等研究,(E-mail)556222@qq.com。

摘要：根据重庆市某三级甲等医院2012年1月至2014年12月5种常见疾病(糖尿病、甲状腺功能亢进、顺产、肠息肉、脑梗死)的月人均治疗费用数据,采用BP神经网络模型、广义回归神经网络模型、灰色GM(1,1)模型以及非线性回归模型,分别预测2015年1月至8月5种疾病的月人均治疗费用的变化情况,并与真实费用数据进行对比,判断4种模型预测的准确程度。结果表明:BP神经网络模型、广义回归神经网络模型、灰色GM(1,1)模型、非线性回归模型预测5种疾病的可决系数R²最小分别为0.278、0.565、0.048和0.097,最大分别为0.826、0.901、0.600和0.747;与2015年1月至8月的真实费用数据比较,4种模型预测的相对误差最小分别为9.845%、3.507%、5.897%和3.642%,最大分别为15.450%、13.940%、30.518%和17.204%。其中广义回归神经网络在疾病费用的预测结果相对于其他模型更准确。

关键词: BP神经网络广义回归神经网络灰色GM(1,1)预测非线性回归分析数学建模

Modeling analysis on the prediction of the cost of diseases

ZHANG Jingfang^a, LI Jiacheng^b, REN Jiashun^a

a. Xinqiao Hospital, Third Military Medical University, Chongqing 400038, P. R. China;
b. Battalion 11, College of Medicine, Third Military Medical University, Chongqing 400038, P. R. China

Supported by Chongqing Key Techologies R & D Program(cstc2013jccxA10012).

Abstract: On the basis of the per capita treatment cost per month of five common diseases, i.e. diabetes, intestinal polyp, hyperthyroidism, eutocia and cerebral infarction, in a 3-A-grade hospital in Chongqing from January, 2012 to December, 2014, we used BP neural network, generalized regression neural network (GRNN), grey system GM (1, 1) and non-linear regression analysis to predict the change of per capita treatment costs per month of these five diseases from January 2015 to August 2015. And the accuracy of these four models was judged by comparing the prediction results with real data. The results show that the minimum coefficients of determination (R²) of the four models are 0.278, 0.565, 0.048 and 0.097, respectively, while their maximum coefficients of determination(R²) are 0.826, 0.901, 0.600 and 0.747, respectively. The minimum prediction errors of the four models are 9.845%, 3.507%, 5.897% and 3.642%, respectively, while their maximum prediction errors are 15.450%, 13.940%, 30.518% and 17.204%, respectively. Compared with the other three models, the GRNN model can predict the cost of diseases more accurately.

Key words: BP neural network GRNN grey system GM(1,1) non-linear regression analysis mathematical modeling

医疗质量的提升和医疗改革的顺利进行其前提是足够的医疗费用，医疗费用预算的关键在于疾病费用预测标准制定得是否科学、合理^{[1, 2]}。因此，能够根据现有搜集得到的医疗费用数据，借助合理有效的数学模型预测并制定医疗费用的预算方案是促进医改顺利进行的关键。根据前期调研结果，社会物价的波动、医疗成本的改变均能引发医疗费用的改变。准确的疾病费用预测对医疗政策的制定能提供有益的参考。尽管有不少医疗费用的相关研究^{[3, 4, 5]}，但仍存在定性研究多，缺乏用数学方法进行精确研究等问题，笔者拟通过运用BP神经网络模型、广义回归神经网络模型、灰色GM(1，1)模型以及非线性回归模型对医疗费用进行预测，并对4种模型的预测效果进行比较分析，以期获得预测精度高、方法科学的数学模型，方便后续研究的顺利进行。

1 资料与方法 1.1 临床资料

本研究选用的是重庆市某三级甲等医院在2012年1月至2015年8月之间5类疾病每月治疗病例数和人均治疗费用的统计数据，其中5类疾病包括糖尿病9 208例、甲状腺功能亢进1 888例、顺产2 997例、肠息肉1 911例、脑梗死6 731例(见表 1)。所选出每月人均疾病治疗费用数据样本共230个，构成矩阵 A =(a_ij)_44×5，数据矩阵的维度为44×5，其中以月份为时间序列，共44个月的人均疾病费用。

表 1 2012年1月至2015年8月5类疾病每月治疗病例数和人均治疗费用(万元) Table 1 The number of patients and mean cost from Jan.2012 to Aug.2015

月份	糖尿病		甲状腺机能亢进		顺产		肠息肉		脑梗死
月份	例数	治疗费用	例数	治疗费用	例数	治疗费用	例数	治疗费用	例数	治疗费用
2012-01	115	1.375	39	0.676	46	0.408	15	0.821	97	2.330
2012-02	176	1.426	46	0.759	35	0.371	49	0.868	107	2.650
2012-03	254	1.470	48	0.904	62	0.391	40	0.914	120	2.424
2012-04	220	1.521	54	0.791	55	0.412	39	0.812	115	2.115
2012-05	212	1.739	58	0.757	59	0.427	40	0.812	123	2.459
2012-06	202	1.511	54	0.979	58	0.426	41	0.970	142	2.713
2012-07	241	1.517	53	0.899	62	0.385	40	0.779	122	3.212
2012-08	185	1.488	60	0.652	73	0.369	47	0.985	134	3.006
2012-09	206	1.280	55	0.541	59	0.424	34	0.853	152	2.403
2012-10	137	1.328	32	0.502	62	0.399	40	0.790	81	3.236
2012-11	172	1.756	63	0.915	62	0.471	54	0.979	143	3.486
2012-12	235	1.961	46	0.984	59	0.444	46	1.166	156	3.620
2013-01	183	1.961	42	0.931	49	0.470	37	0.811	145	4.316
2013-02	114	1.566	20	0.945	34	0.508	24	0.831	94	2.971
2013-03	261	1.730	2	1.145	43	0.491	47	0.947	121	3.155
2013-04	228	1.842	1	0.484	44	0.493	57	1.177	134	2.562
2013-05	226	1.568	28	1.000	47	0.469	60	0.930	148	2.811
2013-06	185	1.576	43	0.782	40	0.498	38	0.921	171	3.260
2013-07	206	1.659	57	1.013	39	0.486	33	0.911	182	3.352
2013-08	183	1.675	64	0.924	59	0.499	45	0.897	150	4.016
2013-09	228	1.604	59	0.933	50	0.523	49	0.998	146	3.260
2013-10	204	1.649	51	1.102	49	0.457	39	0.867	140	3.660
2013-11	245	1.783	2	1.883	42	0.508	43	1.017	162	4.526
2013-12	228	1.820	45	1.034	51	0.482	32	1.064	183	4.611
2014-01	208	1.661	63	0.968	41	0.534	40	0.976	204	4.490
2014-02	181	1.641	2	0.569	59	0.514	37	1.085	127	4.096
2014-03	253	1.790	54	1.063	71	0.517	53	1.167	173	4.883
2014-04	251	1.857	47	1.022	72	0.543	44	1.148	180	3.787
2014-05	246	1.794	66	1.013	56	0.502	64	1.019	177	4.485
2014-06	252	1.665	53	1.032	49	0.496	56	1.068	165	4.199
2014-07	252	1.768	1	0.287	71	0.534	45	1.107	203	4.239
2014-08	209	1.450	62	0.891	76	0.501	48	0.902	190	3.684
2014-09	215	1.434	62	0.939	77	0.550	56	0.900	220	3.414
2014-10	199	1.537	46	1.013	84	0.469	48	0.898	166	3.824
2014-11	250	1.509	37	0.842	101	0.479	61	1.096	155	4.051
2014-12	256	1.729	44	0.890	127	0.500	30	1.162	188	4.715
2015-01	204	1.671	46	0.865	134	0.504	51	1.136	204	3.647
2015-02	156	1.860	40	0.917	132	0.544	31	1.253	152	3.587
2015-03	231	1.789	45	0.875	101	0.521	55	1.073	208	4.215
2015-04	243	1.725	39	0.940	70	0.503	53	1.078	169	4.661
2015-05	222	1.593	50	0.762	121	0.485	50	1.077	168	4.519
2015-06	216	1.454	46	0.701	116	0.480	38	1.201	150	3.307
2015-07	222	1.617	39	0.753	128	0.506	38	1.044	173	3.127
2015-08	96	1.563	24	0.660	72	0.499	24	1.049	91	3.295
总计	9208	71.909	1888	38.536	2997	20.991	1911	43.559	6731	156.379

表 1 2012年1月至2015年8月5类疾病每月治疗病例数和人均治疗费用(万元) Table 1 The number of patients and mean cost from Jan.2012 to Aug.2015

1.2 方法

根据大量学者研究^{[6, 7, 8, 9, 10]}提示，BP神经网络模型、广义回归神经网络、灰色GM(1，1)预测模型以及非线性回归模型是目前应用最为广泛的预测模型，其应用范围涉及天文预测、人口预测、财政预测等方面，但其在临床疾病费用预测方面的应用相对较少。笔者选择以上4种模型对5种临床常见疾病的治疗费用进行预测，并且为判断预测模型的准确程度，本研究选用可决系数R²^[11]作为统计学拟合程度评价指标，其统计学意义在于反映预测模型预测结果与样本观测值之间的偏离程度。R²越大，则预测偏离程度越小，反之，预测偏离程度越大。因此，研究以2012年1月至2014年12月的疾病治疗费用为基础，预测2015年1月至8 月内的疾病治疗费用，并根据预测模型的可决系数R²和2015年1月至8月的真实值和预测值之间的相对误差E判断预测模型的准确程度。

1.2.1 神经网络模型

神经网络模型是按照生物神经元的结构和工作原理构建一个人工神经网络，其中人工神经元作为一种简单的处理器，可以将输入的信号进行加权求和。

本研究采用了两种神经网络模型：BP神经网络模型和广义回归神经网络模型(generalized regression neural network，GRNN)。两种模型的结构大致相同，通常由输入层、输出层及若干隐含层构成，每层有若干个神经元，上层神经元与下层神经元通过权重进行连接，层与层之间都是所有神经元互联构成复杂的连接网络(图 1)。BP神经网络是一种多层神经网络，采用误差反向传播的学习算法进行权值和阈值调整，而广义回归神经网络建立在非参数回归分析的基础上，以样本数据为后验条件进行参数估计，根据最大概率原则进行网络输出^[6]。关于两种神经网络模型的数学原理，详见参考文献[7, 8, 9]。

图 1 神经网络模型模式图 Fig. 1 Neural network model

1.2.2 灰色GM(1,1)预测模型

灰色GM(1,1)模型是灰色系统理论中应用最广泛的灰色动态预测模型，也称单序列一阶线性动态预测模型。该模型是将时间序列 X⁽⁰⁾={X⁽⁰⁾(1),X⁽⁰⁾(2),…,X⁽⁰⁾(n)}累加生成新序列X⁽¹⁾(t)，t为时间序列，如t=1，即表示2012年1月；t=12，即表示2012年12月。X⁽¹⁾(t)且满足微分方程：

$ \frac{{d{X^{\left( 1 \right)}}}}{{dt}} + \alpha {X^{\left( 1 \right)}} = \mu , $

(1)

式中：参数α，μ可以通过最小二乘法求得。得到累加预测值X⁽¹⁾(t+1)，满足：

$ {X^{\left( 1 \right)}}\left( {t + 1} \right) = \left( {{X^{\left( 0 \right)}}\left( 1 \right) - \frac{\mu }{\alpha }} \right){e^{ - \alpha t}} + \frac{\mu }{\alpha }, $

(2)

最后，将累加的预测值还原得到目标预测值X⁽⁰⁾(t+1)。详细原理参考文献[10]。

1.2.3 非线性回归模型

根据选用数据的分布情况，本研究采用二次函数进行线性拟合，即根据n组原始数据点(x,y)的分布，寻找函数$\hat y = A{x^2} + bx + c$使得$\sum\limits_{i = 1}^n {{{\left( {\hat y - y} \right)}^2}} $最小，则在函数${\hat y}$为该组数据拟合函数，则可以根据拟合函数${\hat y}$对随后的疾病费用进行预测^[11]。

2 结果与分析 2.1 预测模型

本研究使用2012年1月至2014年12月的疾病人月均治疗费用数据，分别采用BP神经网络模型、广义回归神经网络模型、灰色GM(1,1)模型以及非线性回归模型，构建了5种疾病的月人均治疗费用模型。其中，为确保BP神经网络模型输出结果的准确性，隐含层神经元间的传递函数选用log-sigmoid型函数logsig，输出层神经元的传递函数选用纯线性函数purelin。广义回归神经网络模型的基函数选用的是格林(Green)函数^{[13, 14, 15]}。灰色GM(1,1)模型中累加函数方程以及非线性回归模型^{[16, 17]}的模型方程如下所示。

1) 糖尿病预测模型。

灰色GM(1,1)模型：X₁⁽¹⁾(t+1)=[X₁⁽¹⁾(t+1)+725.026]e^0.002t-725.026；

非线性回归模型：y₁=-7.558×10^-4t²+0.033t+1.367。

2) 甲状腺功能亢进预测模型。

灰色GM(1,1)模型：X₂⁽¹⁾(t+1)=[X₂⁽¹⁾(t+1)+174.480]e^0.005t-174.480；

非线性回归模型：y₂=-6.136×10^-4t²+0.028t+0.652。

3) 顺产预测模型。

灰色GM(1,1)模型：X₃⁽¹⁾(t+1)=[X₃⁽¹⁾(t+1)+52.406]e^0.008t-52.406；

非线性回归模型：y₃=-1.775×10^-4t²+0.010t+0.358。

4) 肠息肉预测模型。

灰色GM(1,1)模型：X₄⁽¹⁾(t+1)=[X₄⁽¹⁾(t+1)+137.045]e^0.006t-137.045；

非线性回归模型：y₄=-1.094×10^-4t²+0.010t+0.821。

5) 脑梗死预测模型。

灰色GM(1,1)模型：X₅⁽¹⁾(t+1)=[X₅⁽¹⁾(t+1)+175.625]e^0.015t-175.625；

非线性回归模型：y₅=-1.403×10^-3t²+0.108t+2.127。

4种预测模型在5种疾病治疗费用预测中的可决系数R²见表 2，可见5种疾病广义回归网络模型的可决系数R² 均高于其他模型。

表 2 5种疾病四种预测模型的可决系数R² Table 2 Coefficients of determination of four models for five diseases

2.2 预测结果的验证

分别对5种疾病2015年1月至8月的月人均住院费用进行预测，并与实际值进行比较，表 2至表 6给出2015年1月至8月的预测误差E。对糖尿病，广义回归神经网络模型和灰色GM(1,1)模型的相对误差较小；对甲状腺机能亢进、顺产，广义回归神经网络模型和非线性回归模型的相对误差较小；对肠息肉，广义回归神经网络模型和灰色GM(1,1)模型、非线性回归模型的相对误差较小；对甲状腺机能亢进，非线性回归模型和广义回归神经网络模型的相对误差较小；对脑梗死，广义回归神经网络模型和BP神经网络模型的相对误差较小。广义回归神经网络模型在疾病费用的预测方面效果较好。

表 3 4种模型关于糖尿病治疗费用预测结果的验证(万元) Table 3 The results of four models to predict the cost of diabetes

月份	真实值	BP神经网络模型		广义回归神经网络模型		灰色GM(1,1)模型		非线性回归模型
月份	真实值	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%
2015-01	1.671	1.524	8.793	1.704	1.953	1.701	1.762	1.536	8.096
2015-02	1.860	1.524	18.061	1.677	9.845	1.704	8.377	1.512	18.731
2015-03	1.789	1.524	14.820	1.839	2.771	1.708	4.536	1.486	16.942
2015-04	1.725	1.524	11.675	1.797	4.160	1.712	0.762	1.459	15.422
2015-05	1.593	1.522	4.461	1.733	8.779	1.716	7.710	1.430	10.201
2015-06	1.454	1.515	4.208	1.608	10.605	1.719	18.254	1.400	3.703
2015-07	1.617	1.492	7.677	1.470	9.052	1.723	6.587	1.368	15.354
2015-08	1.563	1.421	9.068	1.599	2.292	1.727	10.502	1.335	14.561
平均相对误差/%			9.845±4.822		6.182±3.717		7.311±5.518		12.876±5.077

表 3 4种模型关于糖尿病治疗费用预测结果的验证(万元) Table 3 The results of four models to predict the cost of diabetes

表 4 4种模型关于甲状腺机能亢进治疗费用预测结果的验证(万元) Table 4 The result of four models to predict the cost of hyperthyroidism

月份	真实值	BP神经网络模型		广义回归神经网络模型		灰色GM(1,1)模型		非线性回归模型
月份	真实值	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%
2015-01	0.865	0.924	6.797	0.885	2.362	0.975	12.733	0.841	2.736
2015-02	0.917	0.922	0.555	0.868	5.362	0.980	6.855	0.823	10.226
2015-03	0.875	0.915	4.552	0.911	4.077	0.984	12.452	0.804	8.188
2015-04	0.940	0.901	4.047	0.880	6.335	0.989	5.277	0.783	16.649
2015-05	0.762	0.889	16.734	0.932	22.363	0.994	30.424	0.761	0.098
2015-06	0.701	0.885	26.210	0.782	11.578	0.998	42.462	0.738	5.311
2015-07	0.753	0.883	17.303	0.708	5.906	1.003	33.237	0.714	5.204
2015-08	0.660	0.883	33.849	0.747	13.280	1.008	52.806	0.688	4.331
平均相对误差/%			13.756±11.844		8.908±6.565		24.531±17.727		6.593±5.108

表 4 4种模型关于甲状腺机能亢进治疗费用预测结果的验证(万元) Table 4 The result of four models to predict the cost of hyperthyroidism

表 5 4种模型关于顺产治疗费用预测结果的验证(万元) Table 5 The result of four models to predict the cost of eutocia

月份	真实值	BP神经网络模型		广义回归神经网络模型		灰色GM(1,1)模型		非线性回归模型
月份	真实值	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%
2015-01	0.504	0.446	11.462	0.497	1.242	0.542	7.626	0.500	0.836
2015-02	0.544	0.446	18.101	0.503	7.568	0.546	0.361	0.497	8.787
2015-03	0.521	0.446	14.413	0.540	3.604	0.551	5.695	0.493	5.317
2015-04	0.503	0.446	11.306	0.524	4.132	0.555	10.386	0.490	2.608
2015-05	0.485	0.446	8.113	0.505	4.045	0.559	15.252	0.486	0.071
2015-06	0.480	0.446	7.161	0.487	1.465	0.564	17.357	0.481	0.201
2015-07	0.506	0.446	11.938	0.481	5.022	0.568	12.188	0.477	5.887
2015-08	0.499	0.446	10.556	0.503	0.976	0.573	14.840	0.472	5.427
平均相对误差/%			11.631±3.451		3.507±2.242		10.463±5.675		3.642±3.186

表 5 4种模型关于顺产治疗费用预测结果的验证(万元) Table 5 The result of four models to predict the cost of eutocia

表 6 4种模型关于肠息肉治疗费用预测结果的验证(万元) Table 6 The result of four models to predict the cost of intestinal polyp

月份	真实值	BP神经网络模型		广义回归神经网络模型		灰色GM(1,1)模型		非线性回归模型
月份	真实值	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%
2015-01	1.136	1.011	11.044	1.154	1.567	1.080	4.940	1.053	7.305
2015-02	1.253	1.009	19.457	1.139	9.093	1.087	13.249	1.055	15.767
2015-03	1.073	1.004	6.473	1.239	15.502	1.093	1.906	1.057	1.493
2015-04	1.078	0.993	7.823	1.093	1.427	1.100	2.097	1.059	1.768
2015-05	1.077	0.985	8.499	1.078	0.101	1.107	2.836	1.060	1.540
2015-06	1.201	0.982	18.213	1.077	10.319	1.114	7.205	1.061	11.605
2015-07	1.044	0.981	5.990	1.187	13.689	1.121	7.429	1.062	1.794
2015-08	1.049	0.981	6.523	1.061	1.122	1.128	7.515	1.063	1.316
平均相对误差/%			10.503±5.393		6.603±6.255		5.897±3.806		5.324±5.639

表 6 4种模型关于肠息肉治疗费用预测结果的验证(万元) Table 6 The result of four models to predict the cost of intestinal polyp

根据表 2~7中的数据，可以得出以下结论：①根据可决系数R²的大小关系，4种模型拟合程度排序分别为：广义回归神经网络模型(R_min²=0.565，R_max²=0.901)、BP神经网络模型(R_min²=0.278，R_max²=0.826)、非线性回归模型(R_min²=0.097，R_max²=0.747)、灰色GM(1,1)模型(R_min²=0.048，R_max²=0.600)。②根据预测相对误差E的大小关系，4种模型预测结果可靠性排序分别为：广义回归神经网络模型(E_min=3.507%，E_max=13.940%)、非线性回归模型(E_min=3.642%，E_max=17.204%)、BP神经网络模型(E_min=9.845%，E_max=15.450%)、灰色GM(1,1)模型(E_min=5.897%，E_max=30.518%)。

表 7 4种模型关于脑梗死治疗费用预测结果的验证(万元) Table 7 The result of four models to predict the cost of cerebral infarction

月份	真实值	BP神经网络模型		广义回归神经网络模型		灰色GM(1,1)模型		非线性回归模型
月份	真实值	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%	预测值	相对误差/%
2015-01	3.647	4.029	10.484	4.638	27.183	4.590	25.860	4.216	15.599
2015-02	3.587	4.029	12.316	3.767	4.994	4.660	29.893	4.219	17.603
2015-03	4.215	4.029	4.411	3.598	14.642	4.731	12.232	4.219	0.097
2015-04	4.661	4.029	13.550	4.144	11.091	4.803	3.048	4.217	9.524
2015-05	4.519	4.029	10.853	4.608	1.960	4.876	7.894	4.211	6.806
2015-06	3.307	4.027	21.760	4.534	37.087	4.950	49.675	4.203	27.101
2015-07	3.127	4.021	28.596	3.446	10.209	5.025	60.714	4.192	34.082
2015-08	3.295	4.008	21.629	3.152	4.352	5.102	54.829	4.179	26.820
平均相对误差/%			15.450±7.855		13.940±12.243		30.518±22.337		17.204±11.574

表 7 4种模型关于脑梗死治疗费用预测结果的验证(万元) Table 7 The result of four models to predict the cost of cerebral infarction

3 结论

1) 广义回归神经网络模型预测效果更接近临床数据，并且预测结果较其他模型更加稳定，是4种模型中最适宜应用于临床疾病费用预测工作的数学模型。

2) BP神经网络模型的预测效果较广义回归神经网络模型略差一点，主要原因可能在于其缺乏选择学习速率的方式以及确定隐含层神经元数目的有效方法，这些不足在文献中都有较为详尽的记录。

3) 非线性回归模型的预测效果波动较大，主要原因在于该模型对于数据的要求较高，在数据波动较大，特别是在临床病情复杂，单一病种相对少见的情况下，应用受到较大限制。

4) 灰色GM(1,1)模型的预测效果是4种模型中最差的，主要原因在于灰色GM(1,1)模型是基于第一个初始值X⁽⁰⁾(1)，并采用外推法进行拟合预测的过程，按照最小二乘法的原理也必将经过第一个数据点，因此，初始点的选取会显著影响预测的效果和精度。

5) 对于文中提到的以时间序列为依据的预测模型，广义回归神经网络模型可以进行的自适应、自学习以及容错功能，克服了其他3种模型的缺点，可以不受数据模型参数、结构以及动态特性的影响，对于数据的要求也不高，是可以应用于临床疾病治疗费用预测的重要模型。

参考文献

[1]	Tacettin Ornek. Clinical factors affecting the direct cost of patients hospitalized with acute exacerbation of chronic obstructive pulmonary disease[J]. International Journal of Medical Sciences,2012,9(4):285-290.(1)
[2]	Joel Segal. Cost of illness studies[M]. RTI International Center of Excellence in Health Promotion Economics,2006:2.(1)
[3]	Kappelman M D, Rifas-Shiman S L, Porter C Q, et al. Direct Health Care Costs of Crohn's Disease and Ulcerative Colitis in US Children and Adults[J]. Gastroenterology,2008,135:1907-1913.(1)
[4]	Usa C, Petcharat P, Nathorn C, et al. Factors affecting health-care costs and hospitalizations among diabetic patients in Thai public hospitals.[J]. Value in Health,2008,11(s1):69-74.,(1)
[5]	Mehta S, Moore R D, Graham N M. Potential factors affecting adherence with HIV therapy[J]. Aids,1997,11(14):1665-1670.(1)
[6]	陈明.MATLAB神经网络原理与实例精解[M].北京:清华大学出版社,2013. CHEN Ming. MATLAB neural network principle and example[M]. Beijing:Tsinghua university press,2013.(in Chinese)(2)
[7]	Asoodeh M, Shadizadeh S R, Zargar G. The estimation of stoneley wave velocity from conventional well log data:using an integration of artificial neural networks[J]. Energy Sources Part A:Recovery Utilization and Environmental Effects,2015,37(3):309-317.(2)
[8]	Zhang J, Tan Z, Li C. A novel hybrid forecasting method using GRNN combined with wavelet transform and a GARCH model[J]. Energy Sources Part B:Economics Planning and Policy,2015,10(4):418-426.(2
[9]	Liu X Y, Peng H Q, Bai Y, et al. Tourism flows prediction based on an improved grey GM(1,1) model[J]. Procedia-Social and Behavioral Sciences,2014,138:767-775.(2)
[10]	Khataee A, Vahid B, Behjati B, et al. Kinetic modeling of a triarylmethane dye decolorizeation by photoelectron-Fenton processs in a recirculateing system; Nonlinear regression analysis[J]. Chemical Engineering Research and Design,2013,92(2):362-367.(2)
[11]	张良均,曹晶,蒋世忠.神经网络实用教程[M].北京:机械工业出版社,2009:22-31. ZHANG Liangjun, CAO Jin, JIANG Shizhong. Neural network practical tutorial[M]. Beijing:China Machine Press,2009:22-31.(in Chinese)(2)
[12]	颜虹.医学统计学[M].2版.北京:人民卫生出版社,2010:215. YAN Hong. Medlical statistics[M]. 2th ed. Beijing:People's Medical Publishing House, 2010:215. (in Chinese)
[13]	张德丰.MATLAB神经网络编程[M].北京:化学工业出版社,2011. ZHANG Defeng. MATLAB neural network programming[M]. Beijing:Chemical Industry Press,2011.(in Chinese)(1)
[14]	陈芳,楼文高.基于广义回归神经网络的蔬菜市场日价格预测[J].浙江农业学报,2015,27(7):1253-1258. CHEN Fang, LOU Wengao. Vegetable market day price forecasting based on generalized regression neural network[J]. Acta Agriculturae Zhejiangensis,2015,27(7):1253-1258.(in Chinese)(1)
[15]	Liu B, Zhao L, Zhai Z J, et al. Optimum model of GM(1,1) and its suitable range[J]. Journal of Nanjing University of Aeronautics and Astronautics,2003,35(4):451-454.(1)
[16]	陈永胜.基于MATLAB和SPSS的非线性回归分析[J].牡丹江大学学报,2009,18(5):101-103. CHEN Yongsheng. Nonlinear regression analysis based on MATLAB and SPSS[J]. Journal of Mudanjiang University,2009,18(5):101-104.(in Chinese)(1)
[17]	杨华龙,刘金霞,郑斌.灰色预测GM(1,1)模型的改进及应用[J].数学的实践与认识,2011,41(23):39-46. YANG Hualong, LIU Jinxia, ZHENG Bin. Improvement and application of grey prediction GM (1,1) model[J]. Mathematics in Practice and Theory,2011,41(23):39-46.(in Chinese)(1)

本文献在全文中的定位：

... 用，医疗费用预算的关键在于疾病费用预测标准制定得是否科学、合理^{[1, 2]}。因此，能够根据现有搜集得到的 ...

本文献在全文中的定位：

... 用预测标准制定得是否科学、合理^{[1, 2]}。因此，能够根据现有搜集得到的医疗费用数据，借助合理有效的数学 ...

本文献在全文中的定位：

... 对医疗政策的制定能提供有益的参考。尽管有不少医疗费用的相关研究^{[3, 4, 5]} ...

本文献在全文中的定位：

... 尽管有不少医疗费用的相关研究^{[3, 4, 5]}，但仍存在定性研究多，缺乏用数 ...

本文献在全文中的定位：

... ^{[3, 4, 5]}，但仍存在定性研究多，缺乏用数学方法进行精确研究等问题，笔者拟 ...

本文献在全文中的定位：

... 根据大量学者研究^{[6, 7, 8,} ...

... 以样本数据为后验条件进行参数估计，根据最大概率原则进行网络输出^[6]。关于两种神经网络模型的数学原理，详见参考文献 ...

本文献在全文中的定位：

... 根据大量学者研究^{[6, 7, 8, 9,} ...

... 关于两种神经网络模型的数学原理，详见参考文献[7, 8, 9]。 ...

本文献在全文中的定位：

... ^{[6, 7, 8, 9, 10]} ...

... 经网络模型的数学原理，详见参考文献[7, 8, 9]。 ...

本文献在全文中的定位：

... ^{, 7, 8, 9, 10]}提示，BP神经网络模型、广义回归 ...

... 献[7, 8, 9]。 ...

本文献在全文中的定位：

... ^{8, 9, 10]}提示，BP神经网络模型、广义回归神经网络、灰色GM(1，1)预测模型以及 ...

... 预测值还原得到目标预测值X⁽⁰⁾(t+1)。详细原理参考文献[10]。 ...

本文献在全文中的定位：

... 并且为判断预测模型的准确程度，本研究选用可决系数R²^[11]作为统计学拟合程度评价指标，其统计学意义在于反映预测模型预测结 ...

... 数据拟合函数，则可以根据拟合函数${\hat y}$对随后的疾病费用进行预测^[11]。 ...

本文献在全文中的定位：

... 性函数purelin。广义回归神经网络模型的基函数选用的是格林(Green)函数^{[13, 14, 15]} ...

本文献在全文中的定位：

... 的基函数选用的是格林(Green)函数^{[13, 14, 15]}。灰色GM(1,1)模型中累加函数方程 ...

本文献在全文中的定位：

... ^{[13, 14, 15]}。灰色GM(1,1)模型中累加函数方程以及非线性回归模型 ...

本文献在全文中的定位：

... 灰色GM(1,1)模型中累加函数方程以及非线性回归模型^{[16, 17]}的模型方程如下所示。 ...

本文献在全文中的定位：

... 累加函数方程以及非线性回归模型^{[16, 17]}的模型方程如下所示。 ...