Skip to main content

Advertisement

Table 1 Aggregated simulation results that describe the effect of multiple types of data generating processes

From: Decision trees in epidemiological research

True model Type MSE Terminal nodes
Mean SD Mean 20th 80th
Tree CART 1.26 0.151 7.01 6 8
Pruned CART 1.22 0.137 4.27 3 5
Pruned CART (1-SE) 1.25 0.139 3.31 3 4
CTree 1.27 0.154 3.72 3 4
Linear regression 2.04 0.179    
Regression CART 4.12 0.413 15.24 14 16
Pruned CART 4.19 0.442 13.97 12 16
Pruned CART (1-SE) 4.55 0.509 8.66 6 11
CTree 4.14 0.409 13.96 13 15
Linear regression 1.03 0.093    
Hybrid CART 1.39 0.138 13.1 11 15
Pruned CART 1.37 0.131 5.96 3 9
Pruned CART (1-SE) 1.39 0.133 2.69 2 3
CTree 1.34 0.126 5.42 4 6
Linear regression 1.17 0.106    
  1. These sources of data include a tree structure, a regression model and a hybrid model that combines the two structures