Figure from:Bi-GRU Urgent Classification for MOOC Discussion Forums Based on BERT

FIGURE 1. Bi-GRU with BERT urgent classification model.

FIGURE 2. Document classification model formed by incorporating BERT with one additional output layer. Figure adapted from Devlin et al. [26].

FIGURE 9. The PR curves of RNN in three experiments using data sets A, B, and C where AUC values were equal to 0.741, 0.734, and 0.657, respectively.

FIGURE 10. The PR curves of CNN in three experiments using data sets A, B, and C where AUC values were equal to 0.754, 0.761, and 0.734, respectively.

FIGURE 11. The PR curves of FASTTEXT in three experiments using data sets A, B, and C where AUC values were equal to 0.751, 0.728, and 0.683, respectively.

FIGURE 12. The PR curves of LSTM in three experiments using data sets A, B, and C where AUC values were equal to 0.799, 0.797 and 0.759, respectively.

FIGURE 13. The PR curves of GRU with BERT in three experiments using data sets A, B, and C where AUC values were equal to 0.822, 0.836, and 0.792, respectively.