Latest  Advances  in  Automated  Essay  Scoring:  A  Survey  of  Machine Learning and Deep Learning Methods

Khawar Iqbal Malik

doi:10.32350/umt-air.42.02

Khawar Iqbal Malik Riphah International University, Lahore

DOI: https://doi.org/10.32350/umt-air.42.02

Keywords: automated essay scoring (AES, BILSTM, grading and feedback, human raters, natural language processing (NLP), rubric

Abstract

Abstract Views: 0

The current study aimsto assess the reliability of automated essay scoring (AES) through thecomparison of the mean scoresassigned by an AES tool in the context of a growing educational institution with a rising student population.A survey was conducted to test the reliability and validity of the E-Grading device,as well as to evaluate the use of holistic scores generated by both human and computer scoring,as a better solution for AES systems. While previous research found no significant mean score differences between human and AES scoring, this paper does not confirm these findings. In recent years, several algorithms have been proposed for AESand comparative studies have been conducted to evaluate the effectiveness of these algorithms. Instead, it reviews and examines earlier concepts and techniques applied in AES.

Downloads

Download data is not yet available.

References

B. Bohnet, R. McDonald, G. Simoes, D. Andor, E. Pitler, and J. Maynez, "Morphosyntactic tagging with a meta-BiLSTM model over context sensitive token encodings," 2018. [Online]. Available: https://arxiv.org/abs/1805. 08237.

D. Boulanger and V. Kumar, "Deep learning in automated essay scoring," in Proc. Intell. Tutor. Syst. 14th Int. Conf., ITS 2018, Montreal, QC, Canada, June 2018, pp. 294–299.

C. T. Lim, C. H. Bong, W. S. Wong, and N. K. Lee, "A comprehensive review of automated essay scoring (AES) research and development," Pertanika J. Sci. Technol., vol. 29, no. 3, pp. 1875–1899, 2021, doi: https://doi.org/10.47836/pjst.29.3.27.

H. Elfaik and E. H. NfaouI, "Deep contextualized embeddings for sentiment analysis of Arabic Book's reviews," Proc. Comput. Sci., vol. 215, pp. 973–982, 2022, doi: https://doi.org/10.1016/j.procs.2022.12.100.

M. A. Hussein, H. Hassan, and M. J. P. C. S. Nassef, "Automated language essay scoring systems: A literature review," Peer. J. Comput. Sci., vol. 5, 2019, Art. no. 208, doi: http://doi.org/10.7717/peerj-cs.208.

J. Z. Sukkarieh and J. Blackmore, "c-rater: Automatic content scoring for short constructed responses," in Proc. 22nd Flairs Conf., 2009, pp. 290–295.

M. Mahana, M. Johns, and A. Apte. Automated essay grading using machine learning. Stanford University, 2012. [Online]. Available: https://cs229.stanford.edu/proj2012/MahanaJohnsApte-AutomatedEssay GradingUsingMachineLearning.pdf

K. Sriwanna, "Text classification for subjective scoring using K-nearest neighbors," in Int. Conf. Digit. Arts Media Technol., 2018, pp. 139–142, doi: https://doi.org/10.1109/ICDAMT .2018.8 376511.

I. G. Ndukwe, B. K. Daniel, and C. E. Amadi, "A machine learning grading system using chatbots," in Artif. Intell. Edu. 20th Int. Conf., AIED 2019, Chicago, IL, USA, June, 2019, pp. 365–368.

S. Bonthu, S. Rama Sree, and M. K. Prasad, "Automated short answer grading using deep learning: A survey," in Mach. Learn. Knowledge Extract. 5th IFIP TC 5, TC 12, WG 8.4, WG 8.9, WG 12.9 Int. Cross-Domain Conf., Virtual Event, August, 2021, pp. 61–78.

A. Sakhapara, D. Pawade, B. Chaudhari, R. Gada, A. Mishra, and S. Bhanushali, "Subjective answer grader system based on machine learning," in Soft Comput. Signal Proc. Proc. ICSCSP 2018, 2019, pp. 347–355.

M. S. Devi and H. Mittal, "Machine learning techniques with ontology for subjective answer evaluation," 2016. [Online]. Available: https:// doi.org/10.48550/arXiv.1605.02442.

A. Kumar, A. Kharadi, D. Singh, and M. Kumari, "Subjective answer evaluation system." Available: https://www.academia.edu/download/73054812/IJCST_V9I5P10.pdf.

H. Henderi, Henderi Henderi, and W. Winarno, "Text mining an automatic short Answer Grading (ASAG), comparison of three methods of cosine similarity, Jaccard similarity and Dice's coefficient," J. Appl. Data Sci., vol. 2, no. 2, pp. 45–54, 2021, doi: https://doi.org/10.47738/jads.v2i2.31.

A. M. B. Omran and M. J. Ab Aziz, "Automatic essay grading system for short answers in English language," J. Comput. Sci., vol. 9, no. 10, pp. 1369–1382, 2013, doi: https://doi.org/ 10.3844/jcssp.2013.1369.1382.

A. Amalia, D. Gunawan, Y. Fithri, and I. Aulia, "Automated Bahasa Indonesia essay evaluation with latent semantic analysis," J. Phy. Conf. Ser., vol. 1235, no. 1, 2019, Art. no. 012100, doi: https://doi.org/10.1088/1742-6596/1235/1/012100.

M. Chen and X. Li, "Relevance-based automated essay scoring via hierarchical recurrent model," in Int. Conf. Asian Lang. Process., 2018, pp. 378–383

E. F. Al-Shalabi, "An automated system for essay scoring of online exams in Arabic based on stemming techniques and Levenshtein edit operations," 2016. [Online]. Available: https://doi.org/10.48550/arXiv.1611.02815.

B.-J. Yi, D.-G. Lee, and H.-C. Rim, "The effects of feature optimization on high-dimensional essay data," Math. Prob. Eng., vol. 2015, 2015, Art. no. 21642, doi: https://doi.org/10.1155/ 2015/421642.

T. B. Adji, Z. Abidin, and H. A. Nugroho, "System of negative Indonesian website detection using TF-IDF and Vector Space Model," presented at the 2014 Int. Conf. Elect. Eng. Comput. Sci., Kuta, Bali, Indonesia, Nov. 24–25, 2014, pp. 174–178.

W. Zhu and Y. Sun, "Automated essay scoring system using multi-model machine learning," Comput. Sci. Info., vol. 10, no. 12, pp. 109–117, doi: https://doi.org/10.5121/csit.2020.101211.

A. Lukic and V. J. R. U. Acuna, "Automated essay scoring," 2012. [Online]. Available: http://www. alenlukic.com/assets/docs/aes_report.pdf.

K. Taghipour and H. T. Ng, "A neural approach to automated essay scoring," in Proc. 2016 Conf. Empir. Meth. Nat. Lang. Proc., Austin, Texas, USA, Austin, Texas, Nov. 1–5, 2016, pp. 1882–1891.

F. Dong, Y. Zhang, and J. Yang, "Attention-based recurrent convolutional neural network for automatic essay scoring," in Proc. 21st Conf. Comput. Nat. lang. Lear., Vancouver, Canada, 2017, pp. 153–162.

A. Wiratmo and C. Fatichah, "Indonesian short essay scoring using transfer learning dependency tree LSTM," Int. J. Intell. Eng. Syst., vol. 13, no. 2, Jan. 2020, doi: https:// doi.org/10.22266/ijies2020.0430.27.