Machine Learning for Knowledge Construction in a MOOC Discussion Forum
Yassine Benjelloun Touimi1, Abdelladim Hadioui2, Nour-eddine El Faddouli3, Samir Bennani4

1Yassine benjelloun Touimi*, Department of Computer Science, Mohammadia School of Engineering, University Mohammed V, Morocco.
2Abdeladim Hadioui, Department of Computer Science, Mohammadia School of Engineering, University Mohammed V, Morocco.
3Nourredine Faddouli, Department of Computer Science, Mohammadia School of Engineering, University Mohammed V, Morocco.
4Samir Bennani, Professor and head, Lab in Mohammadia School of Engineering, and Vice President in the University Mohammad V Morocco.
Manuscript received on December 15, 2019. | Revised Manuscript received on December 24, 2019. | Manuscript published on January 10, 2020. | PP: 2933-2942 | Volume-9 Issue-3, January 2020. | Retrieval Number: C8899019320/2020©BEIESP | DOI: 10.35940/ijitee.C8899.019320
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: discussion forums are spreadly employed as learning tools in online courses, particularly in the Massive open online course (MOOC). Learners share opinions, express needs, and seek tutoring, and participate in discussions in the online forum. However, learner’s workstation generates massive information due to the number of MOOC participants, making it difficult to identify relevant information that can help and answer questions during the MOOC. Identifying and extracting knowledge from a MOOC discussion forum requires learner’s engagement in a collaborative and informative learning environment that enables knowledge exchange and information sharing. In this article we offer a new approach to explore forums, interactions and collaboration of learners online, in a knowledge building process, by an extraction framework and presentation of knowledge based on the characteristics of the text written in the learners’ messages during the training. Our proposal consists in combining the pretreatment of the natural language by the TF-IDF metric, and the embedding of the words by Word2Vec, and then we will use the machine learning algorithm SVM for a semantic classification according to the analysis interactions model. Thus, we will apply the transformations and pretreatments on the messages posted in the forums by the participants in the MOOC, then the Word2Vec to represent each word as a vector, which will be concatenated to the features of the context TF-IDF. These vectors will form the data input of our Learning SVM machine algorithm, which aims to establish semantic relationships between concepts. The knowledge is then expressed as ontology for a representation of knowledge and an enrichment of our model. 
Keywords: MOOC, Forum, Interactive Analysis Model, TF-IDF, Word2Vec, Ontology, Support Vector Machine.
Scope of the Article: Machine Learning