AN APACHE SPARK-BASED PLATFORM FOR PREDICTING THE PERFORMANCE OF UNDERGRADUATE STUDENTS

  • 31/03/2022
Đăng nhập để xem được nhiều hơn
Giới thiệu

NOWADAYS, EDUCATION DATA MINING (EDM) PLAYS A VERY IMPORTANT ROLE IN HIGHER EDUCATION INSTITUTIONS. PLENTY OF ALGORITHMS HAVE BEEN EMPLOYED TO MEASURE STUDENT’S GPA IN THE NEXT SEMESTER’S COURSES. THE RESULTS CAN BE USED TO EARLY IDENTIFY DROPOUT STUDENTS OR HELP STUDENTS CHOOSE THE ELECTIVE COURSES WHICH ARE APPROPRIATE FOR THEM. THE MOST WIDELY USED METHODS ARE MACHINE LEARNING, HOWEVER, THE PROBLEM IS THE ACCURACY WHICH CAN BE CHANGED FROM DATASET TO DATASET. MORE IMPORTANTLY, THE PERFORMANCE OF PREDICTION MODELS CAN BE AFFECTED BY THE CHARACTERISTIC OF DATASET ASSOCIATED WITH THE APPLIED MODEL. IN THIS PAPER, WE BUILD A DISTRIBUTED PLATFORM ON SPARK TO PREDICT MISSING GRADES OF ELECTIVE COURSES FOR UNDERGRADUATE STUDENTS. THE PAPER COMPARES SEVERAL METHODS THAT ARE BASED ON THE COMBINATION OF COLLABORATIVE FILTERING & MATRIX FACTORIZATION (NAMELY ALTERNATIVE LEAST SQUARE). WE EVALUATE THE PERFORMANCE OF THESE ALGORITHMS USING A DATASET PROVIDED BY HO CHI MINH UNIVERSITY OF TECHNOLOGY (HCMUT). THE DATASET CONSISTS OF INFORMATION ABOUT UNDERGRADUATE STUDENTS FROM 2006 TO 2017. DEPENDING ON THE CHARACTERISTICS OF OUR DATASET, THE PAPER HIGHLIGHTS THAT ALTERNATIVE LEAST SQUARE WITH NON-NEGATIVE CONSTRAINT ACHIEVES THE BETTER RESULTS THAN OTHERS IN COMPARISON. 2019 IEEE.

Trả lời

Email của bạn sẽ không được hiển thị công khai. Các trường bắt buộc được đánh dấu *

TÀI LIỆU KHÁC