Scientific Journal

Applied Aspects of Information Technology

In Vietnam, since 2015, the Ministry of Education and Training of Vietnam has decided to abolish university entrance exams and advocates the use of high school graduation exam results of candidates for admission to go to universities. The 2015 and 2016 exam questions for the Math exam are the essay questions. From 2017 up to now, the Ministry of Education and Training of Vietnam has applied the form of multiple-choice exams for Mathematics in the high school graduation exam. There are many mixed opinions about the impact of this form of examination and admission on the quality of university students. In particular, the switch from the form of essay examination to multiple-choice exams led the entire Vietnam Mathematical Association at that time to send recommendations on continuing to maintain the form of essay examination for mathematics. The purposes of this article are analysis and evaluation the effects of relevant factors on the academic performance of advanced math students of university students, and offer solutions to optimize university entrance exam. The data set was provided by Training Management Department and Training Quality Control and Testing Laboratory of the University of Finance – Marketing. This dataset includes information about math high school graduation test scores, learning process scores (scores assessed by direct instructors), and advanced math course end test scores of 2834 students in courses from 2015 to 2019. Linear and non-linear regression machine learning models were used to solve the tasks given in this article. An analysis of the data was conducted to reveal the advantages and disadvantages of the change in university enrollment of the Vietnamese Ministry of Education and Training. Tools from the Python libraries have been supported and used effectively in the process of solving problems. Through building and surveying the model, there are suggestions and solutions to problems in enrollment and input quality assurance. Specifically, in the preparation of entrance exams, the entrance exam questions should not exceed 61-66 % of multiple choice questions.
1. Wooldridge, M. Jeffrey. “Introductory Econometrics: A Modern Approach”. South-Western Centage Learning. [5th Edition]. Mason Ohio. USA: 2013. 878 p. 
2. Wooldridge, M. Jeffrey. “Econometric Analysis of Cross Section and Panel Data”. The MIT Press. [2nd Edition]. Cambridge Massachusetts. USA: 2010. 1096 p. 
3. Ryan, T. P. “Modern Regression Methods”. Wiley-Interscience. [2nd Edition]. Hoboken New Jersey. USA: 2018. 672 p. 
4. Cherkassky, V. & Mulier, F. “Learning from Data: Concepts, Theory, and Methods”. Wiley-IEEE Press. [2nd Edition]. Hoboken New Jersey. USA: 2007. 560 p. 
5. Draper, N. R. & Smith, H. “Applied Regression Analysis”. Wiley-Interscience. [3rd Edition]. Hoboken New Jersey. USA: 1998. 736p. 
6. Hutcheson, G. D. “Ordinary Least-Squares Regression”. The SAGE Dictionary of Quantitative Management Research. SAGE Publications. Thousand Oaks California. USA: 2011.p.224–228. DOI: 10.4135/9781446251119.n67. 
7. Rutherford, A. “Introducing ANOVA and ANCOVA: a GLM approach”. John Wiley & Sons, Inc. [2st Edition]. Chichester West Sussex. England: 2011. 360p. DOI: 10.1002/9781118491683. 
8. Hutcheson, G. D. & Moutinho, L. “Statistical Modeling for Management”. Sage Publications. Online Publication. December 27, 2012. DOI: 10.4135/9781446220566. 
9. Fox, J. “An R Companion to Applied Regression”. Sage Publications. Inc. [2st Edition]. Thousand Oaks California. USA: 2011. 449p. 
10. Hutcheson, G. D. “The Multivariate Social Scientist”. Sage Publications. Thousand Oaks California. USA: 1999. 288p. DOI:10.4135/9780857028075. 
11. Agresti, A. “An Introduction to Categorical Data Analysis”. Wiley Series in Probability and Statistics. Wiley-Interscience. [3rd Edition]. Hoboken. New Jersey. USA: 2018. 400p. 
12. Koteswara, R. K. “Testing for the Independence of Regression Disturbances”. Journal Econometrica. 1970; Vol. 38 Issue 1: 97–117. DOI: 10.2307/1909244. 
13. Bremer, M. “Multiple Linear Regression”. MATH 261a, San Jose State University. USA: 2012. URL: ression.pdf. 
14. Gonzalez, P. & Orbe, S. “The Multiple Regression Model: Estimation”. Dpt. Applied Economics III (Econometrics and Statistics). University of the Basque Country. Spain: 2014. URL: 
15. Kirchner, James W. “Data Analysis Toolkit 10: Simple Linear Regression Derivation of Linear Regression Equations”. University of California, Berkeley. USA: September 2001. URL: 
16. Olive, David. “Linear Regression”. Springer International Publishing. Cham Switzerland: 2017. 494p. DOI: 10.1007/978-3-319-55252-1.
17. Math Vault. “ List of Probability and Statistics Symbols”. Montreal. Canada: – Available from: Tittle from the screen. – [Accessed: June, 2020]. 
18. Pishro-Nik, H. “Mean Squared Error (MSE)”. The Department of Electrical and Computer Engineering University of Massachusetts Amherst, USA: – Available from: – [Accessed: June, 2020]. 
19. Devore, Jay L. “Probability and Statistics for Engineering and the Sciences”. Cengage Learning. [8th Edition]. Boston. Massachusetts United States: 2011. 768p. 
20. Barten, Anton P. “The Coeffecient of Determination for Regression without a Constant Term”. In book – The Practice of Econometrics. Dordrecht. Martinus Nijhoff Publishers. Leiden. Belgium: 1987. p.181–189. DOI: 10.1007/978-94-009-3591-4_12. 
21. Jingjing, Zhang. “Model Selection in SVMs using Differential Evolution”. Journal IFAC Proceedings. 2011; Vol.44 Issue 1: 14717–14722. DOI:10.3182/20110828-6-IT-1002.00584. 
22. Prince Grover. “5 Regression Loss Functions All Machine Learners Should Know”. – Available from: Tittle from the screen. – [Accessed: June, 2018]. 
23. Salcedo, Sanz S., et. al. “Support vector machines in engineering: an overview”. Wires Data Mining and Knowledge Discovery. 2014; Vol. 4 Issue 3: 161–267. DOI:10.1002/widm.1125. 
24. Pai, P. F. & Hsu, M. F. “An Enhanced Support Vector Machines Model for Classification and Rule Generation”. Journal Computational Optimization, Methods and Algorithms. 2011; Vol.356:241–258. DOI: 10.1007/978-3-642-20859-1_11. 
25. Smola, A. & Schölkopf, B. “A Tutorial on Support Vector Regression”. Journal Statistics and Computing. 2004; Vol.14:199–222. DOI:10.1023/B:STCO.0000035301.49549.88. 
26. Yoshioka, T. & Ishii, S. “Fast Gaussian process regression using representative data”. International Joint Conference on Neural Networks. 2001; Vol.1: 132–137. DOI:10.1109/IJCNN.2001.939005. 
27. Chiroma, H., Abdulkareem, S., Abubakar, A. I., Herawan, T., et. al. “Kernel Functions for the Support Vector Machine: Comparing Performances on Crude Oil Price Data”. Recent Advances on Soft Computing and Data Mining. Advances in Intelligent Systems and Computing book series. Publ. Springer. Cham. 2014; Vol.287:271–281. DOI: 10.1007/978-3-319-07692-8_26. 
28. The MathWorks, Inc. “Understanding Support Vector Machine Regression”. – Available from: – [Accessed: August, 2020]. 
29. Custer, Charlie. “15 Python Libraries for Data Science You Should Know”. URL: Date February 5, 2020. 
30. Scikit-learn developers. “Linear Models”. – Available from: – [Accessed: May, 2020]. 
31. Scikit-learn developers. “Epsilon-Support Vector Regression”. – Available from: – [Accessed: May, 2020]. 
32. Scikit-learn developers. “Support Vector Regression (SVR) using linear and non-linear kernels”. – Available from: – [Accessed: May, 2020]. 
Last download:
5 Oct 2021


[ © KarelWintersky ] [ All articles ] [ All authors ]
[ © Odessa National Polytechnic University, 2018.]