Next Article in Journal
Approximate Methods for Calculating Singular and Hypersingular Integrals with Rapidly Oscillating Kernels
Next Article in Special Issue
Lexicon-Enhanced Multi-Task Convolutional Neural Network for Emotion Distribution Learning
Previous Article in Journal
Some Notes for Two Generalized Trigonometric Families of Distributions
Previous Article in Special Issue
Cubical Homology-Based Machine Learning: An Application in Image Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Towards Predictive Vietnamese Human Resource Migration by Machine Learning: A Case Study in Northeast Asian Countries

1
Faculty of Architecture, ThuDauMot University, ThuDauMot 820000, Vietnam
2
Department of Civil Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 807618, Taiwan
3
Department of International Business, National Kaohsiung University of Science and Technology, Kaohsiung 82445, Taiwan
4
Department of Business Administration, Southern Taiwan University of Science and Technology, Tainan 71005, Taiwan
5
Shenghua Marketing Co., Ltd., Kaohsiung 814026, Taiwan
6
Department of Computer Science, Hue University of Education, Hue University, Hue 49118, Vietnam
7
Department of Industrial Engineering and Management, National Kaohsiung University of Science and Technology, Kaohsiung 807618, Taiwan
*
Authors to whom correspondence should be addressed.
Axioms 2022, 11(4), 151; https://doi.org/10.3390/axioms11040151
Submission received: 16 February 2022 / Revised: 16 March 2022 / Accepted: 18 March 2022 / Published: 24 March 2022
(This article belongs to the Special Issue Various Deep Learning Algorithms in Computational Intelligence)

Abstract

:
Labor exports are currently considered among the most important foreign economic sectors, implying that they contribute to a country’s economic development and serve as a strategic solution for employment creation. Therefore, with the support of data collected between 1992 and 2020, this paper proposes that labor exports contribute significantly to Vietnam’s socio-economic development. This study also aims to employ the Backpropagation Neural Network (BPNN), k-Nearest Neighbor (kNN), and Random Forest Regression (RFR) models to analyze labor migration forecasting in Taiwan, Korea, and Japan. The study results indicate that the BPNN model was able to achieve the highest accuracy regarding the actual labor exports. In terms of these accuracy metrics, this study will aid the Vietnamese government in establishing new legislation for Vietnamese migrant workers in order to improve the nation’s economic development.

1. Introduction

Human resources play a pivotal role in the prosperity of a country, and global development in general [1,2]. Globalization has resulted in people having more opportunities to work in many different countries [3,4]. With the rapid development of the economy in developed countries, labor resources are needed to meet requirements of the economy’s growth rate [5,6]. In addition, some developed countries face the dilemma of an aging population structure [7,8]. Therefore, there is an urgent need for foreign human resources to make up for the shortage of domestic human resources [9]. In terms of solving these difficulties, many policies have been implemented with the aim of attracting human resources from overseas. Therefore, many countries have developed labor export development plans to create jobs, thus increasing income for each individual and contributing to the growth of the economy in terms of Gross Domestic Product (GDP) [10,11].
Currently, many countries in the Asia-Pacific region participate in labor exports, with the Philippines, Indonesia, Thailand, and China being direct competitors to Vietnam in the international labor market, as they share similarities [12]. The experiences of these countries in managing, operating, and develo** labor export activities can serve as valuable lessons for Vietnam. It is estimated that around 20 million Southeast Asians work outside of their home countries, with approximately half of those being in the Middle East [13]. Additionally, inheriting the results and experiences gained through the era of international labor cooperation with socialist nations from 1980 to 1990, the Party and the State of Vietnam changed its level of awareness and promoted labor export activities [14]. On 9 November 1991, the government issued Decree No. 370/HDBT, which clearly stated: “Putting migrant workers to working abroad for a definite time is a method to generate jobs, produce more incomes for workers and boost foreign currency revenue for the country, resulting in the strengthening of economic, cultural, scientific and technical collaboration relations between Vietnam and other countries that use labor on the principle of equality, mutual benefit, respect for each other’s laws and national traditions” [15]. In 2018, 500,000 Vietnamese contract labor migrants worked abroad, according to the latest statistics [16]. Taiwan, Korea, and Japan are still three important labor export markets for Vietnam, accounting for more than 70% of the total number of laborers working abroad in 2008 and 2009, and are three markets that continue to have a high demand for Vietnamese workers [17]. Additionally, several studies on labor exports in East and Southeast Asia have reported workers migrating from Vietnam to Singapore, South Korea, Taiwan, and Japan [18,19].
When it comes to Backpropagation Neural Network (BPNN), k-Nearest Neighbor (kNN), and Random Forest Regression (RFR) models, many studies have been conducted by applying these models for prediction [20,21,22,23,24,25,26]. Chen, Lai [27] used BPNN and ANN algorithms to forecast that the majority of international visitors to Taiwan would be from the main markets of Hong Kong, Japan, and Macau from January 1971 to August 2009. Likewise, Vihikan, Putra [28] employed BPNN to predict in the arrival of international tourists in Bali to assist the government in constructing the nation’s tourism strategies. Wang, Wu [29] showed that the monthly inward tourism flow to China between 2000 and 2013 could be forecasted by deploying an Enhanced Backpropagation Neural Network. In addition, Lin, Malyscheff [30] used ANN models to predict engineering students’ future retention, consistently achieving an overall prediction accuracy of around 70% or higher. Moreover, the research used RFR and Support Vector Regression (SVR) models to estimate global foreign tourist arrivals, achieving prediction accuracies of 99.4 percent using SVR and 84.7 percent using RFR [31]. Additionally, the research used the kNN technique to investigate the unique nature of campground management, testing several forecasting approaches in order to determine which was best suited to the unique behavior of cam** tourists and the unique nature of campsites [32].
This paper presents a forecast of Vietnamese labor migration using the kNN, RFR, and BPNN models. The data collected in this study include 29 observations from 1992 to 2020. With the support of the collected data, the paper analyzes the role of labor exports in Vietnam’s socio-economic development and compares this export labor with that of other Asian countries. After that, the three algorithms are compared on the basis of the results of their statistical accuracy indicators. More importantly, this research highlights the possible future contexts of labor migration between Vietnam and East Asia, including Korea, Republic of China (Taiwan), and Japan. Some assume that the migration corridor extending between Vietnam and other East Asian nations has progressed to the point where labor migration is only one element in a more varied movement pattern. Nevertheless, there will continue to be significant demand for Vietnamese workers to work in Japan, Korea, and Taiwan for the foreseeable future. More importantly, this research could assist the government of Vietnam in enacting new regulations for Vietnamese migrant workers in order to boost the socio-economic situation. Consequently, the Vietnamese government could determine a different approach for our country’s labor export activities in the context of integration.
The following is a description of the paper’s structure. The first section of the paper is an introduction. The BPNN, kNN, and RFR models’ approach is introduced in Section 2. The study results and discussions are described in Section 3 and Section 4. Finally, the conclusions are presented in Section 5.

2. Materials and Methods

The process of the following experimental stages in this study is summarized in Figure 1. Firstly, the database is preprocessed and tested by statistical methods, and it is also divided into training and testing sets. Secondly, the BPNN, RFR, and kNN models are used to learn the training samples and obtain the optimal network parameters. Finally, the three models’ performances are compared using metrics from the accuracy measurement indicators as Mean Square Error (MSE), Root-Mean-Square Error (RMSE), Mean Absolute Error (MAE), Correlation Coefficient (R), and Correlation of Determination (R2) in the possible result stage. At the same time, seeking the most suitable prediction model for the study if the RMSE, MAE, and MSE gain the lowest values, and R and R2 approach the highest values; on the other hand, the experiments are adjusted by component parameters of models or training data size to look for the optimal accuracy measurement indicators.

2.1. Database

This paper uses a database containing Vietnam labor immigration to Korea, Japan, and Taiwan from 1992 to 2020 obtained by the Department of Overseas Labor (DOLAB). The Python 3.9 software is deployed for data analysis. The data in Figure 2 show that the number of Vietnamese workers migrating to Northeast Asian Countries has witnessed a fluctuation from 1992 to 2020. The total of Vietnamese laborers who immigrated to Northeast Asian Countries was 1,373,712 people for 29 years, in which Japan, Taiwan, and Korea occupied about 811,138,367,967, and 194,607 people, respectively. At the same time, more than 100,000 employees per yearly migrated to the region during the period of 2015–2019. However, the total number of Vietnamese workers in 2020 going to work in Japan, South Korea, and Taiwan is 76,355 persons, implying a decrease in the number of 55% compared with that in 2019 and a decline in total labor exports of nearly 56% compared with the data in 2018. Moreover, the highest total number of Vietnamese workers working in Taiwan, Japan, and South Korea was 68,737 employees in 2018, then 82,703 employees in 2019, and 18,141 in 2008.
The expected statistical results as the Mean, Min, and Max values, St Dev, Skew, and Kurt are also pointed out in Table 1 to quantitatively analyze the yearly labor export characteristics to interpret their significance further. The mean is the crucial popular measure of central trend, and the indicator may be used with continuous data. The Minimum and Maximum values indicate the margin of the time series. The Standard Deviation is used to measure the degree of data dispersion. Finally, Skewness and Kurtosis are deployed to estimate whether the distribution of the sample is normal or not. The statistics result in Table 1 indicate as below: The Mean and Standard Deviation of the labor exports to North-East Asian countries were 25,422 persons and 28,962 persons in Japan, 18,786 persons and 12,689 persons in Taiwan, 4132 persons and 6711 persons in Korea, respectively. The skewness for a normal distribution is zero, and any symmetric data should have a skewness near zero. Negative values for the skewness indicate skewed left data, and positive values for the skewness indicate skewed right data [33,34]. Skewness coefficients were low for the data sets. This approach is appropriate for modeling because a high skewness coefficient has a considerable negative effect on ANN performance [35]. Hence, the values of Kurt and Skew of the data fluctuating from −8.87 to 2.64 could be accepted for prediction through these models. The labor exports to the region using Augmented Dickey-Fuller (ADF) test also point out that the database is separated two parts such as the data of labor exports to Korea (p-value = 0.02, test statistic = −3.16) has a unit root, this data is stationary, and the data of labor exports to Japan (p-value = 0.62, test statistic = −1.32) and Taiwan (p-value = 0.99, Test Statistic = 2.22) do not have a unit root, these data are non-stationary. Furthermore, the time series data are stationary; they can be easily modeled with higher accuracy than their non-stationary counterparts [36].
The input data patterns of the three nations were randomly selected with two sections. The first section was used for the training phase, and it contained roughly 70% of the total data. The remaining 30% of the labor export data was contained in the second component, which was used for the test phase.

2.2. Backpropagation Neural Network (BPNN)

BNPP can learn and store various data, akin to the human brain. Therefore, a single hidden layer is represented in Figure 3. Furthermore, BPNN has three layers: an input layer, at least one hidden layer, and an output layer. Weights connect adjacent layers, always scattered among 0 and 1. Although several heuristic techniques have been deployed by many researchers [37,38], there is no systematic theory for verifying the number of input nodes and hidden layer nodes. Experiments or trial and error based on the test data’s least mean square error are the most fitted methods for determining the appropriate indicators of input and hidden nodes. The following Equations (1) and (2) indicate the relationships of McCulloch–Pitts (M–P) neurons in hidden layer and the output layer [39].
b n = f ( i = 1 d v i n x i + γ n )
y j = f ( n = 1 m w n j b n + p j )
where b n is the result of n t h hidden neuron, x i is the i t h valuable input out of d inputs, y j is the j t h output value, m is the total number of hidden neurons, f is the activation function of the rectified linear unit (ReLU), v i n and w n j are the weight terms, γ n and p j are the bias terms.
Figure 3 demonstrates the BPNN structure below, showing 29 input nodes from x1 to xd (which d is also the 29th year) for the input layer; the 29 neurons for the output layer are also the values of the labor exports. At the same time, one hidden layer contains neurons from b1 to bm (m is the 200th node). Each neuron of the hidden layer and the output layer have in charge of Weight and Bias, such as v i n , γ n and w n j ,   p j to correspond for neuron b n and neuron y j , respectively. Each hidden layer neuron takes the output from input layer neurons and converts these values with a weighted linear sum into the output layer. The output layer obtains the values from the hidden layer. Furthermore, the activation function for the hidden layer is ReLU function. Adam’s method is stochastic optimizations to the solver of weight optimization. The training method for the models is regression.

2.3. K-Nearest Neighbor (kNN)

In machine learning, kNN is one of the most fundamental supervised learning algorithms. The kNN model can be used for regression as well as classification. Moreover, because the observed data has a numeric label to forecast, kNN is employed as a predictive method with a multi-input multi-output (MIMO) strategy for forecasting [32]. kNN begins by initialing the number of k. Hence, the distance between the training and testing instances is calculated. Then, the prediction is made by calculating the test data’s average k nearest distance neighbor. Subsequently, the Euclidean Distance is deployed as witnessed in Equation (3).
d ( x , y ) = i = 1 k ( x y ) 2
where d symbolizes the distance among data points, x represents the testing data, y is the training data.

2.4. Random Forest Regression (RFR)

Random forest is a regression method for classifying or forecasting the numerical value of a variable that incorporates the outputs of various decision tree algorithms [40,41,42]. When an (x) input vector containing the values of the various evidentiary features analyzed for a certain training area is received by a random forest. RFR creates a total of K regression trees and then averages the output. The random forest regression forecaster as shown in Equation (4) when K grows the { T ( x ) } 1 K tree [43].
f ^ K r f ( x ) = 1 K k = 1 K T ( x )
To avoid different trees correlations, RFR promotes tree variety by allowing them to grow from diverse training data subsets obtained through a method known as bagging [43]. Bagging is a method for generating training data that randomly resample the original dataset with replacement data. Therefore, certain data may be used many times during training, whereas others may never be used. Hence, better stability is gained, as it becomes more resistant when faced with minor deviations in input data while also increasing forecast accuracy [40].

2.5. Performance Metrics

Estimating results are based on calculating and comparing the actual values to the forecasted values. These metrics of the accuracy measurement parameters include the Mean Square Error (MSE), Root-Mean-Square Error (RMSE), Mean Absolute Error (MAE), Correlation Coefficient (R), and Correlation of Determination (R2). Furthermore, the error metrics are defined as follows [44,45,46]:
M S E = t = 1 n ( x t x t ) 2 n
M A E = t = 1 n | x t x t | n
R M S E = t = 1 n ( x t x t ) 2 n
R 2 = 1 t = 1 n ( x t x t   ) 2 t = 1 n ( x t 1 n t = 1 n x t ) 2
R = t = 1 n ( x t x ¯ ) ( x t x ¯ ) t = 1 n ( x t x ¯ ) 2 t = 1 n ( x t x ¯ ) 2
where x t ,   x t are the observed and estimated values in the period time t, and n is the number of the observed values in the testing data. x ¯ ,   x ¯ are mean of the observed and estimated value. The R 2   and   R should be approaching 1 to indicate strong model performance, and the MSE, MAE, and RMSE should be as close to zero as possible.

3. Results Analysis

Three models were established to predict labor exports in three countries (Taiwan, Japan, and Korea). For comparing the models based on the estimating performance of labor exports, simulating experiments have pointed out the optimum models with the main indicators in Table 2.
Line graphs in Figure 4 illustrate the RFR (red lines), kNN (grey lines), and BPNN (green lines) algorithms using testing data that were compared with the actual line graphs (blue lines). As far as the predicted models of labor immigration in the three nations are concerned, the BPNN lines are the nearest ones compared with the real data in Taiwan, Korea, and Japan. As a result, the grey lines show that kNN algorithms achieve the second-most accurate level in Taiwan and Japan, whereas kNN only achieves the lowest level in Korea in a selection of random years from 1994 to 2016. Therefore, the accuracy parameters for the labor immigration forecast are calculated (as shown in Table 3) to assess the three forecasting models’ accuracy levels. The outperform of the simulation in Table 3 indicates the estimation accuracy using kNN, RFR, and BPNN algorithms has a significant reliability.
Additionally, when deploying kNN, RFR, and BPNN models in three nations with the accuracy parameters (MAPE, MAE, RMSE, R-squared, and NSE), the BPNN algorithm has the greatest accuracy indicators in Taiwan, Japan, and Korea, whereas the RFR method has the lowest. Furthermore, BPNN is the best machine learning algorithm in Taiwan, Japan, and Korea with B P N N T a i w a n   B P N N J a p a n ,   and   B P N N   K o r e a achieved higher estimation accuracy (with M A P E J a p a n = 0.006, M A P E T a i w a n = 0.006, and M A P E   K o r e a = 0.07) than RFR (with M A P E J a p a n = 0.145, M A P E T a i w a n = 0.137, and M A P E K o r e a = 0.051) and kNN (with M A P E J a p a n = 0.073, M A P E T a i w a n   = 0.066, and M A P E   K o r e a = 0.051). Similarly, based on MAE, RMSE, and NSE indicators, the BPNN algorithm in Taiwan, Japan, and Korea acquired greater accuracy parameters. In terms of forecast models for Taiwan, Japan, and Korea, the parameters of BPNN achieve a similar level of accuracy; however, B P N N   K o r e a earns the highest one compared with others (with M A E K o r e a = 57, and R M S E K o r e a = 67 ) ,   R F R   K o r e a gained the second-highest one (with M A E K o r e a = 488, and R M S E K o r e a = 1002 ) , and k N N   K o r e a is the lowest one (with M A E K o r e a = 638, and R M S E K o r e a = 1570 ). In addition, the scatter plots and three models in Figure 5 calculate the simulation results. As indicated, the dispersion of points around the diagonal of the BPNN model earns the highest accuracy parameters in three countries; the kNN machine learning algorithm is the second-lowest, and the RFR algorithm achieves the lowest accurate level.
The Taylor diagrams examine the performance of estimated and real values using the standard deviation and correlation used in evaluating the models. Moreover, the Taylor chart shows the standard deviation and correlation between the actual and anticipated datasets for the models and general consistency between observed and estimated values when the correlation value approaches 1, as shown in Figure 6. This study can be considered for the kNN, RFR, and BPNN algorithms in Japan, which have SD (testing phase) = 20,649, 20,463, and 19,847, respectively, and are near to the actual data (with SD = 28,962), resulting in the three models achieving the same accurate level in predicting; meanwhile, the BPNN model’s standard deviations in Japan, and Korea implied that the predicted values of BPNN model are nearest with the real values (with S D J a p a n = 11,089, and S D K o r e a = 3637). Hence, the smaller the standard deviation value, the stronger the relationship. The Taylor plot indicates that kNN, RFR, and BPNN algorithms are the most accurate and optimal forecasting.

4. Discussions

Regarding labor exports of Asian countries, Indonesia was the second-largest sending country of labor migrants in Asia after the Philippines, with a long history of immigration and emigration. The Philippines, for example, sent about one million employees abroad in 2005, Indonesia about 400,000, and Bangladesh and Sri Lanka each more than 200,000; Meanwhile, Vietnam only deployed 70,000 to 80,000 people in the same year. According to the most recent published estimates, there are around 500,000 Vietnamese contract workers globally [47]. According to the International Organization for Migration, Indonesian labor migrants significantly raised from 517,619 to 696,746 between 1996 and 2007, with the top five destinations being Saudi Arabia, Malaysia, Taiwan, Singapore, and Hong Kong [48]. Furthermore, although Japan’s openness approves highly qualified migrants through open policies, highly skilled migrants in the targeted nations are still relatively small. In 2010, 198,000 highly skilled migrants in Japan accounted for barely 9% of the 2.1 million migrants [49]. This export labor also compared with other Asian countries pointed out that with amount more than 100,000 employees per yearly were migrated to Japan, Taiwan, and Korea, Vietnam has third ranking of Asia during the period of 2015–2019.
Furthermore, foreign worker exports help improve employees’ lives, but they also substantially impact the country’s economy. The number of remittances from overseas to Vietnam remitted to the country is expected to be between three and four billion dollars per year. Hence, this is encouraging news for the Vietnamese economy [50].
Human resource immigration has played a crucial role in Vietnam’s socio-economic development. Firstly, it can improve income and transform workers’ perceptions. Many workers, for instance, have returned to Vietnam to establish small and medium-sized businesses, contributing to the eradication of large-scale enterprises and using practical experiences gained from many areas of the world in Vietnam. Secondly, labor migration can help to alleviate poverty and advance Vietnam’s socio-economic growth. Then, labor migration provides a substantial source of foreign currency while lowering investment costs to alleviate the issue of domestic employment. Finally, labor exports are also a mechanism for transferring an innovative technology from other nations, assisting in training a quality workforce, and strengthening international cooperation links between Vietnam and other countries in the Northeast Asian Countries. Therefore, forecasting labor exports is significantly important for Vietnam’s labor exporting. The forecast results will assist labor export policymakers in assessing and considering foreign countries’ potential and labor demand. Simultaneously, governments can develop vocational training programs and acquire human resources that fit those countries’ needs.
This paper used kNN, BPNN, and RFR models to analyze and estimate Vietnamese human resources working for the Northeast Asian Countries. The different influential factors and parameters have been described in the simulations. The following key findings are as three models of BPNN, RFR, and kNN indicating forecast results were fairy accuracy for three countries Taiwan, Korea, and Japan. Regarding the error index, the BPNN model predicting Vietnamese labor immigration to Northeast Asian Countries obtained the best R2, MAPE, RMSE, and MAE value. At the same time, the RFR and kNN models showed better forecasting performances. The points indicate that BPNN was the highlight compared with two RFR and kNN algorithms. In addition, the forecasting errors in the case of the models increased if the testing data increased.
The model was demonstrated excellently by a training accuracy of 89.98% and validation accuracy of 84.05%. In terms of estimating tourism demand in Japan, Chen, Lai [27] used a novel forecasting model based on empirical mode decomposition (EMD) and a BPNN, which revealed that the MAPE, RMSE of the proposed EMD-BPNN algorithm are, respectively, 0.958%, 1443, implying that these output values are higher than this study parameter. Moreover, Mishra et al. (2021) [31] study results witnessed that for predicting international tourists with tiny datasets, the random forest regression works well (with an R-square of 0.847), pointing out higher values than this study outputs in terms of R-square indicators.
In addition, the method using three models was the most classical one; however, the models were deployed for the time series data give performance comparable to deep learning models (as Long short-term memory (LSTM) models), but faster training speed and less resource-intensive (as the computer’s memory). For instance, Wang et al. (2020) [29] deployed BPNN and LSTM models to forecast the monthly inbound tourism to mainland China (2000–2013), the monthly tourist arrival Turkey from different countries (2000–2011), and the monthly inbound tourism to The United States of America (June 2006–May 2018). The research result showed that R of BPNN model with 0.999 to approximate the R of LMST model with 0.998.
Although this study used annual data to anticipate labor force migration, the results showed that three models in three specific countries produced reliable results. As a result, it can predict labor migration in other countries around the globe.

5. Conclusions

Labor exports have been among of the most important tasks in Vietnam’s socio-economic development over the past three decades. Labor exports will help increase income and improve skills for workers. In addition, labor exports also contribute to reducing unemployment in the country. Hence, the study aimed to examine the adjustment in the Vietnamese labor migration to Northeast Asian Countries. Based on annual data from 1992 to 2020, this study focused on the role of labor exports on Vietnam’s socio-economic factors and compared them with the movement of human resources to other Asian countries. Afterward, the study implemented three machine learning models for estimation. At the same time, the result indicated that three models earned high accurate results for predicting labor migration, in which the BPNN model showed a more accurate level than other algorithms. Hence, the BPNN model can be effectively applied for labor migration prediction. In the context of integration, forecasting human resource export variation may find a separate direction for Northeast Asian Countries’ human resource activities. Although the MAPE, MAE, and RMSE parameters of three models also attained the high level of accuracy which were compared with the other studies’ parameters values, the BPNN model earned the highest accuracy of the actual labor exports with M A P E J a p a n , _ B P N N = 0.06, M A E J a p a n , _ B P N N = 191, R M S E J a p a n _   B P N N = 225, M A P E K o r e a , _ B P N N = 0.006, M A E K o r e a _   B P N N = 71, R M S E K o r e a , _ B P N N = 102, M A P E T a i w a n _   B P N N = 0.007, M A E T a i w a n _ B P N N = 57, R M S E T a i w a n _ B P N N = 67, respectively. In addition, this study proved that machine learning models play a key role in the decision-making progress for conducting an effect of labor exports. Using Augmented Dickey-Fuller (ADF) test indicates that stationary data will attain more forecast results’ accurate parameters compared with non-stationary data. The study results support stakeholders in establishing projects related to constructing and training high-quality human resources migration, increasing the laborers’ surplus value. Moreover, human resource migration problems help governments enforce suitable policies to appeal to labor from overseas. Another possible future work is to deeply analyze the GDP contribution of human resource immigration for Vietnam’s economy.

Author Contributions

Conceptualization, N.H.G.; Data curation, T.-T.N.; Formal analysis, L.A.P.; Funding acquisition, N.H.G.; Investigation, T.-T.N.; Methodology, L.A.P.; Project administration, C.C.T.; Software, N.H.G.; Validation, C.C.T.; Writing—original draft, N.H.G., T.-T.N. and C.C.T.; Writing—review and editing, C.C.T. and T.-T.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors appreciate the support from the National Kaohsiung University of Science and Technology, Taiwan; Southern Taiwan University of Science and Technology, Taiwan; Shenghua Marketing Co., Ltd., Taiwan; ThuDauMot University, Vietnam; and Hue University, Vietnam.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Crouch, G.I.; Ritchie, J.B. Tourism, Competitiveness, and Societal Prosperity. J. Bus. Res. 1999, 44, 137–152. [Google Scholar]
  2. Rockström, J.; Williams, J.; Daily, G.; Noble, A.; Matthews, N.; Gordon, L.; Wetterstrand, H.; Declerck, F.; Shah, M.; Steduto, P.; et al. Sustainable Intensification of Agriculture for Human Prosperity and Global Sustainability. Ambio 2017, 46, 4–17. [Google Scholar] [PubMed] [Green Version]
  3. Iversen, T.; Cusack, T.R. The Causes of Welfare State Expansion: Deindustrialization or Globalization? World Politics 2000, 52, 313–349. [Google Scholar]
  4. Friedman, B.A. Globalization Implications for Human Resource Management Roles. Empl. Responsib. Rights J. 2007, 19, 157–171. [Google Scholar]
  5. Awan, A.G. Relationship between Environment and Sustainable Economic Development: A Theoretical Approach to Environmental Problems. Int. J. Asian Soc. Sci. 2013, 3, 741–761. [Google Scholar]
  6. Litvinenko, V.S. Digital Economy as a Factor in the Technological Development of the Mineral Sector. Nat. Resour. Res. 2020, 29, 1521–1541. [Google Scholar]
  7. Ince Yenilmez, M. Economic and Social Consequences of Population Aging the Dilemmas and Opportunities in the Twenty-First Century. Appl. Res. Qual. Life 2015, 10, 735–752. [Google Scholar]
  8. Yu, M. Study on the Development Dilemma and Breakthrough of the Traditional Sports of Ethnic Minority in the Northwest of Guangxi under the Background of Urbanization. In Proceedings of the 2017 International Conference on Innovations in Economic Management and Social Science (IEMSS 2017), Hangzhou, China, 15–16 April 2017; Atlantis Press: Paris, France, 2017. [Google Scholar]
  9. Hughes, J.C.; Rog, E. Talent Management: A Strategy for Improving Employee Recruitment, Retention and Engagement within Hospitality Organizations. Int. J. Contemp. Hosp. Manag. 2008, 20, 743–757. [Google Scholar]
  10. Goldsmith, P.D.; Gunjal, K.; Ndarishikanye, B. Rural–Urban Migration and Agricultural Productivity: The Case of Senegal. Agric. Econ. 2004, 31, 33–45. [Google Scholar]
  11. Kapstein, E.B. Workers and the World Economy. Foreign Aff. 1996, 75, 16. [Google Scholar]
  12. Chaponnière, J.-R.; Cling, J.-P.; Zhou, B. Vietnam following in China’s footsteps: The third wave of emerging Asian economies. South. Engines Glob. Growth 2010, 114–140. [Google Scholar]
  13. Hugo, G. International labour migration and migration policies in Southeast Asia. Asian J. Soc. Sci. 2012, 40, 392–418. [Google Scholar]
  14. Tuấn, T.M. Labor export management in some countries and practice in Vietnam. J. Econ. Dev. 2019, 52–60. [Google Scholar]
  15. Nguyen, T. Vietnam and Its Diaspora: An Evolving Relationship. In Emigration and Diaspora Policies in the Age of Mobility; Springer: Berlin/Heidelberg, Germany, 2017; pp. 239–255. [Google Scholar]
  16. Hoang, L.A. Debt and (un) freedoms: The case of transnational labour migration from Vietnam. Geoforum 2020, 116, 33–41. [Google Scholar]
  17. Bélanger, D.; Ueno, K.; Hong, K.T.; Ochiai, E. From foreign trainees to unauthorized workers: Vietnamese migrant workers in Japan. Asian Pac. Migr. J. 2011, 20, 31–53. [Google Scholar]
  18. Ishizuka, F. International labor migration in Vietnam and the impact of receiving countries’ policies. In Institute of Develo** Economies; Japan External Trade Organization: Chiba, Japan, 2013. [Google Scholar]
  19. Ho, T.T.N. Gender and Transnational Migration: Vietnamese Women Workers and Negotiation of Self-Identity in Multicultural Taiwan; Young Scholar Workshop; European Research Center on Contemporary: Tuebingen, Germany, 2012. [Google Scholar]
  20. Chen, T.; Yin, X.; Peng, L.; Rong, J.; Yang, J.; Cong, G. Monitoring and Recognizing Enterprise Public Opinion from High-Risk Users Based on User Portrait and Random Forest Algorithm. Axioms 2021, 10, 106. [Google Scholar]
  21. Hu, X.; Wang, J.; Wang, L.; Yu, K. K-Nearest Neighbor Estimation of Functional Nonparametric Regression Model under Na Samples. Axioms 2022, 11, 102. [Google Scholar]
  22. Gaxiola, F.; Melin, P.; Valdez, F.; Castro, J.R.; Manzo-Martínez, A. Pso with Dynamic Adaptation of Parameters for Optimization in Neural Networks with Interval Type-2 Fuzzy Numbers Weights. Axioms 2019, 8, 14. [Google Scholar]
  23. Shi, C.; Zhuang, X. A Study Concerning Soft Computing Approaches for Stock Price Forecasting. Axioms 2019, 8, 116. [Google Scholar]
  24. Juraev, D.A.; Noeiaghdam, S. Modern Problems of Mathematical Physics and Their Applications. Axioms 2022, 11, 45. [Google Scholar]
  25. Too, J.; Abdullah, A.R.; Saad, N.M. Hybrid Binary Particle Swarm Optimization Differential Evolution-Based Feature Selection for Emg Signals Classification. Axioms 2019, 8, 79. [Google Scholar]
  26. Bretti, G.J.A. Differential Models, Numerical Simulations and Applications. Axioms 2021, 10, 260. [Google Scholar]
  27. Chen, C.-F.; Lai, M.-C.; Yeh, C.-C. Forecasting tourism demand based on empirical mode decomposition and neural network. Knowl.-Based Syst. 2012, 26, 281–287. [Google Scholar]
  28. Vihikan, W.O.; Putra, I.K.G.D.; Dharmaadi, I.P.A. Foreign Tourist Arrivals Forecasting Using Recurrent Neural Network Backpropagation through Time. TELKOMNIKA (Telecommun. Comput. Electron. Control) 2017, 15, 1257–1264. [Google Scholar]
  29. Wang, L.; Wu, B.; Zhu, Q.; Zeng, Y.-R. Forecasting Monthly Tourism Demand Using Enhanced Backpropagation Neural Network. Neural Process. Lett. 2020, 52, 2607–2636. [Google Scholar]
  30. Imbrie, P.; Lin, J.J.-J.; Malyscheff, A. Artificial Intelligence Methods to Forecast Engineering Students’ Retention Based on Cognitive and Non Cognitive Factors. In Proceedings of the 2008 Annual Conference & Exposition, Pittsburgh, PA, USA, 22–25 June 2008. [Google Scholar]
  31. Mishra, R.K.; Urolagin, S.; Jothi, J.; Nawaz, N.; Haywantee, R. Machine Learning based Forecasting Systems for Worldwide International Tourists Arrival. Int. J. Adv. Comput. Sci. Appl. 2021, 12, 55–64. [Google Scholar]
  32. Rice, W.L.; Park, S.Y.; Pan, B.; Newman, P. Forecasting campground demand in US national parks. Ann. Tour. Res. 2019, 75, 424–438. [Google Scholar]
  33. Sahu, S.K.; Dey, D.K.; Branco, M.D. A new class of multivariate skew distributions with applications to bayesian regression models. Can. J. Stat. 2003, 31, 129–150. [Google Scholar]
  34. Brys, G.; Hubert, M.; Struyf, A. A Robust Measure of Skewness. J. Comput. Graph. Stat. 2004, 13, 996–1017. [Google Scholar]
  35. Olyaie, E.; Abyaneh, H.Z.; Mehr, A.D. A comparative analysis among computational intelligence techniques for dissolved oxygen prediction in Delaware River. Geosci. Front. 2017, 8, 517–527. [Google Scholar]
  36. Aiello, M.; Yang, Y.; Zou, Y.; Zhang, L.J. Artificial Intelligence and Mobile Services-Aims 2018; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar]
  37. Nawi, N.M.; Khan, A.; Rehman, M.Z. A new back-propagation neural network optimized with cuckoo search algorithm. In Proceedings of the International Conference on Computational Science and Its Applications, Ho Chi Minh City, Vietnam, 24–27 June 2013; Springer: Berlin/Heidelberg, Germany, 2013; pp. 413–426. [Google Scholar]
  38. Wang, L.; Zeng, Y.; Chen, T. Back propagation neural network with adaptive differential evolution algorithm for time series forecasting. Expert Syst. Appl. 2015, 42, 855–863. [Google Scholar]
  39. Zhao, H.; Liu, W.; Guan, H.; Fu, C. Analysis of Diaphragm Wall Deflection Induced by Excavation Based on Machine Learning. Math. Probl. Eng. 2021, 2021, 6664409. [Google Scholar]
  40. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar]
  41. Guo, L.; Chehata, N.; Mallet, C.; Boukir, S. Relevance of airborne lidar and multispectral image data for urban scene classification using Random Forests. ISPRS J. Photogramm. Remote Sens. 2011, 66, 56–66. [Google Scholar]
  42. Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J. Photogramm. Remote Sens. 2012, 67, 93–104. [Google Scholar]
  43. Rodriguez-Galiano, V.; Sanchez-Castillo, M.; Chica-Olmo, M.; Chica-Rivas, M.J.O.G.R. Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines. Ore Geol. Rev. 2015, 71, 804–818. [Google Scholar]
  44. Kardani, N.; Zhou, A.; Nazem, M.; Shen, S.L. Estimation of bearing capacity of piles in cohesionless soil using optimised machine learning approaches. Geotech. Geol. Eng. 2020, 38, 2271–2291. [Google Scholar]
  45. Touzani, S.; Granderson, J.; Fernandes, S. Gradient boosting machine for modeling the energy consumption of commercial buildings. Energy Build. 2018, 158, 1533–1543. [Google Scholar]
  46. Yang, J.-H.; Yang, M.-S. Engineering. A control chart pattern recognition system using a statistical correlation coefficient method. Comput. Ind. Eng. 2005, 48, 205–221. [Google Scholar]
  47. Bélanger, D. Labor migration and trafficking among Vietnamese migrants in Asia. Ann. Am. Acad. Political Soc. Sci. 2014, 653, 87–106. [Google Scholar]
  48. Deng, J.B.; Wahyuni, H.I.; Yulianto, V.I. Labor migration from Southeast Asia to Taiwan: Issues, public responses and future development. Asian Educ. Dev. Stud. 2020, 10, 69–81. [Google Scholar]
  49. Oishi, N. The limits of immigration policies: The challenges of highly skilled migration in Japan. Am. Behav. Sci. 2012, 56, 1080–1100. [Google Scholar]
  50. Nguyen, P.H. Remittances and competitiveness: A case study of Vietnam. J. Econ. Bus. Manag. 2017, 5, 79–83. [Google Scholar]
Figure 1. The process of the experimental stages in this study.
Figure 1. The process of the experimental stages in this study.
Axioms 11 00151 g001
Figure 2. The heatmap of annual labor exports to Japan, Taiwan, and Korea.
Figure 2. The heatmap of annual labor exports to Japan, Taiwan, and Korea.
Axioms 11 00151 g002
Figure 3. Backpropagation neural network structure.
Figure 3. Backpropagation neural network structure.
Axioms 11 00151 g003
Figure 4. Vietnam labor immigration prediction for (a) Japan, (b) Korea, and (c) Taiwan. (a) labor immigration prediction for Japan; (b) labor immigration prediction for Korea; (c) labor immigration prediction for Taiwan.
Figure 4. Vietnam labor immigration prediction for (a) Japan, (b) Korea, and (c) Taiwan. (a) labor immigration prediction for Japan; (b) labor immigration prediction for Korea; (c) labor immigration prediction for Taiwan.
Axioms 11 00151 g004
Figure 5. The best performance parameters for prediction: (a) labor exports to Japan, (b) labor exports to Korea, (c) labor exports to Taiwan (unit: people).
Figure 5. The best performance parameters for prediction: (a) labor exports to Japan, (b) labor exports to Korea, (c) labor exports to Taiwan (unit: people).
Axioms 11 00151 g005
Figure 6. Taylor diagram representing the best performance of BPNN, RFR, kNN models of (a) labor exports to Japan, (b) labor exports to Taiwan, (c) labor exports to Korea.
Figure 6. Taylor diagram representing the best performance of BPNN, RFR, kNN models of (a) labor exports to Japan, (b) labor exports to Taiwan, (c) labor exports to Korea.
Axioms 11 00151 g006
Table 1. Descriptive statistics and formal hypothesis test of labor exports to Northeast Asian Countries from 1992 to 2020.
Table 1. Descriptive statistics and formal hypothesis test of labor exports to Northeast Asian Countries from 1992 to 2020.
ItemJapanTaiwanKorea
Mean25,42218,7864132
SD28,96212,6896711
Min515216
Max82,70368,73718,141
Kurt−0.872.641.05
Skew0.531.890.91
Results of Augmented Dickey-Fuller Test:
Test Statistic−1.322.22−3.16
p-value0.620.990.02
Critical Value (1%)−3.70−3.83−3.69
Critical Value (5%)−2.98−3.03−2.97
Critical Value (10%)−2.63−2.65−2.62
Table 2. The optimum component for three models.
Table 2. The optimum component for three models.
kNNRFRBPNN
ItemConfigurationItemConfigurationItemConfiguration
leaf_size40n_estimators30Number of inputs29
metriceuclideanmax_depthNoneNumber of hidden layers1
n_jobs2max_features1Hidden layer sizes200
n_neighbors3min_samples_leaf1Number of outputs29
p2min_samples_split2activationReLU
bootstrapFalseoptimizeadam
Table 3. Accuracy parameters for labor immigration prediction.
Table 3. Accuracy parameters for labor immigration prediction.
ParametersJapanTaiwanKorea
kNNRFRBPNNkNNRFRBPNNkNNRFRBPNN
MAPE0.0730.1450.0060.0660.1370.0060.0510.0510.007
MAE16572976191102324927163848857
RMSE19193858225189049171021570100267
R-squared0.9910.9820.9990.9930.8030.9990.8140.9370.999
NSE0.9910.9640.9990.9780.8530.9990.8870.9540.999
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Giang, N.H.; Nguyen, T.-T.; Tay, C.C.; Phuong, L.A.; Dang, T.-T. Towards Predictive Vietnamese Human Resource Migration by Machine Learning: A Case Study in Northeast Asian Countries. Axioms 2022, 11, 151. https://doi.org/10.3390/axioms11040151

AMA Style

Giang NH, Nguyen T-T, Tay CC, Phuong LA, Dang T-T. Towards Predictive Vietnamese Human Resource Migration by Machine Learning: A Case Study in Northeast Asian Countries. Axioms. 2022; 11(4):151. https://doi.org/10.3390/axioms11040151

Chicago/Turabian Style

Giang, Nguyen Hong, Tien-Thinh Nguyen, Chac Cau Tay, Le Anh Phuong, and Thanh-Tuan Dang. 2022. "Towards Predictive Vietnamese Human Resource Migration by Machine Learning: A Case Study in Northeast Asian Countries" Axioms 11, no. 4: 151. https://doi.org/10.3390/axioms11040151

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop