Artificial Neural Network (ANN) Classification: Titanic Passenger Safety

. Scientific and technological innovation has always been the main driver of economic growth and social progress. The rapid development of technology and advances in the internet have made it possible to disseminate information and interact more easily. With the rapid development of technology, a lot of information is shared every second, resulting in big data in terms of different, complex variables. ANN is the result of work in the computer field that is inspired by the capabilities of the human brain which consists of biological neural networks. In recent years, the use of artificial neural networks (ANN) has increased. The research carried out aims to analyze the survival capabilities of Titanic passengers who experienced an accident while sailing and sank. This research uses initial data of 1309 observations with 14 variables. From the research results, 2 hidden variables are the most accurate with an accuracy of 80.5%, compared to the number of hidden variables of 3 (79%) and 4 (79%). So it can be concluded that the number of hidden variables with the same number of hidden screens does not have a significant difference in accuracy.


INTRODUCTION
Scientific and technological innovation has always been a key driver of economic growth and social progress.With the advent of the digital era, continuous breakthroughs in information technologies such as artificial intelligence (AI), blockchain, and big data have brought many new opportunities for patent system improvement (Custers, 2022).Meanwhile, digital technology and related innovations have many characteristics that differ from those of the past, posing further challenges to the innovation system.How to effectively utilize these new technologies to solve or mitigate the problems of legacy systems, and avoid the associated difficulties, has become an important issue for scholars and policymakers in the digital era (Ouyang et al., 2023).In 2011, McKinsey & Company believed that big data could independently mine and analyze information with the help of computers and produce effective information (Cui et al., 2022).Big data has 5 characteristics, including: volume, speed, variety, truth and value (Geerts & O'Leary, 2022).Big data analytics (BDA) has become increasingly available in recent years.Companies can collect and analyze big data with low-cost service packages (Article et al., 2020).One Examines the impact of the use of big data analytics on corporate social performance in corporate social responsibility initiatives, Identifying the importance of a company's ability to provide data-based insights through big data analytics capabilities (Choi & Park, 2022).
ANN is the result of work in the computer field which is inspired by the capabilities of the human brain which consists of biological neural networks (Management & Homes, 2019).Through a processing unit better known as a neuron (Nikoo et al., 2023), where ANN imitates and is trained like the way the human brain works in finding the function value of the goal (Chaki & Biswas, 2023), thus getting better prediction results compared to with conventional models (Betiku et al., 2015).In recent years, the use of artificial neural networks (ANN) has increased (Esfe et al., 2022), this is because ANN modeling is considered a reliable model for non-linear data, where complex relationships between dependent and independent can be found ( Teke et al., 2023).Novelty in this study, the use of ANN to analyze Titanic accident data is a new approach in understanding the factors that influence passenger survival in emergency situations.The emphasis on hidden variables in the ANN model and the finding that a certain number of hidden variables did not make a significant difference in accuracy, provide new insights regarding the optimal configuration of the ANN model for ship accident cases.The relationship between research and the use of big data in analyzing and understanding social phenomena such as ship accidents, which increases understanding of the potential of big data and its analysis in the context of public security and safety.
The application of ANN in this study as a model that is considered able to overcome the complexity of non-linear data, shows the relevance and progress in using machine learning techniques to understand and predict events involving many variables and factors.Research purposes is analyzing the survival capabilities of Titanic passengers who experienced an accident and sank using an Artificial Neural Networks (ANN) approach.Assessing the accuracy of the ANN model in predicting passenger survival capabilities based on initial data including 14 variables.

METHOD
The research carried out aims to analyze the survival capabilities of Titanic passengers who experienced an accident while sailing and sank.This research uses initial data of 1309 observations with 14 variables.Where the safety status of passengers after the drowning tragedy is the dependent variable and the other thirteen variables are independent variables.The variables used in this research can be seen as in Table 1 below:

RESULT AND DISCUSSION
In this research, researchers will create an ANN model to classify whether Titanic passengers were safe or not according to the variables used and compare the accuracy level of the ANN model with different numbers of hidden screens.
The following are the results of the safety classification of Titanic passengers based on the ANN algorithm: 1. Data selection The initial data taken initially had 14 variables and 1309 observations.However, there were several observations whose data were missing (blank) so data selection was carried out both from the number of variables and the number of observations so that complete data was obtained (nothing was missing).The selection results obtained 5 variables with a final number of observations of 1045 observations 2. Building a model using the ANN method (single screen) The ANN method consists of multiple hidden screens and single hidden layers.In this research a single hidden screen will be used with many variables in the hidden screen of 2,3,4 variables.The purpose of the hidden screen is to connect the dependent variable with several independent variables.The process of a single hidden screen ANN with 2, 3, and 4 hidden variables can be seen in Figure 1.
Figure 1.Classification of survivors using the ANN method k=2,3,4 In Figure 1 we can see that each independent variable is connected to all the hidden screen variables 2,3,4 then these hidden variables are connected back to the dependent variable which is the final goal value of the ANN process.

Evaluate the ANN model with test data
After modeling with ANN, the number of variables in the hidden screen has been set to 2 variables, 3 variables and 4 variables.The final score obtained from the classification is as shown in table 3.

Analyze the accuracy of the ANN model
After carrying out ANN modeling, we will compare the accuracy of each method used to find the ANN model with the maximum accuracy.The accuracy values of each method can be seen in Table 4.

CONCLUSION
The ANN method is used to classify with adequate accuracy in determining data classification.This research examines a comparative analysis of the accuracy of the ANN algorithm for classifying the safety of Titanic passengers with a number of variables on a single hidden screen.From the research results, 2 hidden variables are the most accurate with an accuracy of 80.5%, compared to the number of hidden variables of 3 (79%) and 4 (79%).So it can be concluded that the number of hidden variables with the same number of hidden screens does not have a significant difference in accuracy.

Table 2 .
Data variables after selection.In Table2there are 5 variables used, including: safe, pclass, sex, age, and dry.Where the safe variable is the dependent variable while the pclass, sex, age and dry variables are independent variables.In building this model the researcher will use training data of 836 observations (80%) and test data of 209 observations (20%).

Table 3 .
Comparison of the final ANN prediction results with test data values

Table 4 .
Accuracy value of the ANN method with variables 2,3 and 4.