Social Opinion Network Analytics in Community Based Customer Churn Prediction

.


Introduction
In today's business world, the rapid growth in the use of social media has made SNA an improved parameter with enhanced predictive power for customer prediction and retention models. While churn prediction is the ability of a customer to stops contract with one company in order to become a customer of another, Social Network (SN) is a set of people or groups of people with interactions between them and/or some pattern of contacts [1]. Consequently, SNA is a set of research procedures for identifying group of people who shared common structures in systems based on the existing relationship among actors. SNA consists of several related works such as alpha users within the communities, community detection, hub member and non-hub members' identification [2] among others.
In churn prediction, whenever a customer stops the contract with one company to become a customer of a competitor, the customer is considered as lost or churn. Significantly, all churn acts do not happen suddenly, oftentimes, it is a process that is nurtured consistently in a conducive environment until desired remedy is no longer satisfactory to a customer. Thus, with endless reasons for customer churn, sometimes, customers may not announce their decision to quit a service provider, however, intrinsic expression about an organization's brand and its community impact are factors, which require timely analysis in order to predict the churn class of a customer [3]. Although, conventional approaches for churn prediction as largely focused on various learning models. These processes treat individual customer's independently through the feature-based modelling techniques. However, it lacked power to analyse the role of social ties between individuals while determining the propensity to churn.
In digitised social community, social tie is necessary for effective decision support to a customer and to the community members of the respective customer. While [4] developed a fuzzified support vector approach to identify the bias in text, it also reclassified opinions into different degrees of positive and negative class through fuzzy membership extension on a support vector model so as to accommodate inherent ambiguity in spoken words. In addition, several other existing churn classification models via social network analysis directly mine opinions into strict positive and negative categories, the obtained values through this approach tentatively defines the churn category of a customer with positive opinion as non-churner, with negative opinion equivalent to churner. These approaches do not consider the network structure through which other members of the community can be classified. Thus, the power of communal influence via relative affinity is neglected while targeted decision support for customer acquisition and retention becomes more difficult for organizations to manage. With this, organisations spend five-six times to acquire a new customer than retaining an existing one. Therefore, this research developed a CSIG for community-based churn classification via direct node to node analysis of relative strength and influence capacity of an opinionated user to nonopinionated member in a community. The CSIG is a network structure of customer with direct and indirect links. Here, the relationship affinity through degree and closeness centralities uses customer's network strength to identify patterns and impact of an opinionate customer on the immediate nonopinionated community members. It also measures the communication throughput and influence among community members. Thus, the probability of influencing a non-opinionate customer by an opinionated one, alongside the connection strength between the dual as members of the same SN community are obtained. The essence is to maximize the potency in community members' freedom of expression while as members remained connected in short chains of intermediate friends; over a period, the evolution and analytics of SN continues to spread rapidly. Also, as new friendships is constantly formed through random encounters, more patterns for churn decision support are obtained. Therefore, by introducing mutual friends, or further association through common interests and opinion the CSIG application becomes more robust for consumers' behavioural management. Hence, in Section 2, an overview of churn prediction strategies in CRM with SNA influence in communitybased churn prediction is discussed. The developed CSIG is presented in Section 3 while sample experiment and result evaluations are discussed in Section 4. Later, proposed future work is presented in Section 5 before the conclusion is documented in Section 6.

Related Works in SNA and Churn Prediction Strategies in CRM
For many decades, understanding why customers terminate relationships has been a focus of marketing research [5]. The strategic approach has been through customer relationship management (CRM), which targets the development of profitable, long-term relationships with customers and stakeholders [6]. Basically, CRM are operational or analytical driven. While operational CRM deals with collecting customer data through a range of touch points such as contact management system, the analytical tools are adopted to analyse customer data that are stored in databases for the purpose of profiling, segmentations and also pattern generation from customer behaviours in order to predict the churn class of a customer [7].
In more recent times, effective usage of CRM has drifted towards customer centric, targeted marketing instead of the long existing product centric, and mass marketing strategies. However, utilizing this comes with prominent issues due to the competitive nature of service-based industries; such as telecommunication, banking, and online gaming in churn prediction [8]. With the introduction of social networking as a new era of content creation where individuals can easily share experiences and information with other users [9], the development as also enabled customers and businesses to collaborate on the internet. Thus, by using social media, consumers can create content and offer valuable advice to others. When people share their reviews, information, advice, warnings, tips and/or any kind of issues that are interesting to their "connection", the information becomes a helpful source, which may influence consumer's decision-making to a significant degree [10]. This is achieved through the impact of social network analysis.in churn prediction. Therefore, with groups of people who do not share or that shares similar interests, backgrounds and activities but essentially bound to build social networks via the social media, such as microblogs e.g., Twitter has provided interactive and cheaper way for user to share ideas, exchange information and stay connected with people [11]. This group of people share and upload contents with images, texts and videos. They communicate with each other, and the power of influence is demonstrated among member. Thus, a SN structure is constituted by a set of 'nodes' connected with different 'ties' [12]. The cardinality of nodes expands dynamically, especially on the web, as new nodes and profiles are created continuously, while further populating the social web [13]. Hence, network thinking has contributed a number of important insights about social power while the different network analysis approach emphasize that power is inherently relational. When there are common set of connections, the social scientists call these connections 'network' [14].
In addition, a SN consist of a set of individuals and some form of social relationship that ties the individuals together [15]. However, SNA is the analysis of social communication through network and graph theory [16]. It models and visualize social networks in order to detect social structures. In telecommunication, among others, SNA has been explored through customer data and Call Detail Data (CDR). For instance, [17] used detailed CDR of mobile operator to build a connection graph between subscribers; based on calls made between members while [18] predicted customer churn through interpersonal influence using SNA. Just as churn prediction model was modelled for multiplayers in online games via the use of the spreading activation [19]. Reference [20], also developed a churn model in telecommunication sector as a spread of influence by also applying the spreading activation method. Among others, [21] used social groups to predict customer churn in mobile networks. However, to effectively manage SNA community with dynamic capacities to expand, in Section 4, the developed CSIG is discussed. Here, pattern of customer's influence and impact of each opinionated customer in SNA towards churn management of customers using a newly defined quadruple churn categories against existing binary churn prediction model are discussed.

Developed Churn Service Information Graph
One distinct attribute of a social community is that not every member exhibits the same traits. For instance, some members' freely expressed opinion frequently, while some are passive. Despite these differences, by words of expression, community members can be influenced to a significant degree. Thus, in recent times, a community of many customers with relative association have become more universal through the power of the social media. Now, families, friends, colleagues, etc. communicate more often on the social media due to several factors like work, location, and physical distancing. This factors among others, inhibits physical social communication among friends and loved once. Consequently, as people edit, like, follow, link, reply, retweet, make comment, tag, review, and update information on their Twitter account, they form collections of connections. Thus, CSIG uses the analyses of people's opinion through "word of mouth" and the influence of such on other people's decision in SN community and among overlapping communities. Here, the SN influence is based on various degree of centrality with special emphasis on users' opinion. By this, influence degree is not only about centrality measurement of the most influential person in the network; meaning, a person is not only considered to be the most influential based on connected nodes in the network but also by direct/indirect impact of users' opinion analysis though Degree Centrality, and Closeness Centrality.
Here, the degree centrality was used to ascertain the total number of targeted or dependent nodes the expressed opinion of the source node can influence. Likewise, closeness centrality measures how fast can the source node expressed opinion reach the dependent or targeted node in the network. By these two centrality measures, a value is obtained to showcase the connection of the source node to other well connected dependent node in the network. Therefore, through the community of online social networker, we define and represent two distinct nodes as following:

Definition 1: An opinionated (active user) node as a social user who has expressed, is expressing or may express opinion(s) about a product or brand of an organization over a period. Definition 2: A non-opinionated (passive user) node as a social user who does not express, is not willing to express and/or may not express opinion(s) about a product or brand of an organization over a period.
The definition 1 and 2 is further illustrated in the Fig. 1. PU are passive users and AU are the Active users. The bi-directional arrow heads, signifies an existence of two-way communication between nodes while the directed edges indicated that one node is following the other.
By analyzing the social network graph using algorithm listing 1, the communication weight between nodes, and degree of influence between nodes on the social network graph is generated. Here, if an influential node in a community is a churner, the probability that dependent nodes around such can also churn becomes significant and is given necessary consideration. Hence, the CSIG defines the churn class of other members of the community around an active node for churn classification.
The objective is to measure the relative degree of affinity between the passive nodes and the active nodes in the social community using the algorithm in listing 1 for effective decision support in churn management. Calculate the D c for each member of U 8.
Calculate the C f for each set of A c to U 9. Multiply C w to C f for every U in C From algorithm listing 1, churn classes were computed for every member of the community where relative scores were obtained for each node. While the scores are not static, it changes based on node strength and affiliation over a period t i . The dynamic scores are a result of the timing event, which is largely dependent on how active a community is on subject (opinions) that pertains to the business under consideration alongside the introduction of new members. To this end, in algorithm listing 2, the CSIG classifier algorithm is presented. P c = churner 6.
End If By these algorithms, the churn classification of non-opinionated customers around opinioned once is predominantly based on pattern analysis and influence of node through customer's communication weight score, which exist between nodes on a social network. The algorithm listing 1, measures the total amount of direct links an opinionated node has with the other nodes around it on the network while algorithm listing 2, classifies each customer according to the score obtained via the analysis. The essence of the new classification threshold (Premium Customer, Inertia Customer, Potential Churner, and Churner) is triggered on the fact that to a certain degree every customer is a potential churner. Thus, identifying a customer as churner or non-churner leads to churn misclassification with unreliable decision support system for churn management. In section four, same experiments based of twitter feeds and network structure is presented. There, to evaluate the CSIG model, we presented a sample of customer across different churn class and the following results as obtained are presented;

Experiments and Evaluation
The experiment is an analysis of 32201 opinioned Twitter users with different network structure and connections. The clustered network represented users following a telecoms organization with at least an opinion that contains the organization's keyword, product brand, services, promotions and hashtags. The opinions of the users are mined with each belonging to one of the four-classed churn category as earlier defined. The opinion class represented a churn category wherein every member of the community fit into for real time churn analysis. Thus, this opinion mining class are further represented as follows.
Thus, the evaluation report for each of the customer in Table 1 is presented in Tables 2-5 respectively.     By this impact ratio of churn classification with the sampled customer records, a new cluster of churn class were achieved among non-opinionated members of the community. By this, organizations can effectively target subsets of a community cluster with recommendations to different categories of premium customer, inertia customer, potential churner and churner.

Future Work
While the current research work is based on direct node to node connection in a community, in the nearest future, we hope to focus on overlapping communities and its impact on churn prediction in community-based churn management. Thereby, evaluating the achieved result with the CSIG model presented here with others.

Conclusion
With the continuous growth in digital marketing and it influence on customer relationship management, here, the research approached churn management from a social network opinion mining perspective. Although, existing approach performs SNA on customer data records, here customers' opinion is processed to determine their churn class. Later, the churn classes of community members around active customers are mined for a more effective targeted decision support. The algorithm, sample experiments and evaluations are presented earlier showed that churn classification via opinion mining is no longer restricted to opinionated customers but every community member on a social network.
Funding Statement: The authors received no specific funding for this study.

Conflicts of Interest:
The authors declare that they have no conflicts of interest to report regarding the present study.