The importance to understand customer dropout and the diversity of employed algorithms requires an understanding of trends and existing problems to create a ground base of knowledge. For the development of the systematic literature review was adopted, the methodology applied by Kitchenham & Charters (2007) developed in three stages: Plan, Conduct and Report, as described in the following figure:
Research goals is to understand What is the current state of machine learning research studies to predict dropout in contractual settings? Based in this were identified five research questions to determine the main aspects related to the customer dropout with contractual settings:
RQ1. What studies have been published?
RQ2. Which algorithms have been used to predict the dropout?
RQ3. What are the more relevant features related to predicting customer dropout?
RQ4. When the dropout occurs?
RQ5. What is the accuracy of the machine learning algorithms to predict dropout?
Consider the identification of type of machine learning algorithms according business area. This allows to identify which algorithms are used in each business area to address dropout.
All searches where limited to the time interval between 2000 and 2020. Limited to articles in english.
("research papers about dropout with contractual settings" OR "membership") AND ("machine learning to predict dropout" OR "churn") AND ("investigations founds") AND ("not applicable for this research")
((“customer dropout”) OR (“customer churn”) AND “machine learning” AND (“contractual” OR “membership”))
ALL ( customer AND ( ( dropout ) OR ( "churn" ) ) AND "machine learning" AND ( "contractual" OR "membership" ) ) AND PUBYEAR > 1999 AND PUBYEAR < 2021 AND ( LIMIT-TO ( DOCTYPE , "ar" ) OR LIMIT-TO ( DOCTYPE , "cp" ) ) AND ( LIMIT-TO ( LANGUAGE , "English" ) )
Results = 210
ALL ( customer AND ( ( dropout ) OR ( "churn" ) ) AND "machine learning" ) AND PUBYEAR > 1999 AND PUBYEAR < 2021 AND ( LIMIT-TO ( DOCTYPE , "ar" ) OR LIMIT-TO ( DOCTYPE , "cp" ) ) AND ( LIMIT-TO ( LANGUAGE , "English" ) )
Results = 2000
Scopus Studies selection (bibtex)
(((“customer") AND ("dropout" OR “churn”)) AND “machine learning” AND (“contractual” OR “membership”))
results:20
Search: 21_08_2020 (("customer dropout"or "customer churn") and "machine learning" and (contractual* or membership)) https://link.springer.com/search?query=%28%28%22customer+dropout%22or+%22customer+churn%22%29+and+%22machine+learning%22+and+%28contract*+or+membership%29%29+&date-facet-mode=between&facet-start-year=2000&previous-start-year=1997&facet-end-year=2021&previous-end-year=2021
results:79
Search: 21_08_2020 (((“customer") AND ("dropout" OR “churn”)) AND “machine learning” AND (“contractual” OR “membership”))
Results:126
Search: 21_08_2020 (("customer dropout"or "customer churn") and "machine learning" and (contractual or membership)) https://apps.webofknowledge.com/Search.do?product=UA&SID=E5YD2nJO9BseQ2L2G2V&search_mode=GeneralSearch&prID=d77c3de4-5ff9-4e31-a913-3dc06f30f206
TS=(customer AND ( dropout OR churn) AND "machine learning" ) Timespan: 2000-2020. Databases: WOS, CCC, DIIDW, KJD, MEDLINE, RSCI, SCIELO. Search language=Auto results:110
TS=(customer AND ( dropout OR churn) AND "machine learning" AND (contractual or membership)) Databases= WOS, CCC, DIIDW, KJD, MEDLINE, RSCI, SCIELO Timespan=2000-2020 Search language=Auto
results:6
Full Query Sintax:
"query": { (“customer dropout”) OR (“customer churn”) AND “machine learning” AND (“contractual” OR “membership”) }
"filter": { Publication Date: (01/01/2000 TO 12/31/2018), ACM Content: DL, NOT VirtualContent: true }
results: 8
Full Query Sintax: "query": { AllField:((“customer dropout”) OR (“customer churn”) AND “machine learning”) } "filter": { Publication Date: (01/01/2000 TO 12/31/2020), ACM Content: DL, NOT VirtualContent: true } results: 86
Articles identified in the initial dataset
Source | Articles |
---|---|
Scopus | 210 |
IEEE | 20 |
Springer | 79 |
Science Direct | 126 |
WoS | 6 |
ACM | 8 |
Total | 449 |
Science Direct incomplete items where 8, Scopus 11 and IEEE 1. Remaining 429. 16 duplicates removed 424. ASReview (van de Schoot et al., 2020) filtering process:
- Selected randomly some articles and are identified as relevant and irrelevant, at least 5 each. ASReview suggests when to stop;
- ASReview orders the publications in such a way that you see the most relevant publications first
- Stopping criterium could be stopping after n presented abstracts were labeled irrelevant, or if your time is up. You can use the chart in the statistics panel to follow your progress"
Data processing is available in a R Script, quantitative analysis and some text mining in the selected articles