Develompent of Unsupervised Telecom Customers Clustering Model Using Customer Detail Records (CDR) the Case of Ethiotelecom
dc.contributor.advisor | Kifle, Mesfin (PhD) | |
dc.contributor.author | Abebaw, Banchalem | |
dc.date.accessioned | 2021-11-29T09:12:00Z | |
dc.date.accessioned | 2023-11-29T04:06:38Z | |
dc.date.available | 2021-11-29T09:12:00Z | |
dc.date.available | 2023-11-29T04:06:38Z | |
dc.date.issued | 2021-04-07 | |
dc.description.abstract | Almost in every discipline, people are using phones that generate detail records containing information’s about phone usage, such as, connected time of the call, the identities of sources, the identities of destinations, the duration of each call, the amount billed etc. named as (customer detail records) CDR. It’s difficult for someone to scan through all the data and establish the relative decision for the business because it is time. As a result, there is a growing interest towards better solutions for finding, organizing and analyzing these CDR data in telecommunication companies. The effective ways of organizing telecom data form later decision making and business management efficient, less complicated, friendly and low-cost. Customer detail record (CDR) clustering is one of the common methods of managing customers in business using their usage behavior in network. This study proposes a model that cluster telecommunication customers using detail records (CDR) of the user. During the clustering process, all the CDR records pass through pre-processing stages to prepare data for processing. Then transformation of preprocessed data was done by through scaling algorithm. The scaled features were extracted from the sampled CDR record. Finally, customers are clustered based on the usage behavior using the most popular K-means algorithm that is based on the cosine similarity of the weighted features. Ethio-telecom customer detail records were used for experimentations. The clustering results are evaluated to find optimal cluster size using elbow and silhouette methods. The result shows the value of silhouette coefficient is greater when cluster size is four, with this we clustered the sampled data into four classes. When we see the number of customer’s distribution in each class from hundred thousand sampled data, in the first class 19.959 % of the sample was grouped. In the second class 0.603% of the sample was grouped together. In the 3rd class there are 4.383% of the sample with related usage characteristics. In the 4th class there are 75.054% customers with related usage characteristics. The four groups are considered as customers with low usage history of the service, common usage history of the service, customers with very good usage history and customers with very midst usage history. The ultimate goal of any business would be to have as many customers up there in the service category. This can be further used in decision making to have many numbers of customers to our needed group i.e., a group with very good usage behavior of services. | en_US |
dc.identifier.uri | http://etd.aau.edu.et/handle/123456789/29026 | |
dc.language.iso | en | en_US |
dc.publisher | Addis Ababa University | en_US |
dc.subject | CDR Clustering | en_US |
dc.subject | Customer Data Analysis | en_US |
dc.subject | Feature Scaling | en_US |
dc.subject | K-means Clustering | en_US |
dc.title | Develompent of Unsupervised Telecom Customers Clustering Model Using Customer Detail Records (CDR) the Case of Ethiotelecom | en_US |
dc.type | Thesis | en_US |