Shan, Shan (2022) A machine learning approach to solve the complete cold start problem in recommendation systems: building the P2P energy trading recommendation system with theory of consumption values. Doctoral thesis, Northumbria University.
|
Text (Doctoral Thesis)
shan.shan_phd(10030367).pdf - Submitted Version Download (2MB) | Preview |
Abstract
Recommendation Systems (RSs) belong to the subclass of information filtering systems, which is to determine any given user’s “rating” of particular items. However, Cold Start Recommendations (CSR) pose significant challenges due to insufficient historical ratings of information. CSR is divided into Incomplete Cold Start Recommendation (ICSR) and Complete Cold Start Recommendation (CCSR) in historical ratings or user information; the former formulates recommendations to new users and new items, while the latter contributes to new systems. Various approaches deal with cold start challenges, including collaborative filtering, content-based fliting, the hybrid approach, increasing the diversity of data sources, context-awareness, and user profile construction. These approaches are commonly applied in the ICSR, though they are not as evident in the CCSR.
Few previous studies have focused on solutions to CCSR; solving CCSR is significantly important due to the popularity of the recommendation system, the increasing new system, the improved recommendation performance, and the capability of reducing user input. The Peer to Peer (P2P) energy trading recommendation system (ETRS) is an example of a new system facing the complete cold start challenge. This research compares the advantages and drawbacks of different strategies to solve CSR and selects user profile construction to solve CCSR in P2P ETRS.
First, a mixed-method data collection process was used, including semi-structured expert interviews, user-generated content analysis, and a survey to generate insights contributing to an efficient P2P energy trading user profile dataset. Second, a P2P ETRS is proposed by comparing the Decision Tree classifier and Ensemble classifier under different users’ cultural backgrounds. Different cultural backgrounds may have different viewpoints on P2P energy trading. Thus, the system is evaluated with a large body of real-world data acquired from two case-study regions, one is the western dataset, and one is the Chinese dataset. The proposed recommendation system achieves high performance by delivering personalised recommendations. In addition, previous research applied to constructing the user profile dataset, demographic information, and user personalities, neglecting the motivations behind the product selections under different cultural backgrounds. Thus, this research analysed the driving motivations of P2P energy trading with the theory of consumption values under different cultural backgrounds.
This research has two significant contributions. First, this study identified the user profile construction strategy with a machine learning approach to solve the CCSR problem in the P2P ETRS. This strategy can effectively build the original user profile dataset, reduce the user input, and evaluate the recommended model. In addition, the research adopted ensemble learning algorithms to improve the performance of the recommended model. In terms of system performance, the accuracy of the Decision Tree (DT) model achieves an 86.1% for the Chinese dataset and a 68.5% for the western data set. In addition, ensemble learning algorithms can improve the classification of both datasets, as CatBoost performed best in the ensemble model, with an accuracy rate of 94.3% on the Chinese dataset and 79.8% on the western dataset.
Second, this research created a bridge between machine learning and the traditional social science theory. A framework for a P2P energy trading recommendation system within the theory of consumption values was proposed to understand the collaborative consumption behaviour. Previous studies of consumption value theory had applied to collaborative consumption, including Airbnb and Uber. Unlike the previous research, P2P energy trading paid more attention to the feeling of uncertainty on the emotional value rather than enjoyment; That is to say, consumers believe the uncertainty of the supply of using renewable energy is more important than the consideration of enjoyment of the experience. Most notably, the essential driving values in the two datasets are different, although income can impact P2P product selection in both datasets. The feeling of uncertainty played the most crucial role in the Chinese dataset, while the western dataset focused on sustainability.
Item Type: | Thesis (Doctoral) |
---|---|
Uncontrolled Keywords: | user-generated content analysis, user profile construction, consumption behaviour, decision tree classifier, ensemble classifier |
Subjects: | G400 Computer Science G700 Artificial Intelligence |
Department: | Faculties > Engineering and Environment > Computer and Information Sciences University Services > Graduate School > Doctor of Philosophy |
Depositing User: | John Coen |
Date Deposited: | 19 Aug 2022 08:07 |
Last Modified: | 19 Aug 2022 08:15 |
URI: | http://nrl.northumbria.ac.uk/id/eprint/49920 |
Downloads
Downloads per month over past year