<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">37553</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2023.037553</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Prediction of NFT Sale Price Fluctuations on OpenSea Using Machine Learning Approaches</article-title>
<alt-title alt-title-type="left-running-head">Prediction of NFT Sale Price Fluctuations on OpenSea Using Machine Learning Approaches</alt-title>
<alt-title alt-title-type="right-running-head">Prediction of NFT Sale Price Fluctuations on OpenSea Using Machine Learning Approaches</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>Wang</surname><given-names>Zixiong</given-names></name></contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western"><surname>Chen</surname><given-names>Qiuying</given-names></name></contrib>
<contrib id="author-3" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Lee</surname><given-names>Sang-Joon</given-names></name><email>s-lee@chonnam.ac.kr</email></contrib>
<aff><institution>Interdisciplinary Program of Digital Future Convergence Service, Chonnam National University</institution>, <addr-line>Gwangju, 61186, <country>Korea</country></addr-line></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Sang-Joon Lee. Email: <email>s-lee@chonnam.ac.kr</email></corresp>
</author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2023</year></pub-date>
<pub-date date-type="pub" publication-format="electronic"><day>27</day><month>3</month><year>2023</year></pub-date>
<volume>75</volume>
<issue>2</issue>
<fpage>2443</fpage>
<lpage>2459</lpage>
<history>
<date date-type="received">
<day>08</day><month>11</month><year>2022</year>
</date>
<date date-type="accepted">
<day>29</day><month>12</month><year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2023 Wang, Chen and Lee</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Wang, Chen and Lee</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_37553.pdf"></self-uri>
<abstract>
<p>The rapid expansion of the non-fungible token (NFT) market has attracted many investors. However, studies on the NFT price fluctuations have been relatively limited. To date, the machine learning approach has not been used to demonstrate a specific error in NFT sale price fluctuation prediction. The aim of this study was to develop a prediction model for NFT price fluctuations using the NFT trading information obtained from OpenSea, the world&#x2019;s largest NFT marketplace. We used Python programs to collect data and summarized them as: NFT information, collection information, and related account information. AdaBoost and Random Forest (RF) algorithms were employed to predict the sale price and price fluctuation of NFTs using regression and classification models, respectively. We found that the NFT related account information, especially the number of favorites and activity status of creators, confer a good predictive power to both the models. AdaBoost in the regression model had more accurate predictions, the root mean square error (RMSE) in predicting NFT sale price was 0.047. In predicting NFT sale price fluctuations, RF performed better, which the area under the curve (AUC) reached 0.956. We suggest that investors should pay more attention to the information of NFT creators. We anticipate that these prediction models will reduce the number of investment failures for the investors.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>NFT</kwd>
<kwd>sale price fluctuation</kwd>
<kwd>OpenSea</kwd>
<kwd>AdaBoost</kwd>
<kwd>Random forest</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>The non-fungible token (NFT) market has seen some standouts in early 2021. According to statistics reported by White et al., NFT sales show a significant increase in two years &#x2212;41% from 2019 to 2020 and a staggering 3,857% from 2020 to 2021 [<xref ref-type="bibr" rid="ref-1">1</xref>]. By the end of May 2022, the trading volume of these new markets for digital assets grows to approximately $800 million [<xref ref-type="bibr" rid="ref-2">2</xref>]. This trade growth matched the significant increase in Google NFT trends [<xref ref-type="bibr" rid="ref-3">3</xref>]. An NFT is a right to a digital property recorded on the blockchain, which can be any digital asset: an image, a video, a song, or virtual land, etc. [<xref ref-type="bibr" rid="ref-4">4</xref>]. Although transacted through cryptocurrencies, NFTs have distinct characteristics from cryptocurrencies that must be kept in mind when trying to understand them. The primary purpose of cryptocurrencies is to serve as currencies [<xref ref-type="bibr" rid="ref-5">5</xref>]. Unlike cryptocurrencies, which are equivalent, undifferentiable, and replaceable coins, NFTs are considered to be pure digital properties that can&#x2019;t be traded like for like. Therefore, NFTs are irreplaceable and unique [<xref ref-type="bibr" rid="ref-6">6</xref>]. An NFT can provide indisputable answers to questions such as who created this NFT, who owns it, and which of the many copies is the original [<xref ref-type="bibr" rid="ref-7">7</xref>]. There are numerous trading markets for various types of NFTs, with OpenSea and Axie Marketplace being the largest markets, and both of them have accumulated approximately 2.1 million traders [<xref ref-type="bibr" rid="ref-8">8</xref>]. However, while the public attention on NFTs has exploded, our understanding of their overall structures and market evolution is still inadequate. Nadini et al. built and studied traders and NFT networks and found that most traders are specialized [<xref ref-type="bibr" rid="ref-9">9</xref>]. Furthermore, according to a report by NonFungible Corporation, more and more speculative buyers&#x2019; investments start to fail as the year progressed [<xref ref-type="bibr" rid="ref-10">10</xref>]. Therefore, it is crucial to help investors predict the fluctuations of NFT prices to reduce investment failures.</p>
<p>Thus far, studies on NFT prices have been relatively limited. Most commonly, authors attempt to examine trends in some specific collection [<xref ref-type="bibr" rid="ref-11">11</xref>] or link the price of a specific sought-after NFT to other financial assets [<xref ref-type="bibr" rid="ref-12">12</xref>,<xref ref-type="bibr" rid="ref-13">13</xref>]. However, these famous NFTs are limited, which implies that most investors cannot benefit from these specific NFTs. Investors need to consider the thousands of common NFTs that have not received much attention. Therefore, our paper presents a suitable method for the investors to use publicly available data to predict the NFT sale prices and their fluctuations.</p>
<p>In the field of financial investment, researchers have proposed many prediction methods, among which time series models are commonly employed in the study of price prediction. For instance, Mallqui et al. demonstrated that the blockchain information of Bitcoin is an important predictor of Bitcoin price fluctuations [<xref ref-type="bibr" rid="ref-14">14</xref>]. Kim et al. reported that in addition to macro economy factors, the Ethereum blockchain information is strongly associated to the Ethereum price [<xref ref-type="bibr" rid="ref-15">15</xref>]. However, unlike stocks or currencies, there is no daily price for NFTs, because the number and time of transactions are different for each NFT, making it difficult to use time series models for NFT sale-price prediction. Therefore, in this study, we employed machine learning regression models to predict the sale prices of NFTs, screened out the NFTs with multiple trading records, used classification models to predict the fluctuations of NFT sale prices.</p>
<p>The specific purposes of this study are as follows: First, we introduced the machine learning approach for predicting the sale prices as well as the rise or fall in the prices of NFTs. Second, we analyzed and identified the features that aid in predicting the fluctuations of NFT sale prices with a high accuracy. Third, we developed realistic implications and recommendations for investors.</p>
<p>The organization of the rest of the paper is as follows: Section 2 reviews the related works. Section 3 presents our research design and dataset overview. In Section 4, we evaluate the prediction accuracy of each model and identify the variables that result in a highly accurate NFT price prediction. Section 5 discusses the theoretical and practical implications. In Section 6, we summarize the study results and discuss the limitations.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Related Works</title>
<sec id="s2_1">
<label>2.1</label>
<title>OpenSea</title>
<p>OpenSea is the largest marketplace for NFT trading in the world, and according to statistics, it has amassed over $32B in trading volume and over 2.1M traders [<xref ref-type="bibr" rid="ref-8">8</xref>]. NFTs on OpenSea are mainly based on four blockchains, namely Ethereum, Klaytn, Polygon, and Solana, and thus far, more than 95% of the transactions are performed using the Ethereum blockchain. Further, there are many kinds of NFTs on OpenSea, including images, audio and video. The NFT collections can be divided into nine categories: Art, Domain Names, Collectible, Music, Sports, Photography, Trading Cards, Virtual Worlds, and Utility. Each NFT usually has three kinds of related pages, and the information on these pages can be summarized as: NFT information, collection information, and related account information. These three website pages of the NFTs are presented in <xref ref-type="fig" rid="fig-1">Fig. 1</xref> to illustrate OpenSea.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>NFT related pages (for example, CryptoPunk #1)</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_37553-fig-1.tif"/>
</fig>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Fluctuations in NFT Prices</title>
<p>The existing studies on NFT price fluctuations are primarily focused on specific sought-after collections. Dowling found that the pricing sequence of virtual lands in Decentraland is characterized by inefficiencies and appreciation in value [<xref ref-type="bibr" rid="ref-4">4</xref>]. In another study, Dowling studied Cryptopunks, Decentraland, and Axie Infinity and found a limited volatility transmission effect between the pricing of these NFTs and Bitcoin prices [<xref ref-type="bibr" rid="ref-12">12</xref>]. From other perspectives, Aharon and Demir used a time-varying parametric vector autoregressive approach to analyze the correlation between the NFT market and financial assets such as stocks [<xref ref-type="bibr" rid="ref-13">13</xref>]. However, we cannot use the sales trends of these well-known collections as a basis for studying the NFT market. That&#x2019;s because these sought-after collections, like Decentraland virtual lands and Cryptopunks, aren&#x2019;t representative of the general market.</p>
<p>To date, no research has used machine learning approach to predict NFT sales price fluctuations, and no reports are available on the demonstration of a specific NFT sale-price prediction error, except that of Nadini et al., who studied the predictability of NFT sales using a linear regression model. Subsequently, they used the AdaBoost algorithm to predict whether an NFT will be traded multiple times. However, they did not report the prediction error and used variables that are difficult to access by most investors, such as centrality of buyers and sellers in the network of NFT transactions, and so on [<xref ref-type="bibr" rid="ref-9">9</xref>].</p>
<p>We performed an empirical study with thousands of common NFTs as subjects to generalize the results. We attempted to predict NFT sale prices and fluctuations accurately by using publicly available and easily accessible data as independent variables of our machine learning models. We expect that this study will help the investors understand the NFT sale prices and their fluctuations more quickly and make appropriate investments.</p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Machine Learning</title>
<p>Machine learning can be divided into supervised learning, unsupervised learning, and reinforcement learning [<xref ref-type="bibr" rid="ref-16">16</xref>], and the machine learning algorithms are usually used for clustering, classification, and regression. Supervised learning is the prediction based on the input data, resulting in common decision rules [<xref ref-type="bibr" rid="ref-17">17</xref>]. Machine learning has been used in many areas, including demand forecasting, price prediction, and illness diagnosis [<xref ref-type="bibr" rid="ref-15">15</xref>,<xref ref-type="bibr" rid="ref-18">18</xref>,<xref ref-type="bibr" rid="ref-19">19</xref>].</p>
<sec id="s2_3_1">
<label>2.3.1</label>
<title>Price Prediction with Machine Learning</title>
<p>To date, price prediction has been attempted by various researchers using a multitude of approaches. Weng et al. used random forests (RF), neural networks, support vectors, and boosted regression trees to form an expert system to predict stock prices. They found that features collected online improved the accuracy of predictions [<xref ref-type="bibr" rid="ref-20">20</xref>]. Antipov et al. found that the RF algorithm has an advantage over other methods in terms of accuracy when evaluating the price of residential apartments [<xref ref-type="bibr" rid="ref-21">21</xref>]. &#x010C;eh et al. compared the prediction accuracy of the RF algorithm and hedonic model when predicting apartment transaction prices. Their results also showed that RF produced significantly better prediction results [<xref ref-type="bibr" rid="ref-22">22</xref>].</p>
<p>In the field of cryptocurrency, the area most closely related to NFTs, Jang et al. determined the blockchain information and macroeconomic indicators are important to predicting Bitcoin&#x2019;s price [<xref ref-type="bibr" rid="ref-23">23</xref>]. Abraham et al. collected Google trend and Twitter data to predict cryptocurrencies prices, and they found that tweet volume is an important predictor of Bitcoin and Ethereum price fluctuations [<xref ref-type="bibr" rid="ref-24">24</xref>]. Valencia et al. compared RF, neural networks, and support vector machines in predicting cryptocurrencies prices. They also reported that Twitter data can be used to predict the price of cryptocurrencies and found that neural networks have higher accuracy [<xref ref-type="bibr" rid="ref-25">25</xref>].</p>
</sec>
<sec id="s2_3_2">
<label>2.3.2</label>
<title>AdaBoost</title>
<p>Adaptive Boosting (AdaBoost) was developed by Yoav Freund and Robert Schapire. It solved several optimization problems associated with boosting algorithms. This classification meta-algorithm uses the usual principles of boosting by building linear compositions of assumptions and executing the design through iterations of weak learning algorithms based on probability profiles computed using the results of prior iterations [<xref ref-type="bibr" rid="ref-26">26</xref>]. This method allows the learner to concentrate on difficult-to-learn examples [<xref ref-type="bibr" rid="ref-27">27</xref>]. <xref ref-type="fig" rid="fig-2">Fig. 2</xref> illustrates the modeling process of AdaBoost.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Modeling process of AdaBoost</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_37553-fig-2.tif"/>
</fig>
</sec>
<sec id="s2_3_3">
<label>2.3.3</label>
<title>Random Forest</title>
<p>RF is an ensemble learning algorithm which was developed by Breiman. It incorporates the properties of many decision tree algorithms to classify or predict the values of the exploited variables [<xref ref-type="bibr" rid="ref-28">28</xref>]. RF is a new training set generated by repeated random selection of k samples from the raw training N with substitution, and then k classification trees are created based on the bootstrap set to form a random forest [<xref ref-type="bibr" rid="ref-29">29</xref>]. By assembling multiple weak classifiers, the results are polled or averaged so that the overall model results have high accuracy and a generalization performance [<xref ref-type="bibr" rid="ref-30">30</xref>]. <xref ref-type="fig" rid="fig-3">Fig. 3</xref> illustrates the modeling process of RF.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Modeling process of RF</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_37553-fig-3.tif"/>
</fig>
</sec>
</sec>
</sec>
<sec id="s3">
<label>3</label>
<title>Research Design</title>
<p>In this section, we present a method to predict the NFT sale prices and price fluctuations. <xref ref-type="fig" rid="fig-4">Fig. 4</xref> shows the flow of the presented method. In this study, we implemented supervised learning, which is commonly used to analyze the training data and result in an extrapolated function that can be used to map new instances [<xref ref-type="bibr" rid="ref-21">21</xref>,<xref ref-type="bibr" rid="ref-22">22</xref>].</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Flowchart of the method</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_37553-fig-4.tif"/>
</fig>
<sec id="s3_1">
<label>3.1</label>
<title>Data Collection</title>
<p>Due to the changes in OpenSea policies, the application programming interface provided by the website has some limitations. Therefore, instead of using a web crawler to collect the data, we employed an automated testing tool to simulate the behavior of the people visiting the webpages (by using a Python program) and to collect the visited Hyper Text Markup Language links. Finally, XPath analysis was performed to collect data and obtain key information. Our publicly available Python program can be visited using the following uniform resource locator: <ext-link ext-link-type="uri" xlink:href="https://github.com/wzx333222111/ZIXIONG-WANG">https://github.com/wzx333222111/ZIXIONG-WANG</ext-link>. We collected 15,000 pieces of NFT information with sale records on the activity page between July 4 and July 7, 2022. We selected the ether chain NFTs for the data collection because of two reasons: (1) to unify the unit of value, and (2) because ether chain is the blockchain used by most NFTs. Various variables expected to be related to the target variables are shown in <xref ref-type="table" rid="table-1">Table 1</xref>.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Attributes of the dataset</title>
</caption>
<table frame="hsides" >
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Category</th>
<th>Attribute</th>
<th>Description</th>
<th>Type</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="5">NFT information</td>
<td>Favorited</td>
<td>Numbers of this NFT favorited by user</td>
<td>Numerical</td>
</tr>
<tr>
<td>View</td>
<td>Numbers of this NFT viewed by user</td>
<td>Numerical</td>
</tr>
<tr>
<td>All time avg. price</td>
<td>Numbers of an NFT&#x2019;s all-time avg. price</td>
<td>Numerical</td>
</tr>
<tr>
<td>Listings</td>
<td>Whether there are listings to this NFT</td>
<td>Categorical</td>
</tr>
<tr>
<td>Offers</td>
<td>Whether there are offers to this NFT</td>
<td>Categorical</td>
</tr>
<tr>
<td rowspan="6">NFT collection information</td>
<td>Items</td>
<td>Numbers of NFTs in this collection</td>
<td>Numerical</td>
</tr>
<tr>
<td>Owners</td>
<td>Numbers of owners of this collection</td>
<td>Numerical</td>
</tr>
<tr>
<td>Floor price</td>
<td>Numbers of the floor price of this collection</td>
<td>Numerical</td>
</tr>
<tr>
<td>All time avg. price</td>
<td>Numbers of all time avg. price</td>
<td>Numerical</td>
</tr>
<tr>
<td>All time volume</td>
<td>Numbers of all time volume</td>
<td>Numerical</td>
</tr>
<tr>
<td>Verify status</td>
<td>Whether the collection is a verified collection</td>
<td>Categorical</td>
</tr>
<tr>
<td rowspan="6">Creator information</td>
<td>Collected</td>
<td>Numbers of NFTs collected by user</td>
<td>Numerical</td>
</tr>
<tr>
<td>Favorited</td>
<td>Numbers of NFTs favorited by user</td>
<td>Numerical</td>
</tr>
<tr>
<td>Identification status</td>
<td>Whether the user is a verified account</td>
<td>Categorical</td>
</tr>
<tr>
<td>Created status</td>
<td>Whether the user has created an NFT</td>
<td>Categorical</td>
</tr>
<tr>
<td>Activity status</td>
<td>Whether the user has transfer or sale an NFT</td>
<td>Categorical</td>
</tr>
<tr>
<td>Offers received</td>
<td>Whether the user has received an offer</td>
<td>Categorical</td>
</tr>
<tr>
<td rowspan="5">Owner information</td>
<td>Collected</td>
<td>Numbers of NFTs collected by user</td>
<td>Numerical</td>
</tr>
<tr>
<td>Favorited</td>
<td>Numbers of NFTs favorited by user</td>
<td>Numerical</td>
</tr>
<tr>
<td>Created status</td>
<td>Whether the user has created an NFT</td>
<td>Categorical</td>
</tr>
<tr>
<td>Activity status</td>
<td>Whether the user has transfer or sale an NFT</td>
<td>Categorical</td>
</tr>
<tr>
<td>Offers received</td>
<td>Whether the user has received an offer</td>
<td>Categorical</td>
</tr>
<tr>
<td rowspan="2">Target variables</td>
<td>Sales price</td>
<td>Numbers of sale price of an NFT</td>
<td>Numerical</td>
</tr>
<tr>
<td>Fluctuation</td>
<td>Fluctuation of NFT sale price<break/>&#x2212;1 for rise<break/>&#x2212;0 for fall</td>
<td>Categorical</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Data Preprocessing and Ensemble Modeling</title>
<p>Next, we preprocessed the collected data, i.e., performed reduction, cleaning, transformation, and normalization of the collected data to improve the prediction accuracy of the model.</p>
<p>Among the 15,000 collected data, we finally selected 8586 datapoints and split them into a training set (7634 data) and a testing set (952 data) for predicting NFT sale prices. The standard deviation and mean values of the variables are presented in <xref ref-type="table" rid="table-2">Table 2</xref>. In addition, we filtered the NFTs with multiple sale price records, finally selected 6136 datapoints, and divided them into training (5358 data) and testing sets (778 data) to predict the fluctuations of the NFT sale prices.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Descriptive statistics of the variables</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Category</th>
<th>Attribute</th>
<th>Mean</th>
<th>S.D.</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="4">NFT information</td>
<td>Favorited</td>
<td>7.514</td>
<td>190.956</td>
</tr>
<tr>
<td>View</td>
<td>61.223</td>
<td>27.317</td>
</tr>
<tr>
<td>All time avg. price</td>
<td>0.515</td>
<td>1.149</td>
</tr>
<tr>
<td>Sale price</td>
<td>0.519</td>
<td>1.182</td>
</tr>
<tr>
<td rowspan="5">NFT collection information</td>
<td>Items</td>
<td>10871.583</td>
<td>17941.408</td>
</tr>
<tr>
<td>Owners</td>
<td>25923.708</td>
<td>86507.56</td>
</tr>
<tr>
<td>Floor price</td>
<td>0.419</td>
<td>1.149</td>
</tr>
<tr>
<td>All time avg. price</td>
<td>0.769</td>
<td>1.962</td>
</tr>
<tr>
<td>All time volume</td>
<td>12804.826</td>
<td>41944.275</td>
</tr>
<tr>
<td rowspan="2">Creator account information</td>
<td>Collected</td>
<td>133.375</td>
<td>1653.986</td>
</tr>
<tr>
<td>Favorited</td>
<td>3.616</td>
<td>50.883</td>
</tr>
<tr>
<td rowspan="2">Owner account information</td>
<td>Collected</td>
<td>761.711</td>
<td>11920.806</td>
</tr>
<tr>
<td>Favorited</td>
<td>62.160</td>
<td>328.5221</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Then, we used regressor and classifier to develop predictive models. We used Orange (Ver. 3.32.0) as the data analysis tool, which integrates many machine learning algorithms, such as RF, Tree, and Gradient Boosting. However, after comparing the preliminary prediction results of these algorithms available in Orange, we selected the RF and AdaBoost algorithms for this study. The schematic of this tool&#x2019;s operation is depicted in <xref ref-type="fig" rid="fig-5">Fig. 5</xref>. To improve the reliability of the regression model, we used K-fold cross-validation, which was performed 10 times. Cross-validation is a method in which one model validation technique, among a large number of similar ones, is used to assess how the results of statistical analyses can be generalized to independent datasets [<xref ref-type="bibr" rid="ref-31">31</xref>,<xref ref-type="bibr" rid="ref-32">32</xref>]. In prediction problems, the model is usually provided a known dataset for training (training set) and an unknown dataset to test the model (testing set) [<xref ref-type="bibr" rid="ref-33">33</xref>]. The purpose of cross-validation is to reduce selection bias and overfitting problems [<xref ref-type="bibr" rid="ref-34">34</xref>].</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Orange operation schematic</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_37553-fig-5.tif"/>
</fig>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Predictor Phases for Regression Model</title>
<p>We developed four phases to estimate the predictive performance of NFT information, collection information and related account information in the regression model. <xref ref-type="table" rid="table-3">Table 3</xref> shows the variables included in each phase. In Phase 1, we input NFT information as predictors. We expect that using only NFT information as predictors yields high prediction accuracy because these variables provide the most basic information about NFT. The related account information and collection information are added to the variables considered in Phase 2 and Phase 3, respectively. We expect that the addition of these variables should lead to an improvement in prediction accuracy, albeit with a much smaller overall contribution than that of the variables in the first phase. Phase 4 includes all the three types of information, and we expect to get the best prediction results in this phase.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>Variables included in each phase</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th></th>
<th>Phase 1</th>
<th>Phase 2</th>
<th>Phase 3</th>
<th>Phase 4</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td>NFT information</td>
<td>&#x221A;</td>
<td>&#x221A;</td>
<td>&#x221A;</td>
<td>&#x221A;</td>
</tr>
<tr>
<td>Account information</td>
<td></td>
<td>&#x221A;</td>
<td></td>
<td>&#x221A;</td>
</tr>
<tr>
<td>Collection information</td>
<td></td>
<td></td>
<td>&#x221A;</td>
<td>&#x221A;</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Empirical Results</title>
<sec id="s4_1">
<label>4.1</label>
<title>Regression Model Predictive Result Analysis</title>
<p>In this study, we used root mean square error (RMSE), mean absolute error (MAE) and R-square (R<sup>2</sup>) to evaluate the performance of the regression models [<xref ref-type="bibr" rid="ref-15">15</xref>,<xref ref-type="bibr" rid="ref-24">24</xref>,<xref ref-type="bibr" rid="ref-35">35</xref>].</p>
<p>We performed a step-by-step analysis from Phase 1 to 4. Phase 1 includes only the NFT information variables, and RF performs better in this phase of the prediction (RMSE &#x003D; 0.119, MAE &#x003D; 0.042, and R<sup>2</sup> &#x003D; 0.990) (see <xref ref-type="table" rid="table-4">Table 4</xref>). In Phase 2 with account information, we found that AdaBoost achieved better prediction results, and RMSE, MAE and R<sup>2</sup> (RMSE &#x003D; 0.047, MAE &#x003D; 0.007, and R<sup>2</sup> &#x003D; 0.998) (see <xref ref-type="table" rid="table-5">Table 5</xref>) showed significant improvements over their Phase 1 counterparts. The prediction results of Phase 3, which combined NFT and collection information, were slightly more accurate than those of Phase 1 (RMSE &#x003D; 0.075, MAE &#x003D; 0.009, and R<sup>2</sup> &#x003D; 0.996) (see <xref ref-type="table" rid="table-6">Table 6</xref>). However, compared with Phase 2, the prediction error in this phase became larger. Finally, we input all the variables into the prediction model and observed that the results of Phase 4 were better than those of Phase 1 and 3, whereas the RMSE value was slightly larger than those of Phase 2 (RMSE &#x003D; 0.053, MAE &#x003D; 0.007, and R<sup>2</sup> &#x003D; 0.998) (see <xref ref-type="table" rid="table-7">Table 7</xref>). These results show that the relevant account information contains information directly related to the NFT sale price.</p>
<table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>Results of data analysis for phase 1</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Input</th>
<th>Algorithm</th>
<th>RMSE</th>
<th>MAE</th>
<th>R<sup>2</sup></th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="4">NFT information (number of inputs: 5)</td>
<td>AdaBoost training</td>
<td>0.179</td>
<td>0.051</td>
<td>0.977</td>
</tr>
<tr>
<td>AdaBoost testing</td>
<td>0.135</td>
<td>0.015</td>
<td>0.987</td>
</tr>
<tr>
<td>RF training</td>
<td>0.197</td>
<td>0.074</td>
<td>0.972</td>
</tr>
<tr>
<td>RF testing</td>
<td>0.119</td>
<td>0.042</td>
<td>0.990</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>Results of data analysis for phase 2</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Input</th>
<th>Algorithm</th>
<th>RMSE</th>
<th>MAE</th>
<th>R<sup>2</sup></th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td>NFT information (number of inputs: 5)</td>
<td>AdaBoost training</td>
<td>0.116</td>
<td>0.032</td>
<td>0.990</td>
</tr>
<tr>
<td>Account information (number of inputs: 11)</td>
<td>AdaBoost testing</td>
<td><bold>0.047</bold></td>
<td><bold>0.007</bold></td>
<td><bold>0.998</bold></td>
</tr>
<tr>
<td></td>
<td>RF training</td>
<td>0.225</td>
<td>0.093</td>
<td>0.964</td>
</tr>
<tr>
<td></td>
<td>RF testing</td>
<td>0.186</td>
<td>0.063</td>
<td>0.976</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-6">
<label>Table 6</label>
<caption>
<title>Results of data analysis for phase 3</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Input</th>
<th>Algorithm</th>
<th>RMSE</th>
<th>MAE</th>
<th>R<sup>2</sup></th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td>NFT information (number of inputs: 5)</td>
<td>AdaBoost training</td>
<td>0.099</td>
<td>0.024</td>
<td>0.993</td>
</tr>
<tr>
<td>Collection information (number of inputs: 6)</td>
<td>AdaBoost testing</td>
<td>0.075</td>
<td>0.009</td>
<td>0.996</td>
</tr>
<tr>
<td></td>
<td>RF training</td>
<td>0.133</td>
<td>0.048</td>
<td>0.987</td>
</tr>
<tr>
<td></td>
<td>RF testing</td>
<td>0.077</td>
<td>0.027</td>
<td>0.996</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-7">
<label>Table 7</label>
<caption>
<title>Results of data analysis for phase 4</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Input</th>
<th>Algorithm</th>
<th>RMSE</th>
<th>MAE</th>
<th>R<sup>2</sup></th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td>NFT information (number of inputs: 10)</td>
<td>AdaBoost training</td>
<td>0.072</td>
<td>0.020</td>
<td>0.996</td>
</tr>
<tr>
<td>Account information (number of inputs: 14)</td>
<td>AdaBoost testing</td>
<td>0.053</td>
<td>0.007</td>
<td>0.998</td>
</tr>
<tr>
<td>Collection information (number of inputs: 5)</td>
<td>RF training</td>
<td>0.179</td>
<td>0.066</td>
<td>0.977</td>
</tr>
<tr>
<td></td>
<td>RF testing</td>
<td>0.103</td>
<td>0.039</td>
<td>0.993</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Evaluation of Classification Model</title>
<p>In the classification model, we evaluated the model using F1-score, precision, and recall [<xref ref-type="bibr" rid="ref-36">36</xref>]. We also performed the receiver operating curve (ROC) analysis. The ROC is established by graphing the relationship between the true positive and the false positive rates under different settings of the threshold. The false positives can also be called as false alarm probabilities [<xref ref-type="bibr" rid="ref-37">37</xref>]. The results of the training set are shown in <xref ref-type="table" rid="table-8">Table 8</xref>. The RF classification model exhibited a peak area under the curve (AUC) value of 0.945, whereas the AdaBoost model showed a peak AUC value of 0.853. In addition, considering the precision, recall, and F1 scores, the RF algorithm outperformed the AdaBoost algorithm in determining the fluctuations of NFT sale prices. The ROC analysis results are shown in <xref ref-type="table" rid="table-9">Table 9</xref>.</p>
<table-wrap id="table-8">
<label>Table 8</label>
<caption>
<title>Training set results of classification model</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Algorithm</th>
<th>AUC</th>
<th>F1</th>
<th>Precision</th>
<th>Recall</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td>AdaBoost</td>
<td>0.853</td>
<td>0.854</td>
<td>0.853</td>
<td>0.855</td>
</tr>
<tr>
<td>RF</td>
<td>0.945</td>
<td>0.883</td>
<td>0.882</td>
<td>0.884</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-9">
<label>Table 9</label>
<caption>
<title>ROC of each model</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Random forest (AUC: 0.945)</th>
<th>AdaBoost (AUC: 0.853)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td><inline-graphic xlink:href="CMC_37553-inline-1.tif"/></td>
<td><inline-graphic xlink:href="CMC_37553-inline-2.tif"/></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>For further evaluation, we selected 778 datapoints as the testing set, and again predicted the fluctuations of NFT sale prices using the RF model. <xref ref-type="table" rid="table-10">Table 10</xref> shows results of the testing set. Among the 778 datapoints in the testing set, 461 and 317 points exhibited actual sale price fall and rise, respectively. The RF classification model correctly predicted the price volatility for 700 samples and incorrectly predicted price rise and fall for 36 and 42 samples, respectively. The results of the testing set are shown in <xref ref-type="table" rid="table-11">Table 11</xref>.</p>
<table-wrap id="table-10">
<label>Table 10</label>
<caption>
<title>Confusion matrix for RF</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th></th>
<th></th>
<th align="center" colspan="2">Predicted</th>
<th>&#x2211;</th>
</tr>
<tr>
<th></th>
<th></th>
<th>0</th>
<th>1</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="2">Actual</td>
<td>0</td>
<td>419</td>
<td>42</td>
<td>461</td>
</tr>
<tr>
<td>1</td>
<td>36</td>
<td>281</td>
<td>317</td>
</tr>
<tr>
<td>&#x2211;</td>
<td></td>
<td>455</td>
<td>323</td>
<td>778</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-11">
<label>Table 11</label>
<caption>
<title>Prediction performance of RF</title>
</caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th>Algorithm</th>
<th>AUC</th>
<th>F1</th>
<th>Precision</th>
<th>Recall</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td>RF</td>
<td>0.956</td>
<td>0.878</td>
<td>0.870</td>
<td>0.886</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Evaluation of Variables for Classification Model</title>
<p>After conducting a learning process to determine the fluctuations of NFT sale prices, we further analyzed the variables collected from OpenSea to identify the ones that contained more information for determining the fluctuations of NFT sale prices based on the RF model. <xref ref-type="fig" rid="fig-6">Fig. 6</xref> shows the contribution levels of the variables that affected the predictions. NFT all time avg. price (0.025) was the most influential variable for predicting price rise or fall, followed by the number of NFT favorites (0.022), number of creator&#x2019;s favorites (0.020), creator&#x2019;s activity status (0.020), and number of owner&#x2019;s collection (0.011). Among them, NFT favorite means that other users browsed this NFT and clicked the favorite button, which to some extent reflects the recognition and liking of other users for this NFT. Creator&#x2019;s favorites indicate those NFTs for which the creator of the NFT clicked the favorite button. Creator&#x2019;s activity indicates whether there has been recent NFT transfers or transaction activities. Both these variables reflect the activity level of the NFT creator. The owner&#x2019;s collection represents the number of NFTs owned by the owner.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Predicted contributions of the variables</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_37553-fig-6.tif"/>
</fig>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Implications and Discussion</title>
<sec id="s5_1">
<label>5.1</label>
<title>Theoretical Implications</title>
<p>The results of this study have important theoretical implications. First, we determined whether publicly available information on OpenSea helps to predict the price fluctuations of NFTs. According to previous studies, specific famous NFT prices are associated with cryptocurrencies and other financial assets [<xref ref-type="bibr" rid="ref-12">12</xref>,<xref ref-type="bibr" rid="ref-13">13</xref>]. However, these studies were performed without considering the thousands of ordinary NFTs sold publicly in the NFT market. We identified a significant correlation between the sale price fluctuations of these ordinary NFTs and the information publicly available on OpenSea, and herein, suggest that this information can be used to predict the fluctuations of NFT prices.</p>
<p>Second, NFT is a digital asset with considerable public interest, but few researchers have explored the price-prediction approach. Although Nadini et al. obtained the NFT trading information from Ethereum and WAX blockchains and analyzed the predictability of NFT sales using a linear regression model, they used some variables that are difficult to obtain by most investors, such as centrality of buyers and sellers in the network of NFT transactions and did not report their prediction error [<xref ref-type="bibr" rid="ref-9">9</xref>]. In the present study, we extended the study of Nadini et al. by using Python programs to collect NFT information that is readily available to most investors and used this collected NFT information as independent variables. Further, we demonstrated the prediction error by empirical analysis (RMSE &#x003D; 0.047, and MAE &#x003D; 0.007).</p>
<p>Third, in addition to the NFT information of interest, such as NFT price history and numbers of favorited and most viewed NFTs, we also evaluated the prediction value provided by the collection and related account information. The results of the classification model for predicting the fluctuations in the NFT sale prices demonstrated the importance of the relevant account information and revealed that the number of creator&#x0027;s favorites, activity status, and number of owner&#x2019;s collections primarily contribute to the prediction results. Collectively, the results of the regression and classification models indicate that these three features are crucial for predicting the NFT prices, and thus, future studies on NFT price fluctuations should be conducted by considering these three features.</p>
</sec>
<sec id="s5_2">
<label>5.2</label>
<title>Practical Implications</title>
<p>In this study, publicly available data and simple machine learning algorithms were used. As a result, the methodology and results of this study can be easily replicated by academics and practitioners aiming to predict future NFT price fluctuations.</p>
<p>The results of this study also have practical implications: First, we identified some important variables on OpenSea to predict price fluctuations of NFTs. The investors can use these NFT and account information to understand the NFT prices more simply and make appropriate investments. More specifically, we demonstrated that the prediction value of the relevant account information is higher than that of the collection information. Thus, investors can rapidly and accurately predict the NFT prices. Further, using only the NFT and related account information can prevent unnecessary data collection and reduce the calculation time. We suggest that investors can focus on the number of creators&#x2019; collections, status of their activity, and number of owners&#x2019; collections, because the NFTs created by more active creators have a greater potential. Thus, in simple words, investors need to find such active creators.</p>
<p>Second, NFT practitioners can produce more attractive NFTs and promote them to ensure more people click the favorite button. In addition, practitioners should pay attention to the role of account activity frequency, because the NFTs created by more active accounts will attract more attention and likes. Therefore, it is necessary to increase the number of favorites given to other NFTs as well as increase the NFT trading activities.</p>
<p>Third, we compared the accuracy of AdaBoost and RF, in predicting the NFT prices and their fluctuations and observed that the practitioners can use the AdaBoost and RF algorithms for predicting the NFT prices and price fluctuations, respectively.</p>
</sec>
</sec>
<sec id="s6">
<label>6</label>
<title>Conclusion and Limitations</title>
<p>In this study, machine learning algorithms were used for predicting the NFT prices and their fluctuations. We used regression models to predict the NFT sale prices and classification models to predict the NFT sale price fluctuations. The regression (RMSE &#x003D; 0.047, MAE &#x003D; 0.007) and classification (AUC &#x003D; 0.956, F1 &#x003D; 0.878, and Precision &#x003D; 0.870, Recall &#x003D; 0.886) models provided accurate predictions using the AdaBoost and RF algorithms, respectively.</p>
<p>The results indicate that, in addition to the historical average price of the NFTs, the relevant account information provides the most predictive power. The results obtained using the classification model indicate that the number of favorites and activity status of the creator are essential for the prediction performance as they attract more public attention and likes. This shows that the investors can pay more attention to the creator&#x2019;s activity frequency to assess the investment potential of the NFTs.</p>
<p>The limitations of this study are as follows:
<list list-type="order">
<list-item>
<p>Only the publicly available information on OpenSea was considered in this study. However, the website is continuously updated as the market develops, and we cannot predict what new information will be publicly available on the site in the future. In future studies, the updates on the website should be followed to obtain updated results.</p></list-item>
<list-item>
<p>We only considered the factors available on the most popular website, OpenSea. However, there are many other NFT trading marketplaces, such as <ext-link ext-link-type="uri" xlink:href="http://Binance.com">Binance.com</ext-link>. Further, as indicated in previous studies, blockchain information has a positive effect on Ethereum price prediction [<xref ref-type="bibr" rid="ref-9">9</xref>]. Thus, the impact of blockchain information on the price of NFT, which is another product of blockchain technology, will be examined in our future study.</p></list-item>
<list-item>
<p>We cannot distinguish between normal and wash trading. The NFT market, as an emerging market, does not have tenable trading rules, which leads to the phenomenon of wash trading in various trading markets.</p></list-item>
</list></p>
</sec>
</body>
<back>
<ack>
<p>This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the Innovative Human Resource Development for Local Intellectualization support program (IITP-2022-RS-2022-00156287) supervised by the IITP (Institute for Information &#x0026; communications Technology Planning &#x0026; Evaluation). This work was supported by Institute for Information &#x0026; communications Technology Planning &#x0026; Evaluation (IITP) grant funded by the Korea government (MSIT) (No.2022-0-01203, Regional strategic Industry convergence security core talent training business).</p>
</ack>
<sec><title>Funding Statement</title>
<p>The authors received no specific funding for this study.</p>
</sec>
<sec sec-type="COI-statement"><title>Conflicts of Interest</title>
<p>The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>B.</given-names> <surname>White</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Mahanti</surname></string-name> and <string-name><given-names>K.</given-names> <surname>Passi</surname></string-name></person-group>, &#x201C;<article-title>Characterizing the OpenSea NFT marketplace</article-title>,&#x201D; in <conf-name>Companion Proc. of the Web Conf. 2022</conf-name>, <publisher-loc>Lyon, France</publisher-loc>, pp. <fpage>25</fpage>&#x2013;<lpage>29</lpage>, <year>2022</year>. </mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="other"><collab>NonFungible Corporation, NFT Market Report Q2&#x2013;2022</collab>, <year>2022</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://nonfungible.com/market-tracker">https://nonfungible.com/market-tracker</ext-link></mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="other"><collab>Google Trends, 2022</collab> (<year>2022</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://trends.google.com/trends/explore?q=nft&#x0026;geo=KR">https://trends.google.com/trends/explore?q=nft&#x0026;geo=KR</ext-link></mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Dowling</surname></string-name></person-group>, &#x201C;<article-title>Fertile LAND: Pricing non-fungible tokens</article-title>,&#x201D; <source>Finance Research Letters</source>, vol. <volume>44</volume>, no. <issue>4</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>5</lpage>, <year>2022</year>. <pub-id pub-id-type="doi">10.1016/j.frl.2021.102096</pub-id></mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. G.</given-names> <surname>Baur</surname></string-name>, <string-name><given-names>K. H.</given-names> <surname>Hong</surname></string-name> and <string-name><given-names>A. D.</given-names> <surname>Lee</surname></string-name></person-group>, &#x201C;<article-title>Bitcoin: Medium of exchange or speculative assets?</article-title>&#x201D; <source>Journal of International Financial Markets, Institutions and Money</source>, vol. <volume>54</volume>, no. <issue>4</issue>, pp. <fpage>177</fpage>&#x2013;<lpage>189</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>Q.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Wang</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Chen</surname></string-name></person-group>, &#x201C;<article-title>Non-fungible token (NFT): Overview, evaluation, opportunities and challenges</article-title>,&#x201D; <comment>arXiv preprint arXiv:2105.07447</comment>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>T. W.</given-names> <surname>Lounge</surname></string-name></person-group>, &#x201C;<article-title>Choosing the right blockchain for your NFT</article-title>,&#x201D; <year>2020</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://medium.com/phantasticphantasma/choosing-the-right-blockchain-for-your-nft-d1df2bebae91">https://medium.com/phantasticphantasma/choosing-the-right-blockchain-for-your-nft-d1df2bebae91</ext-link></mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><collab>DappRadar, NFT Marketplace</collab></person-group>, <year>2022</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://dappradar.com/nft/marketplaces">https://dappradar.com/nft/marketplaces</ext-link></mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Nadini</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Alessandretti</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Giacinto</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Martino</surname></string-name>, <string-name><given-names>L. M.</given-names> <surname>Aiello</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Mapping the NFT revolution: Market trends, trade networks, and visual features</article-title>,&#x201D; <source>Scientific Reports</source>, vol. <volume>11</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>11</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="other"><collab>NonFungible Corporation</collab>, <article-title>Non-fungible tokens quarterly report Q3&#x2013;2021</article-title>, <year>2021</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://nonfungible.com/market-tracker">https://nonfungible.com/market-tracker</ext-link></mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X. J.</given-names> <surname>Jiang</surname></string-name> and <string-name><given-names>F. L.</given-names> <surname>Xiao</surname></string-name></person-group>, &#x201C;<article-title>Cryptokitties transaction network analysis: The rise and fall of the first blockchain game mania</article-title>,&#x201D; <source>Frontiers in Physics</source>, vol. <volume>9</volume>, no. <issue>57</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>12</lpage>, <year>2021</year>. <pub-id pub-id-type="doi">10.3389/fphy.2021.631665</pub-id></mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Dowling</surname></string-name></person-group>, &#x201C;<article-title>Is non-fungible token pricing driven by cryptocurrencies?</article-title>&#x201D; <source>Finance Research Letters</source>, vol. <volume>44</volume>, no. <issue>534</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>6</lpage>, <year>2022</year>. <pub-id pub-id-type="doi">10.1016/j.frl.2021.102097</pub-id></mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. Y.</given-names> <surname>Aharon</surname></string-name> and <string-name><given-names>E.</given-names> <surname>Demir</surname></string-name></person-group>, &#x201C;<article-title>NFTs and asset class spillovers: Lessons from the period around the <break/>COVID-19 pandemic</article-title>,&#x201D; <source>Finance Research Letters</source>, vol. <volume>47</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>9</lpage>, <year>2021</year>. <pub-id pub-id-type="doi">10.1016/j.frl.2021.102515</pub-id>; <pub-id pub-id-type="pmid">36406741</pub-id></mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. C.</given-names> <surname>Mallqui</surname></string-name> and <string-name><given-names>R. A.</given-names> <surname>Fernandes</surname></string-name></person-group>, &#x201C;<article-title>Predicting the direction, maximum, minimum and closing prices of daily Bitcoin exchange rate using machine learning techniques</article-title>,&#x201D; <source>Applied Soft Computing</source>, vol. <volume>75</volume>, no. <issue>2</issue>, pp. <fpage>596</fpage>&#x2013;<lpage>606</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H. M.</given-names> <surname>Kim</surname></string-name>, <string-name><given-names>G. W.</given-names> <surname>Bock</surname></string-name> and <string-name><given-names>G.</given-names> <surname>Lee</surname></string-name></person-group>, &#x201C;<article-title>Predicting Ethereum prices with machine learning based on blockchain information</article-title>,&#x201D; <source>Expert Systems with Applications</source>, vol. <volume>184</volume>, no. <issue>3</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>8</lpage>, <year>2021</year>. <pub-id pub-id-type="doi">10.1016/j.eswa.2021.115480</pub-id></mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>J. D.</given-names> <surname>Kelleher</surname></string-name>, <string-name><given-names>B. M.</given-names> <surname>Namee</surname></string-name> and <string-name><given-names>A.</given-names> <surname>D&#x0027;arcy</surname></string-name></person-group>, <source>Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies</source>. <publisher-loc>Cambridge, Massachusetts</publisher-loc>: <publisher-name>MIT press</publisher-name>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Kourou</surname></string-name>, <string-name><given-names>T. P.</given-names> <surname>Exarchos</surname></string-name>, <string-name><given-names>K. P.</given-names> <surname>Exarchos</surname></string-name>, <string-name><given-names>M. V.</given-names> <surname>Karamouzis</surname></string-name> and <string-name><given-names>D. I.</given-names> <surname>Fotiadis</surname></string-name></person-group>, &#x201C;<article-title>Machine learning applications in cancer prognosis and prediction</article-title>,&#x201D; <source>Computational and Structural Biotechnology Journal</source>, vol. <volume>13</volume>, pp. <fpage>8</fpage>&#x2013;<lpage>17</lpage>, <year>2015</year>; <pub-id pub-id-type="pmid">25750696</pub-id></mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Soni</surname></string-name>, <string-name><given-names>U.</given-names> <surname>Ansari</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Sharma</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Soni</surname></string-name></person-group>, &#x201C;<article-title>Predictive data mining for medical diagnosis: An overview of heart disease prediction</article-title>,&#x201D; <source>International Journal of Computer Applications</source>, vol. <volume>17</volume>, no. <issue>8</issue>, pp. <fpage>43</fpage>&#x2013;<lpage>48</lpage>, <year>2011</year>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S. C. K.</given-names> <surname>Tekouabou</surname></string-name></person-group>, &#x201C;<article-title>Intelligent management of bike sharing in smart cities using machine learning and Internet of Things</article-title>,&#x201D; <source>Sustainable Cities and Society</source>, vol. <volume>67</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>14</lpage>, <year>2021</year>. <pub-id pub-id-type="doi">10.1016/j.scs.2020.102702</pub-id></mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>B.</given-names> <surname>Weng</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Lu</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>F. M.</given-names> <surname>Megahed</surname></string-name> and <string-name><given-names>W.</given-names> <surname>Martinez</surname></string-name></person-group>, &#x201C;<article-title>Predicting short-term stock prices using ensemble methods and online data sources</article-title>,&#x201D; <source>Expert Systems with Applications</source>, vol. <volume>112</volume>, no. <issue>3</issue>, pp. <fpage>258</fpage>&#x2013;<lpage>273</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>E. A.</given-names> <surname>Antipov</surname></string-name> and <string-name><given-names>E. B.</given-names> <surname>Pokryshevskaya</surname></string-name></person-group>, &#x201C;<article-title>Mass appraisal of residential apartments: An application of random forest for valuation and a CART-based approach for model diagnostics</article-title>,&#x201D; <source>Expert Systems with Applications</source>, vol. <volume>39</volume>, no. <issue>2</issue>, pp. <fpage>1772</fpage>&#x2013;<lpage>1778</lpage>, <year>2012</year>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>&#x010C;eh</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Kilibarda</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Lisec</surname></string-name> and <string-name><given-names>B.</given-names> <surname>Bajat</surname></string-name></person-group>, &#x201C;<article-title>Estimating the performance of random forest versus multiple regression for predicting prices of the apartments</article-title>,&#x201D; <source>ISPRS International Journal of Geo-information</source>, vol. <volume>7</volume>, no. <issue>5</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>16</lpage>, <year>2018</year>. <pub-id pub-id-type="doi">10.3390/ijgi7050168</pub-id></mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Jang</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Lee</surname></string-name></person-group>, &#x201C;<article-title>An empirical study on modeling and prediction of bitcoin prices with bayesian neural networks based on blockchain information</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>6</volume>, pp. <fpage>5427</fpage>&#x2013;<lpage>5437</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Abraham</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Higdon</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Nelson</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Ibarra</surname></string-name></person-group>, &#x201C;<article-title>Cryptocurrency price prediction using tweet volumes and sentiment analysis</article-title>,&#x201D; <source>SMU Data Science Review</source>, vol. <volume>1</volume>, no. <issue>3</issue>, pp. <fpage>1</fpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>F.</given-names> <surname>Valencia</surname></string-name>, <string-name><given-names>A.</given-names> <surname>G&#x00F3;mez-Espinosa</surname></string-name> and <string-name><given-names>B.</given-names> <surname>Vald&#x00E9;s-Aguirre</surname></string-name></person-group>, &#x201C;<article-title>Price movement prediction of cryptocurrencies using sentiment analysis and machine learning</article-title>,&#x201D; <source>Entropy</source>, vol. <volume>21</volume>, no. <issue>6</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>12</lpage>, <year>2019</year>. <pub-id pub-id-type="doi">10.3390/e21060589</pub-id>; <pub-id pub-id-type="pmid">33267303</pub-id></mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Freund</surname></string-name> and <string-name><given-names>R. E.</given-names> <surname>Schapire</surname></string-name></person-group>, &#x201C;<article-title>A decision-theoretic generalization of on-line learning and an application to boosting</article-title>,&#x201D; <source>Journal of Computer and System Sciences</source>, vol. <volume>55</volume>, no. <issue>1</issue>, pp. <fpage>119</fpage>&#x2013;<lpage>139</lpage>, <year>1997</year>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Mishra</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Mishra</surname></string-name> and <string-name><given-names>G. H.</given-names> <surname>Santra</surname></string-name></person-group>, &#x201C;<article-title>Adaptive boosting of weak regressors for forecasting of crop production considering climatic variability: An empirical assessment</article-title>,&#x201D; <source>Journal of King Saud University-Computer and Information Sciences</source>, vol. <volume>32</volume>, no. <issue>8</issue>, pp. <fpage>949</fpage>&#x2013;<lpage>964</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>L.</given-names> <surname>Breiman</surname></string-name></person-group>, &#x201C;<article-title>Random forests</article-title>,&#x201D; <source>Machine Learning</source>, vol. <volume>45</volume>, no. <issue>1</issue>, pp. <fpage>5</fpage>&#x2013;<lpage>32</lpage>, <year>2001</year>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>G&#x00E9;ron</surname></string-name></person-group>, <source>Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems</source>. <publisher-loc>Sebastopol, California</publisher-loc>: <publisher-name>O&#x0027;Reilly Media, Inc press</publisher-name>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>V.</given-names> <surname>Rodriguez-Galiano</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Sanchez-Castillo</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Chica-Olmo</surname></string-name> and <string-name><given-names>M. J. O. G. R.</given-names> <surname>Chica-Rivas</surname></string-name></person-group>, &#x201C;<article-title>Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines</article-title>,&#x201D; <source>Ore Geology Reviews</source>, vol. <volume>71</volume>, pp. <fpage>804</fpage>&#x2013;<lpage>818</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Stone</surname></string-name></person-group>, &#x201C;<article-title>Cross-validatory choice and assessment of statistical predictions</article-title>,&#x201D; <source>Journal of the Royal Statistical Society: Series B (Methodological)</source>, vol. <volume>36</volume>, no. <issue>2</issue>, pp. <fpage>111</fpage>&#x2013;<lpage>133</lpage>, <year>1974</year>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Kohavi</surname></string-name></person-group>, &#x201C;<article-title>A study of cross-validation and bootstrap for accuracy estimation and model selection</article-title>,&#x201D; <source>IJCAI</source>, vol. <volume>14</volume>, no. <issue>2</issue>, pp. <fpage>1137</fpage>&#x2013;<lpage>1145</lpage>, <year>1995</year>.</mixed-citation></ref>
<ref id="ref-33"><label>[33]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Galkin</surname></string-name></person-group>, &#x201C;<article-title>What is the difference between test set and validation set</article-title>,&#x201D; <year>2018</year>.</mixed-citation></ref>
<ref id="ref-34"><label>[34]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>G. C.</given-names> <surname>Cawley</surname></string-name> and <string-name><given-names>N. L.</given-names> <surname>Talbot</surname></string-name></person-group>, &#x201C;<article-title>On over-fitting in model selection and subsequent selection bias in performance evaluation</article-title>,&#x201D; <source>The Journal of Machine Learning Research</source>, vol. <volume>11</volume>, pp. <fpage>2079</fpage>&#x2013;<lpage>2107</lpage>, <year>2010</year>.</mixed-citation></ref>
<ref id="ref-35"><label>[35]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S. C. K.</given-names> <surname>Tekouabou</surname></string-name>, <string-name><given-names>W.</given-names> <surname>Cherif</surname></string-name> and <string-name><given-names>H.</given-names> <surname>Silkan</surname></string-name></person-group>, &#x201C;<article-title>Improving parking availability prediction in smart cities with IoT and ensemble-based model</article-title>,&#x201D; <source>Journal of King Saud University-Computer and Information Sciences</source>, vol. <volume>34</volume>, no. <issue>3</issue>, pp. <fpage>687</fpage>&#x2013;<lpage>697</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-36"><label>[36]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>A. C.</given-names> <surname>M&#x00FC;ller</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Guido</surname></string-name></person-group>, <source>Introduction to Machine Learning with Python A Guide for Data Scientists</source>. <publisher-loc>Sebastopol, USA</publisher-loc>: <publisher-name>O&#x0027;Reilly Media, Inc.</publisher-name>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-37"><label>[37]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>R. A.</given-names> <surname>Kohavi</surname></string-name></person-group>, &#x201C;<article-title>Study of cross-validation and bootstrap for accuracy estimation and model selection</article-title>,&#x201D; in <conf-name>Proc. of the Fourteenth Int. Joint Conf. on Artificial Intelligence</conf-name>, <publisher-loc>Montr&#x00E9;al, Canada</publisher-loc>, pp. <fpage>1137</fpage>&#x2013;<lpage>1143</lpage>, <year>1995</year>. </mixed-citation></ref>
</ref-list>
</back>
</article>