Nothing Special   »   [go: up one dir, main page]

You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (239)

Search Parameters:
Keywords = opinion mining

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
33 pages, 6468 KiB  
Article
Exploring Sentiment Analysis for the Indonesian Presidential Election Through Online Reviews Using Multi-Label Classification with a Deep Learning Algorithm
by Ahmad Nahid Ma’aly, Dita Pramesti, Ariadani Dwi Fathurahman and Hanif Fakhrurroja
Information 2024, 15(11), 705; https://doi.org/10.3390/info15110705 - 5 Nov 2024
Viewed by 522
Abstract
Presidential elections are an important political event that often trigger intense debate. With more than 139 million users, YouTube serves as a significant platform for understanding public opinion through sentiment analysis. This study aimed to implement deep learning techniques for a multi-label sentiment [...] Read more.
Presidential elections are an important political event that often trigger intense debate. With more than 139 million users, YouTube serves as a significant platform for understanding public opinion through sentiment analysis. This study aimed to implement deep learning techniques for a multi-label sentiment analysis of comments on YouTube videos related to the 2024 Indonesian presidential election. Offering a fresh perspective compared to previous research that primarily employed traditional classification methods, this study classifies comments into eight emotional labels: anger, anticipation, disgust, joy, fear, sadness, surprise, and trust. By focusing on the emotional spectrum, this study provides a more nuanced understanding of public sentiment towards presidential candidates. The CRISP-DM method is applied, encompassing stages of business understanding, data understanding, data preparation, modeling, evaluation, and deployment, ensuring a systematic and comprehensive approach. This study employs a dataset comprising 32,000 comments, obtained via YouTube Data API, from the KPU and Najwa Shihab channels. The analysis is specifically centered on comments related to presidential candidate debates. Three deep learning models—Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (Bi-LSTM), and a hybrid model combining CNN and Bi-LSTM—are assessed using confusion matrix, Area Under the Curve (AUC), and Hamming loss metrics. The evaluation results demonstrate that the Bi-LSTM model achieved the highest accuracy with an AUC value of 0.91 and a Hamming loss of 0.08, indicating an excellent ability to classify sentiment with high precision and a low error rate. This innovative approach to multi-label sentiment analysis in the context of the 2024 Indonesian presidential election expands the insights into public sentiment towards candidates, offering valuable implications for political campaign strategies. Additionally, this research contributes to the fields of natural language processing and data mining by addressing the challenges associated with multi-label sentiment analysis. Full article
(This article belongs to the Special Issue Machine Learning and Data Mining for User Classification)
Show Figures

Figure 1

Figure 1
<p>YouTube searches for “Election” in Indonesia.</p>
Full article ">Figure 2
<p>Process of the Systematic Literature Review (SLR) illustrated using the PRISMA diagram.</p>
Full article ">Figure 3
<p>CRISP-DM methodology.</p>
Full article ">Figure 4
<p>Distribution of comments grouped by video source.</p>
Full article ">Figure 5
<p>GPT labelling process.</p>
Full article ">Figure 6
<p>CNN confusion matrix.</p>
Full article ">Figure 7
<p>Bi-LSTM confusion matrix.</p>
Full article ">Figure 8
<p>CNN Bi-LSTM confusion matrix.</p>
Full article ">Figure 9
<p>Area Under Curve results.</p>
Full article ">Figure 10
<p>Hamming loss results.</p>
Full article ">Figure 11
<p>Deployment process classification.</p>
Full article ">Figure 12
<p>Deployment classification result.</p>
Full article ">Figure 13
<p>Volume of conversation for each candidate graph.</p>
Full article ">Figure 14
<p>Emotional distribution for each candidate.</p>
Full article ">
14 pages, 1006 KiB  
Article
Correlations and Fractality in Sentence-Level Sentiment Analysis Based on VADER for Literary Texts
by Ricardo Hernández-Pérez, Pablo Lara-Martínez, Bibiana Obregón-Quintana, Larry S. Liebovitch and Lev Guzmán-Vargas
Information 2024, 15(11), 698; https://doi.org/10.3390/info15110698 - 4 Nov 2024
Viewed by 473
Abstract
We perform a sentence-level sentiment analysis study of different literary texts in English language. Each text is converted into a series in which the data points are the sentiment value of each sentence obtained using the sentiment analysis tool (VADER). By applying the [...] Read more.
We perform a sentence-level sentiment analysis study of different literary texts in English language. Each text is converted into a series in which the data points are the sentiment value of each sentence obtained using the sentiment analysis tool (VADER). By applying the Detrended Fluctuation Analysis (DFA) and the Higuchi Fractal Dimension (HFD) methods to these sentiment series, we find that they are monofractal with long-term correlations, which can be explained by the fact that the writing process has memory by construction, with a sentiment evolution that is self-similar. Furthermore, we discretize these series by applying a classification approach which transforms the series into a one on which each data point has only three possible values, corresponding to positive, neutral or negative sentiments. We map these three-states series to a Markov chain and investigate the transitions of sentiment from one sentence to the next, obtaining a state transition matrix for each book that provides information on the probability of transitioning between sentiments from one sentence to the next. This approach shows that there are biases towards increasing the probability of switching to neutral or positive sentences. The two approaches supplement each other, since the long-term correlation approach allows a global assessment of the sentiment of the book, while the state transition matrix approach provides local information about the sentiment evolution along the text. Full article
(This article belongs to the Special Issue Text Mining: Challenges, Algorithms, Tools and Applications)
Show Figures

Figure 1

Figure 1
<p>Example of one of the data series of sentiment values estimated with VADER. In this case, for the book The Adventures of Sherlock Holmes by Arthur Conan Doyle.</p>
Full article ">Figure 2
<p>Diagram depicting the possible transitions between sentiment values of sentences in a text.</p>
Full article ">Figure 3
<p><b>Top row</b>: Log–log plots of <math display="inline"><semantics> <mrow> <mo>〈</mo> <mi>L</mi> <mo>(</mo> <mi>k</mi> <mo>)</mo> <mo>〉</mo> </mrow> </semantics></math> vs. <span class="html-italic">k</span> for the HFD (<b>left</b>) and <math display="inline"><semantics> <mrow> <mi>F</mi> <mo>(</mo> <mi>n</mi> <mo>)</mo> </mrow> </semantics></math> vs <span class="html-italic">n</span> for the DFA (<b>right</b>). Each trace with different color on both plots corresponds to one book. As can be seen, the composite sentiment data series are monofractal and can be characterized by a single scaling exponent, either <span class="html-italic">D</span> or <math display="inline"><semantics> <mi>α</mi> </semantics></math>. <b>Buttom row</b>: Similar log–log plots as in the top row but for the series resulting after shuffling the sentences in the books.</p>
Full article ">Figure 4
<p>Scaling exponents for the HFD and DFA for the original texts and the shuffled ones. The straight line corresponds to the relationship <math display="inline"><semantics> <mrow> <mi>α</mi> <mo>=</mo> <mn>2</mn> <mo>−</mo> <mi>D</mi> </mrow> </semantics></math>.</p>
Full article ">Figure 5
<p>Box plots for the fraction of sentences on each sentiment state (<math display="inline"><semantics> <msub> <mi>f</mi> <mo>+</mo> </msub> </semantics></math> for positive, <math display="inline"><semantics> <msub> <mi>f</mi> <mo>×</mo> </msub> </semantics></math> for neutral and <math display="inline"><semantics> <msub> <mi>f</mi> <mo>−</mo> </msub> </semantics></math> for negative sentiments) with respect to the total number of sentences in the book. The values shown are the median (inside the box) and the lower and upper quartile. The whiskers extend from the box to show the range of the data, while the flier points are those past the end of the whiskers and are considered as outliers.</p>
Full article ">Figure 6
<p>Normalized histogram (<math display="inline"><semantics> <mrow> <mi>F</mi> <mo>(</mo> <mi>c</mi> <mo>)</mo> </mrow> </semantics></math>) for the compound sentiment <span class="html-italic">c</span>, where each trace with different color corresponds to a book. The inset is a scatter plot of the areas under the histograms for the positive and negative sides, which shows that the probability of finding sentences with positive sentiment is higher than with negative sentiment (the identity line is provided as a visual guide).</p>
Full article ">Figure 7
<p>Cumulative distribution function for the compound values for the positive sentiment (<b>left</b>) and the negative one (<b>right</b>). Each trace corresponds to a book, green color for positive values and red for negative ones. The straight line is provided as a visual guide.</p>
Full article ">Figure 8
<p>Box plots for the normalized populations of each state transition coefficient. The values for the original text and its shuffled version are given side by side (box plots in red color represent the shuffled text). The median values are marked within each box plot with a dotted line, while the mean values are marked with a solid line. The whiskers extend from the box to show the range of the data, while the flier points are those past the end of the whiskers and are considered as outliers.</p>
Full article ">
26 pages, 12142 KiB  
Article
A Study of the Evolution of Haze Microblog Concerns Based on a Co-Word Network Analysis
by Haiyue Lu, Xiaoping Rui, Runkui Li, Guangyuan Zhang, Ziqian Zhang and Mingguang Wu
ISPRS Int. J. Geo-Inf. 2024, 13(10), 352; https://doi.org/10.3390/ijgi13100352 - 4 Oct 2024
Viewed by 623
Abstract
Haze is a phenomenon caused by excessive PM2.5 (air-borne particulate matter having a diameter of fewer than 2.5 μm) and other pollutants and results from the interaction between specific climatic conditions and human activities. It significantly impacts human health, transportation, and the natural [...] Read more.
Haze is a phenomenon caused by excessive PM2.5 (air-borne particulate matter having a diameter of fewer than 2.5 μm) and other pollutants and results from the interaction between specific climatic conditions and human activities. It significantly impacts human health, transportation, and the natural environment and has aroused widespread concern. However, the influence of haze on human mental health, being hidden and indirect, is often overlooked. When haze pollution occurs, people express their feelings and concerns about haze events on media such as Weibo. At present, few studies focus on haze public opinion, as well as the changing trends in people’s discussion of haze since its emergence, which is of great significance for haze response and resource management. Based on the perspective of topic analysis, this study explores the psychological impact of haze on people by exploring the feelings of netizens in haze public opinion and investigates the evolution of people’s concerns based on long-term public opinion data. In this study, seven typical provinces and cities in China with severe haze pollution were selected as the research area. Based on data on the “haze” theme from Weibo from 2013 to 2019, first, the microblog posts were preprocessed, and the keyword co-word network was constructed. Second, the Louvain algorithm was used to detect the topic community. Based on this, the cosine similarity was calculated to realize the temporal evolution analysis of topics. The results show that with the development and change in haze pollution, the content and intensity of the topics netizens pay attention to have changed, including five types: merger, split, survival, transformation, and rebirth/extinction. People’s attention to haze shows obvious spatial differences, and it is related to the degree of haze pollution, which is bipolar. Areas with severe haze tend to pay more attention to haze itself and its influence, while areas with light haze pay more attention to haze control. The research results can provide valuable insights for governments and relevant departments in guiding public opinion and resource allocation. Full article
Show Figures

Figure 1

Figure 1
<p>The framework for topic evolution analysis.</p>
Full article ">Figure 2
<p>The geographical location of the study area. Note: The geographical location of the study area was drawn using ArcGIS 10.4.</p>
Full article ">Figure 3
<p>Diagram of the co-word network structure.</p>
Full article ">Figure 4
<p>The co-word network in winter 2013–2019 (<b>a</b>–<b>g</b>). Note: Nodes with the same color belong to the same topic community, and light blue nodes indicate that they do not belong to any topic community.</p>
Full article ">Figure 5
<p>The spatial differences of topic popularity. (<b>a</b>–<b>g</b>) are schematic diagrams of the spatial distribution of topic popularity from 2013 to 2019.</p>
Full article ">Figure 5 Cont.
<p>The spatial differences of topic popularity. (<b>a</b>–<b>g</b>) are schematic diagrams of the spatial distribution of topic popularity from 2013 to 2019.</p>
Full article ">Figure 6
<p>The topic evolution map of Beijing.</p>
Full article ">Figure 7
<p>The topic evolution map of Shandong.</p>
Full article ">Figure 8
<p>The topic evolution map of Liaoning.</p>
Full article ">Figure 9
<p>The topic evolution map of Hebei.</p>
Full article ">Figure 10
<p>The topic evolution map of Shanxi.</p>
Full article ">Figure 11
<p>The topic evolution map of Tianjin.</p>
Full article ">Figure 12
<p>The topic evolution map of Inner Mongolia. Note (this note applies to <a href="#ijgi-13-00352-f006" class="html-fig">Figure 6</a>, <a href="#ijgi-13-00352-f007" class="html-fig">Figure 7</a>, <a href="#ijgi-13-00352-f008" class="html-fig">Figure 8</a>, <a href="#ijgi-13-00352-f009" class="html-fig">Figure 9</a>, <a href="#ijgi-13-00352-f010" class="html-fig">Figure 10</a>, <a href="#ijgi-13-00352-f011" class="html-fig">Figure 11</a> and <a href="#ijgi-13-00352-f012" class="html-fig">Figure 12</a>): The phrase represents the topic community, the color rectangular color block represents the topic community and its importance, and the gray curved color block represents the evolutionary relationship between the topic communities. This figure was drawn using power BI.</p>
Full article ">
27 pages, 5795 KiB  
Article
Modeling and Analysis of Coupled Online–Offline Opinion Dissemination Scenarios Based on Multi-Factor Interaction
by Zhuo Yang, Yan Guo, Yu-Wei She, Fu-Lian Yin and Yue-Wei Wu
Electronics 2024, 13(19), 3829; https://doi.org/10.3390/electronics13193829 - 27 Sep 2024
Viewed by 519
Abstract
In recent years, new media have exacerbated the complexity of online public opinion scenarios through fragmentation of information, diversification of public opinion, rapid diffusion of public opinion, and concealment of information sources, which have posed several serious challenges to the benign development of [...] Read more.
In recent years, new media have exacerbated the complexity of online public opinion scenarios through fragmentation of information, diversification of public opinion, rapid diffusion of public opinion, and concealment of information sources, which have posed several serious challenges to the benign development of online public opinion ecosystems. Therefore, based on diversified public opinion scenarios, we study the interaction between information dissemination and the evolution of group opinions and the dissemination laws to solve the problem of imprecise grasping of the dissemination laws in complex public opinion scenarios. Facing the two-way interaction between online platforms and real society, we constructed a coupled online–offline viewpoint evolution dynamics model, which considers factors such as the user subject level and the network environment level, and combines viewpoint dynamics theory with information dissemination dynamics theory. Based on the real case of dual interaction between online and offline, we carry out the construction of a two-layer coupling network and numerical fitting comparison experiments to study the synergistic and penetration mechanism of public opinion in both online and offline multi-spaces. Based on parametric analysis experiments, the influence of different factors on communication indicators is mined, and the driving effect of the viewpoint environment of offline communication on online public opinion is studied, which reveals the objective role of multi-factors on the law of intralayer communication, cross-network communication, and viewpoint evolution, and provides strategic suggestions for the comprehensive management of public opinion in online–offline large-scale mass incidents. Full article
Show Figures

Figure 1

Figure 1
<p>Schematic diagram of coupled online–offline opinion dissemination scenarios. (The online layer represents cyberspace, and the offline layer represents real space).</p>
Full article ">Figure 2
<p>Ebbinghaus memory and forgetting curve [<a href="#B34-electronics-13-03829" class="html-bibr">34</a>].</p>
Full article ">Figure 3
<p>User energetic value curve.</p>
Full article ">Figure 4
<p>Macro-framework of SLFI-JA opinion evolutionary propagation dynamics modeling.</p>
Full article ">Figure 5
<p>A framework for modeling the dynamics of the evolutionary propagation of offline opinions. (The parameters labeled red in the figure reflect the state transfer probabilities under the action of the opinion values).</p>
Full article ">Figure 6
<p>Propagation of cumulative and effective volume curves.</p>
Full article ">Figure 7
<p>The law of online–offline network construction. (The number indicates the user number).</p>
Full article ">Figure 8
<p>Nodal degree distribution of online and offline layers. (<b>a</b>) Distribution of degrees of nodes at the online level (range of degrees: 2 to 20). (<b>b</b>) Distribution of degrees of nodes at the offline level (range of degrees: 1 to 20).</p>
Full article ">Figure 9
<p>Experimental implementation process for numerical fitting of online–offline coupled models. (<b>a</b>) Overall basic implementation logic; (<b>b</b>) implementation logic within the online and offline networks.</p>
Full article ">Figure 10
<p>Results of model fitting for online dissemination trends. (The red curve represents the simulation results of the model and the green curve represents the direction of the real data.) (<b>a</b>) Online–offline coupling model fitting results. (<b>b</b>) Online SLFI-JA model fitting results.</p>
Full article ">Figure 11
<p>Results of parameter sensitivity analysis for different parameters based on each indicator. (The value of the vertical axis corresponding to blue color represents the S1 value of the parameter; the value of the vertical axis corresponding to orange color represents the ST value of the parameter. The difference between the orange and blue vertical axis values reflects the total interaction value between the parameter and all other parameters.) (<b>a</b>) Based on the scope of online dissemination <math display="inline"><semantics> <mrow> <msub> <mi>C</mi> <mrow> <mi>f</mi> <mi>i</mi> <mi>n</mi> <mi>a</mi> <mi>l</mi> </mrow> </msub> </mrow> </semantics></math>. (<b>b</b>) Based on online propagation peaks <math display="inline"><semantics> <mrow> <msub> <mi>F</mi> <mrow> <mi>m</mi> <mi>a</mi> <mi>x</mi> </mrow> </msub> </mrow> </semantics></math>. (<b>c</b>) Based on the number of interlayer interactions <math display="inline"><semantics> <mrow> <msub> <mi>C</mi> <mrow> <mi>A</mi> <mi>B</mi> </mrow> </msub> </mrow> </semantics></math>. (<b>d</b>) Based on online peak times <math display="inline"><semantics> <mrow> <msub> <mi>t</mi> <mrow> <mi>m</mi> <mi>a</mi> <mi>x</mi> </mrow> </msub> </mrow> </semantics></math>.</p>
Full article ">Figure 12
<p>Scope of online information dissemination. (The red, blue and green lines in the figure represent <math display="inline"><semantics> <mrow> <mi>δ</mi> <mo>=</mo> <mn>0</mn> <mo>,</mo> <mo> </mo> <mn>0.5</mn> <mo>,</mo> <mo> </mo> <mn>1</mn> </mrow> </semantics></math>.) (<b>a</b>) BA-WS coupling network; (<b>b</b>) BA-ER coupling network; (<b>c</b>) BA-BA coupling network.</p>
Full article ">Figure 13
<p>Range of online information propagation in BA-WS coupled networks with different interlayer heterodyne degrees. (Light red, blue, green, dark red, pink, and black are represented in the diagram: <math display="inline"><semantics> <mrow> <mi>δ</mi> <mo>=</mo> <mn>0</mn> <mo>,</mo> <mo> </mo> <mn>0.2</mn> <mo>,</mo> <mo> </mo> <mn>0.4</mn> <mo>,</mo> <mo> </mo> <mn>0.6</mn> <mo>,</mo> <mo> </mo> <mn>0.8</mn> <mo>,</mo> <mo> </mo> <mn>1</mn> </mrow> </semantics></math>).</p>
Full article ">
23 pages, 2512 KiB  
Article
Overview of Salt Cavern Oil Storage Development and Site Suitability Analysis
by Nan Zhang, Xinrong Gao, Baoxu Yan, Yun Zhang, Songtao Ji and Xilin Shi
Processes 2024, 12(8), 1709; https://doi.org/10.3390/pr12081709 - 14 Aug 2024
Cited by 1 | Viewed by 1475
Abstract
Salt cavern storage, characterized by its safety, stability, large scale, economic viability, and efficiency, stands out as a cost-effective and relatively secure method for large-scale petroleum reserves. This paper provides an overview of the current development status of salt cavern storage technologies both [...] Read more.
Salt cavern storage, characterized by its safety, stability, large scale, economic viability, and efficiency, stands out as a cost-effective and relatively secure method for large-scale petroleum reserves. This paper provides an overview of the current development status of salt cavern storage technologies both domestically and internationally, analyzes the advantageous conditions and numerous challenges faced by salt cavern Strategic Petroleum Reserve (SPR) storage in China, and forecasts the development trends of this technology. The conclusions indicate that China possesses all of the necessary conditions for the development of salt cavern storage. Moreover, utilizing the Analytical Hierarchy Process (AHP), a macro suitability hierarchical evaluation system is constructed for the site selection and construction of salt cavern storage facilities. This system quantifies various site selection indicators, integrating expert opinions and findings from relevant theoretical research to establish grading standards for the suitability indices of salt cavern storage construction. Applied to the site evaluation of salt cavern storage at the Jintan Salt Mine in Jiangsu, the results indicate its high suitability for storage construction, making it an ideal location for establishing such facilities. The evaluation results are consistent with expert opinions, demonstrating the rationality of this method. Full article
(This article belongs to the Section Energy Systems)
Show Figures

Figure 1

Figure 1
<p>Outer dependency for oil of China and days of oil availability by country [<a href="#B1-processes-12-01709" class="html-bibr">1</a>].</p>
Full article ">Figure 2
<p>Distribution of U.S. strategic oil storage.</p>
Full article ">Figure 3
<p>Distribution of China’s strategic oil storage.</p>
Full article ">Figure 4
<p>The process of solution mining during the construction of SPR in rock salt.</p>
Full article ">Figure 5
<p>Construction of the judgment matrix in AHP. <span class="html-italic">O</span> represents the decision objective. <span class="html-italic">C<sub>i</sub></span> represents the evaluation criterion.</p>
Full article ">Figure 6
<p>Target hierarchy model of the suitability evaluation of the SPR salt cavern. <span class="html-italic">O</span> represents the decision objective; <span class="html-italic">C<sub>i</sub></span> represents the evaluation criterion; <span class="html-italic">C<sub>ij</sub></span> represents the <span class="html-italic">j</span>th basic indicator under the <span class="html-italic">i</span>th evaluation criterion in the criterion layer.</p>
Full article ">
24 pages, 4686 KiB  
Article
A Novel Risk Assessment Approach for Open-Cast Coal Mines Using Hybrid MCDM Models with Interval Type-2 Fuzzy Sets: A Case Study in Türkiye
by Mert Mutlu, Nazli Ceren Cetin and Seyhan Onder
Systems 2024, 12(8), 267; https://doi.org/10.3390/systems12080267 - 25 Jul 2024
Viewed by 1474
Abstract
Mining is a high-risk industry where occupational accidents are common due to its complex nature. Therefore, providing a more holistic and dynamic risk assessment framework is essential to identify and minimize the potential risks and enhance safety measures. Unfortunately, traditional risk assessment methods [...] Read more.
Mining is a high-risk industry where occupational accidents are common due to its complex nature. Therefore, providing a more holistic and dynamic risk assessment framework is essential to identify and minimize the potential risks and enhance safety measures. Unfortunately, traditional risk assessment methods have limitations and shortcomings, such as uncertainty, differences in experience backgrounds, and insufficiency to articulate the opinions of experts. In this paper, a novel risk assessment method precisely for such cases in the mining sector is proposed, applied, and compared with traditional methods. The objective of this study is to determine the risk scores of Turkish Coal Enterprises, based on non-fatal occupational accidents, which operates eight large-scale open-cast coal mine enterprises in Türkiye. The causes of the accidents were categorized into 25 sub-criteria under 6 main criteria. The risk scores for these criteria were computed using the Pythagorean fuzzy Analytical Hierarchy Process (PFAHP) method. The first shift (8–16 h) (0.6341) for the shift category is ranked highest out of the 25 sub-risk factors, followed by maintenance personnel (0.5633) for the occupation category; the open-cast mining area (0.5524) for the area category, the 45–57 age range (0.5279) for employee age category, and the mining machine (0.4247) for the reason category, respectively. The methodologies proposed in this study not only identify the most important risk factors in enterprises, but also provide a mechanism for risk-based rankings of enterprises by their calculated risk scores. The enterprises were risk-based ranked with the fuzzy Technique for Order Preference by Similarity to Ideal Solution (FTOPSIS) method and Paksoy approach based on interval type-2 fuzzy sets (IT2FSs). The findings indicate that the first three risk score rankings of enterprises are the same for both approaches. To examine the consistency of the applied methods, sensitivity analyses were performed. The results of the study also indicate that the proposed approaches are recommended for effective use in the mining sector due to their ease of application compared to other methods and their dynamic nature in the risk assessment process. Full article
Show Figures

Figure 1

Figure 1
<p>The “Swiss cheese” model (adapted from [<a href="#B17-systems-12-00267" class="html-bibr">17</a>,<a href="#B18-systems-12-00267" class="html-bibr">18</a>,<a href="#B19-systems-12-00267" class="html-bibr">19</a>,<a href="#B20-systems-12-00267" class="html-bibr">20</a>,<a href="#B21-systems-12-00267" class="html-bibr">21</a>,<a href="#B22-systems-12-00267" class="html-bibr">22</a>,<a href="#B23-systems-12-00267" class="html-bibr">23</a>,<a href="#B24-systems-12-00267" class="html-bibr">24</a>]).</p>
Full article ">Figure 2
<p>The upper trapezoidal membership function <math display="inline"><semantics> <mrow> <msubsup> <mover accent="true"> <mi>A</mi> <mo stretchy="false">˜</mo> </mover> <mi>i</mi> <mi>U</mi> </msubsup> </mrow> </semantics></math> and the lower trapezoidal membership function <math display="inline"><semantics> <mrow> <msubsup> <mover accent="true"> <mi>A</mi> <mo stretchy="false">˜</mo> </mover> <mi>i</mi> <mi>L</mi> </msubsup> </mrow> </semantics></math> of the interval type-2 fuzzy set <math display="inline"><semantics> <mrow> <msub> <mover accent="true"> <mi>A</mi> <mo stretchy="false">˜</mo> </mover> <mi>i</mi> </msub> </mrow> </semantics></math>.</p>
Full article ">Figure 3
<p>The flowchart of the proposed approaches [<a href="#B32-systems-12-00267" class="html-bibr">32</a>].</p>
Full article ">Figure 4
<p>Hierarchical structure of the risk-based classification approach.</p>
Full article ">Figure 5
<p>Risk rankings comparison of enterprises according to the two proposed methods [<a href="#B32-systems-12-00267" class="html-bibr">32</a>].</p>
Full article ">Figure 6
<p>Fuzzy TOPSIS results of sensitivity analysis.</p>
Full article ">Figure 7
<p>Paksoy approach results of sensitivity analysis.</p>
Full article ">
17 pages, 18435 KiB  
Article
Dynamic Mining of Consumer Demand via Online Hotel Reviews: A Hybrid Method
by Weiping Yu, Fasheng Cui, Ping Wang and Xin Liao
J. Theor. Appl. Electron. Commer. Res. 2024, 19(3), 1831-1847; https://doi.org/10.3390/jtaer19030090 - 18 Jul 2024
Viewed by 1048
Abstract
This study aims to dynamically mine the demands of hotel consumers. A total of 378,270 online reviews in the cities of Beijing, Chengdu, and Guangzhou in China were crawled using Python. Natural language processing (e.g., opinion mining and the BERT model) and an [...] Read more.
This study aims to dynamically mine the demands of hotel consumers. A total of 378,270 online reviews in the cities of Beijing, Chengdu, and Guangzhou in China were crawled using Python. Natural language processing (e.g., opinion mining and the BERT model) and an improved Kano model (containing One-dimensional, Attractive, Indifferent, and Must-be) were utilised to analyse online hotel reviews. The results indicate that the hotel attributes that consumers care about (e.g., Clean, Breakfast, and Front Desk) are dynamically fluctuating, and the attention and satisfaction of corresponding attributes will also change. This study classified consumer demand into eight types across cities and found that it changes over time. In addition, we also found that hotel attributes, satisfaction and attention, and consumer demands vary among different cities. Existing studies of capturing consumer demand are usually time-consuming and static, and the results are subjective. This study compared and analysed the consumer demands of hotels in different cities via a dynamic perspective, and used hybrid methods to improve the granularity of the analysis, expanding the general applicability of the Kano model. Hotel managers can refer to the results of this article to allocate resources for improvement and create competitive hotel services. Full article
Show Figures

Figure 1

Figure 1
<p>Research flow.</p>
Full article ">Figure 2
<p>Input sequence and vector representation.</p>
Full article ">Figure 3
<p>Network visualisation for each city.</p>
Full article ">Figure 4
<p>Dynamic consumer demand changes for Kano categorisation.</p>
Full article ">
13 pages, 1512 KiB  
Article
A Framework Model of Mining Potential Public Opinion Events Pertaining to Suspected Research Integrity Issues with the Text Convolutional Neural Network model and a Mixed Event Extractor
by Zongfeng Zou, Xiaochen Ji and Yingying Li
Information 2024, 15(6), 303; https://doi.org/10.3390/info15060303 - 24 May 2024
Viewed by 659
Abstract
With the development of the Internet, the oversight of research integrity issues has extended beyond the scientific community to encompass the whole of society. If these issues are not addressed promptly, they can significantly impact the research credibility of both institutions and scholars. [...] Read more.
With the development of the Internet, the oversight of research integrity issues has extended beyond the scientific community to encompass the whole of society. If these issues are not addressed promptly, they can significantly impact the research credibility of both institutions and scholars. This article proposes a text convolutional neural network based on SMOTE to identify short texts of potential public opinion events related to suspected scientific integrity issues from common short texts. The SMOTE comprehensive sampling technique is employed to handle imbalanced datasets. To mitigate the impact of short text length on text representation quality, the Doc2vec embedding model is utilized to represent short text, yielding a one-dimensional dense vector. Additionally, the dimensions of the input layer and convolution kernel of TextCNN are adjusted. Subsequently, a short text event extraction model based on TF-IDF and TextRank is proposed to extract crucial information, for instance, names and research-related institutions, from events and facilitate the identification of potential public opinion events related to suspected scientific integrity issues. Results of experiments have demonstrated that utilizing SMOTE to balance the dataset is able to improve the classification results of TextCNN classifiers. Compared to traditional classifiers, TextCNN exhibits greater robustness in addressing the problems of imbalanced datasets. However, challenges such as low information content, non-standard writing, and polysemy in short texts may impact the accuracy of event extraction. The framework can be further optimized to address these issues in the future. Full article
Show Figures

Figure 1

Figure 1
<p>Flowchart of our approach.</p>
Full article ">Figure 2
<p>The general TextCNN model architecture diagram. Colors differ to indicate the outcomes of applying different convolutional kernels.</p>
Full article ">Figure 3
<p>Flowchart of TF-IDF and TextRank-based Mixed Event Extractor.</p>
Full article ">
23 pages, 617 KiB  
Article
Navigating Land Conservation, Utilization, and Market Solutions: Insights from the Lanyang River Watershed, Taiwan
by Wan-Jiun Chen, Jihn-Fa Jan, Chih-Hsin Chung and Shyue-Cherng Liaw
Sustainability 2024, 16(11), 4326; https://doi.org/10.3390/su16114326 - 21 May 2024
Cited by 1 | Viewed by 1112
Abstract
In the current fraught relationship between nature and human society, land conservation and utilization have spawned intensive conflicts that require mediation. The present study explores this issue of coordination between nature and society in a fragile watershed located in northeastern Taiwan: the Lanyang [...] Read more.
In the current fraught relationship between nature and human society, land conservation and utilization have spawned intensive conflicts that require mediation. The present study explores this issue of coordination between nature and society in a fragile watershed located in northeastern Taiwan: the Lanyang River Watershed. Land zoning in this area has been historically classified and legally implemented, and additional development is constrained by an application review process. Currently, additional land utilization is still in demand in sensitive areas of this watershed, such as for mining and tilling. Due to the geographically, geologically, and climatically fragile characteristics of the watershed, the hillside residents have benefited from the conservation of nature with comprehensive ecosystem services but are at the forefront of the loss of life and property caused by forest ecosystem degradation. They are one of the key local resource users and main stakeholders. Applying the contingent valuation method to survey the hillside residents, the present study assessed the economic value they receive from the comprehensive ecosystem services offered by the natural forest ecosystems. Their opinions are explored using a survey on their awareness of ecosystem damage, their opinions on damage compensation, and on the feasible compensation channels for damage. As the study results ascertained the high value of the comprehensive ecosystem services continuously delivered by the conserved forest ecosystem, the study affirmed that conservation in the area classified and zoned as sensitive is an economic beneficial policy. With a high regard for ecosystem services and awareness of the impact of degradation and of the general agreement for the feasibility of channels of damage compensation, the continuity of conservation for these comprehensive ecosystem services is the preferred strategy for the local hillside residents. To emphasize this further, the opinions of the local community at the intersection of nature and society, where there is a delineated land zoning framework, strongly favor conservation over intensive resource exploitation and agricultural expansion, making further development an unfavorable strategy. Full article
Show Figures

Figure 1

Figure 1
<p>Study site: Sanshing, Yuangshan, and Dongshan Townships in Yilan County, Taiwan.</p>
Full article ">
19 pages, 4615 KiB  
Article
Fake User Detection Based on Multi-Model Joint Representation
by Jun Li, Wentao Jiang, Jianyi Zhang, Yanhua Shao and Wei Zhu
Information 2024, 15(5), 266; https://doi.org/10.3390/info15050266 - 9 May 2024
Viewed by 1229
Abstract
The existing deep learning-based detection of fake information focuses on the transient detection of news itself. Compared to user category profile mining and detection, transient detection is prone to higher misjudgment rates due to the limitations of insufficient temporal information, posing new challenges [...] Read more.
The existing deep learning-based detection of fake information focuses on the transient detection of news itself. Compared to user category profile mining and detection, transient detection is prone to higher misjudgment rates due to the limitations of insufficient temporal information, posing new challenges to social public opinion monitoring tasks such as fake user detection. This paper proposes a multimodal aggregation portrait model (MAPM) based on multi-model joint representation for social media platforms. It constructs a deep learning-based multimodal fake user detection framework by analyzing user behavior datasets within a time retrospective window. It integrates a pre-trained Domain Large Model to represent user behavior data across multiple modalities, thereby constructing a high-generalization implicit behavior feature spectrum for users. In response to the tendency of existing fake user behavior mining to neglect time-series features, this study introduces an improved network called Sequence Interval Detection Net (SIDN) based on Sequence to Sequence (seq2seq) to characterize time interval sequence behaviors, achieving strong expressive capabilities for detecting fake behaviors within the time window. Ultimately, the amalgamation of latent behavioral features and explicit characteristics serves as the input for spectral clustering in detecting fraudulent users. The experimental results on Weibo real dataset demonstrate that the proposed model outperforms the detection utilizing explicit user features, with an improvement of 27.0% in detection accuracy. Full article
(This article belongs to the Special Issue 2nd Edition of Information Retrieval and Social Media Mining)
Show Figures

Figure 1

Figure 1
<p>Network architecture of MAPM.</p>
Full article ">Figure 2
<p>Network architecture of SIDN.</p>
Full article ">Figure 3
<p>The reasoning process of SIDN.</p>
Full article ">Figure 4
<p>Heatmap of sample features for different classes.</p>
Full article ">Figure 5
<p>Box plots of time interval data.</p>
Full article ">Figure 6
<p>Kernel density plot of time intervals for different user categories.</p>
Full article ">Figure 7
<p>T-SNE data distribution plot when excluding certain features as clustering features.</p>
Full article ">Figure 7 Cont.
<p>T-SNE data distribution plot when excluding certain features as clustering features.</p>
Full article ">Figure 8
<p>The visualization results of the ablation experiment’s accuracy and recall.</p>
Full article ">Figure 9
<p>The impact of different parameters on model performance.</p>
Full article ">Figure 10
<p>The accuracy across different categories.</p>
Full article ">
18 pages, 846 KiB  
Article
Distantly Supervised Explainable Stance Detection via Chain-of-Thought Supervision
by Daijun Ding, Genan Dai, Cheng Peng, Xiaojiang Peng, Bowen Zhang and Hu Huang
Mathematics 2024, 12(7), 1119; https://doi.org/10.3390/math12071119 - 8 Apr 2024
Viewed by 1185
Abstract
Investigating public attitudes on social media is crucial for opinion mining systems. Stance detection aims to predict the attitude towards a specific target expressed in a text. However, effective neural stance detectors require substantial training data, which are challenging to curate due to [...] Read more.
Investigating public attitudes on social media is crucial for opinion mining systems. Stance detection aims to predict the attitude towards a specific target expressed in a text. However, effective neural stance detectors require substantial training data, which are challenging to curate due to the dynamic nature of social media. Moreover, deep neural networks (DNNs) lack explainability, rendering them unsuitable for scenarios requiring explanations. We propose a distantly supervised explainable stance detection framework (DS-ESD), comprising an instruction-based chain-of-thought (CoT) method, a generative network, and a transformer-based stance predictor. The CoT method employs prompt templates to extract stance detection explanations from a very large language model (VLLM). The generative network learns the input-explanation mapping, and a transformer-based stance classifier is trained with VLLM-annotated stance labels, implementing distant supervision. We propose a label rectification strategy to mitigate the impact of erroneous labels. Experiments on three benchmark datasets showed that our model outperformed the compared methods, validating its efficacy in stance detection tasks. This research contributes to the advancement of explainable stance detection frameworks, leveraging distant supervision and label rectification strategies to enhance performance and interpretability. Full article
(This article belongs to the Section Mathematics and Computer Science)
Show Figures

Figure 1

Figure 1
<p>The overall structure of propose DS-ESD, including instruct-based CoT (<b>1</b>), generative model (<b>2</b>), and the transformer-based stance network (<b>3</b>).</p>
Full article ">Figure 2
<p><span class="html-italic">1</span>-shot instruction.</p>
Full article ">Figure 3
<p>Example of instruct-based CoT.</p>
Full article ">Figure 4
<p>Framework of label rectification strategy.</p>
Full article ">Figure 5
<p>Experimental results on varying amounts of labeled data; (<b>a</b>) F1 score on “L”; (<b>b</b>) F1 score on “F”.</p>
Full article ">
22 pages, 2213 KiB  
Article
Understanding Customer Opinion Change on Fresh Food E-Commerce Products and Services—Comparative Analysis before and during COVID-19 Pandemic
by Yanlai Li, Zifan Shen, Cuiming Zhao, Kwai-Sang Chin and Xuwei Lang
Sustainability 2024, 16(7), 2699; https://doi.org/10.3390/su16072699 - 25 Mar 2024
Viewed by 1426
Abstract
During the coronavirus disease 2019 (COVID-19) pandemic, non-face-to-face e-commerce has become a significant consumer channel for customers to buy fresh food. However, little is known about customer opinion changes in fresh food e-commerce (FFEC) products and services during COVID-19. This study investigated the [...] Read more.
During the coronavirus disease 2019 (COVID-19) pandemic, non-face-to-face e-commerce has become a significant consumer channel for customers to buy fresh food. However, little is known about customer opinion changes in fresh food e-commerce (FFEC) products and services during COVID-19. This study investigated the changes in expectations and preferences of FFEC customers on products and services before and during the pandemic from online reviews through a text mining approach. We divided the pandemic into two phases, acute and recovery, and found that eight attributes affect customers’ opinions. Some logistic service-related attributes gained customer attention during the acute phase, but product-related attributes gained more attention in the recovery phase. Customers showed a great level of forgiveness on many attributes during the acute phase, but customers’ dissatisfaction was expressed during the recovery phase. Finally, the results of the comparative importance–performance analysis provide improvement strategies for FFEC and help optimize their resource allocation of FFEC and enhance sustainable operation capacity in the case of a crisis. Full article
Show Figures

Figure 1

Figure 1
<p>The text-mining methodology of this study.</p>
Full article ">Figure 2
<p>IPA.</p>
Full article ">Figure 3
<p>CIPA.</p>
Full article ">Figure 4
<p>SSE of Mini-Batch K-means algorithm.</p>
Full article ">Figure 5
<p>Changes in attributes’ importance before and after COVID-19.</p>
Full article ">Figure 6
<p>Changes in attribute performance across dynamics.</p>
Full article ">Figure 7
<p>Results of CIPA.</p>
Full article ">
15 pages, 2849 KiB  
Article
A Novel Influence Analysis-Based University Major Similarity Study
by Ningqi Zhang, Qingyun Li, Sissi Xiaoxiao Wu, Junjie Zhu and Jie Han
Educ. Sci. 2024, 14(3), 337; https://doi.org/10.3390/educsci14030337 - 21 Mar 2024
Cited by 1 | Viewed by 1049
Abstract
In the field of education, investigating the relationships between different majors in universities is an important topic in current educational research. The application of social networks from informatics provides new opportunities and potentials for the field of education. Due to the complexity of [...] Read more.
In the field of education, investigating the relationships between different majors in universities is an important topic in current educational research. The application of social networks from informatics provides new opportunities and potentials for the field of education. Due to the complexity of social interactions, the social network connections surrounding individuals exert a significant influence on their daily decision-making processes. This paper aims to introduce the social network and influence analysis theories from informatics into the field of education, regarding major as a variable, and comparing and analyzing the influence relationships between majors. An empirical study was conducted, involving the collection of questionnaire data on graduates’ evaluations of various aspects of their university experiences across different majors. The evolution of this model follows the DeGroot opinion dynamics with the inclusion of stubborn nodes. By defining leader majors and general majors based on the data and modeling the questionnaire data as the outcome of a discrete random process, an influence matrix is ultimately generated through the opinion dynamic model. Through this modeling approach, we revealed the underlying influence relationships between different disciplines (majors). These findings provide schools with insights to adjust the directions of discipline cultivation, and offer new perspectives and methods for the study of majors in higher education. Full article
(This article belongs to the Special Issue Challenges and Trends for Modern Higher Education)
Show Figures

Figure 1

Figure 1
<p>Each pixel could be considered as a triple composed of RGB.</p>
Full article ">Figure 2
<p>A tensor is an N-dimensional array of data.</p>
Full article ">Figure 3
<p>Multiply tensor and vector to reduce the dimension.</p>
Full article ">Figure 4
<p>The influence matrix among majors.</p>
Full article ">Figure 5
<p>The first example of the influence matrix: the mutual influence among similar majors.</p>
Full article ">Figure 6
<p>The second example of the influence matrix: the mutual influence among related majors.</p>
Full article ">Figure 7
<p>Visualization of majors’ influence relationships with Gephi.</p>
Full article ">
24 pages, 2186 KiB  
Article
A Machine Learning-Based Pipeline for the Extraction of Insights from Customer Reviews
by Róbert Lakatos, Gergő Bogacsovics, Balázs Harangi, István Lakatos, Attila Tiba, János Tóth, Marianna Szabó and András Hajdu
Big Data Cogn. Comput. 2024, 8(3), 20; https://doi.org/10.3390/bdcc8030020 - 22 Feb 2024
Viewed by 2039
Abstract
The efficiency of natural language processing has improved dramatically with the advent of machine learning models, particularly neural network-based solutions. However, some tasks are still challenging, especially when considering specific domains. This paper presents a model that can extract insights from customer reviews [...] Read more.
The efficiency of natural language processing has improved dramatically with the advent of machine learning models, particularly neural network-based solutions. However, some tasks are still challenging, especially when considering specific domains. This paper presents a model that can extract insights from customer reviews using machine learning methods integrated into a pipeline. For topic modeling, our composite model uses transformer-based neural networks designed for natural language processing, vector-embedding-based keyword extraction, and clustering. The elements of our model have been integrated and tailored to better meet the requirements of efficient information extraction and topic modeling of the extracted information for opinion mining. Our approach was validated and compared with other state-of-the-art methods using publicly available benchmark datasets. The results show that our system performs better than existing topic modeling and keyword extraction methods in this task. Full article
(This article belongs to the Special Issue Artificial Intelligence and Natural Language Processing)
Show Figures

Figure 1

Figure 1
<p>Embedded vector space from the Amazon Reviews dataset visualized using PCA.</p>
Full article ">Figure 2
<p>Embedded vector space of the Amazon Reviews dataset visualized using TSNE.</p>
Full article ">Figure 3
<p>Embedded vector space of the 20 Newsgroups dataset visualized using PCA.</p>
Full article ">Figure 4
<p>Embedded vector space of the 20 Newsgroups dataset visualized using TSNE.</p>
Full article ">Figure 5
<p>Extracting a keyphrase candidate from a simple review using an N-gram-based approach (<math display="inline"><semantics> <mrow> <mi>N</mi> <mo>=</mo> <mn>3</mn> </mrow> </semantics></math>).</p>
Full article ">Figure 6
<p>An example of dependency parsing.</p>
Full article ">Figure 7
<p>An example for keywords extracted by KeyBERT.</p>
Full article ">Figure 8
<p>The BERT+R architecture modified for our system.</p>
Full article ">Figure 9
<p>Recursive clustering considered in our system.</p>
Full article ">Figure 10
<p>Flow diagram of the pipeline.</p>
Full article ">
16 pages, 708 KiB  
Article
Leveraging Chain-of-Thought to Enhance Stance Detection with Prompt-Tuning
by Daijun Ding, Xianghua Fu, Xiaojiang Peng, Xiaomao Fan, Hu Huang and Bowen Zhang
Mathematics 2024, 12(4), 568; https://doi.org/10.3390/math12040568 - 13 Feb 2024
Viewed by 1457
Abstract
Investigating public attitudes towards social media is crucial for opinion mining systems to gain valuable insights. Stance detection, which aims to discern the attitude expressed in an opinionated text towards a specific target, is a fundamental task in opinion mining. Conventional approaches mainly [...] Read more.
Investigating public attitudes towards social media is crucial for opinion mining systems to gain valuable insights. Stance detection, which aims to discern the attitude expressed in an opinionated text towards a specific target, is a fundamental task in opinion mining. Conventional approaches mainly focus on sentence-level classification techniques. Recent research has shown that the integration of background knowledge can significantly improve stance detection performance. Despite the significant improvement achieved by knowledge-enhanced methods, applying these techniques in real-world scenarios remains challenging for several reasons. Firstly, existing methods often require the use of complex attention mechanisms to filter out noise and extract relevant background knowledge, which involves significant annotation efforts. Secondly, knowledge fusion mechanisms typically rely on fine-tuning, which can introduce a gap between the pre-training phase of pre-trained language models (PLMs) and the downstream stance detection tasks, leading to the poor prediction accuracy of the PLMs. To address these limitations, we propose a novel prompt-based stance detection method that leverages the knowledge acquired using the chain-of-thought method, which we refer to as PSDCOT. The proposed approach consists of two stages. The first stage is knowledge extraction, where instruction questions are constructed to elicit background knowledge from a VLPLM. The second stage is the multi-prompt learning network (M-PLN) for knowledge fusion, which learns model performance based on the background knowledge and the prompt learning framework. We evaluated the performance of PSDCOT on publicly available benchmark datasets to assess its effectiveness in improving stance detection performance. The results demonstrate that the proposed method achieves state-of-the-art results in in-domain, cross-target, and zero-shot learning settings. Full article
Show Figures

Figure 1

Figure 1
<p>The overall structure of the proposed PSDCOT.</p>
Full article ">Figure 2
<p>F1<span class="html-italic"><sub>avg</sub></span> of the ablation test.</p>
Full article ">Figure 2 Cont.
<p>F1<span class="html-italic"><sub>avg</sub></span> of the ablation test.</p>
Full article ">
Back to TopTop