Nothing Special   »   [go: up one dir, main page]

You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (105)

Search Parameters:
Keywords = wikis

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
14 pages, 1548 KiB  
Article
Document-Level Event Argument Extraction with Sparse Representation Attention
by Mengxi Zhang and Honghui Chen
Mathematics 2024, 12(17), 2636; https://doi.org/10.3390/math12172636 - 25 Aug 2024
Viewed by 689
Abstract
Document-level Event Argument Extraction (DEAE) aims to extract structural event knowledge composed of arguments and roles beyond the sentence level. Existing methods mainly focus on designing prompts and using Abstract Meaning Representation (AMR) graph structure as additional features to enrich event argument representation. [...] Read more.
Document-level Event Argument Extraction (DEAE) aims to extract structural event knowledge composed of arguments and roles beyond the sentence level. Existing methods mainly focus on designing prompts and using Abstract Meaning Representation (AMR) graph structure as additional features to enrich event argument representation. However, two challenges still remain: (1) the long-range dependency between event trigger and event arguments and (2) the distracting context in the document towards an event that can mislead the argument classification. To address these issues, we propose a novel document-level event argument extraction model named AMR Parser and Sparse Representation (APSR). Specifically, APSR sets inter- and intra-sentential encoders to capture the contextual information in different scopes. Especially, in the intra-sentential encoder, APSR designs three types of sparse event argument attention mechanisms to extract the long-range dependency. Then, APSR constructs AMR semantic graphs, which capture the interactions among concepts well. Finally, APSR fuses the inter- and intra-sentential representations and predicts what role a candidate span plays. Experimental results on the RAMS and WikiEvents datasets demonstrate that APSR achieves a superior performance compared with competitive baselines in terms of F1 by 1.27% and 3.12%, respectively. Full article
Show Figures

Figure 1

Figure 1
<p>Example of DEAE. The event trigger word is marked in <b>bold</b> text, and the arguments are marked in red text with <span class="underline">underlines</span>.</p>
Full article ">Figure 2
<p>Overview of our APSR. Giving a document as the input, we first encode the document with inter-sentential encoder and intra-sentential encoder with a mask matrix <span class="html-italic">M</span>. The detail of the sparse argument representation mask matrix <span class="html-italic">M</span> can be seen in the dashed box below. Then, the AMR parser module constructs semantic graphs to facilitate semantic interaction. Finally, we fuse the argument representations from two encoders and predict what argument role the candidate span plays.</p>
Full article ">Figure 3
<p>Case study, where we pick an instance from the RAMS test set. The event trigger word is marked in red <b>bold</b> text, and the arguments are marked in <b>bold</b> text with <span class="underline">underlines</span>.</p>
Full article ">
32 pages, 7299 KiB  
Article
Analysing A/O Possession in Māori-Language Tweets
by David Trye, Andreea S. Calude, Ray Harlow and Te Taka Keegan
Languages 2024, 9(8), 271; https://doi.org/10.3390/languages9080271 - 6 Aug 2024
Viewed by 964
Abstract
This article contributes the first corpus-based study of possession in Māori, the indigenous language of Aotearoa New Zealand. Like most Polynesian languages, Māori has a dual possessive system involving a choice between the so-called A and O categories. While Māori grammars describe these [...] Read more.
This article contributes the first corpus-based study of possession in Māori, the indigenous language of Aotearoa New Zealand. Like most Polynesian languages, Māori has a dual possessive system involving a choice between the so-called A and O categories. While Māori grammars describe these categories in terms of the inherent semantic relationship between the possessum and possessor, there have been no large-scale corpus analyses demonstrating their use in natural contexts. Social media provide invaluable opportunities for such linguistic studies, capturing contemporary language use while alleviating the burden of gathering data through traditional means. We operationalise semantic distinctions to investigate possession in Māori-language tweets, focusing on the [possessum a/o possessor] construction (e.g., te tīmatanga o te wiki ‘the beginning of the week’). In our corpus comprising 2500 tweets produced by more than 200 individuals, we find that users leverage a wide array of noun types encompassing many different semantic relationships. We observe not only the expected predominance of the O category, but also a tendency for examples described by Māori grammars as A-marked to instead be O-marked (59%). Although the A category persists in the corpus, our findings suggest that language change could be underway. Our primary dataset can be explored interactively online. Full article
(This article belongs to the Special Issue Linguistics of Social Media)
Show Figures

Figure 1

Figure 1
<p>A visual overview of our data curation process. Some steps were performed computationally (orange), while others required manual processing (purple).</p>
Full article ">Figure 2
<p>The number of tweets per user in the <span class="html-italic">Mixed Dataset</span>.</p>
Full article ">Figure 3
<p>The number of tweets per user in the <span class="html-italic">A-Only Dataset</span>.</p>
Full article ">Figure 4
<p>Overview of semantic categories used in existing research and in the present study. Grey boxes denote classes that are not semantically coherent.</p>
Full article ">Figure 5
<p>Our algorithm for determining the <span class="html-italic">Predicted</span> marker for each possessive phrase.</p>
Full article ">Figure 6
<p>Grouped bar chart showing the frequency of each semantic category for both the possessum (blue) and possessor (red).</p>
Full article ">Figure 7
<p>All 13 semantic relationships ordered by frequency.</p>
Full article ">Figure 8
<p>Noun phrases that appear at least 20 times in the <span class="html-italic">Mixed Dataset</span>.</p>
Full article ">Figure 9
<p><span class="html-italic">MultiCat</span> visualisation of the 20 most frequent semantic category combinations for the possessum (<span class="html-italic">PSSM</span>), possessor (<span class="html-italic">PSSR</span>), and relationship (<span class="html-italic">RELA</span>) between the two.</p>
Full article ">Figure 10
<p><span class="html-italic">MultiCat</span> visualisation of the most frequent semantic category combinations in the <span class="html-italic">A-Only</span> Dataset.</p>
Full article ">Figure 11
<p>Spine plot showing the proportions of predicted <span class="html-italic">a</span> and <span class="html-italic">o</span> markers, as well as the percentage of unexpected values (red) for each marker type.</p>
Full article ">Figure 12
<p>Spine plot showing semantic relationships grouped by predicted marker. Red indicates the proportion of each category with an ‘unexpected’ marker.</p>
Full article ">Figure 13
<p>Recurrent configurations in which an <span class="html-italic">o</span> marker was used instead of an <span class="html-italic">a</span> marker.</p>
Full article ">Figure 14
<p>The most common configurations in the <span class="html-italic">A-Only Dataset</span> in which an <span class="html-italic">a</span> marker was used instead of an <span class="html-italic">o</span> marker.</p>
Full article ">Figure 15
<p>100% stacked bar charts showing the proportions of expected and unexpected forms for each marker in the <span class="html-italic">Mixed Dataset</span>.</p>
Full article ">Figure 16
<p>Heatmap Matrix visualisation showing information about tweeters in the <span class="html-italic">Mixed Dataset</span>.</p>
Full article ">Figure 17
<p>Proportions of ‘unexpected’ markers (red) for each predicted marker type, broken down by gender.</p>
Full article ">Figure 18
<p>Visual comparison of proportions of ‘unexpected’ markers (red) for each relationship, broken down by gender. The same bin size is used for ease of comparison.</p>
Full article ">
24 pages, 5520 KiB  
Review
Drug-Induced Fatty Liver Disease (DIFLD): A Comprehensive Analysis of Clinical, Biochemical, and Histopathological Data for Mechanisms Identification and Consistency with Current Adverse Outcome Pathways
by Ernesto López-Pascual, Ivan Rienda, Judith Perez-Rojas, Anna Rapisarda, Guillem Garcia-Llorens, Ramiro Jover and José V. Castell
Int. J. Mol. Sci. 2024, 25(10), 5203; https://doi.org/10.3390/ijms25105203 - 10 May 2024
Cited by 1 | Viewed by 2978
Abstract
Drug induced fatty liver disease (DIFLD) is a form of drug-induced liver injury (DILI), which can also be included in the more general metabolic dysfunction-associated steatotic liver disease (MASLD), which specifically refers to the accumulation of fat in the liver unrelated to alcohol [...] Read more.
Drug induced fatty liver disease (DIFLD) is a form of drug-induced liver injury (DILI), which can also be included in the more general metabolic dysfunction-associated steatotic liver disease (MASLD), which specifically refers to the accumulation of fat in the liver unrelated to alcohol intake. A bi-directional relationship between DILI and MASLD is likely to exist: while certain drugs can cause MASLD by acting as pro-steatogenic factors, MASLD may make hepatocytes more vulnerable to drugs. Having a pre-existing MASLD significantly heightens the likelihood of experiencing DILI from certain medications. Thus, the prevalence of steatosis within DILI may be biased by pre-existing MASLD, and it can be concluded that the genuine true incidence of DIFLD in the general population remains unknown. In certain individuals, drug-induced steatosis is often accompanied by concomitant injury mechanisms such as oxidative stress, cell death, and inflammation, which leads to the development of drug-induced steatohepatitis (DISH). DISH is much more severe from the clinical point of view, has worse prognosis and outcome, and resembles MASH (metabolic-associated steatohepatitis), as it is associated with inflammation and sometimes with fibrosis. A literature review of clinical case reports allowed us to examine and evaluate the clinical features of DIFLD and their association with specific drugs, enabling us to propose a classification of DIFLD drugs based on clinical outcomes and pathological severity: Group 1, drugs with low intrinsic toxicity (e.g., ibuprofen, naproxen, acetaminophen, irinotecan, methotrexate, and tamoxifen), but expected to promote/aggravate steatosis in patients with pre-existing MASLD; Group 2, drugs associated with steatosis and only occasionally with steatohepatitis (e.g., amiodarone, valproic acid, and tetracycline); and Group 3, drugs with a great tendency to transit to steatohepatitis and further to fibrosis. Different mechanisms may be in play when identifying drug mode of action: (1) inhibition of mitochondrial fatty acid β-oxidation; (2) inhibition of fatty acid transport across mitochondrial membranes; (3) increased de novo lipid synthesis; (4) reduction in lipid export by the inhibition of microsomal triglyceride transfer protein; (5) induction of mitochondrial permeability transition pore opening; (6) dissipation of the mitochondrial transmembrane potential; (7) impairment of the mitochondrial respiratory chain/oxidative phosphorylation; (8) mitochondrial DNA damage, degradation and depletion; and (9) nuclear receptors (NRs)/transcriptomic alterations. Currently, the majority of, if not all, adverse outcome pathways (AOPs) for steatosis in AOP-Wiki highlight the interaction with NRs or transcription factors as the key molecular initiating event (MIE). This perspective suggests that chemical-induced steatosis typically results from the interplay between a chemical and a NR or transcription factors, implying that this interaction represents the primary and pivotal MIE. However, upon conducting this exhaustive literature review, it became evident that the current AOPs tend to overly emphasize this interaction as the sole MIE. Some studies indeed support the involvement of NRs in steatosis, but others demonstrate that such NR interactions alone do not necessarily lead to steatosis. This view, ignoring other mitochondrial-related injury mechanisms, falls short in encapsulating the intricate biological mechanisms involved in chemically induced liver steatosis, necessitating their consideration as part of the AOP’s map road as well. Full article
(This article belongs to the Special Issue Molecular Mechanisms of Hepatotoxicity—2nd Edition)
Show Figures

Figure 1

Figure 1
<p>Scoring of the degree of steatosis in human liver biopsies. Steatosis refers to the abnormal accumulation of lipids in hepatocytes after liver biopsy staining with haematoxylin–eosin (H&amp;E). The grading of steatosis typically follows a scale from (<b>S0</b>–<b>S3</b>), indicating the severity of the condition. Micrographs show (10× magnification), representative images, black bar equals 200 μm.</p>
Full article ">Figure 2
<p>Pathological changes in liver tissue in DIFLD. Micrographs were obtained from liver biopsies of DIFLD patients, after processing and H&amp;E staining. Several typical representative patterns of the disease are shown; black bar equals 200 μm: unaffected ((<b>S0</b>), 10× magnification), microsteatosis ((<b>m</b>), 10×), macrosteatosis ((<b>M</b>), 10×), steatohepatitis ((<b>SH</b>), 20×), incipient fibrosis ((<b>iF</b>), 20×) and advanced fibrosis ((<b>aF</b>), 10×).</p>
Full article ">Figure 3
<p>Steatotic pattern in toxic hepatic injury. Steatosis and steatohepatitis can be found in combination with other patterns of hepatic injury, like acute coagulative necrosis ((<b>a</b>), 10× magnification). While periportal parenchymal areas are typically respected ((<b>b</b>), 20×), centroacinar (perivenular) zones are typically affected by steatotic and steatohepatitic lesions ((<b>c</b>), 20×; (<b>d</b>), 40× and (<b>e</b>), 40×). Both, predominantly macrovesicular ((<b>c</b>), 20×) or microvesicular steatosis ((<b>e</b>), 40×) can constitute the major finding in this pattern of lesion. The histological findings of DIFLD are indistinguishable from those of other causes of steatosis. Black bar equals 200 μm. Blue bars equals 75 μm.</p>
Full article ">Figure 4
<p>DIFLD mechanisms and associated drugs. (<b>A</b>) Different mechanisms have been identified or proposed to explain the onset of DIFLD by different drugs. They include: (1) impairment of FAO; (2) inhibition of fatty acid transport across the mitochondrial membranes; (3) increased de novo lipid synthesis; (4) reduction of lipid export by the inhibition of MTP; (5) induction of the MPT pore opening; (6) dissipation of the MTP; (7) impairment of MRC/OXPHOS (I–IV represent the respective complexes of the MRC, while C represents cytochrome c); (8) mtDNA damage and depletion; (9) NR/transcriptomic alterations (alteration of TFs/NRs by modifying their expression levels or by direct agonist/antagonist activity). The triangle and the colour gradient/font size represents the relative number of times the association between the drug and the mechanism has been reported in literature reviews. (<b>B</b>) Percentage of drug participation in the different steatogenic mechanisms, based on prevalence in reviewed literature. Most drugs are associated with more than one mechanism, yet some of them are more frequently reported. Drug groups: (A) drugs predominantly altering β-oxidation or lipid transport; (B) drugs predominantly involved in β-oxidation or MRC impairment; (C) drugs involved in β-oxidation, triglyceride synthesis, and lipid export primarily; (D) not involved in β-oxidation but implicated in various other mechanisms; and (E) drugs involved in β-oxidation, fatty acid transport, lipid export, and several other mechanisms. For more details, see <a href="#app1-ijms-25-05203" class="html-app">Table S1</a>.</p>
Full article ">Figure 5
<p>Hierarchical dendrogram of DIFLD drugs. Input data for each drug included the number of reports in literature reviews associated with a given mechanism of toxicity and the DIFLD outcomes as shown in <a href="#app1-ijms-25-05203" class="html-app">Supplementary Table S1</a>. The dendrogram was generated by MetaboAnalyst, where the distance measured by Pearson is shown. Two major clusters became evident. Cluster A included several prototypical steatotic drugs that most frequently cause MRC/OXPHOS impairment and steatohepatitis. Cluster B drugs were all associated with an impaired lipid export. ACM (Acetaminophen), AMD (Amiodarone), AMP (Amineptine), DDN (Didanosine), DXC (Doxycycline), FLR (Fialuridine), FLU (Fluorouracil), GLC (Glucocorticoids), IBU (Ibuprofen), IRI (Irinotecan), MTX (Methotrexate), NAP (Naproxen), PER (Perhexiline), PIR (Pirprofen), SA (Salicylic), STV (Stavudine), TET (Tetracycline), TGL (Troglitazone), TMF (Tamoxifen), TNP (Tianeptine), VPA (Valproic) and ZDV (Zidovudine).</p>
Full article ">Figure 6
<p>An integrative perspective on acknowledged mechanisms, as well as other potential MIEs influencing liver steatosis AOPs. The blue box serves to gather MIEs and intermediate effects occurring exclusively within the mitochondria, consequently impacting mitochondrial functionalities. FA (fatty acid), TG (triglyceride).</p>
Full article ">
24 pages, 5507 KiB  
Article
Comparison and Evaluation of Five Global Land Cover Products on the Tibetan Plateau
by Yongjie Pan, Danyun Wang, Xia Li, Yong Liu and He Huang
Land 2024, 13(4), 522; https://doi.org/10.3390/land13040522 - 14 Apr 2024
Cited by 1 | Viewed by 1250
Abstract
The Tibetan Plateau (TP) region contains maximal alpine grassland ecology at the mid-latitudes. This region is also recognized as an ecologically fragile and sensitive area under the effects of global warming. Regional climate modeling and ecosystem research depend on accurate land cover (LC) [...] Read more.
The Tibetan Plateau (TP) region contains maximal alpine grassland ecology at the mid-latitudes. This region is also recognized as an ecologically fragile and sensitive area under the effects of global warming. Regional climate modeling and ecosystem research depend on accurate land cover (LC) information. In order to obtain accurate LC information over the TP, the reliability and precision of five moderate/high-resolution LC products (MCD12Q1, C3S-LC, GlobeLand30, GLC_FCS30, and ESA2020 in 2020) were analyzed and evaluated in this study. The different LC products were compared with each other in terms of areal/spatial consistency and assessed with four reference sample datasets (Geo-Wiki, GLCVSS, GOFC-GOLD, and USGS) using the confusion matrix method for accuracy evaluation over the TP. Based on the paired comparison of these five LC datasets, all five LC products show that grass is the major land cover type on the TP, but the range of grass coverage identified by the different products varies noticeably, from 43.35% to 65.49%. The fully consistent spatial regions account for 43.72% of the entire region of the TP, while, in the transition area between grass and bare soil, there is still a large area of medium-to-low consistency. In addition, a comparison of LC datasets using integrated reference datasets shows that the overall accuracies of MCD12Q1, C3S-LC, GlobeLand30, GLC_FCS30, and ESA2020 are 54.29%, 49.32%, 53.03%, 53.73%, and 60.11%, respectively. The producer accuracy of the five products is highest for grass, while glaciers have the most reliable and accurate characteristics among all LC products for users. These findings provide valuable insights for the selection of rational and appropriate LC datasets for studying land-atmosphere interactions and promoting ecological preservation in the TP. Full article
Show Figures

Figure 1

Figure 1
<p>Digital elevation model (DEM) map and the location of Tibetan Plateau.</p>
Full article ">Figure 2
<p>(<b>a</b>) Spatial distribution of the four reference validation samples, (<b>b</b>) and their land cover classification.</p>
Full article ">Figure 3
<p>Spatial patterns for the five LC products on TP: (<b>a</b>) MCD12Q1, (<b>b</b>) C3S-LC, (<b>c</b>) GlobeLand30, (<b>d</b>) GLC_FCS30, and (<b>e</b>) ESA2020.</p>
Full article ">Figure 4
<p>Percentage of separate classes area for five LC products in 2020.</p>
Full article ">Figure 5
<p>Spatial confusion of LC types for each of two different products (ft stands for confounding relationships between two LC datasets).</p>
Full article ">Figure 6
<p>(<b>a</b>) Distribution of consistency of five LC products on the TP, and (<b>b</b>) percentage of spatial consistency at different elevations.</p>
Full article ">Figure 7
<p>Spatial consistency distribution with individual LC types in the TP.</p>
Full article ">Figure 8
<p>Spatial distribution of low and moderate consistency in grass identified by five land cover products (<b>a</b>), and other land cover types besides grass (<b>b</b>).</p>
Full article ">Figure 9
<p>Spatial consistency distribution between different LC datasets and reference validation samples in the TP.</p>
Full article ">
19 pages, 2901 KiB  
Article
TOMDS (Topic-Oriented Multi-Document Summarization): Enabling Personalized Customization of Multi-Document Summaries
by Xin Zhang, Qiyi Wei, Qing Song and Pengzhou Zhang
Appl. Sci. 2024, 14(5), 1880; https://doi.org/10.3390/app14051880 - 25 Feb 2024
Viewed by 1008
Abstract
In a multi-document summarization task, if the user can decide on the summary topic, the generated summary can better align with the reader’s specific needs and preferences. This paper addresses the issue of overly general content generation by common multi-document summarization models and [...] Read more.
In a multi-document summarization task, if the user can decide on the summary topic, the generated summary can better align with the reader’s specific needs and preferences. This paper addresses the issue of overly general content generation by common multi-document summarization models and proposes a topic-oriented multi-document summarization (TOMDS) approach. The method is divided into two stages: extraction and abstraction. During the extractive stage, it primarily identifies and retrieves paragraphs relevant to the designated topic, subsequently sorting them based on their relevance to the topic and forming an initial subset of documents. In the abstractive stage, building upon the transformer architecture, the process includes two parts: encoding and decoding. In the encoding part, we integrated an external discourse parsing module that focuses on both micro-level within-paragraph semantic relationships and macro-level inter-paragraph connections, effectively combining these with the implicit relationships in the source document to produce more enriched semantic features. In the decoding part, we incorporated a topic-aware attention mechanism that dynamically zeroes in on information pertinent to the chosen topic, thus guiding the summary generation process more effectively. The proposed model was primarily evaluated using the standard text summary dataset WikiSum. The experimental results show that our model significantly enhanced the thematic relevance and flexibility of the summaries and improved the accuracy of grammatical and semantic comprehension in the generated summaries. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

Figure 1
<p>The overall framework of TOMDS.</p>
Full article ">Figure 2
<p>TOMDS generates partial structure.</p>
Full article ">Figure 3
<p>Example of discourse segmentation and RST tree conversion. The original paragraph is segmented into 9 EDUs in box (<b>a</b>) and then parsed into an RST discourse tree in box (<b>b</b>). Elab, Interp, Conc, Seq, and Cond are relation labels (Elab = elaboration, Interp = interpretation, Conc = concession, Seq = sequence, and Cond = condition). The converted primary–secondary relationship-based RST discourse tree is shown in box (<b>c</b>). Nucleus nodes including ①, ③, ⑤, ⑦, and ⑨, and satellite nodes including ②, ④, ⑥, and ⑧, are denoted by solid lines and dashed lines.</p>
Full article ">Figure 4
<p>Discourse-aware attention (DaAtt) mechanism.</p>
Full article ">Figure 5
<p>Discourse graph attention (DGAtt) mechanism.</p>
Full article ">Figure 6
<p>Topic attention mechanism.</p>
Full article ">
20 pages, 927 KiB  
Article
A Dynamic Emotional Propagation Model over Time for Competitive Environments
by Zhihao Chen, Bingbing Xu, Tiecheng Cai, Zhou Yang and Xiangwen Liao
Electronics 2023, 12(24), 4937; https://doi.org/10.3390/electronics12244937 - 8 Dec 2023
Viewed by 1304
Abstract
Emotional propagation research aims to discover and show the laws of opinion evolution in social networks. The short-term observation of the emotional propagation process for a predetermined time window ignores situations in which users with different emotions compete over a long diffusion time. [...] Read more.
Emotional propagation research aims to discover and show the laws of opinion evolution in social networks. The short-term observation of the emotional propagation process for a predetermined time window ignores situations in which users with different emotions compete over a long diffusion time. To that end, we propose a dynamic emotional propagation model based on an independent cascade. The proposed model is inspired by the interpretable factors of the reinforced Poisson process, portraying the “rich-get-richer” phenomenon within a social network. Specifically, we introduce a time-decay mechanism to illustrate the change in influence over time. Meanwhile, we propose an emotion-exciting mechanism allowing prior users to affect the emotions of subsequent users. Finally, we conduct experiments on an artificial network and two real-world datasets—Wiki, with 7194 nodes, and Bitcoin-OTC, with 5881 nodes—to verify the effectiveness of our proposed model. The proposed method improved the F1-score by 3.5% and decreased the MAPE by 0.059 on the Wiki dataset. And the F1-score improved by 0.4% and the MAPE decreased by 0.013 on the Bitcoin-OTC dataset. In addition, the experimental results indicate a phenomenon of emotions in social networks tending to converge under the influence of opinion leaders after a long enough time. Full article
(This article belongs to the Special Issue New Advances in Affective Computing)
Show Figures

Graphical abstract

Graphical abstract
Full article ">Figure 1
<p>A motivating example of emotional propagation based on the IC model. A and B are the original active users with positive and negative emotions, respectively. It is possible that active user D is influenced by A, but there is also a potential scenario where B influences D through C over time. E is inactive as it disapproves of B.</p>
Full article ">Figure 2
<p>The process of emotional propagation over time.</p>
Full article ">Figure 3
<p>The degree distributions of different networks.</p>
Full article ">Figure 4
<p>How the activation intensity <math display="inline"><semantics> <mi>α</mi> </semantics></math> affects the growth of the proportion of positive emotions with different parameters.</p>
Full article ">Figure 5
<p>How the time-decay functions affect the growth of the proportion of positive emotions with different parameters.</p>
Full article ">Figure 6
<p>The emotional propagation process of proposed model at each step.</p>
Full article ">Figure 7
<p>The process of emotional propagation of the proposed model in the Bitcoin-OTC dataset. Users with with positive, neutral, and negative emotions are represented by red, yellow, and blue color, respectively. The larger the node in the graph, the greater its out-degree, which means more powerful influence.</p>
Full article ">
20 pages, 2651 KiB  
Article
Transcriptional Profiling of SARS-CoV-2-Infected Calu-3 Cells Reveals Immune-Related Signaling Pathways
by Eric Petterson Viana Pereira, Stela Mirla da Silva Felipe, Raquel Martins de Freitas, José Ednésio da Cruz Freire, Antonio Edson Rocha Oliveira, Natália Canabrava, Paula Matias Soares, Mauricio Fraga van Tilburg, Maria Izabel Florindo Guedes, Chad Eric Grueter and Vânia Marilande Ceccatto
Pathogens 2023, 12(11), 1373; https://doi.org/10.3390/pathogens12111373 - 20 Nov 2023
Cited by 1 | Viewed by 2151
Abstract
The COVID-19 disease, caused by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), emerged in late 2019 and rapidly spread worldwide, becoming a pandemic that infected millions of people and caused significant deaths. COVID-19 continues to be a major threat, and there is [...] Read more.
The COVID-19 disease, caused by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), emerged in late 2019 and rapidly spread worldwide, becoming a pandemic that infected millions of people and caused significant deaths. COVID-19 continues to be a major threat, and there is a need to deepen our understanding of the virus and its mechanisms of infection. To study the cellular responses to SARS-CoV-2 infection, we performed an RNA sequencing of infected vs. uninfected Calu-3 cells. Total RNA was extracted from infected (0.5 MOI) and control Calu-3 cells and converted to cDNA. Sequencing was performed, and the obtained reads were quality-analyzed and pre-processed. Differential expression was assessed with the EdgeR package, and functional enrichment was performed in EnrichR for Gene Ontology, KEGG pathways, and WikiPathways. A total of 1040 differentially expressed genes were found in infected vs. uninfected Calu-3 cells, of which 695 were up-regulated and 345 were down-regulated. Functional enrichment analyses revealed the predominant up-regulation of genes related to innate immune response, response to virus, inflammation, cell proliferation, and apoptosis. These transcriptional changes following SARS-CoV-2 infection may reflect a cellular response to the infection and help to elucidate COVID-19 pathogenesis, in addition to revealing potential biomarkers and drug targets. Full article
Show Figures

Figure 1

Figure 1
<p>Volcano plot showing differentially expressed genes (DEGs) in SARS-CoV-2-infected Calu-3 cells vs. control (uninfected) cells. The most expressive DEGs are identified. de: Differential expression. down: Down-regulated. no_sig: Non-significant. up: Up-regulated.</p>
Full article ">Figure 2
<p>Heatmap of the top 20 DEGs in Calu-3 cells. Expression patterns of genes are compared between control (C1, C2, and C5) and infected (I1, I2, and I5) samples. For each gene, the relative values of gene expression are depicted in a blue-red scale, in which red tones are representative of higher expression, and blue tones, of lower expression. FPKM: Fragments per Kilobase Million.</p>
Full article ">Figure 3
<p>Functional enrichment of up-regulated DEGs. Enriched terms for Gene Ontology’s (GO) biological process (<b>A</b>). Enriched terms for KEGG Pathways and WikiPathways (<b>B</b>).</p>
Full article ">Figure 4
<p>Calu-3 cells 24 h after SARS-CoV-2 infection. The transcriptional features of the infected cells indicate a metabolic model with the activation of inflammatory and antiviral signaling, in addition to both apoptotic and cytoprotective/proliferative signaling. Up-regulated genes include a potential SARS-CoV-2 receptor (EPHA4), a chaperone (HSPA6), pro-inflammatory transcription factors (IRF3 and FOS), and inflammatory mediators (ACE, MMP17, IL6, SECTM1, and ISG15).</p>
Full article ">
592 KiB  
Proceeding Paper
Comparison of the Effectiveness and Performance of Student Workgroups in Online Wiki Activities with and without AI
by Giacomo Nalli and Serengul Smith
Eng. Proc. 2023, 56(1), 248; https://doi.org/10.3390/ASEC2023-16273 - 15 Nov 2023
Cited by 1 | Viewed by 482
Abstract
Collaborative learning has been widely acknowledged as a successful teaching method within the education field, with research indicating its positive impact on student outcomes. During the COVID-19 pandemic, when all courses transitioned online due to lockdown measures, many universities employed learning management systems [...] Read more.
Collaborative learning has been widely acknowledged as a successful teaching method within the education field, with research indicating its positive impact on student outcomes. During the COVID-19 pandemic, when all courses transitioned online due to lockdown measures, many universities employed learning management systems to facilitate continued group work among students. However, forming effective student groups remained challenging, particularly given the large number of enrolled students. To address this issue, this study proposes the application of an artificial intelligence (machine learning) solution to automatically group students based on their behaviours and interactions within an e-learning environment. This paper explores the potential of machine learning (ML) algorithms in assisting educators to create heterogeneous groups, considering various student attributes, such as behaviour and performance, to optimise collaborative learning outcomes. Students’ performance within a module was compared using a wiki activity that employed group work over the course of two academic years. In the first experiment, groups were formed randomly, while in the second experiment, students with similar behaviours were firstly identified using a clustering algorithm and then organised by an additional algorithm into heterogeneous groups. The results demonstrate the efficacy of the machine learning solution compared to the random approach in assisting educators with group formation for a collaborative activity such as the wiki, confirmed by a comparative analysis showing an improvement in student performance and satisfaction. This research contributes to the advancement of online education through the creation of more effective group dynamics using machine learning algorithms, thereby improving overall student learning. Full article
(This article belongs to the Proceedings of The 4th International Electronic Conference on Applied Sciences)
Show Figures

Figure 1

Figure 1
<p>Scores obtained by students of the User Experience (UX) Design 2020/2021 (<b>a</b>), and students’ grades involved in the User Experience (UX) Design 2021/2022 (<b>b</b>).</p>
Full article ">
27 pages, 3333 KiB  
Article
Artistic Style Recognition: Combining Deep and Shallow Neural Networks for Painting Classification
by Saqib Imran, Rizwan Ali Naqvi, Muhammad Sajid, Tauqeer Safdar Malik, Saif Ullah, Syed Atif Moqurrab and Dong Keon Yon
Mathematics 2023, 11(22), 4564; https://doi.org/10.3390/math11224564 - 7 Nov 2023
Cited by 2 | Viewed by 3116
Abstract
This study’s main goal is to create a useful software application for finding and classifying fine art photos in museums and art galleries. There is an increasing need for tools to swiftly analyze and arrange art collections based on their artistic styles as [...] Read more.
This study’s main goal is to create a useful software application for finding and classifying fine art photos in museums and art galleries. There is an increasing need for tools to swiftly analyze and arrange art collections based on their artistic styles as a result of the digitization of art collections. To increase the accuracy of the style categorization, the suggested technique involves two parts. The input image is split into five sub-patches in the first stage. A DCNN that has been particularly trained for this task is then used to classify each patch individually. A decision-making module using a shallow neural network is part of the second phase. Probability vectors acquired from the first-phase classifier are used to train this network. The results from each of the five patches are combined in this phase to deduce the final style classification for the input image. One key advantage of this approach is employing probability vectors rather than images, and the second phase is trained separately from the first. This helps compensate for any potential errors made during the first phase, improving accuracy in the final classification. To evaluate the proposed method, six various already-trained CNN models, namely AlexNet, VGG-16, VGG-19, GoogLeNet, ResNet-50, and InceptionV3, were employed as the first-phase classifiers. The second-phase classifier was implemented as a shallow neural network. By using four representative art datasets, experimental trials were conducted using the Australian Native Art dataset, the WikiArt dataset, ILSVRC, and Pandora 18k. The findings showed that the recommended strategy greatly surpassed existing methods in terms of style categorization accuracy and precision. Overall, the study assists in creating efficient software systems for analyzing and categorizing fine art images, making them more accessible to the general public through digital platforms. Using pre-trained models, we were able to attain an accuracy of 90.7. Our model performed better with a higher accuracy of 96.5 as a result of fine-tuning and transfer learning. Full article
(This article belongs to the Section Mathematics and Computer Science)
Show Figures

Figure 1

Figure 1
<p>Basic architecture of shallow neural network.</p>
Full article ">Figure 2
<p>Process of patch extraction in phase 1.</p>
Full article ">Figure 3
<p>Assembling of probability vectors in phase 3.</p>
Full article ">Figure 4
<p>Final style classification labeling process in phase 4.</p>
Full article ">Figure 5
<p>Digitized Artworks Sample taken from WikiArt Dataset.</p>
Full article ">Figure 6
<p>Sample taken from Australian Aboriginal paintings dataset.</p>
Full article ">Figure 7
<p>Distribution of each style in percentage in WikiArt dataset.</p>
Full article ">Figure 8
<p>Distribution of each style in percentage in Pandora 18K dataset.</p>
Full article ">Figure 9
<p>Painting classification architecture.</p>
Full article ">Figure 10
<p>Training and testing setup for the method proposed.</p>
Full article ">Figure 11
<p>Structure of CNN architecture with layers.</p>
Full article ">Figure 12
<p>Paint classification model training and testing environment.</p>
Full article ">Figure 13
<p>Fine-tuning process for source model and target model.</p>
Full article ">Figure 14
<p>Differences between the proposed method’s accuracy (Case 6) and the benchmark (Case 1) while employing various CNN models.</p>
Full article ">Figure 15
<p>Confusion arrays adjusted to Dataset 1 with the Inceptionv3 CNN model. The suggested course of action (Case 6 patches only). Patch-based baseline (Case 2) is option B. The table cells’ shade intensity rises as the percentage accuracy value rises since the numbers represent percentage accuracy (divided by 100).</p>
Full article ">
23 pages, 1636 KiB  
Article
Enhancing Semantic Web Technologies Using Lexical Auditing Techniques for Quality Assurance of Biomedical Ontologies
by Rashmi Burse, Michela Bertolotto and Gavin McArdle
BioMedInformatics 2023, 3(4), 962-984; https://doi.org/10.3390/biomedinformatics3040059 - 1 Nov 2023
Viewed by 1039
Abstract
Semantic web technologies (SWT) represent data in a format that is easier for machines to understand. Validating the knowledge in data graphs created using SWT is critical to ensure that the axioms accurately represent the so-called “real” world. However, data graph validation is [...] Read more.
Semantic web technologies (SWT) represent data in a format that is easier for machines to understand. Validating the knowledge in data graphs created using SWT is critical to ensure that the axioms accurately represent the so-called “real” world. However, data graph validation is a significant challenge in the semantic web domain. The Shapes Constraint Language (SHACL) is the latest W3C standard developed with the goal of validating data-graphs. SHACL (pronounced as shackle) is a relatively new standard and hitherto has predominantly been employed to validate generic data graphs like WikiData and DBPedia. In generic data graphs, the name of a class does not affect the shape of a class, but this is not the case with biomedical ontology data graphs. The shapes of classes in biomedical ontology data graphs are highly influenced by the names of the classes, and the SHACL shape creation methods developed for generic data graphs fail to consider this characteristic difference. Thus, the existing SHACL shape creation methods do not perform well for domain-specific biomedical ontology data graphs. Maintaining the quality of biomedical ontology data graphs is crucial to ensure accurate analysis in safety-critical applications like Electronic Health Record (EHR) systems referencing such data graphs. Thus, in this work, we present a novel method to create enhanced SHACL shapes that consider the aforementioned characteristic difference to better validate biomedical ontology data graphs. We leverage the knowledge available from lexical auditing techniques for biomedical ontologies and incorporate this knowledge to create smart SHACL shapes. We also create SHACL shapes (baseline SHACL graph) without incorporating the lexical knowledge of the class names, as is performed by existing methods, and compare the performance of our enhanced SHACL shapes with the baseline SHACL shapes. The results demonstrate that the enhanced SHACL shapes augmented with lexical knowledge of the class names identified 176 violations which the baseline SHACL shapes, void of this lexical knowledge, failed to detect. Thus, the enhanced SHACL shapes presented in this work significantly improve the validation performance of biomedical ontology data graphs, thereby reducing the errors present in such data graphs and ensuring safe use in the life-critical applications referencing them. Full article
(This article belongs to the Special Issue Feature Papers in Applied Biomedical Data Science)
Show Figures

Figure 1

Figure 1
<p>OWL (OWA) vs. SHACL (CWA).</p>
Full article ">Figure 2
<p>Validation using Shapes and Constraint Language (SHACL).</p>
Full article ">Figure 3
<p>Example to illustrate the characteristic difference of a biomedical ontology data graph. The shape of a disorder is dependent on the lexical features in its name. (<b>a</b>) Burn caused by fire (disorder). (<b>b</b>) Burn of skin (disorder).</p>
Full article ">Figure 4
<p>(<b>a</b>) SHACL validation process without considering internal lexical features of class names (baseline). (<b>b</b>) SHACL validation process after augmenting internal lexical features of class names in the SHACL shape construction.</p>
Full article ">Figure 5
<p>(<b>a</b>) RDF data graph structure representing SNOMED-CT concepts targeted by non-conjunctive stop word method. (<b>b</b>) RDFLex data graph structure representing SNOMED-CT concepts targeted by non-conjunctive stop word method.</p>
Full article ">Figure 6
<p>(<b>a</b>) Knowledge base files representing the insights from [<a href="#B27-biomedinformatics-03-00059" class="html-bibr">27</a>]. (<b>b</b>) Knowledge base files representing the insights from [<a href="#B28-biomedinformatics-03-00059" class="html-bibr">28</a>].</p>
Full article ">Figure 7
<p>SHACL vs. SHACLex validation report for the concept Injury due to sword (SCTID:243051008) belonging to the sample set “Disorder due to Object” (DdtO). Missing mandatory attributes due to (SCTID:42752001) and causative agent (SCTID:246075003) were caught as violations only by SHACLex.</p>
Full article ">Figure 8
<p>SHACL vs. SHACLex validation report for the concept Pulmonary hypostasis (SCTID:196116008) conforming to the POS pattern “ADJ NOUN”. Missing hierarchical relationship with Hypostasis (SCTID:72127003) was caught as a violation only by SHACLex.</p>
Full article ">Figure A1
<p>SNOMED-CT hierarchies [<a href="#B32-biomedinformatics-03-00059" class="html-bibr">32</a>].</p>
Full article ">Figure A2
<p>SNOMED-CT logical nodel [<a href="#B2-biomedinformatics-03-00059" class="html-bibr">2</a>].</p>
Full article ">Figure A3
<p>Overview of methodology in [<a href="#B27-biomedinformatics-03-00059" class="html-bibr">27</a>].</p>
Full article ">Figure A4
<p>Overview of methodology in [<a href="#B28-biomedinformatics-03-00059" class="html-bibr">28</a>].</p>
Full article ">
10 pages, 772 KiB  
Article
Mendelian Randomization Study of Lipid Metabolites Reveals Causal Associations with Heel Bone Mineral Density
by Mingxin Wu, Yufei Du, Chi Zhang, Zhen Li, Qingyang Li, Enlin Qi, Wendong Ruan, Shiqing Feng and Hengxing Zhou
Nutrients 2023, 15(19), 4160; https://doi.org/10.3390/nu15194160 - 27 Sep 2023
Cited by 3 | Viewed by 2749
Abstract
Background: Osteoporosis, which is a bone disease, is characterized by low bone mineral density and an increased risk of fractures. The heel bone mineral density is often used as a representative measure of overall bone mineral density. Lipid metabolism, which includes processes such [...] Read more.
Background: Osteoporosis, which is a bone disease, is characterized by low bone mineral density and an increased risk of fractures. The heel bone mineral density is often used as a representative measure of overall bone mineral density. Lipid metabolism, which includes processes such as fatty acid metabolism, glycerol metabolism, inositol metabolism, bile acid metabolism, carnitine metabolism, ketone body metabolism, sterol and steroid metabolism, etc., may have an impact on changes in bone mineral density. While some studies have reported correlations between lipid metabolism and heel bone mineral density, the overall causal relationship between metabolites and heel bone mineral density remains unclear. Objective: to investigate the causal relationship between lipid metabolites and heel bone mineral density using two-sample Mendelian randomization analysis. Methods: Summary-level data from large-scale genome-wide association studies were extracted to identify genetic variants linked to lipid metabolite levels. These genetic variants were subsequently employed as instrumental variables in Mendelian randomization analysis to estimate the causal effects of each lipid metabolite on heel bone mineral density. Furthermore, metabolites that could potentially be influenced by causal relationships with bone mineral density were extracted from the KEGG and WikiPathways databases. The causal associations between these downstream metabolites and heel bone mineral density were then examined. Lastly, a sensitivity analysis was conducted to evaluate the robustness of the results and address potential sources of bias. Results: A total of 130 lipid metabolites were analyzed, and it was found that acetylcarnitine, propionylcarnitine, hexadecanedioate, tetradecanedioate, myo-inositol, 1-arachidonoylglycerophosphorine, 1-linoleoylglycerophoethanolamine, and epiandrosterone sulfate had a causal relationship with heel bone mineral density (p < 0.05). Furthermore, our findings also indicate an absence of causal association between the downstream metabolites associated with the aforementioned metabolites identified in the KEGG and WikiPathways databases and heel bone mineral density. Conclusion: This work supports the hypothesis that lipid metabolites have an impact on bone health through demonstrating a causal relationship between specific lipid metabolites and heel bone mineral density. This study has significant implications for the development of new strategies to osteoporosis prevention and treatment. Full article
(This article belongs to the Section Lipids)
Show Figures

Figure 1

Figure 1
<p>Causal association identified between eight lipid metabolites and H-BMD. The range of OR values for eight lipid metabolites are shown in forest plots. The OR values of each lipid metabolite are shown by the green (OR values &gt; 1) and red (OR values &lt; 1) points, respectively, with the vertical lines on each side of the point denoting the 95% confidence interval. OR, odds ratio; SNP, single-nucleotide polymorphism; 95%CI, 95% confidence interval.</p>
Full article ">Figure 2
<p>Causal association identified between downstream metabolites and H-BMD. The range of OR values for downstream metabolites are shown in forest plots. The green point (OR values &gt; 1) represents the OR values of each downstream metabolite. The vertical lines on each side of the green point reflect the 95% confidence interval. OR, odds ratio; SNP, single-nucleotide polymorphism; 95%CI, 95% confidence interval.</p>
Full article ">
18 pages, 4099 KiB  
Article
Novel Paintings from the Latent Diffusion Model through Transfer Learning
by Dayin Wang, Chong Ma and Siwen Sun
Appl. Sci. 2023, 13(18), 10379; https://doi.org/10.3390/app131810379 - 16 Sep 2023
Cited by 1 | Viewed by 2069
Abstract
With the development of deep learning, image synthesis has achieved unprecedented achievements in the past few years. Image synthesis models, represented by diffusion models, demonstrated stable and high-fidelity image generation. However, the traditional diffusion model computes in pixel space, which is memory-heavy and [...] Read more.
With the development of deep learning, image synthesis has achieved unprecedented achievements in the past few years. Image synthesis models, represented by diffusion models, demonstrated stable and high-fidelity image generation. However, the traditional diffusion model computes in pixel space, which is memory-heavy and computing-heavy. Therefore, to ease the expensive computing and improve the accessibility of diffusion models, we train the diffusion model in latent space. In this paper, we are devoted to creating novel paintings from existing paintings based on powerful diffusion models. Because the cross-attention layer is adopted in the latent diffusion model, we can create novel paintings with conditional text prompts. However, direct training of the diffusion model on the limited dataset is non-trivial. Therefore, inspired by the transfer learning, we train the diffusion model with the pre-trained weights, which eases the training process and enhances the image synthesis results. Additionally, we introduce the GPT-2 model to expand text prompts for detailed image generation. To validate the performance of our model, we train the model on paintings of the specific artist from the dataset WikiArt. To make up for missing image context descriptions of the WikiArt dataset, we adopt a pre-trained language model to generate corresponding context descriptions automatically and clean wrong descriptions manually, and we will make it available to the public. Experimental results demonstrate the capacity and effectiveness of the model. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

Figure 1
<p>The generative adversarial network.</p>
Full article ">Figure 2
<p>The variational autoencoder.</p>
Full article ">Figure 3
<p>The flow-based generative models.</p>
Full article ">Figure 4
<p>The diffusion and denoising process.</p>
Full article ">Figure 5
<p>The overview of framework.</p>
Full article ">Figure 6
<p>The flowchart of our system.</p>
Full article ">Figure 7
<p>Sample images and text descriptions of <span class="html-italic">WikiArt</span>.</p>
Full article ">Figure 8
<p>The generated novel paintings of input text prompts “An astronaut riding a horse” and “A woman holding a knife in her hand”.</p>
Full article ">Figure 9
<p>The generated novel paintings of input text prompts “An emotional dog next to an alien landscape” and “An ethereal swan perched by a planet”.</p>
Full article ">Figure 10
<p>The generated novel paintings of input text prompts “A gothic sea monster in the high seas” and “A drone hovering over the sky”.</p>
Full article ">Figure 11
<p>The generated novel paintings of input text prompts “A tank in the lake” and “A computer keyboard with various keys”.</p>
Full article ">
19 pages, 1018 KiB  
Article
Towards Robust Neural Rankers with Large Language Model: A Contrastive Training Approach
by Ziyang Pan, Kangjia Fan, Rongyu Liu and Daifeng Li
Appl. Sci. 2023, 13(18), 10148; https://doi.org/10.3390/app131810148 - 8 Sep 2023
Viewed by 1767
Abstract
Pre-trained language model-based neural rankers have been widely applied in information retrieval (IR). However, the robustness issue of current IR models has not received sufficient attention, which could significantly impact the user experience in practical applications. In this study, we focus on the [...] Read more.
Pre-trained language model-based neural rankers have been widely applied in information retrieval (IR). However, the robustness issue of current IR models has not received sufficient attention, which could significantly impact the user experience in practical applications. In this study, we focus on the defensive ability of IR models against query attacks while guaranteeing their retrieval performance. We discover that improving the robustness of IR models not only requires a focus on model architecture and training methods but is also closely related to the quality of data. Different from previous research, we use large language models (LLMs) to generate query variations with the same intent, which exhibit richer and more realistic expressions while maintaining consistent query intent. Based on LLM-generated query variations, we propose a novel contrastive training framework that substantially enhances the robustness of IR models to query perturbations. Specifically, we combine the contrastive loss in the representation space of query variations with the ranking loss in the retrieval training stage to improve the model’s ability to understand the underlying semantic information of queries. Experimental results on two public datasets, WikiQA and ANTIQUE, demonstrate that the proposed contrastive training approach effectively improves the robustness of models facing query attack scenarios while outperforming baselines in retrieval performance. Compared with the best baseline approach, the improvements in average robustness performance of Reranker IR models are 24.9%, 26.5%, 27.0%, and 75.0% on WikiQA and 8.7%, 1.9%, 6.3%, and 13.6% on ANTIQUE, in terms of the MAP (Mean Average Precision), MRR (Mean Reciprocal Rank), nDCG@10 (Normalized Discounted Cumulative Gain) and P@10 (Precision), respectively. Full article
Show Figures

Figure 1

Figure 1
<p>The training paradigm for IR ranking models.</p>
Full article ">Figure 2
<p>Contrastive training framework for a more robust PLM-based IR model. The original queries are used to generate different expressions with the same intent during the query variation phase, utilizing a LLM. The retrieval structure is illustrated on Reranker as an example, where BPR Loss serves as the loss function for the ranking task and Normalized Temperature-Scaled Cross Entropy Loss serves for the intent alignment task.</p>
Full article ">Figure 3
<p>The avg d. and worst d. of MAP, MRR, P@10, and NDCG@10 of different IR models. We selected a subset of traditional retrieval models from the baselines, as well as PLM-based IR models, represented by BERT. Both BERT and our proposed method employed Cross Encoder (Reranker). The data in the top two rows represent the results for WikiQA, while the data in the bottom two rows represent the results for ANTIQUE. The values shown in the subfigures represent the absolute values of the actual results. A higher value indicates poorer robustness. Each color signifies the robustness of the corresponding model under different datasets and evaluation metrics.</p>
Full article ">
19 pages, 2313 KiB  
Article
Few-Shot Knowledge Graph Completion Model Based on Relation Learning
by Weijun Li, Jianlai Gu, Ang Li, Yuxiao Gao and Xinyong Zhang
Appl. Sci. 2023, 13(17), 9513; https://doi.org/10.3390/app13179513 - 22 Aug 2023
Viewed by 1498
Abstract
Considering the complexity of entity pair relations and the information contained in the target neighborhood in few-shot knowledge graphs (KG), existing few-shot KG completion methods generally suffer from insufficient relation representation learning capabilities and neglecting the contextual semantics of entities. To tackle the [...] Read more.
Considering the complexity of entity pair relations and the information contained in the target neighborhood in few-shot knowledge graphs (KG), existing few-shot KG completion methods generally suffer from insufficient relation representation learning capabilities and neglecting the contextual semantics of entities. To tackle the above problems, we propose a Few-shot Relation Learning-based Knowledge Graph Completion model (FRL-KGC). First, a gating mechanism is introduced during the aggregation of higher-order neighborhoods of entities in formation, enriching the central entity representation while reducing the adverse effects of noisy neighbors. Second, during the relation representation learning stage, a more accurate relation representation is learned by using the correlation between entity pairs in the reference set. Finally, an LSTM structure is incorporated into the Transformer learner to enhance its ability to learn the contextual semantics of entities and relations and predict new factual knowledge. We conducted comparative experiments on the publicly available NELL-One and Wiki-One datasets, comparing FRL-KGC with six few-shot knowledge graph completion models and five traditional knowledge graph completion models for five-shot link prediction. The results showed that FRL-KGC outperformed all comparison models in terms of MRR, Hits@10, Hits@5, and Hits@1 metrics. Full article
(This article belongs to the Special Issue Knowledge Graphs: State-of-the-Art and Applications)
Show Figures

Figure 1

Figure 1
<p>An example of a five-shot KGC task.</p>
Full article ">Figure 2
<p>Overview of the FRL-KGC framework.</p>
Full article ">Figure 3
<p>The main structure of the high-order neighborhood entity encoder based on the gating mechanism.</p>
Full article ">Figure 4
<p>The main structure of the relation representation encoder.</p>
Full article ">Figure 5
<p>The main structure of a learning framework composed of a Transformer and an LSTM.</p>
Full article ">Figure 6
<p>The schematic diagram of matching process calculation.</p>
Full article ">Figure 7
<p>Impact of few-shot size <span class="html-italic">K</span> in the performance of FKGC methods on Wiki-One dataset.</p>
Full article ">
28 pages, 1207 KiB  
Article
Graph-Based Extractive Text Summarization Sentence Scoring Scheme for Big Data Applications
by Jai Prakash Verma, Shir Bhargav, Madhuri Bhavsar, Pronaya Bhattacharya, Ali Bostani, Subrata Chowdhury, Julian Webber and Abolfazl Mehbodniya
Information 2023, 14(9), 472; https://doi.org/10.3390/info14090472 - 22 Aug 2023
Cited by 1 | Viewed by 3454
Abstract
The recent advancements in big data and natural language processing (NLP) have necessitated proficient text mining (TM) schemes that can interpret and analyze voluminous textual data. Text summarization (TS) acts as an essential pillar within recommendation engines. Despite the prevalent use of abstractive [...] Read more.
The recent advancements in big data and natural language processing (NLP) have necessitated proficient text mining (TM) schemes that can interpret and analyze voluminous textual data. Text summarization (TS) acts as an essential pillar within recommendation engines. Despite the prevalent use of abstractive techniques in TS, an anticipated shift towards a graph-based extractive TS (ETS) scheme is becoming apparent. The models, although simpler and less resource-intensive, are key in assessing reviews and feedback on products or services. Nonetheless, current methodologies have not fully resolved concerns surrounding complexity, adaptability, and computational demands. Thus, we propose our scheme, GETS, utilizing a graph-based model to forge connections among words and sentences through statistical procedures. The structure encompasses a post-processing stage that includes graph-based sentence clustering. Employing the Apache Spark framework, the scheme is designed for parallel execution, making it adaptable to real-world applications. For evaluation, we selected 500 documents from the WikiHow and Opinosis datasets, categorized them into five classes, and applied the recall-oriented understudying gisting evaluation (ROUGE) parameters for comparison with measures ROUGE-1, 2, and L. The results include recall scores of 0.3942, 0.0952, and 0.3436 for ROUGE-1, 2, and L, respectively (when using the clustered approach). Through a juxtaposition with existing models such as BERTEXT (with 3-gram, 4-gram) and MATCHSUM, our scheme has demonstrated notable improvements, substantiating its applicability and effectiveness in real-world scenarios. Full article
(This article belongs to the Special Issue Text Mining: Challenges, Algorithms, Tools and Applications)
Show Figures

Figure 1

Figure 1
<p>A classification taxonomy of ATS ecosystem [<a href="#B14-information-14-00472" class="html-bibr">14</a>].</p>
Full article ">Figure 2
<p>A generic model for extractive ATS [<a href="#B14-information-14-00472" class="html-bibr">14</a>].</p>
Full article ">Figure 3
<p>Sentence-based model proposed by [<a href="#B37-information-14-00472" class="html-bibr">37</a>].</p>
Full article ">Figure 4
<p>An example to generate sentence-centric model.</p>
Full article ">Figure 5
<p>Word-based model proposed by [<a href="#B38-information-14-00472" class="html-bibr">38</a>].</p>
Full article ">Figure 6
<p>Example of word graph of a sentence.</p>
Full article ">Figure 7
<p>Graph-based extractive text summarization architecture.</p>
Full article ">Figure 8
<p>Flowchart for execution flow of proposed scheme.</p>
Full article ">Figure 9
<p>ROUGE-1 score comparison between models M1 and M2 for WikiHow dataset.</p>
Full article ">Figure 10
<p>ROUGE-2 score comparison between models M1 and M2 for WikiHow dataset.</p>
Full article ">Figure 11
<p>ROUGE-L score comparison between models M1 and M2 for wikiHow dataset.</p>
Full article ">Figure 12
<p>ROUGE score comparison between model proposed scheme (average of all five clusters) and base model [<a href="#B89-information-14-00472" class="html-bibr">89</a>].</p>
Full article ">Figure 13
<p>ROUGE score comparison between models M1 and M2 for Opinosis dataset.</p>
Full article ">
Back to TopTop