<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">60517</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2025.060517</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Syntax-Enhanced Entity Relation Extraction with Complex Knowledge</article-title>
<alt-title alt-title-type="left-running-head">Syntax-Enhanced Entity Relation Extraction with Complex Knowledge</alt-title>
<alt-title alt-title-type="right-running-head">Syntax-Enhanced Entity Relation Extraction with Complex Knowledge</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>Bi</surname><given-names>Mingwen</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-2" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Chen</surname><given-names>Hefei</given-names></name><xref ref-type="aff" rid="aff-2">2</xref><email>hff125125@163.com</email></contrib>
<contrib id="author-3" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Yang</surname><given-names>Zhenghong</given-names></name><xref ref-type="aff" rid="aff-3">3</xref><email>yangjohn@cau.edu.cn</email></contrib>
<aff id="aff-1"><label>1</label><institution>Business School, Beijing Wuzi University</institution>, <addr-line>Beijing, 101149</addr-line>, <country>China</country></aff>
<aff id="aff-2"><label>2</label><institution>National Engineering Research Center for Agri-product Quality Traceability, Beijing Technology and Business University</institution>, <addr-line>Beijing, 100048</addr-line>, <country>China</country></aff>
<aff id="aff-3"><label>3</label><institution>College of Information and Electrical Engineering, China Agricultural University</institution>, <addr-line>Beijing, 100083</addr-line>, <country>China</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Authors: Hefei Chen. Email: <email>hff125125@163.com</email>; Zhenghong Yang. Email: <email>yangjohn@cau.edu.cn</email></corresp>
</author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2025</year>
</pub-date>
<pub-date date-type="pub" publication-format="electronic">
<day>26</day><month>03</month><year>2025</year>
</pub-date>
<volume>83</volume>
<issue>1</issue>
<fpage>861</fpage>
<lpage>876</lpage>
<history>
<date date-type="received">
<day>03</day>
<month>11</month>
<year>2024</year>
</date>
<date date-type="accepted">
<day>08</day>
<month>1</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2025 The Authors.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Published by Tech Science Press.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_60517.pdf"></self-uri>
<abstract>
<p>Entity relation extraction, a fundamental and essential task in natural language processing (NLP), has garnered significant attention over an extended period., aiming to extract the core of semantic knowledge from unstructured text, i.e., entities and the relations between them. At present, the main dilemma of Chinese entity relation extraction research lies in nested entities, relation overlap, and lack of entity relation interaction. This dilemma is particularly prominent in complex knowledge extraction tasks with high-density knowledge, imprecise syntactic structure, and lack of semantic roles. To address these challenges, this paper presents an innovative &#x201C;character-level&#x201D; Chinese part-of-speech (CN-POS) tagging approach and incorporates part-of-speech (POS) information into the pre-trained model, aiming to improve its semantic understanding and syntactic information processing capabilities. Additionally, A relation reference filling mechanism (RF) is proposed to enhance the semantic interaction between relations and entities, utilize relations to guide entity modeling, improve the boundary prediction ability of entity models for nested entity phenomena, and increase the cascading accuracy of entity-relation triples. Meanwhile, the &#x201C;Queue&#x201D; sub-task connection strategy is adopted to alleviate triplet cascading errors caused by overlapping relations, and a Syntax-enhanced entity relation extraction model (SE-RE) is constructed. The model showed excellent performance on the self-constructed E-commerce Product Information dataset (EPI) in this article. The results demonstrate that integrating POS enhancement into the pre-trained encoding model significantly boosts the performance of entity relation extraction models compared to baseline methods. Specifically, the F1-score fluctuation in subtasks caused by error accumulation was reduced by 3.21%, while the F1-score for entity-relation triplet extraction improved by 1.91%.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Entity relation extraction</kwd>
<kwd>complex knowledge</kwd>
<kwd>syntax-enhanced</kwd>
<kwd>semantic interaction</kwd>
<kwd>pre-trained BERT</kwd>
</kwd-group>
<funding-group>
<award-group id="awg1">
<funding-source>National Key Technology R&#x0026;D Program of China</funding-source>
<award-id>2021YFD2100605</award-id>
</award-group>
<award-group id="awg2">
<funding-source>National Natural Science Foundation of China</funding-source>
<award-id>62433002</award-id>
</award-group>
<award-group id="awg3">
<funding-source>Project of Construction and Support for High-Level Innovative Teams of Beijing Municipal Institutions</funding-source>
<award-id>BPHR20220104</award-id>
</award-group>
<award-group id="awg4">
<funding-source>Beijing Scholars Program</funding-source>
<award-id>099</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>In past years, the development of knowledge graphs tailored to specific vertical domains has demonstrated significant application potential and economic worth. Although companies can easily utilize tools to dump structured data directly into the knowledge base, most of the knowledge still exists in unstructured text. Improving the automatic extraction efficiency of entities and relations has emerged as the key to fixing the problem [<xref ref-type="bibr" rid="ref-1">1</xref>]. Entity relation extraction, the central goal in the domain of information extraction, tries to extract set types of entities and relations of unorganized text and construct a structured knowledge triplet (main entity, relation, and guest entity), which is of great significance for downstream applications such as knowledge graph construction [<xref ref-type="bibr" rid="ref-2">2</xref>,<xref ref-type="bibr" rid="ref-3">3</xref>], intelligent question answering [<xref ref-type="bibr" rid="ref-4">4</xref>], and public opinion analysis [<xref ref-type="bibr" rid="ref-5">5</xref>]. Researchers typically categorize the problem into two main subtasks: named entity recognition [<xref ref-type="bibr" rid="ref-6">6</xref>&#x2013;<xref ref-type="bibr" rid="ref-8">8</xref>] and relation extraction [<xref ref-type="bibr" rid="ref-9">9</xref>,<xref ref-type="bibr" rid="ref-10">10</xref>]. The two dominant research frameworks are the pipeline approach and the joint method.</p>
<p>As an early modeling method, the pipeline method trains two models to extract entities and classify relations, respectively. The entity model and the relation model can be trained using independent data sets, and it is not necessary to label them at the same time. According to the different execution orders of the model in the framework, it can be divided into the traditional entity relation model and the relation-entity model. However, because the two models are trained independently, and the post model is dependent upon the earlier model&#x2019;s prediction outcomes, this leads to the problems of error accumulation and lack of interaction in this method [<xref ref-type="bibr" rid="ref-11">11</xref>]. In addition, the traditional pipeline method also has some entity redundancy [<xref ref-type="bibr" rid="ref-12">12</xref>].</p>
<p>Considering the aforementioned issues with the pipeline method, the end-to-end joint method as a novel modeling method has received a lot of attention [<xref ref-type="bibr" rid="ref-13">13</xref>,<xref ref-type="bibr" rid="ref-14">14</xref>]. According to the number of framework stages, the joint method can be divided into multi-task learning (two-stage) and structured prediction (single-stage) [<xref ref-type="bibr" rid="ref-15">15</xref>].</p>
<p><bold>Multi-task learning.</bold> Although this approach adopts an end-to-end training framework, it inherently involves cascading tasks, making it fundamentally similar to the pipeline method. Specifically, it still constructs separate models for entity recognition and relation extraction, but with a key distinction: both tasks share a unified encoder and leverage parameter sharing for joint optimization. For example, reference [<xref ref-type="bibr" rid="ref-16">16</xref>] employed Conditional Random Fields (CRF) to reframe entity recognition and relation extraction as a multi-head selection task, effectively addressing multi-relation challenges. In a related effort, reference [<xref ref-type="bibr" rid="ref-17">17</xref>] proposed the CASREL framework, which utilizes a cascading binary markup strategy to model relations as functions mapping subjects to objects within sentences, thereby resolving the issue of overlapping relation triples. Similarly, reference [<xref ref-type="bibr" rid="ref-18">18</xref>] introduced SPERT, a span-based joint entity and relation extraction model utilizing an attention mechanism. This model mitigates the problem of entity overlap by incorporating advanced entity recognition and filtering mechanisms. Further innovations include the work of [<xref ref-type="bibr" rid="ref-19">19</xref>], which approached joint entity and relation extraction as a direct set prediction problem. This strategy eliminates the complexity of predicting the sequence of multiple triples, streamlining the extraction process. To tackle entity overlap more effectively, reference [<xref ref-type="bibr" rid="ref-20">20</xref>] proposed an end-to-end learning model integrated with a replication mechanism, enabling the simultaneous extraction of related facts from sentences of varying complexity. Moreover, a novel reading comprehension-based paradigm has gained prominence, reframing entity and relation extraction as a multi-turn question-answering task. This method transforms the extraction process into one of identifying the answer&#x2019;s scope within the given context, offering a flexible and intuitive framework for handling intricate extraction scenarios [<xref ref-type="bibr" rid="ref-21">21</xref>]. However, similar to traditional pipeline methods, multi-task learning methods still have significant gaps in training and inference processes. This method has inherent leakage bias [<xref ref-type="bibr" rid="ref-22">22</xref>], and there is still an error accumulation problem in inference, thereby affecting the accuracy and reliability of the final results.</p>
<p><bold>Structured prediction.</bold> Reference [<xref ref-type="bibr" rid="ref-23">23</xref>] introduced a relation-specific attention network (RSAN) that employs a relation-aware attention mechanism to generate tailored phrase representations for different relations. This model utilizes sequence labeling to identify and extract corresponding head and tail elements. Similarly, reference [<xref ref-type="bibr" rid="ref-24">24</xref>] proposed an innovative labeling scheme that reformulates the joint extraction task as a labeling problem, enabling the direct extraction of entities and their relations in a unified manner. Additionally, reference [<xref ref-type="bibr" rid="ref-22">22</xref>] presented a single-stage joint extraction model, TPLinker, which redefines the task as a token pair linking problem, effectively addressing the issue of exposure bias in the extraction process.</p>
<p>While the structured prediction method successfully mitigates issues such as exposure bias, entity overlap, and missing interactions, it does not necessarily imply that the joint method outperforms the pipeline method [<xref ref-type="bibr" rid="ref-15">15</xref>]. In fact, current methods for entity and relation extraction have some limitations, such as difficulty in handling nested entity and relation overlap, which pose challenges for models in extracting complex knowledge. In addition, when the encoding dimension of the data is very high, the model may encounter difficulties in training due to extremely sparse data. Consequently, the development of novel syntax-enhanced models is essential to effectively overcome these limitations and enhance the accuracy and robustness of entity relation extraction.</p>
<p>This paper adopts the pipeline extraction method of relation before entity, based on the Chinese pre-trained BERT [<xref ref-type="bibr" rid="ref-25">25</xref>] (i.e., Bidirectional Encoder Representation from Transformers) model called &#x201C;BERT-wwm-ext&#x201D; [<xref ref-type="bibr" rid="ref-26">26</xref>]. Two independently trained models are included in the pipeline extraction method, which are called entity model and relation model. The predicted value of the relation model is supplemented by the input of the entity model as an additional semantic feature and combined with the &#x201C;Queue&#x201D; input scheme proposed in this paper. The model effectively solves the problems of relation overlap and entity overlap that were neglected by previous research work. In addition, it benefits from the excellent performance of the pre-trained model BERT in the field of text classification (96%), we combine BERT with POS information to further enhance the performance of the relation model. On the basis of alleviating the exposure bias of the pipeline method as much as possible, it effectively avoids the limitation of the joint extraction model in processing complex samples.</p>
<p>In this paper, a comprehensive comparative experiment is carried out on the self-constructed data set EPI. The experimental results show that the performance fluctuation of subtasks caused by error accumulation decreased by 3.21%, while the entity relation triplet extraction performance increased by 1.91%, which proves the effectiveness of the model.</p>
<p>The main contributions of this paper are as follows:</p>
<p>(1) This paper proposes CN-POS and introduces the POS information represented by it in the pre-trained language model, which solves the problem that Chinese POS and character-level encoding cannot be aligned, and enhances the model&#x2019;s semantic understanding ability of syntactic information.</p>
<p>(2) This paper introduces the RF framework, which employs a &#x201C;Queue&#x201D; scheme to link the relation model and entity model. This approach directs the model&#x2019;s attention to specific parts of the input sentence corresponding to different relations, thereby strengthening the semantic interaction between relations and entities and mitigating the effects of knowledge overlap.</p>
<p>(3) This paper constructs an EPI dataset and proposes SE-RE, achieving optimal performance on the dataset.</p>
<p>The remainder of this paper is structured as follows: <xref ref-type="sec" rid="s2">Section 2</xref> presents the dataset and task description. <xref ref-type="sec" rid="s3">Section 3</xref> details the proposed model and algorithm. <xref ref-type="sec" rid="s4">Section 4</xref> outlines the experimental setup, followed by results and discussion in <xref ref-type="sec" rid="s5">Section 5</xref>. Finally, <xref ref-type="sec" rid="s6">Section 6</xref> concludes the paper.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Dataset and Task</title>
<p>The starting point of this paper is to provide a high-performance model of automatic extraction of entity relation for the vertical domain knowledge graph based on the construction of the ontology layer [<xref ref-type="bibr" rid="ref-27">27</xref>]. However, most of the existing public datasets are oriented to the public domain and rarely involve professional fields. Therefore, they are mostly simple sentences with low semantic complexity, which are reflected in the following characteristics: (1) short sentences (2) each sentence contains only one or two knowledge triples (3) the semantic differences of the entities in sentences are obvious and easy to recognize (e.g., person, location, organization, time, etc.), and knowledge extraction can be achieved through some excellent NLP tools.</p>
<p>A dataset called EPI describing e-commerce listings was constructed in this paper, which is a complex sample with high-density knowledge in the e-commerce field. The data for the EPI dataset is obtained from commonly used e-commerce platforms without additional processing or reduction, preserving the integrity of the original data. This characteristic ensures its applicability in a wide range of scenarios, accurately reflecting the features and complexities of real-world e-commerce data. We extracted information about 1653 products and obtained 5498 sample data through sentence segmentation. Each sample describes the information on related products. A total of 14 relation types and 12 entity categories are defined, which constitute 14 schemas. The goal is to extract structured knowledge triple RDF (Resource Description Framework). A specific example is shown in <xref ref-type="table" rid="table-1">Table 1</xref>.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>A specific example in EPI</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Sample</th>
<th>RDF</th>
</tr>
</thead>
<tbody>
<tr>
<td><bold>ZH:</bold> &#x534E;&#x7855;&#x7075;&#x8000;&#x662F;&#x7B14;&#x8BB0;&#x672C;&#x7535;&#x8111;, &#x914D;&#x5907;&#x9510;&#x9F99;&#x5904;&#x7406;&#x5668;, &#x9002;&#x7528;&#x4E8E;&#x529E;&#x516C;&#x548C;&#x5A31;&#x4E50;&#x3002;</td>
<td><bold>ZH:</bold> (&#x534E;&#x7855;&#x7075;&#x8000;, &#x7C7B;&#x522B;, &#x7B14;&#x8BB0;&#x672C;&#x7535;&#x8111;), (&#x534E;&#x7855;&#x7075;&#x8000;, &#x54C1;&#x724C;, &#x534E;&#x7855;), (&#x534E;&#x7855;&#x7075;&#x8000;, &#x89C4;&#x683C;, &#x9510;&#x9F99;&#x5904;&#x7406;&#x5668;), (&#x534E;&#x7855;&#x7075;&#x8000;, &#x9002;&#x7528;&#x573A;&#x666F;, &#x529E;&#x516C;), (&#x534E;&#x7855;&#x7075;&#x8000;, &#x9002;&#x7528;&#x573A;&#x666F;, &#x5A31;&#x4E50;)</td>
</tr>
<tr>
<td><bold>EN:</bold> ASUS Zenbook is a laptop that is equipped with a Ryzen processor for both work and play</td>
<td><bold>EN:</bold> (ASUS Zenbook, Category, laptop), (ASUS Zenbook, Brand, ASUS), (ASUS Zenbook, Specification, Ryzen processor), (ASUS Zenbook, Usage, work), (ASUS Zenbook, Usage, play)</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>It is worth noting that: (1) A product can have multiple aliases, be sold on multiple platforms, and be suitable for a variety of scenarios (2) Different styles and colors of a product will interfere with the price (3) Different models of a product have differences in function, material and price (4) The promotional price of the product in different time periods is different (5) Because Chinese places a strong focus on semantics and less on framework, there are a large number of reference omissions and irregular splicing vocabulary in the samples.</p>
</sec>
<sec id="s3">
<label>3</label>
<title>Proposed Model</title>
<p>Considering the limitations of existing entity relation extraction methods in effectively modeling the interactions between entities and relations, this paper proposes SE-RE, a novel approach designed to tackle the challenges of entity nesting and relation overlap in complex semantic texts. The overall structure of the model is illustrated in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>, while <xref ref-type="fig" rid="fig-2">Fig. 2</xref> provides a detailed case study. The model architecture comprises the following three main components: (1) To enhance the semantic modeling ability of each extraction model for Chinese syntactic information, this paper proposes CN-POS and introduces it into the Chinese pre-trained BERT model to compensate for its shortcomings in POS feature modeling. (2) To improve the interaction between entity recognition and relation extraction models, this paper proposes a &#x201C;Queue&#x201D; input scheme and RF to achieve syntactic and semantic enhancement based on different relations. (3) To alleviate the problems of relation overlap and entity redundancy, this paper employs a &#x201C;Queue&#x201D; input framework to feed the strengthened input sequences into the entity recognition model in order to obtain entity pairs cascaded by the current relation.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>The overall architecture of the SE-RE model</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_60517-fig-1.tif"/>
</fig><fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>A specifific case study</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_60517-fig-2.tif"/>
</fig>
<sec id="s3_1">
<label>3.1</label>
<title>Problem Definition</title>
<p>Traditional entity relation extraction methods only extract entities and relations from the input sequence, making it difficult to achieve cascading matching between entities and relations. But in this paper, for a given input sequence <inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, the SE-RE can extract all knowledge triples RDF that conform to the predefined schema, Equations such as <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>.
<disp-formula id="eqn-1"><label>(1)</label><mml:math id="mml-eqn-1" display="block"><mml:mi>R</mml:mi><mml:mi>D</mml:mi><mml:mi>F</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>s</mml:mi><mml:mi>u</mml:mi><mml:mi>b</mml:mi><mml:mi>j</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>o</mml:mi><mml:mi>b</mml:mi><mml:mi>j</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mi>m</mml:mi><mml:mo>]</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>o</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2208;</mml:mo><mml:mi>S</mml:mi><mml:mi>c</mml:mi><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mo>,</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:mrow><mml:mo>(</mml:mo><mml:mi>s</mml:mi><mml:mi>u</mml:mi><mml:mi>b</mml:mi><mml:mi>j</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>o</mml:mi><mml:mi>b</mml:mi><mml:mi>j</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> represents the knowledge triples extracted from the input sequence, <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mi>m</mml:mi></mml:math></inline-formula> represents the total number of triples, <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:mi>S</mml:mi><mml:mi>c</mml:mi><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>a</mml:mi></mml:math></inline-formula> is a predefined set of triplet categories, and <inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:mi>R</mml:mi></mml:math></inline-formula>, and <inline-formula id="ieqn-7"><mml:math id="mml-ieqn-7"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>o</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> respectively represents the categories of the main entity, relation, and guest entity set in the schema.</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Input Representation</title>
<p>For the given text sequence <inline-formula id="ieqn-8"><mml:math id="mml-ieqn-8"><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, this paper uses the large-scale Chinese pre-trained BERT model to encode it into a vector <inline-formula id="ieqn-9"><mml:math id="mml-ieqn-9"><mml:msup><mml:mi>S</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>. To meet the special input format of the BERT model, special identifiers <inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:mo stretchy="false">[</mml:mo><mml:mi>C</mml:mi><mml:mi>L</mml:mi><mml:mi>S</mml:mi><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula> and <inline-formula id="ieqn-11"><mml:math id="mml-ieqn-11"><mml:mo stretchy="false">[</mml:mo><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula> are first added at the beginning and end of <inline-formula id="ieqn-12"><mml:math id="mml-ieqn-12"><mml:mi>S</mml:mi></mml:math></inline-formula> to form <inline-formula id="ieqn-13"><mml:math id="mml-ieqn-13"><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mi>C</mml:mi><mml:mi>L</mml:mi><mml:mi>S</mml:mi><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy="false">]</mml:mo><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>. If there is context in the input sequence, that is, <inline-formula id="ieqn-14"><mml:math id="mml-ieqn-14"><mml:mi>S</mml:mi></mml:math></inline-formula> covers two different contexts of text sequence <inline-formula id="ieqn-15"><mml:math id="mml-ieqn-15"><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, we need to add the <inline-formula id="ieqn-16"><mml:math id="mml-ieqn-16"><mml:mo stretchy="false">[</mml:mo><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula> mark between the sentences to form <inline-formula id="ieqn-17"><mml:math id="mml-ieqn-17"><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mi>C</mml:mi><mml:mi>L</mml:mi><mml:mi>S</mml:mi><mml:mo stretchy="false">]</mml:mo><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy="false">]</mml:mo><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy="false">]</mml:mo><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>.</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>CN-POS Tagging Strategy</title>
<p>Due to BERT being a character-level encoding model, while Chinese POS information is a word-level feature. For instance, the Chinese &#x201C;&#x7C7B;&#x522B;&#x201D; corresponds to the English word &#x201C;Category&#x201D;. As a token, the POS of &#x201C;Category&#x201D; is noun (n). Similarly, the POS of &#x201C;&#x7C7B;&#x522B;&#x201D; is also n. But as tokens, &#x201C;&#x7C7B;&#x201D; and &#x201C;&#x522B;&#x201D; obviously cannot be tagged in a traditional manner, because the characters do not express complete semantics. In addition, unlike the clear space boundaries between English words, Chinese needs additional word segmentation.</p>
<p>This paper proposes CN-POS to achieve alignment between Chinese POS information and token sequences. The specific process is shown in <xref ref-type="fig" rid="fig-3">Fig. 3</xref>. For the input sequence <inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, CN-POS first uses a hybrid NLP tool based on &#x201C;Jieba Segmentor, custom dictionary and LTP(Language Technology Plantform) POS-tagger&#x201D; to perform word segmentation and POS tagging, resulting in a sequence A of words and their corresponding POS tags <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:mi>C</mml:mi><mml:mi>P</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x003A;</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mo fence="false" stretchy="false">}</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula>. For each character <inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:mi>t</mml:mi><mml:mi>o</mml:mi><mml:mi>k</mml:mi><mml:mi>e</mml:mi><mml:msubsup><mml:mi>n</mml:mi><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> in any word <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, CN-POS assigns a POS label <inline-formula id="ieqn-22"><mml:math id="mml-ieqn-22"><mml:mi>P</mml:mi><mml:mo>,</mml:mo><mml:mi>P</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>B</mml:mi><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>I</mml:mi><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, which indicates the beginning and inside of the current POS, respectively. For example, for the noun &#x201C;Category&#x201D; with a part of speech of <inline-formula id="ieqn-23"><mml:math id="mml-ieqn-23"><mml:mi>n</mml:mi></mml:math></inline-formula>, the alignment of its token sequence is <inline-formula id="ieqn-24"><mml:math id="mml-ieqn-24"><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>B</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>I</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>n</mml:mi><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>A BERT pre-training flowchart of CN-POS annotation and POS enhancement</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_60517-fig-3.tif"/>
</fig>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>POS Embedding</title>
<p>This paper references the POS embedding method in [<xref ref-type="bibr" rid="ref-28">28</xref>]. Specifically: we obtained the input POS tagging sequence <inline-formula id="ieqn-25"><mml:math id="mml-ieqn-25"><mml:mi>P</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>. The POS embedding layer leverages the sparse vector representation <inline-formula id="ieqn-26"><mml:math id="mml-ieqn-26"><mml:mi>P</mml:mi></mml:math></inline-formula> to generate a corresponding dense vector representation, enabling more efficient encoding of part-of-speech information <inline-formula id="ieqn-27"><mml:math id="mml-ieqn-27"><mml:msup><mml:mi>V</mml:mi><mml:mrow><mml:mi>P</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>, equations such as <xref ref-type="disp-formula" rid="eqn-2">Eq. (2)</xref>.
<disp-formula id="eqn-2"><label>(2)</label><mml:math id="mml-eqn-2" display="block"><mml:msup><mml:mi>V</mml:mi><mml:mrow><mml:mi>P</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msubsup><mml:mi>V</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msubsup><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">]</mml:mo><mml:mo fence="false" stretchy="false">}</mml:mo><mml:mo>,</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-28"><mml:math id="mml-ieqn-28"><mml:msubsup><mml:mi>V</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x2208;</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>p</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>, and <inline-formula id="ieqn-29"><mml:math id="mml-ieqn-29"><mml:mi>h</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>p</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:math></inline-formula> denotes the size of the hidden layer in the POS embedding. Subsequently, the self-attention layer is employed to capture and analyze the entire sequence of POS tags, facilitating a comprehensive understanding of their contextual relationships and extract the grammatical dependencies in the input sentence <inline-formula id="ieqn-30"><mml:math id="mml-ieqn-30"><mml:msup><mml:mi>h</mml:mi><mml:mrow><mml:mi>P</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>.</p>
</sec>
<sec id="s3_5">
<label>3.5</label>
<title>Relation Model</title>
<p>The relation extraction model aims to predict the relation categories contained in the input sequence. As shown in the &#x201C;Relation Model&#x201D; section of <xref ref-type="fig" rid="fig-1">Fig. 1</xref>, the common &#x201C;BERT&#x002B;Bi-LSTM&#x002B;FC&#x201D; (Bi-directional Long Short-Term Memory, Fully Connected) network architecture is used for the relation extraction task. For each input sequence <inline-formula id="ieqn-31"><mml:math id="mml-ieqn-31"><mml:mi>S</mml:mi></mml:math></inline-formula>, the input feature encoder is employed to obtain the character encoding <inline-formula id="ieqn-32"><mml:math id="mml-ieqn-32"><mml:msup><mml:mi>S</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> and the POS encoding <inline-formula id="ieqn-33"><mml:math id="mml-ieqn-33"><mml:msup><mml:mi>P</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>. Unlike the simple vector concatenation method used in previous research, this paper further maps the POS encoding <inline-formula id="ieqn-34"><mml:math id="mml-ieqn-34"><mml:msup><mml:mi>P</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> to the <inline-formula id="ieqn-35"><mml:math id="mml-ieqn-35"><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>p</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi><mml:mi>s</mml:mi></mml:math></inline-formula> sequence of the BERT model to alleviate its interference with the character encoding <inline-formula id="ieqn-36"><mml:math id="mml-ieqn-36"><mml:msup><mml:mi>S</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>, and utilizes the fine-tuning of the BERT model during the training process to optimize it. Then, the concatenated vector of the two is further passed through a layer of Bi-LSTM to obtain the final hidden layer state vector representation <inline-formula id="ieqn-37"><mml:math id="mml-ieqn-37"><mml:mi>H</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula> of the input sequence. Afterwards, it is mapped onto the target relation category set through a FC network, and its probability distribution logit is calculated using the sigmoid function. Finally, the predicted set of relation categories <inline-formula id="ieqn-38"><mml:math id="mml-ieqn-38"><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula> is output by comparing it to a pre-set threshold.</p>

<p>In the training stage of the model, <inline-formula id="ieqn-39"><mml:math id="mml-ieqn-39"><mml:mi>B</mml:mi><mml:mi>C</mml:mi><mml:mi>E</mml:mi><mml:mi>W</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>h</mml:mi><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>s</mml:mi><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>s</mml:mi></mml:math></inline-formula> is used as the LOSS function of the model training, which combines the <inline-formula id="ieqn-40"><mml:math id="mml-ieqn-40"><mml:mi>S</mml:mi><mml:mi>i</mml:mi><mml:mi>g</mml:mi><mml:mi>m</mml:mi><mml:mi>o</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi></mml:math></inline-formula> layer and <inline-formula id="ieqn-41"><mml:math id="mml-ieqn-41"><mml:mi>B</mml:mi><mml:mi>C</mml:mi><mml:mi>E</mml:mi><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>s</mml:mi></mml:math></inline-formula>, It is widely used in classification tasks, and has excellent performance in processing imbalanced samples, equations such as <xref ref-type="disp-formula" rid="eqn-3">Eqs. (3)</xref> and <xref ref-type="disp-formula" rid="eqn-4">(4)</xref>.
<disp-formula id="eqn-3"><label>(3)</label><mml:math id="mml-eqn-3" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd /><mml:mtd><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mrow><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:msup><mml:mo fence="false" stretchy="false">}</mml:mo><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="eqn-4"><label>(4)</label><mml:math id="mml-eqn-4" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd /><mml:mtd><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">[</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>&#x22C5;</mml:mo><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mi>i</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mi>i</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo stretchy="false">]</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>where <inline-formula id="ieqn-42"><mml:math id="mml-ieqn-42"><mml:mi>c</mml:mi></mml:math></inline-formula> is the class number, <inline-formula id="ieqn-43"><mml:math id="mml-ieqn-43"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of the sample in the batch and <inline-formula id="ieqn-44"><mml:math id="mml-ieqn-44"><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the weight of the positive answer for the class <inline-formula id="ieqn-45"><mml:math id="mml-ieqn-45"><mml:mi>c</mml:mi></mml:math></inline-formula>. <inline-formula id="ieqn-46"><mml:math id="mml-ieqn-46"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the true value, <inline-formula id="ieqn-47"><mml:math id="mml-ieqn-47"><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mi>i</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the predicted value, and <inline-formula id="ieqn-48"><mml:math id="mml-ieqn-48"><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the loss weight.</p>
</sec>
<sec id="s3_6">
<label>3.6</label>
<title>&#x201C;Queue&#x201D; Scheme and RF Mechanism</title>
<p>RF mechanism aims to alleviate the phenomenon of entity nesting and relation overlap. The essence of the entity relation extraction task is to strengthen the interaction between entities and relations by leveraging the strong correlations within triplets. This approach aims to enhance the performance of subtask models and improve the accuracy of cascading processes. Inspired by the semantic core of predicates in Chinese [<xref ref-type="bibr" rid="ref-29">29</xref>], this paper proposes the RF using a &#x201C;Queue&#x201D; input to model the interaction between entity and relation. Specifically, as shown in the &#x201C;Filling and Queue&#x201D; section of <xref ref-type="fig" rid="fig-1">Fig. 1</xref>, for the input sequence <inline-formula id="ieqn-49"><mml:math id="mml-ieqn-49"><mml:mi>S</mml:mi></mml:math></inline-formula> and the relation category set <inline-formula id="ieqn-50"><mml:math id="mml-ieqn-50"><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula> output by the relation extraction model, each relation <inline-formula id="ieqn-51"><mml:math id="mml-ieqn-51"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is filled into the beginning of each clause in <inline-formula id="ieqn-52"><mml:math id="mml-ieqn-52"><mml:mi>S</mml:mi></mml:math></inline-formula> (clauses are sub clauses separated by commas, semicolons, and periods), forming <inline-formula id="ieqn-53"><mml:math id="mml-ieqn-53"><mml:mi>n</mml:mi></mml:math></inline-formula> sequences <inline-formula id="ieqn-54"><mml:math id="mml-ieqn-54"><mml:mi>S</mml:mi><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>S</mml:mi><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>S</mml:mi><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mi>S</mml:mi><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula> with enhanced relation semantics, where <inline-formula id="ieqn-55"><mml:math id="mml-ieqn-55"><mml:mi>S</mml:mi><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, which aims to use relations to guide the entity recognition model to focus on the corresponding entity categories. Therefore, based on relational semantics and entity category semantics, the impact of nested entities on model performance can be mitigated. Formally, the sequence representation formula for relational semantic enhancement is in <xref ref-type="disp-formula" rid="eqn-5">Eq. (5)</xref>.
<disp-formula id="eqn-5"><label>(5)</label><mml:math id="mml-eqn-5" display="block"><mml:mi>S</mml:mi><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>R</mml:mi><mml:mi>F</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>S</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2208;</mml:mo><mml:mi>R</mml:mi><mml:mo fence="false" stretchy="false">}</mml:mo><mml:mo>,</mml:mo></mml:math></disp-formula></p>

<p>In addition, to address the issue of relation overlap in entity relation extraction tasks, this paper inputs the <inline-formula id="ieqn-56"><mml:math id="mml-ieqn-56"><mml:mi>n</mml:mi></mml:math></inline-formula> semantically enhanced sequences constructed above into the subsequent entity recognition model one by one. Since each input sequence <inline-formula id="ieqn-57"><mml:math id="mml-ieqn-57"><mml:mi>S</mml:mi><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> highlights the semantics of only a single relation, the entity recognition model can generate an independent semantic understanding of the same initial sequence <inline-formula id="ieqn-58"><mml:math id="mml-ieqn-58"><mml:mi>S</mml:mi></mml:math></inline-formula>, thereby avoiding the impact of relation overlap on model performance while maintaining flexible execution of subtasks.</p>
</sec>
<sec id="s3_7">
<label>3.7</label>
<title>Entity Model</title>
<p>The entity model aims to identify entities related to specific relations by analyzing the contextual connections between various vocabulary and phrases. For the aforementioned relational semantic-enhanced text sequence <inline-formula id="ieqn-59"><mml:math id="mml-ieqn-59"><mml:mi>S</mml:mi><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> with &#x201C;Queue&#x201D; input, the goal of the entity recognition model is to extract the entity pairs that have a semantic cascade relation with the relational <inline-formula id="ieqn-60"><mml:math id="mml-ieqn-60"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. As shown in the &#x201C;Entity Model&#x201D; section of <xref ref-type="fig" rid="fig-1">Fig. 1</xref>, the network architecture of &#x201C;BERT&#x002B;Bi-LSTM&#x002B;CRF&#x201D; [<xref ref-type="bibr" rid="ref-30">30</xref>,<xref ref-type="bibr" rid="ref-31">31</xref>] is used to perform the entity recognition task, and the specific execution process of the model is as follows:</p>

<p><bold>Input layer.</bold> For each input sequence <inline-formula id="ieqn-61"><mml:math id="mml-ieqn-61"><mml:mi>S</mml:mi><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, the complete annotation sequence is <inline-formula id="ieqn-62"><mml:math id="mml-ieqn-62"><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, <inline-formula id="ieqn-63"><mml:math id="mml-ieqn-63"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2208;</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>B</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mo>,</mml:mo><mml:mi>I</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mo>,</mml:mo><mml:mi>O</mml:mi><mml:mo>,</mml:mo><mml:mi>X</mml:mi><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, where <inline-formula id="ieqn-64"><mml:math id="mml-ieqn-64"><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>B</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mo>,</mml:mo><mml:mi>I</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mo>,</mml:mo><mml:mi>O</mml:mi><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula> represents the entity beginning, the entity interior and the entity exterior of the current <inline-formula id="ieqn-65"><mml:math id="mml-ieqn-65"><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. In order to eliminate the influence of relation words on label sequence, <italic>X</italic> is assigned to <inline-formula id="ieqn-66"><mml:math id="mml-ieqn-66"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> as a special entity label. Similarly, to avoid the influence of <inline-formula id="ieqn-67"><mml:math id="mml-ieqn-67"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> on the POS tagging sequence and the inherent <inline-formula id="ieqn-68"><mml:math id="mml-ieqn-68"><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>p</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi><mml:mi>s</mml:mi></mml:math></inline-formula> sequence in BERT, this paper performs the following relational word mapping operations.</p>
<p>Input_ids sequence: <inline-formula id="ieqn-69"><mml:math id="mml-ieqn-69"><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>p</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi><mml:mi>s</mml:mi></mml:math></inline-formula> is the encoding mapping strategy used in the BERT model to convert tokens into corresponding ids in the vocabulary verbs. Its value range is <inline-formula id="ieqn-70"><mml:math id="mml-ieqn-70"><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>p</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi><mml:mi>s</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mn>100</mml:mn><mml:mo>,</mml:mo><mml:mi>l</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula>, where <inline-formula id="ieqn-71"><mml:math id="mml-ieqn-71"><mml:mi>l</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> represents the total number of words in the BERT vocabulary. To avoid inserting the relational word <inline-formula id="ieqn-72"><mml:math id="mml-ieqn-72"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> in verbs or affecting the encoding mapping of the original input sequence, This paper does not perform character-level tokenization operations on <inline-formula id="ieqn-73"><mml:math id="mml-ieqn-73"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. Instead, it is treated as a complete word and assigned a special <inline-formula id="ieqn-74"><mml:math id="mml-ieqn-74"><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>p</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi><mml:mi>s</mml:mi></mml:math></inline-formula> encoding mapping, with a value range of <inline-formula id="ieqn-75"><mml:math id="mml-ieqn-75"><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>p</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi><mml:mi>s</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula>, and <inline-formula id="ieqn-76"><mml:math id="mml-ieqn-76"><mml:mi>n</mml:mi></mml:math></inline-formula> being the total number of relation categories in the pre-set schema. It is worth noting that the above method is only applicable when <inline-formula id="ieqn-77"><mml:math id="mml-ieqn-77"><mml:mi>n</mml:mi><mml:mo>&#x003C;</mml:mo><mml:mn>100</mml:mn></mml:math></inline-formula>.</p>
<p>POS tagging: For the token <inline-formula id="ieqn-78"><mml:math id="mml-ieqn-78"><mml:msub><mml:mi>W</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> in the initial text sequence <inline-formula id="ieqn-79"><mml:math id="mml-ieqn-79"><mml:mi>S</mml:mi></mml:math></inline-formula>, its POS label adheres to the CN-POS proposed in this paper. While for the relational word <inline-formula id="ieqn-80"><mml:math id="mml-ieqn-80"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, it is given a special POS label in this paper.</p>
<p><bold>Context embedding layer.</bold> Through operations performed at the input layer, the model eliminates the influence of the additional inserted relational words on each feature sequence. At the context encoding layer, the entity recognition model obtains the word vector encoding <inline-formula id="ieqn-81"><mml:math id="mml-ieqn-81"><mml:mi>S</mml:mi><mml:msup><mml:mi>R</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> and the POS encoding <inline-formula id="ieqn-82"><mml:math id="mml-ieqn-82"><mml:mi>P</mml:mi><mml:mi>O</mml:mi><mml:msup><mml:mi>S</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>m</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> corresponding to the input sequences through the input feature encoder. The two are concatenated to obtain the hidden layer vector representation, which is then used to derive the final hidden layer state <inline-formula id="ieqn-83"><mml:math id="mml-ieqn-83"><mml:mi>H</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula> is obtained by Bi-LSTM.</p>
<p><bold>Output layer.</bold> The entity determination for the named entity recognition task is based on the prediction results of the entity label sequence corresponding to the token sequence. To enhance the temporal dependency of entity label sequences and avoid model outputs with illogical label orders (e.g., <inline-formula id="ieqn-84"><mml:math id="mml-ieqn-84"><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mi>O</mml:mi><mml:mi>I</mml:mi><mml:mo>&#x2026;</mml:mo><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>), this paper uses CRF to decode the final hidden layer state vector <inline-formula id="ieqn-85"><mml:math id="mml-ieqn-85"><mml:mi>H</mml:mi></mml:math></inline-formula> output by the Bi-LSTM network. The training loss function is calculated as in <xref ref-type="disp-formula" rid="eqn-6">Eq. (6)</xref>.
<disp-formula id="eqn-6"><label>(6)</label><mml:math id="mml-eqn-6" display="block"><mml:mi>L</mml:mi><mml:mi>O</mml:mi><mml:mi>S</mml:mi><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:msubsup><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msubsup><mml:mi>l</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>C</mml:mi><mml:mi>R</mml:mi><mml:mi>F</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-86"><mml:math id="mml-ieqn-86"><mml:mi>l</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>C</mml:mi><mml:mi>R</mml:mi><mml:mi>F</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is the negative <inline-formula id="ieqn-87"><mml:math id="mml-ieqn-87"><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi><mml:mi mathvariant="normal">&#x005F;</mml:mi><mml:mi>l</mml:mi><mml:mi>i</mml:mi><mml:mi>k</mml:mi><mml:mi>e</mml:mi><mml:mi>h</mml:mi><mml:mi>o</mml:mi><mml:mi>o</mml:mi><mml:mi>d</mml:mi></mml:math></inline-formula> of the comparison between the predicted value <inline-formula id="ieqn-88"><mml:math id="mml-ieqn-88"><mml:mi>C</mml:mi><mml:mi>R</mml:mi><mml:mi>F</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> of the entity model and the true label <inline-formula id="ieqn-89"><mml:math id="mml-ieqn-89"><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of the entity label.</p>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Experiment</title>
<p><bold>Dataset.</bold> This paper verifies the proposed SE-RE model on the self-constructed EPI data set, which is oriented to complex samples with high-density knowledge in the e-commerce domain. The dataset is divided into training, development, and testing sets using random sampling while ensuring that the sample distributions accurately represent the characteristics of the original dataset. This approach maintains consistency in data patterns across all subsets, enabling robust model training and evaluation (See <xref ref-type="sec" rid="s2">Section 2</xref> for a detailed introduction of the dataset). <xref ref-type="table" rid="table-2">Table 2</xref> presents the sample distribution for the relation model, illustrating the frequency and proportions of various relation types within the dataset. Similarly, <xref ref-type="table" rid="table-3">Table 3</xref> details the sample distribution for the entity model (i.e., the single relation sample obtained by the RF mechanism), showcasing the distribution of different entity categories. These tables provide a comprehensive overview of the dataset&#x2019;s composition for both models, ensuring transparency in the data preparation process. The overall sample distribution of each relation can be observed in <xref ref-type="table" rid="table-4">Table 4</xref>.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>The sample distribution of the relation model and the average length of each sample</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th></th>
<th>Train</th>
<th>Dev</th>
<th>Test</th>
</tr>
</thead>
<tbody>
<tr>
<td>Total number</td>
<td>3668</td>
<td>814</td>
<td>1016</td>
</tr>
<tr>
<td>Average length</td>
<td>29.47</td>
<td>29.10</td>
<td>29.10</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>The sample distribution of the entity model and the total amount of knowledge triples</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th></th>
<th>Train</th>
<th>Dev</th>
<th>Test</th>
</tr>
</thead>
<tbody>
<tr>
<td>Total number</td>
<td>9017</td>
<td>1914</td>
<td>2385</td>
</tr>
<tr>
<td>RDF num</td>
<td>12,277</td>
<td>2639</td>
<td>3273</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>Sample distribution of each relation, corresponding entity label, and F1-score of relation classification</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Relation</th>
<th>Total number</th>
<th>Entity_label</th>
<th>F1-score (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td>&#x7C7B;&#x522B; (Category)</td>
<td>973</td>
<td>CAT</td>
<td>98.53</td>
</tr>
<tr>
<td>&#x522B;&#x540D; (Nickname)</td>
<td>1056</td>
<td>PNN</td>
<td>99.73</td>
</tr>
<tr>
<td>&#x529F;&#x80FD; (Application)</td>
<td>1204</td>
<td>APP</td>
<td>97.87</td>
</tr>
<tr>
<td>&#x53D1;&#x8D27;&#x5730;&#x70B9; (Location)</td>
<td>778</td>
<td>LOC</td>
<td>99.63</td>
</tr>
<tr>
<td>&#x54C1;&#x724C; (Brand)</td>
<td>1541</td>
<td>BRD</td>
<td>99.45</td>
</tr>
<tr>
<td>&#x4EF7;&#x683C; (Price)</td>
<td>1475</td>
<td>PRC</td>
<td>99.63</td>
</tr>
<tr>
<td>&#x552E;&#x540E; (After-sales Service)</td>
<td>60</td>
<td>AFS</td>
<td>93.31</td>
</tr>
<tr>
<td>&#x9500;&#x552E;&#x5E73;&#x53F0; (Platform)</td>
<td>313</td>
<td>PLT</td>
<td>96.36</td>
</tr>
<tr>
<td>&#x9002;&#x7528;&#x573A;&#x666F; (Usage)</td>
<td>174</td>
<td>APP</td>
<td>98.41</td>
</tr>
<tr>
<td>&#x6750;&#x6599; (Material)</td>
<td>836</td>
<td>MAT</td>
<td>100</td>
</tr>
<tr>
<td>&#x914D;&#x4EF6; (Accessory)</td>
<td>1108</td>
<td>ACC</td>
<td>98.58</td>
</tr>
<tr>
<td>&#x89C4;&#x683C; (Specification)</td>
<td>1553</td>
<td>SPEC</td>
<td>100</td>
</tr>
<tr>
<td>&#x8BC4;&#x4EF7; (Review)</td>
<td>1155</td>
<td>REV</td>
<td>98.63</td>
</tr>
<tr>
<td>&#x578B;&#x53F7; (Model)</td>
<td>1097</td>
<td>MDL</td>
<td>99.75</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><bold>Parameter setting.</bold> During the training phase, the model employs an Adam optimizer with a linear scheduler for parameter adjustment, incorporating a warmup ratio of 0.1. The training process spans 10 epochs with a dropout rate of 0.1 (keep ratio of 0.9) and a batch size of 32. The learning rate is set at 5e &#x2212; 5, the maximum sequence length is limited to 128, and the hidden layer dimension of the Bi-LSTM network is 256. Notably, the output of the relation extraction model plays a pivotal role in determining the overall performance of the SE-RE model. Therefore, in order to improve the performance of the relation extraction model as much as possible, this paper incorporates an F1-score threshold into the training of each epoch. Finally, when the threshold is 0.45, the F1-score reaches the optimal value.</p>
<p><bold>Baselines.</bold> This article conducts comparative experiments between the SE-RE model proposed for the entity relation extraction task and the most representative state-of-the-art models on the EPI dataset. The detailed introduction of the baseline models is as follows:</p>
<p>Multi-head [<xref ref-type="bibr" rid="ref-16">16</xref>]: As a standard method for joint entity relation extraction, Multi-head adopts a span-based extraction framework that can simultaneously perform entity recognition and relation extraction without any manual input or additional tools, effectively addressing the situation in which the same entity connects to multiple relations.</p>
<p>Pipeline [<xref ref-type="bibr" rid="ref-15">15</xref>]: The entity recognition stage is carried out before the relation extraction step in this traditional pipelined approach to entity-relation extraction, which splits the operation into two separate subtasks. Special entity boundary labels are then constructed from the output of the entity model to model the dependency relation between entity pairs.</p>
<p>SpERT.PL [<xref ref-type="bibr" rid="ref-32">32</xref>]: This method further enriches the context embedding of pre-trained converters by embedding POS tags. Specifically, The method assigns the POS of the entity to each token that constitutes the entity and incorporates the POS into the Transformer-based pre-trained language model. Additionally, the method introduces a large number of additional features, such as the span range feature within the entity, the overall span feature, the correlation feature between entities, and the correlation feature between entity categories and relations.</p>
<p>SE-RE(PL): To further demonstrate the effectiveness of the proposed CN-POS, this paper replaces the POS feature encoding strategy in the SE-RE model with the direct splicing method used in the SpERT.PL model.</p>
<p>SE-RE(MRC): To further demonstrate the effectiveness of the RF mechanism proposed in this paper, the relation syntax-enhanced strategy in the SE-RE model is replaced with the contextual sentence format used in the BERT-MRC model, namely &#x201C;text_b: original text sequence&#x201D;.</p>
<p>SE-RE(-POS): To further validate the effectiveness of incorporating POS information into the entity relation extraction task and highlight the advancements achieved by the CN-POS enhanced pre-trained encoding model (i.e., the improved version of the traditional BERT-wwm-ext model) proposed in this paper, detailed experiments and evaluations were conducted. We remove all POS information from the SE-RE model for comparative analysis.</p>
<p><bold>Evaluation.</bold> In this paper, we use the standard evaluation metrics for the entity relation extraction task: Precision (P), Recall (R), and F1-score (F1) to evaluate the relation extraction model, the entity recognition model, and the final ternary cascade results. It is worth noting that some baseline models adopt a span-based modeling strategy. Therefore, in the entity recognition task, this paper adopts a strict evaluation principle, which means that the output entity span results need to be completely consistent with the true values in order to be considered accurate prediction results.</p>
</sec>
<sec id="s5">
<label>5</label>
<title>Results and Discussion</title>
<p>In this section, we analyze the experimental results from three key perspectives: the entity-relation extraction framework, the entity-relation model feature interaction strategy, and the ablation experiment.</p>
<sec id="s5_1">
<label>5.1</label>
<title>Entity Relation Extraction Framework</title>
<p><xref ref-type="table" rid="table-5">Table 5</xref> shows the experimental results of the SE-RE model proposed in this paper, compared with other baseline models on the EPI dataset. Among them, Pipeline, SE-RE, and its variants use independent encoders to execute entity and relational models, while Multi-head and SpERT.PL uses a unified encoder with shared parameters. In addition, Multi-head, Pipeline, and SpERT.PL all adopt the span-based entity before the relation extraction framework, while the SE-RE series model adopts the execution order of relation before the entity.</p>
<table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>Performance comparison of entity-relation extraction models on EPI</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Model</th>
<th colspan="3">Relation model</th>
<th colspan="3">Entity model</th>
<th>RDF</th>
</tr>
<tr>
<th></th>
<th>P (%)</th>
<th>R (%)</th>
<th>F1 (%)</th>
<th>P (%)</th>
<th>R (%)</th>
<th>F1 (%)</th>
<th>F1 (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Multi-head [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>79.16</td>
<td>83.88</td>
<td>81.45</td>
<td>85.47</td>
<td>90.05</td>
<td>87.70</td>
<td>82.15</td>
</tr>
<tr>
<td>Pipeline [<xref ref-type="bibr" rid="ref-15">15</xref>]</td>
<td>82.04</td>
<td>87.39</td>
<td>84.63</td>
<td>89.94</td>
<td>91.70</td>
<td>90.81</td>
<td>86.84</td>
</tr>
<tr>
<td>SpERT.PL [<xref ref-type="bibr" rid="ref-32">32</xref>]</td>
<td>84.88</td>
<td>88.68</td>
<td>86.74</td>
<td>91.39</td>
<td>92.76</td>
<td>92.07</td>
<td>88.41</td>
</tr>
<tr>
<td>SE-RE</td>
<td>88.49</td>
<td>92.73</td>
<td>90.56</td>
<td>91.79</td>
<td>93.59</td>
<td>92.68</td>
<td>90.32</td>
</tr>
<tr>
<td>SE-RE(PL)</td>
<td>87.97</td>
<td>92.27</td>
<td>90.07</td>
<td>90.54</td>
<td>93.36</td>
<td>91.93</td>
<td>89.33</td>
</tr>
<tr>
<td>SE-RE(MRC)</td>
<td>88.49</td>
<td>92.73</td>
<td>90.56</td>
<td>91.44</td>
<td>92.97</td>
<td>92.20</td>
<td>89.76</td>
</tr>
<tr>
<td>SE-RE(-POS)</td>
<td>86.42</td>
<td>91.80</td>
<td>89.03</td>
<td>90.26</td>
<td>92.47</td>
<td>91.35</td>
<td>88.70</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>From the model performance shown in <xref ref-type="table" rid="table-5">Table 5</xref>, it can be observed that: (1) In all subtasks and the triplet extraction task as a whole, the SE-RE model put forward in this study performs the best. The F1-score of the most representative RDF improves by 1.91% compared to that of the SpERT.PL model (The F1-score is 90.32% and 88.41%, respectively), which proves the effectiveness of the proposed method. (2) Models based on the entity before the relation extraction framework (Multi-head, Pipeline, SpERT.PL) rely on the output results of the entity model, and incorrect entity predictions and redundant candidate entity pairs can cause serious error accumulation problems. The performance of the relation models in these frameworks is generally poor (The F1-scores of the relation models are 81.45%, 84.63%, and 86.74%, respectively), which in turn affects the final triplet cascade matching results. (3) The SE-RE proposed in this paper (90.56% for the relation model F1-score) does not affect the label prediction performance of the entity model itself (the entity model F1-score reaches an optimal 92.68%), although the output results of the relation model are still used as part of the entity model&#x2019;s inputs, and it only affects the final triplet cascade results, which further proves that the relation-based semantic core of SE-RE model can effectively alleviate the relation overlap problem through the pipeline-based relation input method.</p>

<p>To further validate the stability and generalization ability of the proposed SE-RE model across different datasets and complex scenarios, we conducted additional comparative experiments on the publicly available FS-QA [<xref ref-type="bibr" rid="ref-33">33</xref>] dataset. FS-QA focuses on the Chinese food safety domain and serves as an authoritative benchmark for information extraction and coreference resolution tasks. As shown in <xref ref-type="table" rid="table-6">Table 6</xref>, the experimental results demonstrate that the SE-RE model significantly outperforms existing baseline methods, achieving the best performance. Notably, the F1-score for the most representative RDF task improves by 1.12% compared to the SpERT.PL model. These findings further confirm the superiority of the SE-RE model in cross-domain applications, highlighting its robustness and strong generalization capability.</p>
<table-wrap id="table-6">
<label>Table 6</label>
<caption>
<title>Performance comparison of entity-relation extraction models on FS-QA</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Model</th>
<th colspan="3">Relation model</th>
<th colspan="3">Entity model</th>
<th>RDF</th>
</tr>
<tr>
<th/>
<th>P (%)</th>
<th>R (%)</th>
<th>F1 (%)</th>
<th>P (%)</th>
<th>R (%)</th>
<th>F1 (%)</th>
<th>F1 (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Multi-head [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>81.42</td>
<td>86.26</td>
<td>83.77</td>
<td>85.98</td>
<td>90.41</td>
<td>88.14</td>
<td>82.58</td>
</tr>
<tr>
<td>Pipeline [<xref ref-type="bibr" rid="ref-15">15</xref>]</td>
<td>83.94</td>
<td>88.65</td>
<td>86.23</td>
<td>90.56</td>
<td>91.65</td>
<td>91.10</td>
<td>87.24</td>
</tr>
<tr>
<td>SpERT.PL [<xref ref-type="bibr" rid="ref-32">32</xref>]</td>
<td>86.43</td>
<td>90.86</td>
<td>88.59</td>
<td>92.07</td>
<td>92.85</td>
<td>92.46</td>
<td>90.02</td>
</tr>
<tr>
<td>SE-RE</td>
<td>89.23</td>
<td>93.62</td>
<td>91.37</td>
<td>92.25</td>
<td>93.93</td>
<td>93.08</td>
<td>91.14</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In addition, the results in <xref ref-type="table" rid="table-5">Tables 5</xref> and <xref ref-type="table" rid="table-6">6</xref> indicate that the SE-RE model achieves a significant improvement over the baseline methods. Statistical tests confirm the robustness of these results, with <italic>p</italic>-values &#x003C; 0.01 in paired <italic>t</italic>-tests, and 95% confidence intervals consistently placing the SE-RE model&#x2019;s performance above the baseline.</p>

</sec>
<sec id="s5_2">
<label>5.2</label>
<title>Entity-Relation Model Feature Interaction Strategy</title>
<p>By comparing and analyzing the performance of each model in <xref ref-type="table" rid="table-5">Table 5</xref>, it can be concluded that: (1) In the span-based entity before relation extraction framework, the SpERT.PL and Pipeline models outperform the Multi-head (F1-scores of RDF are 88.41%, 86.84%, and 82.15%, respectively), which is mainly due to the fact that the first two models consider the interactive dependencies between entities and model the internal features of entity spans, such as span width. Another reason for the better performance achieved by SpERT.PL is the further incorporation of the POS feature and the overall feature coding of the entity span. (2) The SE-RE model utilizes the CN-POS strategy to also take into account the POS features of the input sequences (F1-score of RDF is 90.32%) and further models the effects of relations on entities ignored by SpERT.PL through the RF mechanism, and utilizes the relations to guide the entity model to output the entity types oriented to a specific relation, which alleviates the entity redundancy problem facing the SpERT.PL model and to a certain extent avoids the interference of nested entities on the entity model. Therefore, in the entity recognition task, the SE-RE model (precision and recall of entity model are 91.79% and 93.59%, respectively) has higher recall while maintaining comparable precision compared to the SpERT.PL model (precision and recall of entity model are 91.39% and 92.76%, respectively). In addition, under the influence of the RF mechanism, the SE-RE model can greatly alleviate the problem of entity-relation cascade errors.</p>

</sec>
<sec id="s5_3">
<label>5.3</label>
<title>Ablation Experiment</title>
<p>The comparative analysis of the performance between the SE-RE model and its variants in <xref ref-type="table" rid="table-5">Table 5</xref> shows that: (1) The performance of the SE-RE model is higher than that of SE-RE(PL) (F1-scores of RDF is 90.32% and 89.33%, respectively), which indicates that the CN-POS strategy proposed in this paper is more effective than the direct splicing of POS features used in the SpERT.PL model. (2) Since SE-RE(-POS) does not consider POS features, it has the lowest performance (F1-score of RDF is 88.70%), which suggests that POS features can effectively contribute to the performance of entity relation extraction models. (3) Since SE-RE (MRC) only changes the form of relation-to-entity interactions, its relational modeling performance remains consistent with that of the SE-RE model (90.56% for both relational model F1-scores). However, with the entity-relation interaction model based on the MRC framework, additional relation features introduce noisy data. Moreover, the interaction distance between post-sequential entities and relations is too long, and cross-sentence dependency modeling reduces the ability of relations to guide the entity model, causing the model&#x2019;s performance to decline (F1-scores of the entity model are 92.20% and 92.68%, respectively, a decrease of 0.48%). (4) The SE-RE(MRC) outperforms SE-RE(PL) (F1-scores of RDF are 89.76% and 89.33%, respectively), which suggests that the CN-POS strategy is able to mitigate the effect of noise due to relation features to some extent.</p>

</sec>
<sec id="s5_4">
<label>5.4</label>
<title>Error Analysis</title>
<p>We also provide a failure case (as shown in <xref ref-type="fig" rid="fig-4">Fig. 4</xref>). In this example, &#x201C;Apple iPhone 13 Pro Max 512 GB &#x94F6;&#x8272;&#x7248;&#x672C;&#x201D; is a complete and specific entity, while &#x201C;iPhone 13 &#x7CFB;&#x5217;&#x201D; is a generalized entity that encompasses multiple models. Although the SE-RE model, under the RF mechanism, can leverage the &#x201C;&#x7C7B;&#x522B;&#x201D; relation to model long-distance cross-sentence entity dependencies, the model may prioritize &#x201C;iPhone 13&#x7CFB;&#x5217;&#x201D; as the entity matching &#x201C;&#x65D7;&#x8230;&#x624B;&#x673A;.&#x201D; This prioritization overlooks the more precise boundary of &#x201C;Apple iPhone 13 Pro Max 512 GB &#x94F6;&#x8272;&#x7248;&#x672C;,&#x201D; leading to errors in triplet extraction. This issue highlights the current limitations of the model in handling complex nested entities.</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>A failure case</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_60517-fig-4.tif"/>
</fig>
<p>To address this problem, future improvements should focus on enhancing the capacity of the model to discern entity boundaries at a finer granularity. Approaches such as multi-task learning or hierarchical representation methods could be adopted to balance relation guidance with precise entity semantic matching. Additionally, exploring the incorporation of an entity granularity control mechanism to prioritize entities at different granularities from a semantic perspective could further improve recognition accuracy.</p>
</sec>
</sec>
<sec id="s6">
<label>6</label>
<title>Conclusion</title>
<p>This paper proposes SE-RE to address entity nesting and relation overlap in complex semantic texts. The aim is to extract structured entity relation triplets from text descriptions in the unstructured e-commerce product information domain. Taking inspiration from previous research that used POS information to enhance model performance, this paper constructs CN-POS and introduces it into the Chinese pre-trained BERT model, the problem of POS noise and insufficient feature training introduced by the traditional feature splicing approach is effectively alleviated. Inspired by the semantic core of predicates in Chinese, this paper adopts a model framework of relation before entity, effectively alleviating the problem of error accumulation. On this basis, the use of the &#x201C;Queue&#x201D; scheme and RF mechanism enhances the interaction between relation and entity, greatly improving the cascading errors caused by relation overlap. Additionally, this strategy can guide the entity model to obtain more accurate boundaries of nested entities.</p>
</sec>
</body>
<back>
<ack>
<p>Not applicable.</p>
</ack>
<sec>
<title>Funding Statement</title>
<p>This research was funded by the National Key Technology R&#x0026;D Program of China under Grant No. 2021YFD2100605, the National Natural Science Foundation of China under Grant No. 62433002, the Project of Construction and Support for High-Level Innovative Teams of Beijing Municipal Institutions under Grant No. BPHR20220104, Beijing Scholars Program under Grant No. 099.</p>
</sec>
<sec>
<title>Author Contributions</title>
<p>The authors confirm their contributions to the paper as follows: Study conception and design: Mingwen Bi, Zhenghong Yang; Data collection: Hefei Chen; Analysis and interpretation of results: Mingwen Bi, Hefei Chen; Draft manuscript preparation: Mingwen Bi, Hefei Chen. All authors reviewed the results and approved the final version of the manuscript.</p>
</sec>
<sec sec-type="data-availability">
<title>Availability of Data and Materials</title>
<p>The data that support the findings of this study are available from the corresponding author, Hefei Chen, upon reasonable request.</p>
</sec>
<sec>
<title>Ethics Approval</title>
<p>Not applicable.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The authors declare no conflicts of interest to report regarding the present study.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Takanobu</surname> <given-names>R</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>T</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>M</given-names></string-name></person-group>. <article-title>A hierarchical framework for relation extraction with reinforcement learning</article-title>. <source>AAAI Conf Artif Intell</source>. <year>2019</year>;<volume>33</volume>(<issue>1</issue>):<fpage>7072</fpage>&#x2013;<lpage>9</lpage>. doi:<pub-id pub-id-type="doi">10.1609/aaai.v33i01.33017072</pub-id>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Mao</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Chuan</surname> <given-names>T</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>J</given-names></string-name>, <string-name><surname>Su</surname> <given-names>WW</given-names></string-name>, <string-name><surname>He</surname> <given-names>YJ</given-names></string-name>, <string-name><surname>Mei</surname> <given-names>DH</given-names></string-name>, <etal>et al.</etal></person-group> <chapter-title>Knowledge graph-based approach to unstructured multi-source data fusion</chapter-title>. In: <source>Advances in artificial intelligence big data and algorithms</source>. <publisher-name>IOS Press</publisher-name>; <year>2023</year>. p. <fpage>663</fpage>&#x2013;<lpage>9</lpage>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>X</given-names></string-name>, <string-name><surname>Li</surname> <given-names>L</given-names></string-name>, <string-name><surname>Qiao</surname> <given-names>S</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>N</given-names></string-name>, <string-name><surname>Tan</surname> <given-names>C</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>Y</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>One model for all domains: collaborative domain-prefix tuning for cross-domain NER</article-title>. <comment>arXiv:2301.10410</comment>. <year>2023</year>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Ye</surname> <given-names>R</given-names></string-name>, <string-name><surname>Ge</surname> <given-names>R</given-names></string-name>, <string-name><surname>Fengting</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Chai</surname> <given-names>J</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>S</given-names></string-name></person-group>. <article-title>Leveraging unstructured text data for federated instruction tuning of large language models</article-title>. <comment>arXiv:2409.07136</comment>. <year>2024</year>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Nemes</surname> <given-names>L</given-names></string-name>, <string-name><surname>Kiss</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Information extraction and named entity recognition supported social media sentiment analysis during the COVID-19 pandemic</article-title>. <source>Appl Sci</source>. <year>2021</year>;<volume>11</volume>(<issue>22</issue>):<fpage>11017</fpage>. doi:<pub-id pub-id-type="doi">10.3390/app112211017</pub-id>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cui</surname> <given-names>X</given-names></string-name>, <string-name><surname>Song</surname> <given-names>C</given-names></string-name>, <string-name><surname>Li</surname> <given-names>D</given-names></string-name>, <string-name><surname>Qu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Long</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>Y</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>RoBGP: a Chinese nested biomedical named entity recognition model based on roberta and global pointer</article-title>. <source>Comput Mater Contin</source>. <year>2024</year>;<volume>78</volume>(<issue>3</issue>):<fpage>3603</fpage>&#x2013;<lpage>18</lpage>. doi:<pub-id pub-id-type="doi">10.32604/cmc.2024.047321</pub-id>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dash</surname> <given-names>A</given-names></string-name>, <string-name><surname>Darshana</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yadav</surname> <given-names>DK</given-names></string-name>, <string-name><surname>Gupta</surname> <given-names>V</given-names></string-name></person-group>. <article-title>A clinical named entity recognition model using pretrained word embedding and deep neural networks</article-title>. <source>Decis Anal J</source>. <year>2024</year>;<volume>10</volume>:<fpage>100426</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.dajour.2024.100426</pub-id>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Xiang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>X</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>J</given-names></string-name>, <string-name><surname>Shi</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>E</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>X</given-names></string-name></person-group>. <article-title>A U-shaped network-based grid tagging model for Chinese named entity recognition</article-title>. <source>Comput Mater Contin</source>. <year>2024</year>;<volume>79</volume>(<issue>3</issue>):<fpage>4149</fpage>&#x2013;<lpage>67</lpage>. doi:<pub-id pub-id-type="doi">10.32604/cmc.2024.050229</pub-id>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wei</surname> <given-names>C</given-names></string-name>, <string-name><surname>Li</surname> <given-names>J</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wan</surname> <given-names>S</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Graph convolutional networks embedding textual structure information for relation extraction</article-title>. <source>Comput Mater Contin</source>. <year>2024</year>;<volume>79</volume>(<issue>2</issue>):<fpage>3299</fpage>&#x2013;<lpage>314</lpage>. doi:<pub-id pub-id-type="doi">10.32604/cmc.2024.047811</pub-id>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>X</given-names></string-name>, <string-name><surname>El-Gohary</surname> <given-names>N</given-names></string-name></person-group>. <article-title>Deep learning-based relation extraction and knowledge graph-based representation of construction safety requirements</article-title>. <source>Autom Constr</source>. <year>2023</year>;<volume>147</volume>:<fpage>104696</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.autcon.2022.104696</pub-id>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Ji</surname> <given-names>H</given-names></string-name></person-group>. <article-title>Incremental joint extraction of entity mentions and relations</article-title>. <source>Proc ACL</source>. <year>2014</year>;<volume>1</volume>:<fpage>402</fpage>&#x2013;<lpage>12</lpage>. doi:<pub-id pub-id-type="doi">10.3115/v1/P14-1</pub-id>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chan</surname> <given-names>YS</given-names></string-name>, <string-name><surname>Roth</surname> <given-names>D</given-names></string-name></person-group>. <article-title>Exploiting syntactico-semantic structures for relation extraction</article-title>. <source>Proc ACL</source>. <year>2011</year>;<volume>1</volume>:<fpage>551</fpage>&#x2013;<lpage>60</lpage>. doi:<pub-id pub-id-type="doi">10.5555/2002472.2002542</pub-id>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dai</surname> <given-names>D</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>X</given-names></string-name>, <string-name><surname>Lyu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Dou</surname> <given-names>S</given-names></string-name>, <string-name><surname>She</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>H</given-names></string-name></person-group>. <article-title>Joint extraction of entities and overlapping relations using position-attentive sequence labeling</article-title>. <source>AAAI Conf Artif Intell</source>. <year>2019</year>;<volume>33</volume>:<fpage>6300</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.1609/aaai.v33i01.3301630</pub-id>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tan</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>X</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>W</given-names></string-name></person-group>. <article-title>Jointly extracting multiple triplets with multilayer translation constraints</article-title>. <source>AAAI Conf Artif Intell</source>. <year>2019</year>;<volume>33</volume>(<issue>01</issue>):<fpage>7080</fpage>&#x2013;<lpage>7</lpage>. doi:<pub-id pub-id-type="doi">10.1609/aaai.v33i01.33017080</pub-id>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Zhong</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>D</given-names></string-name></person-group>. <article-title>A frustratingly easy approach for joint entity and relation extraction</article-title>. <comment>arXiv:2010.12812</comment>. <year>2020</year>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bekoulis</surname> <given-names>G</given-names></string-name>, <string-name><surname>Deleu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Demeester</surname> <given-names>T</given-names></string-name>, <string-name><surname>Develder</surname> <given-names>C</given-names></string-name></person-group>. <article-title>Joint entity recognition and relation extraction as a multi-head selection problem</article-title>. <source>Expert Syst Appl</source>. <year>2018</year>;<volume>114</volume>:<fpage>34</fpage>&#x2013;<lpage>45</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.eswa.2018.07.032</pub-id>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Wei</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Su</surname> <given-names>J</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Tian</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Chang</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>A novel cascade binary tagging framework for relational triple extraction</article-title>. <comment>arXiv:1909.03227</comment>. <year>2019</year>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Eberts</surname> <given-names>M</given-names></string-name>, <string-name><surname>Ulges</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Span-based joint entity and relation extraction with transformer pre-training</article-title>. In: ECAI 2020. <publisher-name>IOS Press</publisher-name>; <year>2020</year>. p. <fpage>2006</fpage>&#x2013;<lpage>13</lpage>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Sui</surname> <given-names>D</given-names></string-name>, <string-name><surname>Zeng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Joint entity and relation extraction with set prediction networks</article-title>. <source>IEEE Transacti Neural Networks Learn Systems</source>. <year>2024</year>;<volume>35</volume>(<issue>9</issue>):<fpage>12784</fpage>&#x2013;<lpage>95</lpage>. doi:<pub-id pub-id-type="doi">10.1109/TNNLS.2023.3264735</pub-id>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Zeng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zeng</surname> <given-names>D</given-names></string-name>, <string-name><surname>He</surname> <given-names>S</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Extracting relational facts by an end-to-end neural model with copy mechanism</article-title>. In: <conf-name>Proceedings of the Anterior Cruciate Ligament</conf-name>; <year>2018</year>; <publisher-loc>Melbourne, VIC, Australia</publisher-loc>. p. <fpage>506</fpage>&#x2013;<lpage>14</lpage>. doi:<pub-id pub-id-type="doi">10.18653/v1/P18-1047</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Yin</surname> <given-names>F</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Yuan</surname> <given-names>A</given-names></string-name>, <string-name><surname>Chai</surname> <given-names>D</given-names></string-name></person-group> <article-title>Entity relation extraction as multi-turn question answering</article-title>. <comment>arXiv:1905.05529</comment>. <year>2019</year>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>B</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>T</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>L</given-names></string-name></person-group>. <article-title>TPLinker: single-stage joint extraction of entities and relations through token pair linking</article-title>. <comment>arXiv:2010.13415</comment>. <year>2020</year>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Yuan</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>X</given-names></string-name>, <string-name><surname>Pan</surname> <given-names>S</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Song</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>L</given-names></string-name></person-group>. <article-title>A relation-specific attention network for joint entity and relation extraction</article-title>. In: <conf-name>International Joint Conference on Artificial Intelligence</conf-name>; <year>2021</year>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Zheng</surname> <given-names>S</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>F</given-names></string-name>, <string-name><surname>Bao</surname> <given-names>H</given-names></string-name>, <string-name><surname>Hao</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>P</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>B</given-names></string-name></person-group>. <article-title>Joint extraction of entities and relations based on a novel tagging scheme</article-title>. <comment>arXiv:1706.05075</comment>. <year>2017</year>.</mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Devlin</surname> <given-names>J</given-names></string-name>, <string-name><surname>Chang</surname> <given-names>MW</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>K</given-names></string-name>, <string-name><surname>Toutanova</surname> <given-names>K</given-names></string-name></person-group>. <article-title>BERT: pre-training of deep bidirectional transformers for language understanding</article-title>. <comment>arXiv:1810.04805</comment>. <year>2018</year>.</mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cui</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Che</surname> <given-names>W</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>T</given-names></string-name>, <string-name><surname>Qin</surname> <given-names>B</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>Z</given-names></string-name></person-group>. <article-title>Pre-training with whole word masking for chinese BERT</article-title>. <source>IEEE/ACM Trans Audio Speech Lang Process</source>. <year>2021</year>;<volume>29</volume>:<fpage>3504</fpage>&#x2013;<lpage>14</lpage>. doi:<pub-id pub-id-type="doi">10.1109/TASLP.2021.3124365</pub-id>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>M</given-names></string-name>, <string-name><surname>Ye</surname> <given-names>T</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Y</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Construction and application of an ontology-based domain-specific knowledge graph for petroleum exploration and development</article-title>. <source>Geosci Front</source>. <year>2023</year>;<volume>14</volume>(<issue>5</issue>):<fpage>101426</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.gsf.2022.101426</pub-id>.</mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Phan</surname> <given-names>MH</given-names></string-name>, <string-name><surname>Ogunbona</surname> <given-names>PO</given-names></string-name></person-group>. <article-title>Modelling context and syntactical features for aspect-based sentiment analysis</article-title>. In: <conf-name>Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics</conf-name>; <year>2020</year>. p. <fpage>3211</fpage>&#x2013;<lpage>20</lpage>. doi:<pub-id pub-id-type="doi">10.18653/v1/2020.acl-main.293</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>T</given-names></string-name>, <string-name><surname>Chang</surname> <given-names>B</given-names></string-name>, <string-name><surname>Sui</surname> <given-names>Z</given-names></string-name></person-group>. <article-title>Chinese semantic role labeling with bidirectional recurrent neural networks</article-title>. In: <conf-name>Proceedings of the Conference Empirical Methods in Natural Language Processing (EMNLP)</conf-name>; <year>2015</year>; <publisher-loc>Lisbon, Portugal</publisher-loc>. p. <fpage>1626</fpage>&#x2013;<lpage>31</lpage>. doi:<pub-id pub-id-type="doi">10.18653/v1/D15-1186</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Catelli</surname> <given-names>R</given-names></string-name>, <string-name><surname>Casola</surname> <given-names>V</given-names></string-name>, <string-name><surname>Pietro</surname> <given-names>GDe</given-names></string-name>, <string-name><surname>Fujita</surname> <given-names>H</given-names></string-name>, <string-name><surname>Esposito</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Combining contextualized word representation and sub-document level analysis through Bi-LSTM&#x002B;CRF architecture for clinical de-identification</article-title>. <source>Knowl-Based Syst</source>. <year>2021</year>;<volume>213</volume>:<fpage>106649</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.knosys.2020.106649</pub-id>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Lample</surname> <given-names>G</given-names></string-name>, <string-name><surname>Ballesteros</surname> <given-names>M</given-names></string-name>, <string-name><surname>Subramanian</surname> <given-names>S</given-names></string-name>, <string-name><surname>Kawakami</surname> <given-names>K</given-names></string-name>, <string-name><surname>Dyer</surname> <given-names>C</given-names></string-name></person-group>. <article-title>Neural architectures for named entity recognition</article-title>. <comment>arXiv:1603.01360</comment>. <year>2016</year>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Santosh</surname> <given-names>TYSS</given-names></string-name>, <string-name><surname>Chakraborty</surname> <given-names>P</given-names></string-name>, <string-name><surname>Dutta</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sanyal</surname> <given-names>DK</given-names></string-name>, <string-name><surname>Das</surname> <given-names>PP</given-names></string-name></person-group>. <article-title>Joint entity and relation extraction from scientific documents: role of linguistic information and entity types</article-title>. In: <conf-name>2nd Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE 2021) Co-Located with JCDL 2021</conf-name>; <year>2021</year>; <publisher-loc>USA</publisher-loc>.</mixed-citation></ref>
<ref id="ref-33"><label>[33]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bi</surname> <given-names>M</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>Z</given-names></string-name></person-group>. <article-title>Machine reading comprehension combined with semantic dependency for Chinese zero pronoun resolution</article-title>. <source>Artif Intell Rev</source>. <year>2023</year>;<volume>56</volume>(<issue>8</issue>):<fpage>7597</fpage>&#x2013;<lpage>612</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s10462-022-10364-5</pub-id>.</mixed-citation></ref>
</ref-list>
</back></article>