ass日本风韵熟妇pics男人扒开女人屁屁桶到爽|扒开胸露出奶头亲吻视频|邻居少妇的诱惑|人人妻在线播放|日日摸夜夜摸狠狠摸婷婷|制服 丝袜 人妻|激情熟妇中文字幕|看黄色欧美特一级|日本av人妻系列|高潮对白av,丰满岳妇乱熟妇之荡,日本丰满熟妇乱又伦,日韩欧美一区二区三区在线

基于BERT-CRF模型的生鮮蛋供應(yīng)鏈命名實體識別
CSTR:
作者:
作者單位:

作者簡介:

通訊作者:

中圖分類號:

基金項目:

北京市科委科技計劃項目(Z191100008619007)


Named Entity Recognition of Fresh Egg Supply Chain Based on BERT-CRF Architecture
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 圖/表
  • |
  • 訪問統(tǒng)計
  • |
  • 參考文獻
  • |
  • 相似文獻
  • |
  • 引證文獻
  • |
  • 資源附件
  • |
  • 文章評論
    摘要:

    對于生鮮蛋供應(yīng)鏈知識圖譜構(gòu)建過程中供應(yīng)鏈領(lǐng)域?qū)嶓w名稱多樣,、特征信息提取不充分的問題,,提出了一種基于BERT-CRF模型(Bidirectional encoder representations from transformers-conditional random field)的命名實體識別方法。該方法使用BIO(Begin,、Internal,、Other)標(biāo)記規(guī)則進行序列標(biāo)注,以字向量和位置向量作為輸入,,通過BERT預(yù)訓(xùn)練模型提取輸入序列全局特征,并在模型的末端添加CRF層引入硬約束,,構(gòu)建適合生鮮蛋供應(yīng)鏈領(lǐng)域命名實體識別的模型框架。所提出的模型與其他3種命名實體識別模型在自建數(shù)據(jù)集上進行了對比實驗,,該數(shù)據(jù)集包含12810條文本語料數(shù)據(jù),,5大類21個小類。實驗結(jié)果表明,,本文模型取得了很好的結(jié)果,,準(zhǔn)確率,、召回率和F1值分別達到91.82%、90.44%,、91.01%,,驗證了本文模型優(yōu)于其他3種模型。最后本文模型使用自建的食品領(lǐng)域菜譜數(shù)據(jù)集進行實驗,,結(jié)果表明模型具有一定的泛化能力,。

    Abstract:

    Recognizing named entities from raw text is the first step to construct a fresh egg supply chain knowledge graph and support a variety of downstream natural language processing tasks. This task can sort out the information in the supply chain and provide a basis for food safety traceability. In the raw text of fresh egg supply chain, there were various types of entities, and feature information extraction was inefficient. In order to solve the problem of fast and accurate identification of the named entities which entity types were pre-defined, a bidirectional encoder representations from transformers-conditional random field (BERT-CRF) architecture was proposed to solve the task of named entity recognition (NER) in the area of fresh egg supply chain. In BERT-CRF architecture, begin, internal and other (BIO) labeling rule was used to label the sequence, and the concatenation of character vector and position vector was used as inputs. The pre-training language model (BERT) was used to obtain the global features of input sequence, and the CRF layer was added at the end of the model to introduce hard constraints. A comparative experiment was conducted with other three NER model on the self-constructed dataset that contained five categories and 21 subcategories. The result showed that the BERT-CRF model was superior to the others and reported a state-of-the-art performance. The precision, recall and F1-score were 91.82%, 90.44% and 91.01%, respectively. Finally, through the comparative experiments with other self-constructed dataset (dish dataset), the results showed that the model had a certain generalization ability.

    參考文獻
    相似文獻
    引證文獻
引用本文

劉新亮,張夢琪,谷 情,任延昭,何東彬,高萬林.基于BERT-CRF模型的生鮮蛋供應(yīng)鏈命名實體識別[J].農(nóng)業(yè)機械學(xué)報,2021,52(S0):519-525. LIU Xinliang, ZHANG Mengqi, GU Qing, REN Yanzhao, HE Dongbin, GAO Wanlin. Named Entity Recognition of Fresh Egg Supply Chain Based on BERT-CRF Architecture[J]. Transactions of the Chinese Society for Agricultural Machinery,2021,52(S0):519-525.

復(fù)制
分享
文章指標(biāo)
  • 點擊次數(shù):
  • 下載次數(shù):
  • HTML閱讀次數(shù):
  • 引用次數(shù):
歷史
  • 收稿日期:2021-07-17
  • 最后修改日期:
  • 錄用日期:
  • 在線發(fā)布日期: 2021-11-10
  • 出版日期: 2021-12-10
文章二維碼