<th id="5nh9l"></th><strike id="5nh9l"></strike><th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th><strike id="5nh9l"></strike>
<progress id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"><noframes id="5nh9l">
<th id="5nh9l"></th> <strike id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>
<progress id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span><strike id="5nh9l"><noframes id="5nh9l"><strike id="5nh9l"></strike>
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"></span><span id="5nh9l"><video id="5nh9l"></video></span>
<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th>
<progress id="5nh9l"><noframes id="5nh9l">

基于BiLSTM的公共安全事件觸發詞識別

Public security event trigger identification based on Bidirectional LSTM

  • 摘要: 提出基于雙向長短期記憶網絡(bidirectional long short-term memory,BiLSTM)和前向神經網絡的融合模型完成公共安全事件的觸發詞識別任務.首先通過BiLSTM提取整段文本的高層語義特征,避免了以往機器學習方法需要人工提取特征的問題,其次采用特征拼接并在前向神經網絡中識別并分類事件觸發詞.實驗結果表明相較于基準模型,本文方法在中文突發事件語料庫(Chinese emergency corpus,CEC)上取得了更為突出的性能,Micro-F1值為78.47%.此外本文討論了不同拼接特征在觸發詞識別任務中的重要性,對文本分析中3類特征(詞性、句法、實體)的重要程度進行了比較和分析,得出句法特征對于事件觸發詞識別任務助益最大的結論.

     

    Abstract: As the internet coverage continues to expand, obtaining valuable information from a large amount of fragmented semi-structured text data has become a huge challenge considering the vast amount of social public information. Event trigger identification technology can effectively mine and refine text information so that the users can quickly and accurately get what they need; thus, it has gradually become an active research area in the field of natural language processing. An event trigger word is generally a word or phrase that marks the occurrence of the event, then trigger word identification has been applied to many aspects and plays an important role in the fields of knowledge base construction, intelligent search engine, automatic question answering robot, and automatic summarization. However, the text data are characterized by high dimensionality and ambiguity. The existing identification methods are mostly based on manual complex feature engineering or only consider the features in a certain text window. In this process, manual analysis and selection of a large number of features are required. Considerable reliance on natural language processing tools leads to the inability of applying the model on a large scale, and there are problems of erroneous cascade communication and complicated feature engineering. This paper proposed a fusion model based on the bidirectional long short-term memory (BiLSTM) and feed-forward neural networks to complete the trigger identification task for public security events. First, the high-level features of the entire text were extracted through BiLSTM to avoid manual feature extraction, which was associated with the existing machine learning methods. Then, contacted features were used to input feed-forward neural networks and identify event triggers. The experimental results show that the proposed method achieves good performance in the Chinese emergency corpus, CEC, and the Micro-F1 is 78.47%. In addition, the importance of different contacted features was also discussed in trigger word recognition tasks, and the importance of three types of features, namely part of speech, syntax, and entity, in text analysis was analyzed. It is concluded that syntactic features are most helpful to the task of event-trigger word recognition.

     

/

返回文章
返回
<th id="5nh9l"></th><strike id="5nh9l"></strike><th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th><strike id="5nh9l"></strike>
<progress id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"><noframes id="5nh9l">
<th id="5nh9l"></th> <strike id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>
<progress id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span><strike id="5nh9l"><noframes id="5nh9l"><strike id="5nh9l"></strike>
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"></span><span id="5nh9l"><video id="5nh9l"></video></span>
<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th>
<progress id="5nh9l"><noframes id="5nh9l">
259luxu-164