A method for preprocessing an incomplete information table
-
摘要: 針對不完備信息表預處理問題中的不完備數據的填補問題、冗余屬性的約簡問題和連續屬性的離散化問題進行了研究.應用粗糙集理論,由相容信息表中條件屬性與決策屬性間的一致性對應關系,定義了劃分區間的加法運算,解決了不完備數據填補問題;根據類別概念,定義了差別向量,利用差別向量加法運算刪除了冗余屬性;根據條件屬性與決策屬性之間的依賴關系及相對信息熵概念,實現了連續屬性的離散化.數值示例和實驗結果顯示此方法是有效可行的.Abstract: This paper studied the problems of filling up incomplete data, reducing redundant attributes and discretizing continuous attributes in preprocessing the incomplete information table with continuous attributes in a rough set. According to the concept of interval value and the consistency of condition attributes and decision attributes, a plus rule for interval values was defined to filling up the incomplete data. Depending on the conception of classification, the discernible vector was defined and the discernible vector addition rule was used to delete redundant attributes. By use of the super-club data and entropy of the information table, the discretization of continuous attributes was implemented. The illustration and experimental results indicate that the method is effective.
-
Key words:
- incomplete information table /
- rough set /
- information entropy /
- attributes reduction /
- discretization
-

計量
- 文章訪問數: 147
- HTML全文瀏覽量: 22
- PDF下載量: 6
- 被引次數: 0