Abstract
This study proposed early decision heuristics for objectionable content classification using an inverse chi-square classifier. The experimental results indicated that only examining the title plus 10% of a web page's content can cost-effectively achieve an average precision of 93%. More importantly, the F1 measure achieved its best when the title plus 60% of the body was examined. The proposed early decision making heuristics can serve as the trade-off baseline for real-time online filtering.
Original language | English |
---|---|
Pages | 35-39 |
Number of pages | 5 |
DOIs | |
State | Published - 2008 |
Event | IEEE International Conference on Intelligence and Security Informatics, 2008, IEEE ISI 2008 - Taipei, Taiwan Duration: 17 Jun 2008 → 20 Jun 2008 |
Conference
Conference | IEEE International Conference on Intelligence and Security Informatics, 2008, IEEE ISI 2008 |
---|---|
Country/Territory | Taiwan |
City | Taipei |
Period | 17/06/08 → 20/06/08 |