Abstract
In this paper, we describe our approach for addressing Task 1 in the KDD CUP 2002 competition. The approach is based on developing and using an improved automatic feature selection method in conjunction with traditional classifiers. The feature selection method used is based on capturing frequently occurring keyword combinations (or motifs) within short segments of the text of a document and has proved to produce more accurate classification results than approaches relying solely on using keyword-based features.
References
3
Referenced
24
- SVM light http://svmlight.joachims.org/ SVM light http://svmlight.joachims.org/
- Foundations of statistical natural language preprocessing. Christopher D. manning and Hinrich Schutze , 2000 , The MIT Press . Foundations of statistical natural language preprocessing. Christopher D. manning and Hinrich Schutze, 2000, The MIT Press. / manning and Hinrich Schutze by Foundations D. (2000)
- Kensington Discovery Edition http://www.inforsense.com Kensington Discovery Edition http://www.inforsense.com
Dates
Type | When |
---|---|
Created | 18 years, 7 months ago (Jan. 17, 2007, 1:32 p.m.) |
Deposited | 2 months ago (June 18, 2025, 1:43 p.m.) |
Indexed | 2 months ago (June 19, 2025, 12:46 a.m.) |
Issued | 22 years, 8 months ago (Dec. 1, 2002) |
Published | 22 years, 8 months ago (Dec. 1, 2002) |
Published Online | 22 years, 8 months ago (Dec. 1, 2002) |
Published Print | 22 years, 8 months ago (Dec. 1, 2002) |
@article{Ghanem_2002, title={Automatic scientific text classification using local patterns: KDD CUP 2002 (task 1)}, volume={4}, ISSN={1931-0153}, url={http://dx.doi.org/10.1145/772862.772876}, DOI={10.1145/772862.772876}, number={2}, journal={ACM SIGKDD Explorations Newsletter}, publisher={Association for Computing Machinery (ACM)}, author={Ghanem, Moustafa M. and Guo, Yike and Lodhi, Huma and Zhang, Yong}, year={2002}, month=dec, pages={95–96} }