RSP Science Hub
  • Register
  • Login

International Research Journal on Advanced Science Hub

Notice

As part of Open Journals’ initiatives, we create website for scholarly open access journals. If you are responsible for this journal and would like to know more about how to use the editorial system, please visit our website at https://ejournalplus.com or
send us an email to info@ejournalplus.com

We will contact you soon

  1. Home
  2. Volume 2, Issue Special Issue ICSTM 12S
  3. Authors

Current Issue

By Issue

By Subject

Keyword Index

Author Index

Indexing Databases XML

About Journal

Aims and Scope

Editorial Board

Editorial Staff

Publication Ethics

Indexing and Abstracting

Related Links

FAQ

Peer Review Process

News

Editor and Reviewer guidelines

Digital Archiving & Preservation Policy

Performance Analysis of Feature Selection Techniques for Text Classification

    Hemlata Patel Dhanraj Verma

International Research Journal on Advanced Science Hub, 2020, Volume 2, Issue Special Issue ICSTM 12S, Pages 44-50
10.47392/irjash.2020.259

  • Show Article
  • Download
  • Cite
  • Statistics
  • Share

Abstract

Internet is a suitable, highly available and low cost publishing medium. Therefore a significant data is hosted and published using websites. In this domain some amount of data is directly present for common people and some of data is not publically distributed. Such kinds of data are utilizable by service providers and administrators for business intelligence and other similar applications. In this presented work the web data analysis or mining is the key area of investigation and experimental study. The web data mining can be dividing in three major classes i.e. web content mining, web structure mining and web usages mining. In this work the web content mining and web usages mining is taken into consideration. First of all the web content mining is explored thus a system is developed for making comparative performance study of different content feature selection techniques. In this experiment the GINI index, Information Gain, DFS and Odd Ratio is compared using a real world collection of web pages. In order to classify the extracted features from the web contents the SVM (Support Vector Machine) is applied. The comparative study demonstrates the IG and GI is the suitable feature selection techniques that work well with the SVM classifier.
Keywords:
    Web Data Mining GINI Index Information Gain K- Nearest Neighbour Support Vector Machine
  • PDF (788 K)
  • XML
(2020). Performance Analysis of Feature Selection Techniques for Text Classification. International Research Journal on Advanced Science Hub, 2(Special Issue ICSTM 12S), 44-50. doi: 10.47392/irjash.2020.259
Hemlata Patel; Dhanraj Verma. "Performance Analysis of Feature Selection Techniques for Text Classification". International Research Journal on Advanced Science Hub, 2, Special Issue ICSTM 12S, 2020, 44-50. doi: 10.47392/irjash.2020.259
(2020). 'Performance Analysis of Feature Selection Techniques for Text Classification', International Research Journal on Advanced Science Hub, 2(Special Issue ICSTM 12S), pp. 44-50. doi: 10.47392/irjash.2020.259
Performance Analysis of Feature Selection Techniques for Text Classification. International Research Journal on Advanced Science Hub, 2020; 2(Special Issue ICSTM 12S): 44-50. doi: 10.47392/irjash.2020.259
  • RIS
  • EndNote
  • BibTeX
  • APA
  • MLA
  • Harvard
  • Vancouver
  • Article View: 429
  • PDF Download: 216
  • LinkedIn
  • Twitter
  • Facebook
  • Google
  • Telegram
  • Home
  • Glossary
  • News
  • Aims and Scope
  • Privacy Policy
  • Sitemap
This journal is licensed under a Creative Commons Attribution 4.0 International (CC-BY 4.0)

Powered by eJournalPlus