Decision Tree, Naïve Bayes and Support Vector Machine Applying on Social Media Usage in NYC / Comparative Analysis
Main Article Content
Abstract
Data mining and classification are most research idea that used in many topics by researchers. This study presents the comparison of three algorithms for classifications such as (Decision Tree, Naïve Bayes and Support Vector Machine), applying for social media usage dataset by NYC, to get the best result of the classification algorithm that can classify the instances according to the platforms. The final result of this research refer to the Support Vector Machine returned the best result among these techniques.
Article Details

This work is licensed under a Creative Commons Attribution 4.0 International License.
Tikrit Journal of Pure Science is licensed under the Creative Commons Attribution 4.0 International License, which allows users to copy, create extracts, abstracts, and new works from the article, alter and revise the article, and make commercial use of the article (including reuse and/or resale of the article by commercial entities), provided the user gives appropriate credit (with a link to the formal publication through the relevant DOI), provides a link to the license, indicates if changes were made, and the licensor is not represented as endorsing the use made of the work. The authors hold the copyright for their published work on the Tikrit J. Pure Sci. website, while Tikrit J. Pure Sci. is responsible for appreciate citation of their work, which is released under CC-BY-4.0, enabling the unrestricted use, distribution, and reproduction of an article in any medium, provided that the original work is properly cited.
References
1. Milan Kumari, Sunila Godara, “Comparative
Study of Data Mining Classification Methods in
Cardiovascular Disease Prediction”, International
Journal of Computer Science and Technology, Vol. 2,
Issue 2, June 2011.
2. Rohit Arora Suman, “Comparative Analysis of
Classification Algorithms on Different Datasets using
WEKA”, International Journal of Computer
Applications (0975 – 8887), Vol. 54, No.13,
September 2012.
3. S. Vijayarani, M. Muthulakshmi, “Comparative
Analysis of Bayes and Lazy Classification
Algorithms “, International Journal of Advanced
Research in Computer and Communication
Engineering, Vol. 2, Issue 8, August 2013.
4. Tina R. Patil, Mrs. S. S. Sherekar,
“Performance Analysis of Naive Bayes and J48
Classification Algorithm for Data Classification”,
International Journal of Computer Science and
Applications Vol. 6, No.2, Apr 2013 ISSN: 0974-
1011.
5. Swasti Singhal, Monika Jena, “A Study on
WEKA Tool for Data Preprocessing, Classification
and Clustering”, International Journal of Innovative
Technology and Exploring Engineering (IJITEE)
ISSN: 2278-3075, Vol. 2, Issue-6, May 2013.
6. Pritam H. Patil, Suvarna Thube, Bhakti
Ratnaparkhi, K.Rajeswari, “Analysis of Different
Data Mining Tools using Classification, Clustering
and Association Rule Mining”, International Journal
of Computer Applications (0975 – 8887), Vol. 93 ,
No.8, May 2014
7. R. Nivedha, N. Sairam, “A Machine Learning
based Classification for Social Media Messages”,
Indian Journal of Science and Technology, Vol.
8(16), July 2015, ISSN: 0974-6846.
8. Bahadorreza Ofoghi, Meghan Mann, Karin
Verspoor, “Towards Early Discovery of Salient
Health Threats: A Social Media Emotion
Classification Technique”, Pacific Symposium on
Biocomputing 2016.
9. https://catalog.data.gov/dataset/nyc-social-mediausage-
555a2, “Social Media Usages Dataset.
10. file:///Users/sewerae/Desktop/social%20media/Int
roduction%20to%20machine%20learning:%20Classif
ication%20of%20news%20with%20the%20help%20
of%20the%20working%20environment%20We.weba
rchive, “Introduction to Machine Learning:
Classification of NEWS with the Help of the
Working Environment WEKA”.
11. Chris Barrows, Eileen Reynolds, “New York
University Social Media Style Guide”, Last Edit:
Chris Barrows NYU Social Media Team, (Summer
2014).
12. Deepa S. Deulkar, R. R. Deshmukh, “Data
Mining Classification”, Imperial Journal of
Interdisciplinary Research (IJIR), Vol-2, Issue-4,
2016, ISSN: 2454-1362.
13. Jason Brownlee, “Support Vector Machines for
Machine Learning”, Machine Learning Algorithms,
April 20, 2016.
14. Ranjini Srinivas, “Managing Large Data Sets
Using Support Vector Machines”, A THESIS
Presented to the Faculty of the Graduate College,
University of Nebraska, master degree 2010.
15. David M. W. Powers, “Evaluation: From
Precision, Recall and F-Factor to ROC,
Informedness, Markedness & Correlation”, School of
Informatics and Engineering, Flinders University
Adelaide, Australia, Technical Report SIE-07-001,
December 2007.
16. https://en.wikipedia.org/wiki/Precision_and_recall
, Recall and Precision.
17. Han J., Kamber M., Pei J., “Data Mining
Concepts and Techniques”, Elsevier, Massachusetts,
no. 3, pp. 443-490, 2012.
18. Mandeep Kaur, Pravneet Kaur, “A Review on
Automatic News Classification using the
Probabilistic Classification Algorithms”, International
Journal of Science and Research (IJSR) ISSN: 2319-
7064, August (2015).
19. Amrita Naika, Lilavati Samantb, “Correlation
review of classification algorithm using data mining
tool: WEKA Rapidminer, Tanagra, Orange and
Knime”, International Conference on Computational
Modeling and Security (CMS 2016), Procedia
Computer Science 85, 2016.