Measuring Different Tasks for Unstructured Data and High Speed Data in Data Stream Mining
K. Rajasekhar1 , P. Venkata Maheswara2
Section:Research Paper, Product Type: Journal Paper
Volume-7 ,
Issue-5 , Page no. 582-589, May-2019
CrossRef-DOI: https://doi.org/10.26438/ijcse/v7i5.582589
Online published on May 31, 2019
Copyright © K. Rajasekhar, P. Venkata Maheswara . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: K. Rajasekhar, P. Venkata Maheswara, “Measuring Different Tasks for Unstructured Data and High Speed Data in Data Stream Mining,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.5, pp.582-589, 2019.
MLA Style Citation: K. Rajasekhar, P. Venkata Maheswara "Measuring Different Tasks for Unstructured Data and High Speed Data in Data Stream Mining." International Journal of Computer Sciences and Engineering 7.5 (2019): 582-589.
APA Style Citation: K. Rajasekhar, P. Venkata Maheswara, (2019). Measuring Different Tasks for Unstructured Data and High Speed Data in Data Stream Mining. International Journal of Computer Sciences and Engineering, 7(5), 582-589.
BibTex Style Citation:
@article{Rajasekhar_2019,
author = {K. Rajasekhar, P. Venkata Maheswara},
title = {Measuring Different Tasks for Unstructured Data and High Speed Data in Data Stream Mining},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2019},
volume = {7},
Issue = {5},
month = {5},
year = {2019},
issn = {2347-2693},
pages = {582-589},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4284},
doi = {https://doi.org/10.26438/ijcse/v7i5.582589}
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i5.582589}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4284
TI - Measuring Different Tasks for Unstructured Data and High Speed Data in Data Stream Mining
T2 - International Journal of Computer Sciences and Engineering
AU - K. Rajasekhar, P. Venkata Maheswara
PY - 2019
DA - 2019/05/31
PB - IJCSE, Indore, INDIA
SP - 582-589
IS - 5
VL - 7
SN - 2347-2693
ER -
VIEWS | XML | |
250 | 179 downloads | 125 downloads |
Abstract
Data streams are continuous flows of data. Examples of data streams include network traffic, sensor data, call center records and so on. One important problem is mining data streams in extremely large databases (e.g. 100 TB). Satellite and computer network data can easily be of this scale. However, today’s data mining technology is still too slow to handle data of this scale. In addition, data mining should be a continuous, online process, rather than an occasional one-shot process. Organizations that can do this will have a decisive advantage over ones that do not. One particular instance is from high speed network traffic where one hopes to mine information for various purposes, including identifying anomalous events possibly indicating attacks of one kind or another. A technical problem is how to compute models over streaming data, which accommodate changing environments from which the data are drawn. This is the problem of “concept drift” or “environment drift.” This problem is particularly hard in the context of large streaming data. How may one compute models that are accurate and useful very efficiently? For example, one cannot presume to have a great deal of computing power and resources to store a lot of data, or to pass over the data multiple times. Hence, incremental mining and effective model updating to maintain accurate modeling of the current stream are both very hard problems.
Key-Words / Index Term
Data Stream, Data Stream Mining, Concept Drift/Environment Drift
References
[1] Latifur Khan1,Wei Fan, Data Stream Mining and Its Applications, June, 2012.
[2] Chen, S., Wang, H., Zhou, S., Yu, P (2008). Stop chasing trends: Discovering high order models in evolving data, In: Proc. ICDE, pp. 923–932 (2008).
[3] Gaber MM, Zaslavsky A, Krishnaswamy S. Mining data streams: a review. ACM SIGMOD Rec 2005,
[4] Mohamed Medhat Gaber, Advances in data stream mining, Volume 2, Januar y / Februar y 2012.
[5] C. C. Aggarwal, J. Han, J. Wang, and P. S. Yu, "A framework for projected clustering of high dimensional data streams," in Proceedings of the Thirtieth international conference on Very large data bases-Volume 30, 2004, p.863.
[6] Gaber MM, Zaslavsky A, Krishnaswamy S. Mining data streams: a review. ACM SIGMOD Rec 2005,
[7] Nicolás García-Pedrajas • Aida de Haro-GarcíaScaling up data mining algorithms: review and taxonomy,Received: 4 June 2011 / Accepted: 26 September 2011 / Published online: 13 January 2012
[8] Madjid Khalilian, Norwati Mustapha, MD Nasir Suliman, MD Ali Mamat,” A Novel K-Means Based Clustering Algorithm for High Dimensional Data Sets”,Vol.1, IMECS 2010,march 2010.
[9] R.S. Walse , G.D. Kurundkar , P. U. Bhalchandra,”A Review: Design and Development of Novel Techniques for Clustering and Classification of Data” in IJSRCSE, Vol.06 , Special Issue.01 , pp.19-22, Jan-2018.
[10] A.Jenita Jebamalar “Efficiency of Data Mining Algorithms Used In Agnostic Data Analytics Insight Tools” Int. J. Sc. Res. in Network Security and Communication ,Volume-6, Issue-6, December 2018.
[11] Himanshi , Komal Kumar Bhatia, “Prediction Model for Under-Graduating Student’s Salary Using Data Mining Techniques” Int. J. Sc. Res. in Network Security and Communication , Volume-6, Issue-2, April 2018.