tailieunhanh - SVD based dimensionality reduction for efficient web page classification

This study implements word count function of Vector Space Model and singular value decomposition (SVD) technique of feature extraction based on dimensionality reduction. The proposed system presents an effective pre-processing and dimensionality reduction techniques which help the document classify by using naïve byes algorithm. The experimental results show that the proposed method enhances the performance of document classification. | ISSN:2249-5789 Sweety M Patel et al, International Journal of Computer Science & Communication Networks,Vol 6(3),181-186 SVD based Dimensionality Reduction for Efficient Web Page Classification Sweety M. Patel Prof. Dipak C. Patel Department of Computer Engineering U. V. Patel Collage of Engineering (UVPCE), Mehsana Gujarat, India Abstract I. INTRODUCTION Web mining is the use of data mining techniques to Web is a system of inter-connected documents (with automatically discover and extract information from the Web hyperlinks) on one or more Web servers. The web is hyperlink structure, page content, and usage logs of data [1]. perhaps the largest data sources in the world. Web mining We are collect the data from the real world applications contain is the techniques of extracting useful data from web lots of erroneous data. Data pre-processing is an important step servers including website usage logs, files and documents, in data mining to correct the erroneous data present in the multimedia, hyperlinks etc. Based on the primary kinds of dataset [2]. Many data mining applications contain high data used in the mining process, Web mining tasks can be dimensional data. Dimensionality refers to number of terms in classified into three types: Web structure mining, Web a web page. Dimensionality Reduction is about converting data content mining and Web usage mining. Web content of very high dimensional data into much lower dimensional mining is the process of extracting useful information from data such that each of the lower dimensions conveys much the web documents. Data such as images, text, and more information. High dimensionality of web pages causes multimedia are high dimensional in nature. As the problem while classify them. The main objective of reducing dimensionality of data increases query performance dimensionality of web pages is to improve the performance of decreases, demand for .

TỪ KHÓA LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG
11    150    1    28-04-2024
6    93    0    28-04-2024
380    92    0    28-04-2024
337    83    0    28-04-2024
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.