tailieunhanh - Data Analysis Machine Learning and Applications Episode 2 Part 6

Tham khảo tài liệu 'data analysis machine learning and applications episode 2 part 6', kỹ thuật - công nghệ, cơ khí - chế tạo máy phục vụ nhu cầu học tập, nghiên cứu và làm việc hiệu quả | 424 Joaquin Vanschoren and Hendrik Blockeel all their parameters. Also 86 commonly used classification datasets were taken from the UCI repository and inserted together with their calculated characteristics. Then to generate a sample of classification experiments that covers a wide range of conditions while also allowing to test the performance of some algorithms under very specific conditions some algorithms were explored more thoroughly than others. First we ran all experiments with their default parameter settings on all datasets. Secondly we defined sensible values for the most important parameters of the algorithms SMO which trains a support vector machine MultilayerPerceptron J48 a implementation 1R a simple rule learner and Random Forests an ensemble learner and varied each of these parameters one by one while keeping all other parameters at default. Finally we further explored the parameter spaces of J48 and 1R by selecting random parameter settings until we had about 1000 experiments on each dataset. For all randomized algorithms each experiment was repeated 20 times with different random seeds. All experiments about 250 000 in total where evaluated using 10-fold cross-validation using the same folds for each dataset. An online interface is available at http dtai expdb for those who want to reuse experiments for their own purposes together with a full description and code which may be of use to set up similar databases for example to store analyse and publish the results of large benchmark studies. 4 Using the database We will now illustrate how easy it is to use this experiment database to investigate a wide range of questions on the behavior of learning algorithms by simply writing the right queries and interpreting the results or by applying data mining algorithms to model more complex interactions. Comparing different algorithms A first question may be How do all algorithms in this database compare on a specific dataset D To

TỪ KHÓA LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG
crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.