Search the web
Sign In
New User? Sign Up
recursive-partitioning · Recursive Partitioning
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Want to share photos of your group with the world? Add a group photo to Flickr.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Methodology development: Open vs. Proprietary   Message List  
Reply | Forward Message #78 of 95 |
In the Data Mining world that is dominated by Computer Scientists, the
methodology behind the software packages sold/licensed in the market
is often proprietary. Take, for example, the classification and
regression trees software package CART(r). The basic idea behind
CART(r) is the algorithm proposed by Breiman, Friedman, Olshen, and
Stone (1984). However, there has been quite a few proprietary
improvement in CART(r) so that you can no longer know for sure what's
going on inside the software package. The same is true for C5.0/See5
(another classification trees software) that supersedes C4.5.

When dealing with proprietary methodology, it's (practically)
impossible to study the properties of the method thoroughly. Personally, I feel
uncomfortable using a method that can't
be evaluated objectively by fellow researchers. It may be OK if the
application has nothing to do with human experimentation (as in
Biostatistics). Since most (if not all) applications of Data Mining
are in commerce, the risk of using unproven methodology that hasn't
been extensively scrutinized may be acceptable.

Perhaps this joke is true after all: when a Statistician gets an idea,
she/he'll write and publish a paper while when a Computer Scientist
gets an idea, she/he'll form a company. :)

Comments?



--
T.S. Lim
tslim@...
www.Recursive-Partitioning.com



------------------------------------------------------------
Get paid to write review! http://recursive-partitioning.epinions.com





Mon Oct 16, 2000 6:00 am

tslim@...
Send Email Send Email

Forward
Message #78 of 95 |
Expand Messages Author Sort by Date

In the Data Mining world that is dominated by Computer Scientists, the methodology behind the software packages sold/licensed in the market is often...
T.S. Lim
tslim@...
Send Email
Oct 16, 2000
5:56 am
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help