There is no one tool that is considered superior for purposes of Data Mining. Data Mining means different things to different displines and as a result, many solutions to different kinds of problems exist. A simple working definition of Data Mining is one that uses various tools to uncover structure from large amounts(tens of millions to billions of records) of high dimensional data(100s, 1000s or more variables) obtained as a consequence of natural or human systems under interaction. The explosion of data storage and acquistion over the last 30 years has created datasets from all areas of human investigation. The potential and incentive for understanding these structures presents research and business arbitrage opportunities. Weka is a collection of Machine Learning Algorithms written in Java. An interactive Gui is provided as well as a command line invocation capability for running multiple jobs. The tools offered in the base version of Weka is extensive. Data management, database connectivity, clustering, visualization, network modeling, prediction tools and validation methods are among its many features.
Weka is available at http://www.cs.waikato.ac.nz/ml/weka
-
Archives
- March 2013
- December 2010
- November 2010
- October 2010
- November 2009
- October 2009
- September 2009
- July 2009
- May 2009
- April 2009
- March 2009
- February 2009
- January 2009
- December 2008
- November 2008
- September 2007
- June 2007
- May 2007
- April 2007
- March 2007
- February 2007
- January 2007
- December 2006
- November 2006
-
Meta