Skip to main content

Posts

Showing posts with the label statistics

Featured Post

Data Mining with Weka -Installation

Weka - Data mining Tool W eka is a tool for big data and data mining. It is used to various classification, experiments, and analysis over large data sets. Installation Guide -weka  You can download Weka from   here   and follow the normal installation procedure. After completion you will get following window, here you can begin your classification or experiment on different data sets with Weka.

Introduction to Applied Statistics and It's Software Environment

             Introduction to Applied Statistics Applied Statistics is a branch of applied mathematics concerned with the collection and interpretation of quantitative data and the use of probability theory to estimate population parameters. In applying statistics such as a scientific, industrial, or societal problem, it is necessary to begin with a population or process to be studied. Populations can be diverse topics such as "all persons living in a country" or "every atom composing a crystal". It deals with all aspects of data including the planning of data collection in terms of the design of surveys and experiments.                    Software Environment The S, S plus, Mat lab and R environments are an integrated suite of software facilities for data analysis and graphical display with the feature like an extensive and coherent collection of tools for statistics and data analysis, language for expressing statistical models and tools for using linear an

Data Mining With Weka - Algorithms

Weka - Data Mining Tool  Algorithms: There are a lot of algorithm in weka for various classification and experiments and some the major and widely used are following : Decision tree(J48): NAME: weka.classifiers.trees.J48 SYNOPSIS: Class for generating a pruned or Un pruned  C4.5 decision tree. Naïve Bayes: NAME: weka.classifiers.bayes.NaiveBayes SYNOPSIS: Class for a Naive Bayes classifier using estimator classes. Numeric estimator precision values are chosen based on analysis of the training data. For this reason, the classifier is not an UpdateableClassifier (which in typical usage are initialized with zero training instances) KNN(IBK): NAME: weka.classifiers.lazy.IBk SYNOPSIS: K-nearest neighbours classifier. Can select appropriate value of K based on cross-validation. Can also do distance weighting. SVM(LibSVM):  NAME: weka.classifiers.functions.LibSVM SYNOPSIS: A wrapper class for the libsvm tools (the libsvm classes,