Analytics is a multidimensional field that encompasses mathematics, statistics and machine-learning techniques to find meaningful patterns and knowledge in recorded data. In
today's day and age when all types of data are available, it is imperative that we use this data to make better fact-based decisions. The analytics modules within Sigma Magic
are built on top of the R platform. While R is a very powerful open source platform, it is also relatively hard to understand and use since it requires learning the R Script
and programming expertise. Sigma Magic brings in some of these powerful tools to you in a easy to use manner.
The following templates are available within Analytics:
Clusters - Hierarchical
This template seeks to build a hierarchical cluster for a given data set. Typically used for continuous data where similar observations are grouped into
Clusters - K-Means
K-means aims to cluster observations into K clusters where each observation is assigned to the cluster for which it is the closest. Typically used
for continuous data to group similar observations into one bucket.
Correspondence analysis is a graphical way of displaying the data to determine possible relationships between row and column variables of a contingency
Decision Tree - Conditional
Use significance testing procedures to determine a recursive split of a dependent variable and develop a decision tree. The decision tree can be used to
predict future values of the dependent variable based on the developed tree.
Decision Tree - Recursive
Uses a recursive partitioning technique to arrive at the decision tree. This employs information measures to determine the best split of the tree. The
decision tree can be used to predict future values of the dependent variables.
This template can be used to build a model for different groups (where the groups are known upfront). The objective of the model is to be able to predict which
group a new observation belongs based on certain parameters.
Factor Analysis - Exploratory
This template can be used to create a determine which factors can be combined together in order to express a large number of variables into a parsimonious
set of factors.
Generalized Linear Model
Generalized linear models are a generalization of ordinary least squares approach to build more complex real-world models where the linear relationship between
the independent and dependent variables are no longer valid. It also allows response variables to have error distribution that are not normally distributed.