Monday, 14 December 2015

Machine Learning - Comparison with Rule based

Machine Learning – An Introduction
In general, Machine Learning is a type of artificial intelligence that provide machines with the ability to learn without being explicitly programmed. In order to make the machine robust, some amount of pre-training is required so that the machine can learn new and similar (can be defined in different dimensions) instances of a particular task on its own (without any human intervention).
Why Machine Learning?
Basically, for any task, the approaches can be classified into two broad categories: Rule-based and Machine Learning approaches.
Rule-based approaches are nothing but writing if-then-else possible conditions to achieve the desired outcome. Framing rules is the starting point in understanding the machine learning approaches, because any machine learning algorithms can perform their learning through set of features (properties or characteristics - vary for different domains and tasks).
Rules based approaches have some limitations and disadvantages which are overcome in machine learning approaches. One such limitation is the rules are domain specific. It is difficult to design the rules as domain independent. It leads to the conflict of the existing rules, which results in overfitting and/or producing some false positive and true negative results. It is also hard to design abstract level of rules where the desired outcome could not be reached with the abstract rules. Moreover, it is difficult to capture some specific classes through generic rules.
Thus, to overcome the limitations and barriers, the machine learning algorithms are introduced.
An Example - Rule-based Approach Vs Machine Learning
Let us consider a chess board which have 8X8 matrix, consists of 8 rows and 8 columns. The task is to build the rules as well as machine learning algorithm for this game.

Figure 1
Figure 1 shows a sample chess board. There are certain steps to frame the rules. They are:
1.       In order to frame the rules for this chess board, we need to define rules for each chess pieces or coins with some pre conditions
2.       Rules needs to be framed for each row and column
3.       Rules are also framed as per the opponent move.
4.       Some chess pieces or coins have restrictions which needs to be defined in the rules
Based on these conditions, the rules are framed which are big enough to write when the task is big and the data is too large.
On the other hand, machine learning approaches require a set of probabilities for example, to perform the same actions done by the set of rules.
Here too, a set of preconditions and/or probabilities are required to start with.
1.       Initial probability for all the chess pieces or coins
2.       Transition probability for each move of each coin
3.       Emission probability – the outcome of the each move. If the opponent coin is defeated, then the probability of opponent’s chess piece is zero and the one who defeats is some increased probability from the previous state.



Figure 2 – Moves highlighted for White Horse
From Figure-2, it can be seen that, there are six different ways to move the White horse to save from Black King. For rule-based approach, six different rules are needed. Among six, four are safe and two are not. These are also captured by the rules where more complex rules are needed to reslove this.
In machine learning, the probabilities of each place and each coin should decide on which place is the best to choose to move the white horse. So six different probabilities will be computed and among them, four have some moderate probabilities and two of them have some low proabilities. Based on these values, the correct position will be selected to place the White Horse.
Likewise, the probabilities for other chess pieces are also computed.
Thus machine learning is required for any task which makes the system robust. 

Monday, 15 June 2015

Statistics synonyms - 1

Independent Variable (represents the output or effect) - "predictor variable", "regressor", "controlled variable", "manipulated variable", "explanatory variable", "exposure variable", "risk factor", "feature", and "input variable"

Dependent Variable (represents the input or causes) - "response variable", "regressand", "measured variable", "responding variable", "explained variable", "outcome variable", "experimental variable", and "output variable"

Both these terminologies and its variations may vary depends on the context and application where we use. Say for instance, the term "feature" is an independent variable which is mostly used in machine learning and pattern recognition. the term "risk factor" is used in medical diagnosis.


Tuesday, 19 May 2015

ML Task: 1 Identification of Fraudlent/ Fake User Ids in a Social Network

Identification of Fraudlent/ Fake User Ids in a Social Network

For Example: Facebook

Objective:
To identify fake user ids (anonymous, fake, uncommon names) in a network.

Briefly describe the task.

Choose Machine Learning algorithm on your own, and discuss briefly how it is effectively used in the identification.

Any Questions and Queries kindly let me know




Monday, 18 May 2015

Surprising Machine Learning Applications



Surprising Machine Learning Applications

1.      Medical Diagnosis
2.      Climate Change
3.      Robotics
4.      Automatic car driving
5.      Protecting Animals
6.      Mobile applications
7.      CyberSecurity
8.      detecting fraud at banks
9.      identifying which consumers are most likely to respond favorably to: direct mail, Groupon deals, Facebook advertisements
10.  trading stocks and derivatives
11.  pricing insurance premiums (Google recently funded a startup that helps natural disaster insurance companies determine prices based on weather patterns due to global warming)
12.  identifying human genes that make people more likely to develop cancer
13.  predicting housing prices for real estate companies
14.  predicting wine-tasting ratings
15.  programmatically reading text from a random photograph
16.  programmatically recognizing faces
17.  anti-virus software (packet inspection)
18.  factory maintenance diagnostics
19.  delayed airplane flights
20.  determining which voters to canvass during an election
21.  developing pharmaceutical drugs (combinatorial chemistry)
22.  predicting tastes in music (Pandora)
23.  predicting tastes in movies/shows (Netflix)
24.  search engines (Google)
25.  predicting interests (Facebook)
26.  predicting other books you might like (Amazon)




 Add more applications on any areas including NLP, Text Mining, Sentiment Analysis, E-Learning, etc. There are many tasks inside each application. Say for example,

Medical Diagnosis have
  1.     Predicting Emergency Room Wait Times
  2.   Identifying Heart Failure
  3.    Predicting Strokes and Seizures
  4.   Predicting Hospital Readmissions 
Expecting more applications from you people. Add more to this list. 

Sunday, 17 May 2015

Machine Learning - Five different areas

Machine Learning - Introduction

As we all know the basics of Machine Learning (ML) and thus we directly continue with the five important and different areas of ML. The diagram below shows the five areas of ML.

Five different areas of Machine Learning



Let we discuss each area and its applications (different fields (real time) where each type is popular) in the future posts.