Nand Kishor Contributor

Nand Kishor is the Product Manager of House of Bots. After finishing his studies in computer science, he ideated & re-launched Real Estate Business Intelligence Tool, where he created one of the leading Business Intelligence Tool for property price analysis in 2012. He also writes, research and sharing knowledge about Artificial Intelligence (AI), Machine Learning (ML), Data Science, Big Data, Python Language etc... ...

Full Bio 
Follow on

Nand Kishor is the Product Manager of House of Bots. After finishing his studies in computer science, he ideated & re-launched Real Estate Business Intelligence Tool, where he created one of the leading Business Intelligence Tool for property price analysis in 2012. He also writes, research and sharing knowledge about Artificial Intelligence (AI), Machine Learning (ML), Data Science, Big Data, Python Language etc...

3 Best Programming Languages For Internet of Things Development In 2018
913 days ago

Data science is the big draw in business schools
1086 days ago

7 Effective Methods for Fitting a Liner
1096 days ago

3 Thoughts on Why Deep Learning Works So Well
1096 days ago

3 million at risk from the rise of robots
1096 days ago

Top 10 Hot Artificial Intelligence (AI) Technologies
346782 views

2018 Data Science Interview Questions for Top Tech Companies
98625 views

Want to be a millionaire before you turn 25? Study artificial intelligence or machine learning
93372 views

Here's why so many data scientists are leaving their jobs
90633 views

Google announces scholarship program to train 1.3 lakh Indian developers in emerging technologies
70500 views

5 Things You Need to Know about Sentiment Analysis and Classification

By Nand Kishor |Email | Mar 27, 2018 | 27915 Views

We take a look at the important things you need to know about sentiment analysis, including social media, classification, evaluation metrics and how to visualise the results.

In the last years, Sentiment Analysis has become a hot-trend topic of scientific and market research in the field of Natural Language Processing (NLP) and Machine Learning. Below, you can find 5 useful things you need to know about Sentiment Analysis that are connected to Social Media, Datasets, Machine Learning, Visualizations, and Evaluation Methods applied by researchers and market experts. Letâ??s get started!

1. Social Media are the main resource
 Sentiment Analysis examines the problem of studying texts, like posts and reviews, uploaded by users on microblogging platforms, forums, and electronic businesses, regarding the opinions they have about a product, service, event, person or idea.

Figure 1. 3-Classes Sentiment Analysis [1]

The most common use of Sentiment Analysis is this of classifying a text to a class. Depending on the dataset and the reason, Sentiment Classification can be binary (positive or negative) or multi-class (3 or more classes) problem.

In addition, among researchers and stakeholders, you can find either similar or completely different opinions concerning the relation between emotion detection and sentiment analysis, depending on their perspective. However, regardless the result or approach, they all adopt the same techniques.

2. Before starting the Sentiment Analysis
 Datasets
Many evaluations and labeled sentiment datasets have been created, especially for Twitter posts and Amazon product reviews.

The most popular and widespread are:
Also, anyone using the APIs provided by many platforms and forums can crawl and collect data. The most famous API is that of Twitter.

Pre-processing
An initial step in text and sentiment classification is pre-processing. A significant amount of techniques is applied to data in order to reduce the noise of text, reduce dimensionality, and assist in the improvement of classification effectiveness. The most popular techniques include:

  • Remove numbers
  • Stemming
  • Part of speech tagging
  • Remove punctuation
  • Lowercase
  • Remove stopwords
 
3. How to classify Sentiment?
Machine Learning
This approach, employes a machine-learning technique and diverse features to construct a classifier that can identify text that expresses sentiment. Nowadays, deep-learning methods are popular because they fit on data learning representations.

Lexicon-Based
This method uses a variety of words annotated by polarity score, to decide the general assessment score of a given content. The strongest asset of this technique is that it does not require any training data, while its weakest point is that a large number of words and expressions are not included in sentiment lexicons.

Hybrid
The combination of machine learning and lexicon-based approaches to address Sentiment Analysis is called Hybrid. Though not commonly used, this method usually produces more promising results than the approaches mentioned above.

Figure 2. Sentiment classification techniques [2]

4. Evaluation metrics
 As a classification problem, Sentiment Analysis uses the evaluation metrics of Precision, Recall, F-score, and Accuracy. Also, average measures like macro, micro, and weighted F1-scores are useful for multi-class problems. Depending on the balance of classes of the dataset the most appropriate metric should be used.

Figure 3. Steps-to-Evaluate-Sentiment-Analysis [3]

5. Visualise Results
To visualize the results of Sentiment Analysis, many people employ well-known techniques, such as graphs, histograms, and confusion matrices. Because of present multiple data domains and tasks, visualizations approaches like wordcloud, interactive maps, sparkline-style plots are also very popular.

Figure 4. Sentiment Word Cloud [4]

Source: Kdnugget