No Cover Image

Journal article 761 views 106 downloads

Re-Engineered Word Embeddings for Improved Document-Level Sentiment Analysis

Scott Yang Orcid Logo, Farzin Deravi

Applied Sciences, Volume: 12, Issue: 18, Start page: 9287

Swansea University Author: Scott Yang Orcid Logo

  • 61289.VOR.pdf

    PDF | Version of Record

    Copyright: © 2022 by the authors. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.

    Download (1.77MB)

Check full text

DOI (Published version): 10.3390/app12189287

Abstract

In this paper, a novel re-engineering mechanism for the generation of word embeddings is proposed for document-level sentiment analysis. Current approaches to sentiment analysis often integrate feature engineering with classification, without optimizing the feature vectors explicitly. Engineering fe...

Full description

Published in: Applied Sciences
ISSN: 2076-3417
Published: MDPI AG 2022
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa61289
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract: In this paper, a novel re-engineering mechanism for the generation of word embeddings is proposed for document-level sentiment analysis. Current approaches to sentiment analysis often integrate feature engineering with classification, without optimizing the feature vectors explicitly. Engineering feature vectors to match the data between the training set and query sample as proposed in this paper could be a promising way for boosting the classification performance in machine learning applications. The proposed mechanism is designed to re-engineer the feature components from a set of embedding vectors for greatly increased between-class separation, hence better leveraging the informative content of the documents. The proposed mechanism was evaluated using four public benchmarking datasets for both two-way and five-way semantic classifications. The resulting embeddings have demonstrated substantially improved performance for a range of sentiment analysis tasks. Tests using all the four datasets achieved by far the best classification results compared with the state-of-the-art.
Keywords: sentiment analysis; semantic classification; feature re-engineering; NLP
College: Faculty of Science and Engineering
Funders: Swansea University
Issue: 18
Start Page: 9287