Crimson Publishers Publish With Us Reprints e-Books Video articles

Abstract

COJ Robotics & Artificial Intelligence

Unpacking Text Representation in NLP: A Comparative Study of Models and Methods

Submission: November 22, 2024;Published: March 10, 2025

ISSN 2639-0612
Volume1 Issue4

Abstract

In recent years, the applications of NLP have seen a boom with the emergence of Large Language Models. Computer Science, Artificial Intelligence, and Machine Learning are essentially powered by mathematics, and to make it practical, the real-world objects and concepts are quantified so the math can be applied to them practically. This works well in the cases where the data can be quantified without losing any information. The same cannot be said about the textual data as there are no direct and straight forward methods of converting text into numbers meaningfully. To make the language processing possible using mathematics techniques effectively, several studies have been carried out over time on the matter of text representation numerically. Some rely on using vectors containing word frequencies while some techniques go for an even simpler approach like one hot encoding. This manuscript is the extract of a comprehensive comparative study of various text representation techniques that have been in use for the majority of NLP history.

Get access to the full text of this article

About Crimson

We at Crimson Publishing are a group of people with a combined passion for science and research, who wants to bring to the world a unified platform where all scientific know-how is available read more...

Leave a comment

Contact Info

  • Crimson Publishers, LLC
  • 260 Madison Ave, 8th Floor
  •     New York, NY 10016, USA
  • +1 (929) 600-8049
  • +1 (929) 447-1137
  • info@crimsonpublishers.com
  • www.crimsonpublishers.com