Επιτομή:
In today's digital era, social media platforms serve as a rich source of valuable
information, offering researchers new opportunities to analyze user sentiments and
opinions. This thesis references the data that overwhelm modern daily life, as well as
the techniques developed to manage it, giving rise to a distinct branch of computer
science solely dedicated to this field.
4
Subsequently, an extensive analysis of the Apache Spark software follows, including
its architecture and functionality, step-by-step installation, and the utilization of
simple algorithms on small-scale data.
Furthermore, the capabilities of Apache Spark are presented in a more complex
scenario through the method known as Sentiment Analysis on a large-scale dataset
collected from the popular social networking platform Twitter.
The aim of this study is to leverage the parallel processing capabilities of Apache
Spark to address the inherent challenges of processing and analyzing a large volume
of datα