Skip to content

buhari15/tweets-nigeria-analysis

Repository files navigation

Nigerian tweets analysis

This portfolio is part of a degree in Data Science at IU, International University of Applied Sciences.

The main task of the portfolio is to extract prevalent topics from Twitter messages, in other words, topic modeling.

The main Tasks as are follows:

  1. Extract relevant tweets of a city or region. Raw data

  2. Preprocess the data. Data preprocessing

  3. The 10 most frequently used hashtags. Most frequent hashtags

  4. The 10 most active users. Most active users

  5. LDA with TFIDF output. Common topics LDA with TFIDF

  6. LDA with CountVectorizer output. Common topics LDA with CountVectorizer

  7. LSA with CountVectorizer. Common topics LSA with CountVectorizer

Author

Buhari Abubakar

License

Copyright © 2023 Buhari Abubakar Released under the MIT license.


Releases

No releases published

Packages

No packages published

Languages