A Fake News Detection Web App built using Python, Scikit-learn, and Streamlit.
It classifies news articles as Real or Fake based on their textual content.
This project uses Natural Language Processing (NLP) and a Machine Learning model trained on real-world news datasets to detect whether a piece of news is genuine or fake.
Users can paste any headline or article text into the app, and it will instantly predict the authenticity with a confidence score.
-
The dataset contains two files:
Fake.csvTrue.csv
-
Data was preprocessed and combined into one dataframe.
-
Text features were extracted using TF-IDF Vectorization.
-
A PassiveAggressiveClassifier was trained to distinguish fake vs. real news.
-
The final trained model and vectorizer were saved as:
model.pklvectorizer.pkl
-
These are used in the Streamlit web app for real-time prediction.
| Technology | Purpose |
|---|---|
| Python | Programming language |
| Scikit-learn | Machine learning algorithms |
| Pandas / NumPy | Data processing |
| TF-IDF Vectorizer | Feature extraction |
| Streamlit | Web app interface |
git clone https://github.com/<your-username>/fake-news-detector.git
cd fake-news-detector
###🌐 Deployment
The project is deployed using Streamlit Cloud.
Once deployed, users can access it directly through a shareable URL.
🧑💻 Author
Aayush Dhote
💼 Aspiring AI Engineer | Data Science & Machine Learning Enthusiast]
❤️ Acknowledgements
Streamlit
Scikit-learn Documentation
Dataset: Kaggle – Fake and True News