Fake News Detection Using Logistic Regression & Decision Tree Classifier | by Almira Chusnul

Earlier than we’re going by way of the code, right here’s some packages libraries that ought to be put in and imported first:

# Putting in Packages!pip set up pandas
!pip isntall seaborn
!pip set up Matplotlib
!pip set up tqdm
!pip set up nltk
!pip set up wordcloud

# Importing Librariesimport pandas as pd 
import seaborn as sns 
import matplotlib.pyplot as plt

Datasets

Then, we have to obtain the dataset on our native repository. The dataset accommodates varied information from many fields, equivalent to politics, well being, and others. On this challenge, we’ll use the well being information, politics information, and all information as knowledge prepare. Listed below are the variations of information train-test that we’ll gather:

All information as knowledge prepare and politics information as knowledge check
All information as knowledge prepare and well being information as knowledge check
Politics information as knowledge prepare and knowledge check
Well being information as knowledge prepare and knowledge check
Politics information as knowledge prepare and well being information as knowledge check
Well being information as knowledge prepare and politics information as knowledge check

Dataset could be downloaded: here

To stop knowledge bias in knowledge coaching, we should always prepare the pretend information knowledge dan actual information knowledge with the identical quantity (we’ll drop the remaining knowledge that’s an excessive amount of).