Skip to content

Commit bbd21e6

Browse files
authored
Update README.md
1 parent 11039d1 commit bbd21e6

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ Predicting Topic of An Advertisement Based on Copy & Deep Dive Analysis
1111
## Obtaining the Data
1212
* Using Beautiful Soup, scraped about 146,000 pages for several different types of categorical and text data of advertisements including the actual copy(headline), industry, type of media, region and full description of the campaign.
1313
* Below is an example of a page and the locations of the information scraped
14-
![image-1](Images/welovead.com_example.PNG)
14+
![image-1](Images/welovead.com_example.PNG=250x)
1515
* Stored all information in a NoSQL MongoDB
1616

1717
## Cleaning
@@ -22,9 +22,9 @@ Predicting Topic of An Advertisement Based on Copy & Deep Dive Analysis
2222

2323
## Exploratory Analysis
2424
* Basic EDA showed that the U.S., Volkswagen, print media, and the Public Interest & Non-Profit Sector dominated this data set as you can see in the charts below:
25-
2625
![image-2](Images/ads_by_company_countplot.png)
2726
![image-3](Images/ads_by_industry_countplot.png)
27+
2828
* Most common words, included 'new,' 'world', 'life' and 'help', which makes sense because so many ads are advertising a new product and the most common industry is non-profit
2929
* Most common bigrams included 'don't let,' 'help us' and 'domestic violence' once again demonstrating the dominance of public interest/non-profit ads
3030
* Examined common words in descriptions as well, but it seemed to be too contextual about the campaigns to be interesting

0 commit comments

Comments
 (0)