Scrape ER Wait Times in London, Ontario

^{Graph plotted from data in ER-wait-times.csv.}

Overview

This project is designed to scrape emergency room wait times from a specified URL (https://www.lhsc.on.ca/adult-ed/emergency-department-wait-times), process the data, and log it into CSV files. The project includes functions for web scraping using different libraries, string manipulation, and logging.

Usage

To run the scraping and logging process, execute the app.py script. This will start an infinite loop that scrapes the data at specified intervals (default is every 15 minutes).

python app.py

Ensure you have the necessary libraries installed, which you can do using the following command:

pip install requests beautifulsoup4 cloudscraper requests_html pandas

Configuration

Modify the default file names for data and log storage in the main function of app.py.
Adjust the scrape_interval variable in app.py to change the frequency of scraping.

File Structure

project/
│
├── request_soup.py      # Contains various functions for web scraping using requests, cloudscraper, and requests_html.
├── helper_fns.py        # Contains utility functions for string manipulation and HTTP request status formatting.
├── file_builders.py     # Contains a function to log data to a CSV file, either creating a new file or appending to an existing one.
├── app.py               # The main script that orchestrates the scraping, processing, and logging of emergency room wait times.
├── plot_csv.py          # Plots the data from the csv file generated by app.py. 
└── README.md            # Project readme file.

Files and Functions

request_soup.py

linkToSoup_scrapingAnt: Uses the ScrapingAnt API to fetch and parse a webpage, optionally using a proxy country and CSS selector.
linkToSoup: Fetches and parses a webpage using the requests library, with optional configurations for headers, cookies, etc.
linkToSoup_h: Fetches and parses a webpage using the requests_html library.
linkToSoup_c: Fetches and parses a webpage using the cloudscraper library to bypass anti-bot measures.

helper_fns.py

stripStr: Removes extra whitespace from a string.
truncateIfLong: Truncates a string to a specified maximum length, adding '...' if the string is too long.
miniStr: Converts an object to a string, removes extra whitespace, joins lines and words with specified separators, and optionally truncates the string.
reqStatus: Formats the status of an HTTP request as a string, including status code, reason, elapsed time, and URL.

file_builders.py

log_data: Logs data to a CSV file. Creates a new file if necessary, or appends to an existing file.

app.py

main: Scrapes emergency room wait times from a specified URL and logs the data. Handles scenarios including successful data retrieval, warnings for unexpected data formats, and errors during the scraping process.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
__pycache__		__pycache__
.gitattributes		.gitattributes
ER-wait-times.csv		ER-wait-times.csv
Figure_1.png		Figure_1.png
Figure_2.png		Figure_2.png
Figure_3.png		Figure_3.png
Figure_4.png		Figure_4.png
LICENSE		LICENSE
README.md		README.md
app.py		app.py
file_builders.py		file_builders.py
helper_fns.py		helper_fns.py
plot_csv.py		plot_csv.py
request_soup.py		request_soup.py
scrape-log-ER-wait.csv		scrape-log-ER-wait.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrape ER Wait Times in London, Ontario

Overview

Usage

Configuration

File Structure

Files and Functions

request_soup.py

helper_fns.py

file_builders.py

app.py

License

About

Uh oh!

Releases

Packages

Languages

License

fwSara95h/london-ON-ER-wait-times-scraper

Folders and files

Latest commit

History

Repository files navigation

Scrape ER Wait Times in London, Ontario

Overview

Usage

Configuration

File Structure

Files and Functions

request_soup.py

helper_fns.py

file_builders.py

app.py

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages