I created this program to extract data from the website sushiscan.su (a manga online reading site in french), load them into a MongoDB database and then use their Data API.
The MongoDB Atlas Data API lets you read and write data in Atlas with standard HTTPS requests. To use the Data API, all you need is an HTTPS client and a valid API key.
The final goal is to provide daily updated data to one of my projects Get-mangas.
Access to the dashboard MongoDB Charts : link
Apache Airflow : 2.3.4
OS : Xubuntu 22.04 LTS
sudo apt-get install libmysqlclient-dev
sudo apt-get install libssl-dev
sudo apt-get install libkrb5-dev
#
pip3 install apache-airflow
#
pip3 install typing_extensions
#
pip3 install apache-airflow-providers-mongo
Run this command :
# it will create a airflow.cfg dans le dossier ~/airflow
airflow init db
Then, run the following to create a user :
airflow users create --role Admin --username admin --email admin --firstname admin --lastname admin --password admin
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
- Mangas are collected on Sushi-Scan
requirements.txtwas generated with the library pipreqs
๐ Please support the publishers and authors by purchasing their works ๐
