HSK Worksheet Generator

A simple set of tools to automatically generate HSK worksheets as CSVs, Mochi flash cards and PDFs files.

This application uses the excellent AllSet Learning Chinese Vocabulary Wiki as its default crawler data source. Other crawlers can be implemented as long as they generate the following fields for each scrapped item:

Name	Type	Description
id	int	Sequential ID of each item in a given category
category	str	Name of the item category
chinese	str	Chinese word
pinyin	str	Pinyin representation
english	str	English translation

Installation

Install all Python dependencies with:

poetry install

You will also need Typst installed and available on PATH to generate PDFs.

Usage

All commands listed below should be issued from this project's root folder unless otherwise stated.

Export Vocabulary as CSV

To extract HSK 3 vocabulary to a CSV file at ./output/hsk_3.csv, you should run:

scrapy crawl AllSetLearning -a hsk=1 -O ./output/hsk_3.csv

Export Vocabulary as Mochi

To extract HSK 2 vocabulary to Mochi flashcards at ./output/hsk_2.mochi, you should run:

scrapy crawl AllSetLearning -a hsk=2 -O ./output/hsk_2.mochi

Generate PDF from CSV

To generate a PDF HSK 1 worksheet at ./output/hsk_1.pdf from a given CSV vocabulary file located at ./output/hsk_1.csv, you should run:

typst compile template/main.typ output/hsk_1.pdf
    --root .
    --font-path font
    --input hsk="1"
    --input csv_file_path="../output/hsk_1.csv"

The resulting file should look like this:

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
.vscode		.vscode
font		font
hsk_worksheet_generator		hsk_worksheet_generator
input		input
output		output
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
release.sh		release.sh
sample.png		sample.png
scrapy.cfg		scrapy.cfg
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HSK Worksheet Generator

Installation

Usage

Export Vocabulary as CSV

Export Vocabulary as Mochi

Generate PDF from CSV

About

Uh oh!

Releases 4

Packages

Uh oh!

Languages

License

teofilosalgado/hsk-worksheet-generator

Folders and files

Latest commit

History

Repository files navigation

HSK Worksheet Generator

Installation

Usage

Export Vocabulary as CSV

Export Vocabulary as Mochi

Generate PDF from CSV

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Languages

Packages