Automated pipeline to extract data and build DBpedia Spotlight models through Jenkins.
Wikipedia Stats Extractor - Tool to extract wikipedia counts/statistics for Name Entity Linking - wikistats-extractor/Jenkinsfile
DBpedia Spotlight Model Builder - Tool to build DBpedia Spotlight models - spotlight-models/Jenkinsfile
Jenkins build parameters:
| LANGUAGE | VERSION | LANGUAGE | HOME_HOST_PATH | LOCALE | LANGUAGE_STEMMER |
|---|---|---|---|---|---|
| Chinese | 2016-10 | zh | a path | zh_CH | None |
| Czech | 2016-10 | cs | a path | cs_CZ | None |
| English | 2016-10 | en | a path | en_US | English |
| Danish | 2016-10 | da | a path | da_DK | Danish |
| Dutch | 2016-10 | nl | a path | nl_NL | Dutch |
| French | 2016-10 | fr | a path | fr_FR | French |
| German | 2016-10 | de | a path | de_DE | German |
| Hungarian | 2016-10 | hu | a path | hu_HU | Hugarian |
| Italian | 2016-10 | it | a path | it_IT | Italian |
| Japanese | 2016-10 | ja | a path | ja_JP | None |
| Norwegian | 2016-10 | no | a path | no_NO | Norwegian |
| Portuguese | 2016-10 | pt | a path | pt_BR | Portuguese |
| Russian | 2016-10 | ru | a path | ru_RU | Russian |
| Spanish | 2016-10 | es | a path | es_ES | Spanish |
| Swedish | 2016-10 | sv | a path | sv_SE | Swedish |
| Turkish | 2016-19 | tr | a path | tr_TR | Turkish |
All the original code produced for DBpedia Spotlight Model is licensed under Apache License, 2.0.
More documentation is available from the DBpedia Spotlight wiki.
Check the FAQ here
