Extract lists and tables from wikipedia and add their information to DBpedia.
Make sure you have Java 1.8, scala and sbt installed.
- Clone repo
- Install jena CLI
- on OS X you can run
brew install jena
- on other platforms you need to install them as described here
- Then run
scripts/loadDumps.sh
, optionally you can update the preloaded typeCounts withscripts/typeCount.sh
. - Download or create a wiki-markup xml dump. Downloads from special:export work just fine.
- Convert it to a json dump with
scripts/convert.sh
. - Copy
src/main/resources/application.conf-default
tosrc/main/resources/application.conf
, there you need to change the input filename accordingly to your generated dump file and you can change the parameters of the algorithm. - To start the application run
sbt run
and chooseGenerateTypes
as main class.