Skip to content

Latest commit

 

History

History
21 lines (11 loc) · 848 Bytes

README.md

File metadata and controls

21 lines (11 loc) · 848 Bytes

Liblinear-java 20 Newsgroups Example

An example showing how to use liblinear-java for text classification in Scala with the 20 newsgroups data set. Additionally, this example shows how Apache Lucene can be leveraged for tokenizing, shingling, and more.

##Downloading the Data Set

The 20 newsgroups data set is available here. This example requires the "bydate" version.

Unzip and place this directory in the root of this project.

Building & Running the Example

Requires java and sbt.

Building and running:

$ sbt run