README

The Social Knowledge Extractor (SKE) is a software tool that allows to discover new entities using Twitter.

INSTALLATION GUIDE

Requirements

Python (>3.4.0) and pip
MongoDB
MySQL
A Dandelion Account
A Twitter Application

Setting up

Download the repository
Create 4 files for configuration:
- addressMongo.json : setup of your Mongo, port, host and name_db: {"port_local": "", "adress_local": "", "name_db": "ske_2"}
- addressMySQL.json : setup of your MySQL, password, user, host, database: {"password": "", "user": "", "host": "", "database": "ske_2"}
- credentialsDandelion.json : your app_key and app_id (of your Dandelion account): {"app_key" : "", "app_id" : ""}
- credentialsTwitter.json : your Twitter account : {"consumer_key": "", "access_token_secret": "", "access_token": "", "consumer_secret": ""}
create a csv file with the account names of your seeds, one seed name each row
setup on pipeline.sh the id of your experiment, the number of tweets to get for each user and the name of the file of your seeds

Run

from the terminal run pipeline.sh: bash pipeline.sh

Legenda

storeSeed.py

takes as input a csv file of seed names and an id experiment
write "seeds" in sql db

twitter.py

takes as input parameters(N or dates), id experiment, type (seeds or candidates)
write "tweets" and "users" in mongo db

myDandelion.py

takes as input id experiment
write annotations in "tweets" in mongo db

createFeatureVector.py

takes as input id experiment and type
write features in "users" in mongo db

listCandidates.py

takes as input id experiment
write "candidates" in sql db

Databases (both called ske_2):

SQL

"seeds": screen_name, id_experiment
"candidates": screen_name, id_experiment, score

Mongo

"tweets": id_user, text, lang, favourite_count, reqteet_count, create_at, mentions, id_tweet, id_experiment, coordinates, annotations
"users": id_user, screen_name, id_experiment, type, features

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
awseed.csv		awseed.csv
chessseed.csv		chessseed.csv
createCentroid.py		createCentroid.py
createCentroidInstance.py		createCentroidInstance.py
createFeatureVector.py		createFeatureVector.py
createInstanceVector.py		createInstanceVector.py
createSpace.py		createSpace.py
dandelionAPI.py		dandelionAPI.py
dbpedia_ontology.json		dbpedia_ontology.json
deleteExperiment.py		deleteExperiment.py
evaluateCandidate.py		evaluateCandidate.py
evaluateCandidate2.py		evaluateCandidate2.py
evaluateInstances.py		evaluateInstances.py
evaluateTypes.py		evaluateTypes.py
expert.csv		expert.csv
financeseed.csv		financeseed.csv
listCandidate.py		listCandidate.py
listDomain.py		listDomain.py
myDandelion.py		myDandelion.py
myDandelion2.py		myDandelion2.py
pipeline.sh		pipeline.sh
rankCandidates.py		rankCandidates.py
seed.csv		seed.csv
setupMongo.py		setupMongo.py
setupMySQL.py		setupMySQL.py
storeDomain.py		storeDomain.py
storeExpertTypes.py		storeExpertTypes.py
storeHub.py		storeHub.py
storeSeed.py		storeSeed.py
twitter.py		twitter.py
twitterCandidate.py		twitterCandidate.py
twitterHub.py		twitterHub.py
validation.py		validation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

README

INSTALLATION GUIDE

Requirements

Setting up

Run

Legenda

storeSeed.py

twitter.py

myDandelion.py

createFeatureVector.py

listCandidates.py

Databases (both called ske_2):

SQL

Mongo

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

DataSciencePolimi/social-knowledge-extractor-2

Folders and files

Latest commit

History

Repository files navigation

README

INSTALLATION GUIDE

Requirements

Setting up

Run

Legenda

storeSeed.py

twitter.py

myDandelion.py

createFeatureVector.py

listCandidates.py

Databases (both called ske_2):

SQL

Mongo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages