-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathslides.txt
97 lines (68 loc) · 2.56 KB
/
slides.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
% An overview of dissemin
% The team of dissemin - presentation : Thomas Bourgeat
% May 12, 2015

# The main idea
- This is institution-centric: to help the institutions and their researchers to elaborate an open access policy.
- Input:
* list of departments
* list of researchers
- Output:
* list of annotated publications: availability, publishing policy ...
# Harvesting: the sources
- Primary (specific APIs):
* CrossRef.org
* Bielefeld Academic Search
* SHERPA/RoMEO
* CORE
- And from some repositories via OAI-PMH:
* arXiv
* HAL
* PubMed Central
* DOAJ, Persée, Cairn.info, Numdam, ...
# Updating the harvest
- OAI-PMH is like RSS feeds (or Atom)
- We can so just keep track of new entries.
# Disambiguation
- We can get the same paper several times : we need to say that it is the same paper.
- We need to identify authors.
- TODO classifier
<p align="center">

</p>
# A human feedback for disambiguation
- We added a way for authenticated users to give feedback on authorship of articles.
- to correct wrong classifications by hand will improve the full classification.
# Uploading full text: to improve the picture.
- If I am a researcher, to put my paper on repositories I usually need :
+ the metadatas
+ the consent of the authors and publisher
+ hopefully the fulltext / the tex file / the document.
- In dissemin, we already know the publisher policy and we have the metadata.
So we only need the consent of the authors and the plain document.
# Protocol.
- SWORD: Basic HTTP, widely used.
- This protocol is used by HAL (large french repository).
- This is widely used,
- But all the difficulty is in the details, and in the range of usecases!
- There are alternatives that we should also support (zenodo for example).
# Email
- A feature to ask the authors for a fulltext.
- Possible :
+ if you are authenticated
+ if the authors did not get an email the last 15 days.
+ if the policy of the publisher allows that
# CAS authentication
- Central Authentication System (CAS) are widely deployed in universities.
- Dissemin creates an authenticated account for all the researchers.
- No possiblity to create fake accounts.
# Future works and ideas
- possibility for an user to add an entry not found by dissemin.
- research function
- an output API
- a docker version (there is a VM version)
- more APIs (for Zenodo for example)
- a strong SWORD implementation
- a meta-dissemin?
And more important
- to test the system in real production: hopefully in october 2015 at the ENS.