You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+11Lines changed: 11 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -40,6 +40,17 @@ It is also possible to ingest the daily update files provided by MEDLINE
40
40
(`ftp://ftp.ncbi.nlm.nih.gov/pubmed/updatefiles/`). **BY DEFAULT, ALL UPDATE
41
41
FILES WILL BE APPLIED IN THIS MODE**
42
42
43
+
## Abbreviation expansion
44
+
Abberviation expansion is done via the ALLIE (http://allie.dbcls.jp) database.
45
+
By default, abbrevations are kept as-is from PubMed, but by changing the setting in `.env`
46
+
to
47
+
48
+
```
49
+
EXPAND_ABBREVIATIONS=1
50
+
```
51
+
52
+
The ALLIE database will be downloaded and installed into a postgres table. As the PubMed abstracts are ingested, this database is queried and any abbreviations found within the abstract are replaced with the long form, and the result is stored within the `abstract_long_form` field.
53
+
43
54
## Caveats
44
55
- The intended use is for testing of query logic, and the JVM options set for
0 commit comments