Skip to content

Commit a6b06b6

Browse files
committed
Ported two posts
1 parent 7d9391c commit a6b06b6

3 files changed

Lines changed: 76 additions & 0 deletions

File tree

Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,37 @@
1+
---
2+
layout: post
3+
title: "Bioclipse beta5: really the last one now"
4+
date: 2009-05-21
5+
blogger-link: https://chem-bla-ics.blogspot.com/2009/05/bioclipse-beta5-really-last-one-now.html
6+
doi: 10.59350/6xj82-qag38
7+
tags: chembl bioclipse cdk
8+
image: /blog/assets/images/starlite.png
9+
---
10+
11+
[Bioclipse beta 5](http://bioclipse.blogspot.com/2009/05/bioclipse-20-beta5-released.html) was just released by Ola, and the team had
12+
some bad days over an [problem](http://chemicalrcp.blogspot.com/2009/05/eclipse-spring-export-problem-uses.html) that happened after
13+
a merge of [an important branch](http://jonalv.blogspot.com/2009/04/i-just-came-up-with-yet-another-way-of.html) regarding the
14+
managers we are using to allow scripting of Bioclipse.
15+
16+
![](/blog/assets/images/starlite.png)
17+
18+
In the end, [Jonathan](http://jonalv.blogspot.com/) found a workaround for the problem, even though we still have no clue what was
19+
the exact cause. Additionally, Arvid implemented one of the last missing features of the JChemPaint editor, being the ability to
20+
draw bonds in any arbitrary direction, and the ability to create a new bond to an already existing atom. This really seems to be the
21+
last beta before the 2.0 release candidate. So, head over to [SourceForge](http://sourceforge.net/projects/bioclipse) as it is
22+
now time to report this smaller things you like to see improved.
23+
24+
The beta has many really nice features, and we will have much to write about in later blogs. One thing I particularly like, is the
25+
support for (really) large SD files; the above screenshot is a 800MB file with [StarLite structures](http://chembl.blogspot.com/),
26+
though we also tried files larger than 1GB. There is a *2D-Structure* tab, which will zoom in on the structure in a regular
27+
JChemPaint editor.
28+
29+
For the Bioclipse scripting, I can just encourage you to browse this blog for example scripts.
30+
31+
There are many extensions currently being developed, around the globe, which will extend the basic Bioclipse workbench towards
32+
particular use cases. While surely these will get blogged about in detail later, I do want to briefly mention them. In the works
33+
are features for: QSAR, Decision Support, Speclipse (NMR and MS spectrum handling), Resource Description Framework, a StructureDatabase,
34+
Metabolomics, Medea (MS spectrum and fragmentation prediction), XMPP, and much more.
35+
36+
Focus of Bioclipse 2.1 will be towards bioinformatics: sequence handling, BLAST, better PDB/CIF support for protein structures,
37+
and who knows.
Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,39 @@
1+
---
2+
layout: post
3+
title: "No, PDFs really do suck!"
4+
date: 2009-06-17
5+
blogger-link: https://chem-bla-ics.blogspot.com/2009/06/no-pdfs-really-do-suck.html
6+
doi: 10.59350/dv8xh-5dk63
7+
tags: publishing
8+
---
9+
10+
A typical blog by Peter MR made (again), [The ICE-man: Scholary HTML not PDF](http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2102),
11+
the point of why PDF is to data what a hamburger is to a cow, in reply to a blog by Peter SF, [Scholarly HTML](http://ptsefton.com/2009/06/11/trip-report-visit-to-microsoft.htm#id3).
12+
13+
This lead to a [discussion on FriendFeed](http://friendfeed.com/petermr/767254d7/ice-man-scholary-html-not-pdf).
14+
A couple of misconceptions:
15+
16+
**"But how are we going to cite without paaaaaaaaaaaage nuuuuuuuuuuumbers?"**<br />
17+
We don't. Many online-only journals can do without; there is DOI. And if that is not enough, the legal business has means of
18+
identifying paragraphs, etc, which should provide us with all the methods we could possibly need in science.
19+
20+
**Typesetting of PDFs, in most journals, is superior than HTML, which is why I prefer to read a PDF version if it is available. It is nicer to the eyes.**<br />
21+
Ummm... this is supposed to be Science, not a California Glossy. It seems that
22+
[pretty looks is causing major body count](http://shirleywho.wordpress.com/2009/05/11/an-open-letter-to-oprah/) in
23+
the States. Otherwise, HTML+CSS can likely beat any pretty looks of PDF, or at least match it.
24+
25+
**As I seem to be the only physicist/mathematician who comments on these sort of things, I feel like a broken record,
26+
but math support in browsers currently sucks extremely badly and this is a primary reason why we will continue to use
27+
PDF for quite some time.**<br />
28+
HTML+[MathML](http://www.w3.org/Math/) is well established, and default FireFox browsers have no problem showing mathematical
29+
equations. For years, the [Blue Obelisk](http://en.wikipedia.org/wiki/Blue_Obelisk) [QSAR descriptor ontology](http://qsar.sourceforge.net/dicts/qsar-descriptors/index.xhtml)
30+
has been using such a set up for years. If you use TeX to author your equations, you can
31+
[convert it to HTML](http://silas.psfc.mit.edu/mathmltalk/) too.
32+
33+
**We can mine the data from the PDF text.** Theoretically, yes. Practically, it is money down the drain. PDF is particularly
34+
nasty here, as it breaks words at the end of a line, and even can make words consist of unlinked series of characters
35+
positioned at (x,y). PDF, however, can contains a lot of metadata, but that is merely a hack, and unneeded workaround.
36+
Worse, hardly used regarding chemistry. PDF can contain PNG images which can contain CML; the tools are there, but not
37+
used, and there are more efficient technologies anyway.
38+
39+
I, for one, agree with Peter on PDF: it really suck as scientific communication medium.

assets/images/starlite.png

90.4 KB
Loading

0 commit comments

Comments
 (0)