The journey is the reward

The analysis of a large dataset can be done in many different ways. At least in my experience documentation is getting confusing after many weeks of work – what has been done at which time? Can I reproduce my earlier findings? Where are the latest figures? What remains to be done? …

This is an ever increasing problem. Some information is documented in lab books, other in clinical journals, or even sitting on a server database where it may change over time. I have therefore described my own documentation procedure in a PDF paper that is online at this site.

With a quick view at the figure you will see that I am (mis-)using spreadsheets for documentation Please note (1) the drop down arrows at date and task that can be used for selecting lines even of long lists and (2) that tabs can used for fast switching between results (3) one tab contains all analysis scripts (4) and one tab all links.

spread.png


CC-BY-NC

Best of two worlds

Finally, linkage and association data can be used together after downloading new software using genotype inference.

It reduces the number of genotyping reactions and increases the power of genome-wide association studies. Our method combines sparse marker data from a linkage scan and high-resolution SNP genotypes for several individuals to infer genotypes for related individuals.

Sure, we

Geo IP identification

The Geo IP database is available at Maxmind and allows to trace your home city from IP addresses. Here is a quick and dirty script to upload the Geo IP data into MySQL:

|wj_geo.cmd|

I would also put an index on loc_id. Finally the database should be available as

SELECT city
FROM GeoLiteCity INNER JOIN GeoLiteCityBlocks ON GeoLiteCity.locID = GeoLiteCityBlocks.locID
WHERE $myIP >= startIpNum AND $myIP <= endIpNum;

where $myIP is calculated as

substr($_SERVER['REMOTE_ADDR'],0,3) * 16777216 +
substr($_SERVER['REMOTE_ADDR'],4,3) * 65536 +
substr($_SERVER['REMOTE_ADDR'],8,3) * 256 +
substr($_SERVER['REMOTE_ADDR'],12,3)


CC-BY-NC

A low-cost system for a PDF literature archiv I

Getting a scientific paper on your harddisk is quite simple. I am using a Fujitsu Scan Snap that can process a single page in a few seconds. The resulting PDF needs to be further tweaked by OCR recognition like ABBY FineReader (I couldn’t find any good open source alternative). FR will leave your PDF intact while adding recognized text as an overlay (or “underlay”). Unfortunately FR does not support batch processing but your OS will do by using a windows scripting engine like CLRscript. We also need a tool to extract a text file from the modified PDF. A good choice is pdftotext — look at the sourcecode and the DRM discussion before compiling it with a compiler like Cygwin. The following perl script doesn´t do anything than traversing your target directory and creating a batch file. As filenames offered by publishers are rather strange, I would first start to create some clean file names by replacing all spaces and brackets with something innocent like underscores.
perl.exe ocr.pl rename h:\pdf\2008\*.*
Now we create text files from the PDFs (usually done better by XPDF than directly by GDS).
perl.exe ocr.pl extract h:\pdf\2008\*.pdf
The resulting textfiles may be inspected: very small file sizes usually indicate no valid extraction and should be deleted before starting the OCR step as OCR is only done when text files are missing.
perl.exe ocr.pl ocr h:\pdf\2008\*.pdf
In the last step you may want to repeat the extract step.

ocr.zip
|wj_ocr.txt|


CC-BY-NC

Unripe

A key event in the vitamin D hypothesis of allergy induction is the immature state of dendritic cells. So far, maturity has been mainly described in terms of reduced expression of cell surface marker like CD80. A new study in Nature now further unravels how the capacity of DC to present antigen may be disturbed. Ubiquitination – the covalent attachment of ubiquitin polymers – of the MHC II ß chain ceases on maturation allowing the transport from endosomal compartments to the plasma membrane. Immature cells seem to be capable to some level of peptide-MHC interaction (at least for some selected antigens) although this process is greatly enhanced by maturation of DCs. Semi-maturity is believed to be an important inetrim stage where at least an earlier review argued

we propose a model in which steady-state migration and partial maturation (semi-maturation) of DCs is embedded as a major component within immune homeostasis, established for permanent and active tolerance induction against self-antigens derived from peripheral tissues by inducing antigen-specific CD4+ Tr cells. Semi-maturation induced by proinflammatory cytokines, such as TNF-alpha, seems to represent a unique developmental tolerogenic stage for DCs, which is based on the absence of proinflammatory cytokine production, despite high expression of MHC II and costimulatory molecules.

Another interesting study in the J Immunol – coined “alternatively activated dendritic cells” the authors probably talk about the same immature cells (compare with my cartoon summarizing a 2002 paper in Trend Mol Med). These immature DCs secrete high levels of IL10 (a paradox discussed in my most recent paper). In addition these DCs produce low amounts of IL12p70, TLR4 and CCR7. What was new to me, was an impressive list of pharmacological agents that suppress DC development: aspirin (also paracetamol?), corticosteroids, cyclosporine A, rapamycin (also other antibiotics?), and finally mycophenolate mofetil.

There is also an update of the IL10 paradox: Allergic sensitization may be down regulated by CD40 AGONISTs independent of IL10!

Finally, I would like to understand what immature really do after encountering allergen exposure. A new paper in nature immunology says that

immature DC are also thought to carry antigen to lymph nodes and to interact with naive T cells but without a previous maturation stimulus, those interactions result in abortive activation of the T cells, which can be eliminated, rendered unresponsive or induced to differentiate into regulatory T cells.

which still does not answer my question.

Vitamin D actions
vd_immun.png

Yea, yea.


CC-BY-NC

Some are hybrids, some are not

Microchimerism is an interesting phenomenon that describes the hosting of foreign cells in an individuum – the prefix micro relates to the rather low counts of foreign cells (see the self discussion).
It is believed (but unproven) that most cases of microchimerism relate to the persistence of fetal cells in the maternal organism. The background of microchimerism is extremely complicated as highlighted in a recent review about the immunology of placentation in mammals. This paper has some nice cartoons about the types of placentation (epitheliochorial, endothelichorial and haemochorial) where the invasive potential of fetal trophoblast cells is the culprit of reciprocal (?) cell traffic between mother and fetus. The highest risk is found in women with induced abortion; cell count is ranging from 0 to 21 male cells per 100,000 female cells in peripheral blood; transfer may occur from mother <-> child, twin <-> twin, or sib <-> mother <-> sib.
Microchimerism has been examined in transplantation medicine (where the recipient replaces the outer donor organ epithelium), in blood transfusion and HCT, as well as in some autoimmune diseases (systemic sclerosis, SLE, thyroiditis, PBC). A clinical review reports that fetal cells have been found to persist for many years, probably for a lifetime.
I have doubts if that is true as I am not aware of any quantitative long-term study. Nearly all studies identified only male cells in women although now genomic studies of single cells are possible allowing a much better identification of foreign cells. If you are looking for a PhD thesis, microchimerism could be your field!
I already wondered if microchimerism could lead to genotyping errors, a question that can now easily be tested on the garbage of genotyping labs: We usually have genotyping errors in the 1-10 o/oo range; sometimes we see also triallelic SNPs. As far as I can renember, microchimerism has never been analyzed in the allergy field, although allergy can transplanted as well as asthma. Yea, yea.


CC-BY-NC

Not just a face-lift

The redesign of the PLoS journals today is not just a facelift. They continue to have fresh ideas and unconventional approaches. A recent email said that

we encourage authors who are fluent in languages other than English to submit translated versions of an article summary … in other languages. Translations should be submitted as Supporting Information files labeled “Translation of the article summary (or entire article) into language XXX by author YYY”.

That is a nice idea, showing another benefit of online journals, yea, yea.

plosone.png


CC-BY-NC

Are genes becoming more important with increasing age?

I found an apparent paradox between two studies. John Whitfield did a twin study in Australia and concluded that shared environmental effects decreased with age (from about 50% to 10%) while additive genetic effects increased. The new Sardinian study found higher heritabilities among younger individuals and explained that by an increase of environmental insults with age. Nice said, but who is right?


CC-BY-NC

Born to be write

A decade ago Karen Hunter (at that time Senior Vice President of Elsevier) did a brilliant analysis why scientists publish:

For academic scientists, the research paradigm is the experiment and the publication output is a journal article. Academic science researchers publish to establish their claim at a specific time to a specific result. They publish to gain other forms of recognition (such as promotion and tenure) that require publication. They publish in order to have independent certification of the results and to have those certified (refereed) results archived in perpetuity. Finally, they publish to communicate with those who may be interested in their works today …

She continues with another important aspect

… not the circle of cognoscenti (who do not need publication to be informed) but researchers in related fields, researchers in less well-connected institutions and students working their way into the inner ring.

In my opinion, the “claim at a specific time to a specific result” is probably the most relevant motivation for a scientist. Nevertheless having claims on ideas presented in a printed paper seems to be still a habit of the pre 1995 stone age of scientific publishing. Databases will be certainly as reliable in the future as printed paper. I guess that in 50 years the access to current electronic documents will be even better than to any printed paper. Yea, yea.


CC-BY-NC

On the limits of science

Have you ever heard of the Wikipedia Knowledge Dump? With the headline “WikiDumper: The Official Appreciation Page for the Best of the Wikipedia Rejects. One man’s trash is another man’s treasure” Dr. Cliff Pickover collects the best entries. For example you can read about the Beard Theorem that suggests that the size of one’s beard has a direct correlation to the radicality of a person’s socialist views. The site is as good as the Ig Noble, yea, yea.


CC-BY-NC

Can anybody become a Nobel Prize winner?

Anders Sandberg attended a seminar in Stockholm and has written an interesting report. I have doubts if we have so much need for heroes. There are so many prizes that you can apply for – an impressive list of prizes and honours that you can find at the CVs of some laureates: Albert Lasker Award, Paul Ehrlich Prize, and many, many more. Google returns 60.100.000 hits if you search for “science prize” and there is now even the European database of science prizes that will find a prize for every scientist. Yea, yea.


CC-BY-NC

WordPress as CMS

I have read many useful (and also some less useful) comments how to squeeze WordPress to work as a CMS.

I did not want to make any major changes to scripts that would be lost after an upgrade. I did not want to have extra plugins to change home (for example by creating an overriding home.cfm). I did not want to have any new categories. I did not want to change permalink structure. I still need my directory plugin to work, I still need the blog (some redirects even loose the blog address!) and I wanted to keep the RSS feed.

After several hours I came up with an very simple solution: Take a standard page and rename its title and slug to “Home” – assign a special “Home” template – redirect htaccess to this page. The only trick is to make the “Home” template work: it is basically a copy of the index.cfm in your WordPress theme directory where the line calling loop.php is being replaced with a slightly modifed loop code.

myblog.php
|wj_myblog.php|


CC-BY-NC

HUGO Changing Offensive Gene Names

Hsien Hsien Lei has a good comment on gene names approved by Human Genome Organisation (HUGO) Gene Nomenclature Committee which are nevertheless offending . Some of the inappropriate names are LFNG – lunatic fringe homolog (Drosophila), MFNG manic fringe homolog (Drosophila) as well as SHH sonic hedgehog homolog (Drosophila). There are many more names that arise only from a particular culture (like death executioner Bcl-2); it seems also a particular kind of humour to call a deaf mouse Beethoven. Yea, yea.


CC-BY-NC