Category Archives: Genetics

3D LD

While waiting for genomewide SNP data to be re-partioned into LD blocks I found this page with some neat progamming tricks. It is part of the dissertation of Ben Fry / MIT about computational information design. Page 74 ff has a history of redesigning the widely used haploview pogram.

The design of these diagrams was first developed manually to work out
the details, but in interest of seeing them implemented, it was clear that
HaploView needed to be modified directly in order to demonstrate the
improvements in practice. Images of the redesigned version are seen
on this page and the page following. The redesigned version was even-
tually used as the base for a subsequence ‘version 2.0’ of the program,
which has since been released to the public and is distributed as one of
the analysis tools for the HapMap [www.hapmap.org] project.

3d-ld.png

 

CC-BY-NC Science Surf accessed 05.11.2025

How can we know?

A recent paper in Nature reported

Tissue samples were obtained from one of the following sources: Asterand, Pathlore, Tissue Transformation Technologies, Northwest Andrology, National Disease Research Interchange and Biocat. Only anonymized samples were used, and ethical approval was obtained for the study from Ärztekammer Berlin and the Cambridge Local Research Ethics Committee. […] Human primary cells were obtained from Cascade Biologics, Cell Applications, Analytical Biological Services, Cambrex Bio Science and the Deutsches Institut für Zell- und Gewebeersatz.

How did “Ärztekammer Berlin” or “Cambridge LREC” evaluate ethical performance of these companies? Or did anonymity automatically guarantee ethical research? Or is it just a formal requirement to mention ethics? Or …?

 

CC-BY-NC Science Surf accessed 05.11.2025

Evolution in fast motion

Nature genetics as an advance online publication about comparative genome sequencing of E. coli where 13 de novo mutations in 5 strains were monitored over 44 d (or ~660 generations). It is a great study – not only because the author list includes one of my previous coauthors – but for giving a first insight about development of a mutation and fixing its allele frequency. Unfortunately, there is no flowchart and the methods are somewhat vague, what has been sequenced (or resequenced) in which strain at what time . In other words who are the winners? Did they manage that by their own strength or with a little help of some friends? Why rises the allele frequency always to 100% and what about some discrepancy of allele frequency and fitness? We will hopefully see more of these studies, yea, yea.

 

CC-BY-NC Science Surf accessed 05.11.2025

Pathway to nowhere

I love pathway diagrams since I mounted the famous Biochemical Pathways of Boehringer in my bachelor flat. As far as complex disease genetics is concerned with many disease genes, an integration into a pathway context becomes critical. There are many attempts to extract this information from the literature and many companies that offer highly curated information (Biomax, Ariadne Genomics, Genomatix to name a few). Academia relies mainly on KEGG, the Kyoto encyclopedia of genes and genomes or Biocarta. Last week another pathway server appeared that is curated by the NCI and Nature magazine. Let’s have a look – I am currently working on Affymetrix 500K SNP annotation, yea, yea.

 

CC-BY-NC Science Surf accessed 05.11.2025

How to detect your own CNVs

How to detect copy number variation (CNV) in your own genotype chip data, can be found in a companion paper of the recent Nature publication.
In the previous Nature paper the authors explained their algorithm to be based on k-means and PAM (partitioning around medoid) clustering, but it seems quite different. They call genotypes with DM (which seems to be already obsolete by the BRLMM, see a comparison at Broad and the AFFX whitepaper), then adjust heterocygote ratios by Gaussian mixture clustering, normalize and reduce noise before! merging NspI and StyI arrays. The software is at Genome Science, Tokyo. Yea, yea.

 

CC-BY-NC Science Surf accessed 05.11.2025

On mice and men with asthma

A new series of pro- and con editorials in the Am J Res Crit Care Med discusses the question why in some instances mouse models have “misdirected resources and thinking”. You may have noticed that I have only rarely used animals for research; the authors of this editorial have collected empirical data on the exploding use of murine models. Despite their attractiveness from a technological point, they are often useless because

  • mice do not have asthma as even the most hyperresponsive strain does not show spontaneous symptoms
  • mice do not have allergy – although sensitization can be manipulated by high intraperitoneal allergen/adjuvant injection, this does not involve immediate and late airway obstruction.
  • immune reaction in mice is quite different – the interfering of some substances like vitamin D cannot be reliable tested, there is no pure Th1 and Th2 reaction in human and less stronger IL-13 response
  • mice typically can not be challenged with the complex (and interacting) human exposure – oxidant stress, viral infection, obesity, diet, smoke, pollutants, ….
  • time course is difficult to mimicking in the mouse, there is no longterm model
  • structure of mouse airways is different – there are fewer airway generations, much less hypertrophy of smooth muscle
  • inflammation in mouse is parenchymal rather than restricted
  • humans are outbred, mice are inbred
  • early microbial environment is different
  • many promising interventions of mice pathways failed in humans (VLA-4, IL4, IL5, bradykinine, PAF,…)

I am sure there are even more arguments – I suggest that the authors deserve the Felix-Wankel price.

Addendum

15 Dec 2006: The BMJ has 6 more examples about the discordance between animal and human studies: steroids in acute head injury, antifibrinolytics in haemorrhage, thrombolysis or tirilazad treatment in acure ischaemic stroke, antenatal steroids to prevent RDS and biphosphonate to treat osteoporosis.

19 Dec 2006: Another pitfall paper

31 Dec 2006: A blog on animal welfare

25 Apr 2007: Call for better mouse models

15 Jul 2018: Of Mice, Dirty Mice, and Men: Using Mice to Understand Human Immunology

16 Oct 2019 Allergy protection on farms – why also studies in mice could have failed

18 Feb 2023 The case for free running lab mice, see
Cell and Science paper

7 Jan  2023: A review concluding that  The vitamin D system in humans and mice: Similar but not the same

12 March 2024: Limitations of the mouse model

  • Lack of clinical relevance – The OVA model often fails to accurately replicate human asthma and allergic diseases. OVA is not a common human allergen, unlike house dust mites, pollen, and pet dander.
  • Artificial sensitization – The model requires sensitization with alum, a strong Th2-skewing adjuvant, which does not reflect natural allergen exposure in humans.
  • Simplistic immune response – OVA exposure leads to a predominantly Th2 immune response, whereas human asthma involves a complex interplay of Th1, Th2, Th17, and innate immune responses.
  • Limited chronicity – Most OVA models induce short-term allergic inflammation but fail to reproduce the chronic airway remodeling, fibrosis, and structural changes seen in human asthma.
  • Non-physiological exposure – OVA is often administered at high doses via intraperitoneal injection or repeated aerosol exposure, which does not mimic real-life human allergen exposure.
  • Strain-specific bias – Different mouse strains respond variably to OVA, making it difficult to generalize findings.
  • Lack of environmental and genetic factors – The model does not account for genetic predisposition, pollution, infections, or microbiome variations that influence human allergic disease development.
  • Poor translational value – Many drugs that show promise in OVA models fail in human clinical trials, highlighting the model’s limited predictive power.

 

CC-BY-NC Science Surf accessed 05.11.2025

Why dog and cat can’t marry

or in more scientific terms: Why are F1 hybrids so often sterile or lethal?

The Dobzhansky-Muller theory says that there is an incompatibility between genes with reduced fitness that have diverged between species. So far nobody has ever observed a D-M gene but a new Science paper describes two genes that separate D. simulans and D. melanogaster: lhr (lethal hybrid rescue) and hmr (hybrid male rescue).

Well, dogs have 78 chromosomes, arranged in 39 pairs, while cats have 38 chromosomes, in 19 pairs – so no viable embryo can be produced.

 

Yea, Yea.

 

CC-BY-NC Science Surf accessed 05.11.2025

For the first time two human genomes compared

Another “first discovery” in this nature genetics preprint although the analysis could have already been done some years earlier. The CNV specialists from Toronto now compare the Human Genome Project sequence with the Celera sequence – the gap between the two compilations was obviously bigger than the intra-sequence gaps. Of course both sequences are still mosaics from several individuals but the analysis nicely exemplifies how difficult it will be to compare the genome of two different human beings.
The authors employ a whole battery of alignment tools BLAT, MEGABLAST, GCA and A2Amapper. Of course results depend on the strategy, definition and implementation. As show by FISH analysis most of the discrepancies are true and can be classified into a few categories – insertions or deletions if seen from the second genome (has somebody ever thought about a minimal human genome?), mismatches and inversions. We are getting here a preview of the diagnostic workup in a patient in 2026. This blog contains forward looking statements while the responsibility rests solely with the reader. Yea, yea.

 

CC-BY-NC Science Surf accessed 05.11.2025

Headline: For the first time 2 human genomes compared

What sounds brand new in a current nature genetics preprint, could have even been done already a couple of years before. The CNV specialists from Toronto compare now the Human Genome Project sequence with the Celera sequence. Was the gap between these two compilations bigger than the intra sequence holes? Both sequences are of course mosaics from several individuals but the analysis nevertheless exemplifies how difficult it may be to compare even the genome of two related individuals. The authors employ a whole battery of alignment tools BLAT, MEGABLAST, A2Axxx, GCA. Of course results are different depending on strategy, definition and implementation used. As show nicely by FISH analysis most of the discrepancies are true and can be classified into a few categories – insertions with and without corresponding fragment or deletions if seen from the second genome (has somebody ever thought about a minimal human genome?), mismatches and inversions. We are getting here a preview of the diagnostic workup in a patient in 2026. This blog contains forward looking statements while the responsibility rests solely with the reader. Yea, yea.

 

CC-BY-NC Science Surf accessed 05.11.2025

New LD measure

There is a new way to calculate LD that may overcome the limitations of D’ and R^2 that are not easily generalizable to multiallelic markers (or haplotypes) and depend on the distribution of SNPs (or haplotypes).
The paper is at BMC, the sources at the authors’ website. I have slightly modified the program to allow input and output file names on the command line before compiling it. Use at your own risk, yea, yea.

 

CC-BY-NC Science Surf accessed 05.11.2025

Tit-for-tat or altruism in science

No, this essay will not deal with altruism in science but with the science of altruism. There are two new papers from the Fehr group (one in Science on Nov, 3 about diminished reciprocal fairness after magnetic stimulation of the right prefrontal cortex and a second in Nature on Aug, 24 about altruism in two indigenous groups in Papua New Guinea). I was, however, much more impressed by their recent review of human altruism.
Cooperation between genetically unrelated groups is a typical human behaviour (otherwise seen only in ants, bees and the naked mole rat) where there seems a strong reciprocity between selfishness and altruism. Cooperation is rarely stable and may deteriorate under worse conditions. Altruistic rewarding and reputation seeking seem to be the most powerful determinants of future donors’ behaviour where effects of punishing behaviour seem to be underestimated: Cooperation in larger group continues only if punishment of defectors and non-punishers is possible.

punish480.png

 

CC-BY-NC Science Surf accessed 05.11.2025

An utter refutation

I am slow in commenting on a paper that has already been published earlier this year – Joe Terwillingers vivid refutation of the fundamental theorem of the hapmap proponents that

if a marker is in tight LD with a polymorphism that directly impacts disease risk, as measured by the metric r^2, then one would be able to detect an association between the marker and disease with sample size that was increased by a factor of 1/r^2 over that needed to detect the effect of the functional variant directly

I cannot comment on the statistical proof but fear from my recent experience with Crohn and asthma tags that he may be right with his assumption: Even marker in high LD with the functional variant may not show any association at all. These may be bad news for all those currently running large screening programs with hapmap based variants believing that P(A|BC)=P(A|Bc)=P(A|B), yea, yea.

Addendum

Tag SNPs also do not work with CNVs

 

CC-BY-NC Science Surf accessed 05.11.2025

Who will survive?

When looking at gene variants in a population we may forget that even having a perfect sampling scheme this will not be an unbiased view of the human genome. Earlier studies suggested that up to 75% of conceptions are lost during early development; a further indicator of an biased view are unexplained cases of departure from Hardy-Weinberg equilibrium.
Selective survival during early pregnancy is still a terra incognita and except of studies in the Hutterites I am not aware of any (modern) study that looked at selective survival.
A study of Grant Montgomery now shows fresh data on genomewide allele sharing in 1,592 DZ twins from Australia and 336 DZ pairs from the Netherlands.
It is somewhat disappointing that there is no excess allele sharing in the HLA region nor somewhere else in the genome. Maybe further studies can do that a high resolution than with just 359 microsatellite marker?

 

CC-BY-NC Science Surf accessed 05.11.2025

Escaping from a swamp

The November AJHG has an excellent re-analysis of the dysbindin-schizophrenia association using new methodology that surpasses all previous meta-analysis techniques. As the single SNP association results from the previous 6 studies cannot be directly compared, they construct a European super-hap map from all tag SNPs in that region, place them in a phylogenetic tree before finally mapping all single associations on these haplotypes. Their Fig.1B show the main results; as the circles in Fig.1B are somewhat confusing, I have withdrawn their results – adding the haplotype frequencies and ordering the studies by year of publication.

pc010002-2.JPG

We may think of a triple-blind study – neither patients, nor PIs, nor we did know anything before. The results are alarming. I do not understand how the Kirov set could have included all haplotypes and why the Schwab/Williams set is in opposition to the Straub/Bogaert/Funke set.
What could have gone wrong? The authors of the current re-analysis believe that population differences are an unlikely reason for the inconsistency as the allele frequencies match between studies. Good news that genotyping errors may be largely excluded.
Unfortunately the authors remain vague why there is no common causal variant. Have there been different sampling schemes, different diagnostic thresholds, different environmental exposures in the previous studies? Is dysbindin at all a schizophrenia gene, or only under a certain genetic background? It seems possible that studies of one branch are false positives. Or is the haplotype reconstruction in the re-analysis erroneous for whatever reasons?
Von Münchhausen is well know for escaping from a swamp by pulling himself up by his own hair. I would like I could do that too.

 

CC-BY-NC Science Surf accessed 05.11.2025