H: An Introduction[The mtDNA H Haplogroup Project] 


H: An Introduction

Nomenclature

Mitochondria

Mitochondria are semi-independent entities found in human cells. They work to produce energy for the cell.

Mitochondrial DNA

Mitochondrial DNA is inherited only from our mothers. Every child inherits a colony of mitochondrial DNA from their mother in her egg cell. The male sperm cell has its own mtDNA. In humans, it does not survive to be passed on.

Mitochondrial DNA is made up of a control region and a coding region. The control region is further divided into two segments. Because the control region has a higher mutation rate than the coding region the segments are called the hypervariable regions. In most journal articles hypervariable region will be abbreviated to HVR. Thus they are HVR1 and HVR2.

RegionPositions
Hypervariable Region 2 00001-00574
Coding Region 00575-16000
Hypervariable Region 1 16001-16569


Mitochondrial DNA is abbreviated to mtDNA.

Mutations

A mutation is a change in the mtDNA sequence. Mutations are described in terms of the first human mtDNA to be fully sequenced. Because it was sequenced at Cambridge University it is called the Cambridge Reference Sequence, CRS.

This is an example of a change at the third place in the mtDNA. The change in the sample is shown in bold.

Location 001002003004005006007008009010
CRS G A T C A C A G G T
Sample G A C C A C A G G T


There are are multiple ways to write this. The first is to give the position and then the change: 003C. The second is to give the normal value for CRS, the position, and then the change: T003C. When talking about a single change the second method is clearer. When describing all of the changes found in a sample though it is quicker to use the first method. Some papers place both the normal CRS value and the mutation value after the position. They then indicate the change with an arrow: 003T→C.

Most mutations to mtDNA are insertions, deletions, transitions, and transpositions.

Insertions

An insertion is when an additional segment is added to the DNA sequence. Insertions are described by writing the position, a period, the insertion point and the name of the new segment. Here is an example of the common insertion 315.1C.

Location310311312313314315 316317318319
CRS T C C C C C G C T T
Sample T C C C C C C G C T T


Insertions are highly variable. One person may have some mtDNA with an insertion and some without it.

Deletions

A deletion is when a single segment is removed from the DNA sequence. Deletions are described by writing the position and a minus sign, ”-.” Here is an example of the common pair of deletions at 522 and 523.

Location520521522523524525526527528529
CRS C A C A C C G C T G
Sample C A C C G C T G


Deletions are highly variable. Like insertions, one person may have some mtDNA with the deletion and some without it.

Transitions

A transition is when one segment is changed to another of the same type.

A→G, G→A, T→C, and C→T are transitions. Here is an example of the transition A263G.

Location260261262263264265266267268269
CRS G C C A C T T T C C
Sample G C C G C T T T C C


Transitions do not happen that often but they are more frequent than transpositions.

Tranversion

A transversion is when a segment changed to another of the opposite type.

A→T, T→A, C→G, and G→C are transpositions. Here is an example of the transversion A227T.

Location220221222223224225226227228229
CRS T G C T T G T A G G
Sample T G C T T G T T G G


Transversions are uncommon.

Haplogroups

A Haplogroup is a collection of related samples. MtDNA Haplogroups are usually defined by changes to the slower coding region. Haplogroup H is defined by two mutations, A2706A and C7028C. Because the CRS was an H, these two defining mutations will not show up on a list of coding region mutations for Hs.

Here is a typical list of coding region mutations for an H1.

A750G A1438G G3010A A4769G A8860G A15326G

Location750751752753754755756757758759
CRS A A C A A G C A T C
H1 G A C A A G C A T C


Location1430143114321433143414351436143714381439
CRS A G T A A A C T A A
H1 A G T A A A C T G A


Location3010301130123013301430153016301730183019
CRS G A T G G T G C A G
H1 A A T G G T G C A G


Location4760476147624763476447654766476747684769
CRS A A T C A T A A T A
H1 A A T C A T A A T G


Location8860886188628863886488658866886788688869
CRS A C A G T G A T T A
H1 G C A G T G A T T A


Location15320153211532215323153241532515326153271532815329
CRS C T A G C A A C A C
H1 C T A G C A G C A C


This will often be written as 750G,1438G,3010A,4769G,8860G,15326G.

Haplotypes

A haplotype is a list of the mutations for an individual in relation the the CRS. The haplotype may be the HVR regions or the entire mtDNA sequence. This is a full sequence result from GenBank, ID EF660933.

A263G 315.1C A750G A1346G A1438G G3010A G4084A A4769G G5460A T6044C A8860G T9110C A15326G C16256T T16519C

If someone were to have this sequence and test HVR1 then their haplotype would be HVR1: 16256T, 16519C.

Haplogroup H
HVR116256T16519C


OR

Haplogroup H
HVR1 (16000+)256T519C


If they were to test both HVR1 and HVR2 then their haplotype would be HVR1: 16256T, 16519C; HVR2: 263G, 315.1C.

Haplogroup H
HVR116256T16519C
HVR2263G315.1C


OR

Haplogroup H
HVR1 (16000+)256T519C
HVR2263G315.1C


If they were to test their full sequence then their haplotype would be HVR1: 16256T, 16519C; HVR2: 263G, 315.1C; CR: 750G, 1346G, 1438G, 3010A, 4084A, 4769G, 5460A, 6044C, 8860G, 9110C, 15326G.

Haplogroup H1
HVR116256T16519C
HVR2263G315.1C
CR750G1346G1438G3010A4084A4769G5460A6044C8860G9110C15326G


OR

Haplogroup H1
HVR1 (16000+)256T519C
HVR2263G315.1C
CR750G1346G1438G3010A4084A4769G5460A6044C8860G9110C15326G


Current Trees

All Haplogroups interconnect forming a single tree of descent from our common foremother.

L → L1 → L3 → N → R.

R is a Macro-Haplogroup whose daughter populations span much of the Northern Hemisphere.

H in the R Tree

Based on the tree from the National Genographic Project public partition database, Behar 2007, this is a basic outline of the R Macro-Haplogroup tree.

  • R {C12705C}
    • R*
    • R0* {G11719G} [aka pre-HV]
      • HV* {C14766T}
        • H {C7028C}
        • V {G4580A}
    • B {8280-}
    • R9 {3970}
    • TJ
      • T {G13368A}
      • J {A12612G}
    • U {A11467G}
      • K {A1811G,G9055A,A12308G,G12372A}

results/helena.txt · Last modified: 12/03/2008 03:18 by goddess