Haplogroup G (Y-DNA): Wikis


Note: Many of our articles have direct quotes from sources you can cite, within the Wikipedia article! This article doesn't yet, but we're working on it! See more info or our list of citable articles.


From Wikipedia, the free encyclopedia

Haplogroup G
Time of origin 9,500-20,000 years BP
Place of origin Middle East or Southwest Asia
Ancestor F
Descendants G1, G2 and their subgroups
Defining mutations M201 (G), M285 (G1), P287 (G2) with common subgroups: P16 (G2a1), M406 (G2a3a), G2a3b1 (P303), M377 (G2c)

In human genetics, Haplogroup G (M201) is a Y-chromosome haplogroup. It is a branch of Haplogroup F (M89). Haplogroup G has an overall low frequency in most populations but is widely distributed within many ethnic groups of the Old World in Europe, northern and western Asia, northern Africa, the Middle East, India, Sri Lanka and Malaysia.



Various estimated dates and locations have been proposed for the origin of Haplogroup G. The National Geographic Society places haplogroup G origins in the Middle East 10-20,000 years ago and presumes that people carrying the haplogroup took part in the spread of the Neolithic [1] Two scholarly papers have also suggested an origin in the Middle East, while differing on the date. Semino et al. suggested 17,000 years ago.[2] Cinnioglu et al. suggested the mutation took place 9,500 years ago.[3] The oldest skeletons confirmed by ancient DNA testing as carrying haplogroup G date only from the 7th century C.E. and were found in present-day Bavaria, Germany.[4]

Genetic Findings


M201 SNP Mutation

The M201 SNP mutation that characterizes haplogroup G was identified at Stanford University and was first reported in 2001.[5] The technical specifications of M201 are given as: refSNPid is rs2032636.....Y chromosome location of 13536923....forward primer is tatgcatttgttgagtatatgtc.....reverse primer is gttctgaatgaaagttcaaacg.....the mutation involves a change from G to T.

Equivalent SNP Mutations

A number of SNPs have been identified with seemingly the same coverage in the population as M201. Because M201 was identified first, it is the standard SNP test used when testing for G persons. In order to determine if one of these alternative SNPs represents a subgroup of M201, the alternative SNPs must be tested in G persons who are negative for the known subgroups of G. There are only a tiny number of persons in such a category, and only a tiny number of persons have been tested for G equivalent SNPs other than M201.

The following SNPs are so far are identified as M201 equivents: P257, L240, U2, U3, U6, U7, U12, U17, U20, U21, U23 and U33. P257 was first reported in 2008.[6] L240 was identified in 2009.[7] The "U" SNPs were identified in 2006 but not published until 2009.[8]

In addition, there are multiple other SNPs thought to have the same coverage as M201. These are found at: rs9786910, rs9786537, rs2713254, rs35567891 and rs34621155 on the Y chromosome. No labs have yet assigned them shorthand names.

Subgroup Tree Listing


  • G1 (M285, M342)
    • G1* -
      • G1a (P20.1, P20.2, P20.3)
      • G1b (P76)
  • G2 (P287)
    • G2* -
    • G2a (P15, L31/S149/U5)
      • G2a* -
        • G2a1 (P16.1, P16.2)
          • G2a1* -
            • G2a1a (P18.1, P18.2, P18.3)
        • G2a2 (M286)
        • G2a3 (L30/S126, L32/S148)
          • G2a3* -
            • G2a3a (M406)
              • G2a3a* -
                • G2a3a1 (L14/S130/U16, S133)
                • G2a3a2 (L184#, L185#)
            • G2a3b (L141#)
              • G2a3b* -
                • G2a3b1 (P303/S135)
                  • G2a3b1* -
                    • G2a3b1a (U1)
                      • G2a3b1a* -
                        • G2a3b1a1 (L13/S131/U13, L78)
                      • G2a3b1b (L43/S147)
                        • G2a3b1b* -
                          • G2a3b1b1 (L42/S146)
                        • G2a3b1c (L139#)
                • G2a3b2 (L177.1#, L177.2#, L177.3#)
        • G2a4 (L91#)
    • G2b (M287)
    • G2c (M377, L72, L183#)
      • G2c* -
        • G2c1 (M283)

The # indicates newly identified in 2009

The five most commonly occurring subgroups of haplogroup G with their corresponding SNPs in parentheses are: G1 (M285), G2a1 (P16) G2a3a (M406), G2a3b1 (P303) and G2c (M377).

Characteristics of Haplogroup G Subgroups

The International Society of Genetic Genealogy (ISOGG)[9] maintains the most up-to-date version of haplogroup categories. These classifications are based on shared SNP mutations. The discovery of new SNPs can result in assignment of new names to haplogroup categories. There were only a few G categories until 2008 when major revisions to categories were made. Even more G SNPs were identified in 2009 leading to more changes. Until 2008, new G SNPs were reported from labs at the University of Arizona (P designations), Stanford University (M designations) or the University of Central Florida (U designations). Beginning in 2008, additional G SNPs were identified at Family Tree DNA (L designations) and Ethnoancestry (S designations). These latter labs also made use of raw data results reported by individuals tested for about 2,000 SNPs at 23andMe to provide new L or S-designated SNP tests. In 2009, Family Tree DNA's Walk through the Y Project, sequencing certain Y-chromosome segments, provided a number of new G SNPs with the L designation.

Because SNPs provide the most reliable method of categorization, each is allowed to represent an official G category. Categories have alternating letters and numbers. But unusual values or unusual value combinations found at short tandem repeat markers (STRs) can also provide the basis of additional subgrouping. The identification of a new SNP can necessitate renaming of one or more categories.

G1 (M285+)

Almost all haplogroup G1 persons have the value of 12 at short tandem repeat (STR) marker DYS392 and all will have the M285 SNP mutation which characterizes this group. This value of 12 is uncommon in other haplogroups.

Subgroups of G1a and G1b exist with several new SNPs within G1 under investigation.

The highest reported concentration of G1 and its subgroups in a single country is in Iran, with next most frequent concentrations in neighboring countries to the west.

There are distinctive Ashkenazi Jewish and Kazakh G1 subgroups based on STR marker value combinations. These subgroups do not have SNP mutations to identify them.

G2a1 (P16+)

Haplogroup G2a1 and its one subgroup G2a1a represent the majority of haplogroup G samples in some parts of the Caucasus Mountains area. G2a1 is found only in tiny numbers elsewhere. Almost all G2a1 persons have a value of 10 at STR marker DYS392. They also typically have higher values for DYS385b, such as 16, 17 or 18, than seen in most G persons.

The North Ossetians in the mid northern Caucasus area of Russia belong overwhelmingly to the G2a1 subgroup based on available samples.

The presence of the SNP P18 mutation characterizes G2a1's only subgroup, G2a1a. The reliability of both P16 and P18 in identifying everyone in each of these categories has been questioned and individual components of the SNP have to be examined.

Ashkenazi Jewish G2a1 men with northeastern European ancestry form a distinct cluster based on STR marker values. Men from the Caucasus and men from eastern Europe also form distinctive STR clusters.

G2a2 (M286+)

This group is tiny. Samples indicating British Isles and Turkish ancestry have been identified. The British samples have distinctive double values for STR marker DYS19.

G2a3 (L30+, S126+, U8+)

Men who belong to this group but are negative for all its subgroups represent a small number of men.

G2a3a (M406+)

G2a3a and its several subgroups seem most commonly found in Turkey and Greece and the costal areas of the eastern Mediterranean where it can constitute up to 50% of haplogroup G samples. G2a3a is more common in southern Europe than northern Europe. In Europe—except in Greece and Italy -- G2a3a constitutes less than 20% of G samples. G2a3a so far has seldom surfaced in northern Africa or southern Asia, but represents a small percentage of the G population in the Caucasus Mountains region and in Iran.

A relatively high percentage of G2a3a persons have a value of 21 at STR marker DYS390. The DYS391 marker has mostly a value of 10, but sometimes 11, in G2a3a persons, and DYS392 is almost always 11. If a sample meets the criteria indicated for these three markers, it is likely the sample is G2a3a.

G2a3a has two known subgroups. G2a3a1 is relatively common, but the frequency of G2a3a2 which was first identified in 2009 is unknown.

G2a3b (L141+)

The SNP that defines this group was identified only in mid-2009 at Family Tree DNA, handicapping the ability to describe the features of this group. Initial testing seems to indicate this is a significant G group. Samples from persons with British Isles and Turkish ancestry have been identified. A high percentage of L141+ persons who do not belong to any L141 subgroup have the value of 11 at STR marker DYS490-- a finding rare in other G categories.

G2a3b1 (P303+)

The G2a3b1 definable subgroups are heavily concentrated throughout Europe west of the Black Sea and Russia where G2a3b1 is often in the majority among G persons. Small percentages of G2a3b1 are found primarily in the area encompassed by Turkey, the southern Caucasus countries, Iran and the Middle East where the G2a3b1 SNP may have originated. G2a3b1 is also found in India.

The largest G2a3b1 subgroup based on available samples is one in which almost all persons have the value of 13 at STR marker DYS388. There are additional subgroups of DYS388=13 men characterized by the presence of specific SNPs or uncommon STR marker oddities.

The next largest G2a3b1 subgroup is characterized by the presence of the L13/S13 SNP. This subgroup is most common in north central Europe.

The final major subgroup is characterized in a high percentage of cases by the values of 9 at marker DYS568 and 20,21 at marker YCA. This latter DYS568=9 subgroup contains a further large subgroup consisting overwhelmingly of Ashkenazi Jews.

The highest percentage of G2a3b1 persons in a discrete population so far described is in the island of Ibiza off the eastern Spanish coast.

G2a4 (L91+)

The SNP that characterizes this group (L91) was identified only in spring 2009 at Family Tree DNA. G2a4 would seem to encompass a significant group of G persons, but the recent identification and the paucity of samples are obstacles to describing characteristics of this group. Included within G2a4 are some men with double values for STR marker DYS19, but there are also G2a men with this finding who are not G2a4. The double 19 value situation is not seen in the G2a1 and G2a3 subgroups.

G2c (M377+)

A clade of closely related Ashkenazi Jews represent virtually all G2c persons, with just three other G2c haplotypes having been reported so far: one Turk from Kars in northeast Turkey near Armenia, one Pashtun, and one Burusho in Pakistan. The extreme rarity of G2c in northern Pakistan could indicate that G2c in this area originates outside the region and was brought there in the historic period, perhaps from further west (Pakistan was part of both the Achaemenid Persian Empire, conquered by Alexander the Great, and then formed a part of the Greco-Bactrian Kingdom). These two reported Pakistani G2c haplotypes are quite divergent from the Ashkenazi Jewish clade, and therefore do not at all indicate a recent common origin. The Turkish G2c is somewhat closer, but not identical. It remains to be seen if testing will reveal G2c haplotypes in other populations — this is some indication that G2c occurs at low levels in the Near East. Early reports that Ashkenazi G men were all G1 are now proven incorrect. There are also Jewish genetic clades within G2 and G1 whose members are not closely related to the G2c men. All G2c men tested so far have a rare null value for the DYS425 marker, (a missing "T" allele of the DYS371 palindromic STR), the result of a RecLOH event, a finding not yet seen among most other G haplotypes. Among Jews in Israel drawn from many areas of the world, G2c constituted 3.7% in one study.[10]

Geographical Distribution

Knowing the distribution of haplogroup G in general is not as useful as that of the distribution of its subgroups. The subgroups likely spread to new areas of the world in different time periods and to different locations. All available G samples derive from studies or collections that do not meet criteria for random sampling, and conclusions based on them are only rough approximations of what is seen in populations.

In Europe west of the Black Sea Haplogroup G is found at about 5% of the population on average throughout most of the continent.[11] The concentration of G falls below this average in Scandinavia, the westernmost former Soviet republics and Poland, as well as in Iceland and the British Isles. There are seeming pockets of unusual concentrations within Europe. In Wales, a distinctive G2a3b1 type (DYS388=13 and DYS592=12) dominates there and pushes the G percentage of the population higher than in England. In western Austria, in the Tirol (Tyrol) the G percentage can reach 8% or more. In the northern and highland areas of the island of Sardinia off western Italy), G percentages reach 11% of the population in one study[12] and reached 21% in the town of Tempio in another study. In the Greek island of Crete, approximately 7%[13] to 11%[14] of males belong to haplogroup G. In north-eastern Croatia, in the town of Osijek, G was found in 14% of the males.[15] The city is on the banks of the river Drava, which notably begins in the Tirol/Tyrol region of the Alps, another haplogroup G focus area in Europe.

In Russia, the Ukraine and central Asia, the G percentage is around 1% or less. The northern slope of the Caucasus Mountains represents a major exception where concentrations in the Kabardinian and Ossetian populations are noted. In Digora, North Ossetia the highest known concentration of G in a single town in the world is reached where 74% of the tested men were G.[16] The Madyars of central Kazakhstan, a Kazakh sub-ethnic group, were found to be 87% G1. Haplogroup G is found as far east as northern China in small percentages where G can reach more substantial percentages in minority groups such as the Uyghers.

In Turkey, the southern Caucasus region and Iran, haplogroup G reaches the highest percentage of a regional population worldwide. Among Turkish males 11% of the population is G.[3] In Iran, Haplogroup G reaches 13 to 15% of the population in various parts of the country. While it is found in percentages higher than 10% among the Bakhtiari, Gilaki and Mazandarani, it is closer to 5% among the Iranian Arabs and in some large cities.[17] Among the samples in the YHRD database from the southern Caucasus countries, 29% of the samples from Abazinia, 31% from Georgia, 18% from Azerbaijan and 11% from Armenia appear to be G samples.

In southern Asia, haplogroup G is found in concentrations of approximately 18%[18] to 20%[19] of Kalash, approximately 16% of Brahui,[19] and approximately 11.5% of sampled Pashtun,[18] but in only about 3% of the general Pakistani population.[18] The many groups in India and Bangla Desh have not been well studied. About 6% of the samples from Sri Lanka and Malaysia were reported as haplogroup G, but none were found in the other coastal lands of the Indian Ocean or Pacific Ocean in Asia.[20]

In the Middle East, haplogroup G accounts for about 3% of the population in almost all areas.[21][22] Among the Druze mostly residents of Israel 10% were found to be haplogroup G.[23]

In Africa, haplogroup G is rarely found in sub-Saharan Africa or south of the horn of Africa among native populations. In Egypt, studies have provided information that pegs the G percentage there to be between 6%[24] and 9%.[25] Three percent of North African Berbers were found to be haplogroup G.[26] Two percent of Arab Moroccans and 8% of Berber Moroccans were likewise found to be G.[27]

In the Americas, the percentage of haplogroup G corresponds to the numbers of persons from Old World countries who emigrated. It is not found among Native Americans except where intermarriage with non-native persons has occurred.

Around 10% of Jewish males are Haplogroup G.

Migration Sources in the Historical Period

Haplogroup G seems to have expanded from southwestern Asia or the Middle East to Europe west of the Black Sea and to the western Mediterranean and to central Asia primarily about 1,700 to 3,000 years ago. The time period is based on time calculations derived by comparing the number of STR marker mutations among men in these three areas and the presumed region of origin. This region of origin for the major G groups of today was most likely somewhere in Turkey, the Caucasus Mountains region, Iran or the eastern Mediterranean where G is found both in significant numbers and with the diversity expected with a long presence.

The principal spread of G from the mentioned region was to Europe west of the Black Sea. As to the most common G groups there, their (a) presence in 4% or more of the European population plus (b) seeming ancestral splits (based on mutations in 67-marker STR samples) between persons in southern and northern Europe no more recently than the early Middle Ages suggest the G ancestors came as part of identifiable groups (or a group) rather than as isolated movements of individuals. There are Jewish components of the major G groups which require explanation when advocating any particular ancestral group for these major G groups now in Europe.

The likeliest geographical sources for spread of each of the following principal G subgroups is: G1—Iran or eastern Middle East...... G2a1—the northern or southern Caucasus region..... G2a3a—Turkey or the eastern Mediterrean...... G2a3b1 (Caucasus type) -- the northern or southern Caucasus..... G2a3b1 (DYS388=13 type) -- perhaps southern Caucasus region or developed in Europe..... G2a3b1 (DYS568=9 type) -— unknown..... G2a3b1a1—Iran or southern Caucasus region or developed in Europe...... G2c—northern Middle East

North of the Caucasus Mountains

If G migrations originated in the area north of the Caucasus Mountains, surviving traces of an original G population in today's Ukraine and almost all of Russia are few. The exceptions today are in Russia along the north slope of the Caucasus where G2a3b1 is common among the Kabardinians, G2a1 is common among the Ossetians, and an unknown G type is found in Dagestan Other G types are found in small numbers in these three areas. The variety of G types there could indicate a long G presence in the northern Caucasus if G did not arrive piecemeal from southern locations a long time after the formation of each of these G types.

There were two major groups immediately north of the Caucasus area in the period of interest who engaged in migrations: the Sarmatians and the Huns. Both groups have cultural similarities, such as manipulating infant head shapes. One after another, the two moved into the area north of the Caucasus from the east when a westward movement provided better options than resisting groups to the east. The Sarmatians subsumed an existing population of Scythians during their occupation of this area. Like the Scythians, the Sarmatians spoke an Iranian language. The more central portion of the Sarmatians was identified as Alan Sarmatians. Strabo claimed these groups could assemble hundreds of thousands of warriors.[28] Being nomadic with a history of moving en masse, it is conceivable that the Sarmatians when they began the Great Migration westward in the 1st century B.C.E. took with them a high percentage of their population. The Huns were also nomadic and had been as far east as central Asia. Their language is not well defined. Complicating the discovery of the genetic make-up of these groups, scholars have described both Sarmatians and Huns as confederations of groups rather than solitary ethnic groups.

Greek historian Diodorus Siculus related that the Scythians had carried Medes to the River Don north of the Caucasus presumably from today's northwestern Iran, this event giving rise to their name Sauromatians.[29] Scythians had occupied Media 653-625 B.C.E. Pliny also suggested Sarmatians descended from Medes.[30] But historians have difficulty explaining how the Sarmatians instead seemed to come from central Asia off to the east in their occupation of Scythian lands around the 5th century B.C.E. The Huns in their westward migration had reached today's Ukraine by the 100s C.E. replacing the Sarmatians who left, and they moved dramatically into more western lands in the 400s. Under Hun rule, some Alans who had not migrated remained in the area north of the Caucasus. In the Middle Ages, invading Mongols reportedly forced them down to the Caucasus area. And there are Alan remnants which appeared in multiple locations. A group of 30,000 Alans, for example, formed the royal guard of the Yuan court in Dadu (Beijing, China), and a group of Alans arrived from unknown locations in Medieval Hungary.

While it is conceivable that many of the G persons in Europe proper today had ancestors part of the Great Migration that carried large populations out of southwestern Russia, they left few close relatives in the area they vacated if so. G2a1 which predominates among the North Ossetians is rare in Europe proper. The Caucasus G2a3b1 common among the Kabardinians (DYS390=23, DYS388=12, DYS385=13,14), for example, is seen only rarely in Europe proper where the DYS388=13 type (DYS390=22, DYS385=14,15) predominates. In addition, none of the ancient DNA recovered so far from skeletons in the Scythian/Sarmatian area of occupation has been haplogroup G.

While these mentioned northern Caucasus types of G are uncommon in lands to the north or in Europe proper, they have significant equivalents in the southeastern Caucasus, particularly in Georgia, including South Ossetia which is often included with Georgia.

If these persons in the north Caucasus were originally Medes who became Sauromatians (later Sarmatians and Alans) when carried to the north Caucasus by the Scythians, the G groups in the land of the Kingdom of the Medes in northwestern Iran, and also the rest of Iran, are different from what is seen in the northern Caucasus. G1, for example, is common in Iran, but rare in the north Caucasus.

South of the Caucasus Mountains

In contrast to the regional types of G seen north of the Caucasus Mountains, the three countries of Georgia, Armenia and Azerbaijan provide STR samples consistent with all types of haplogroup G seen today in Europe west of the Black Sea with the exception of G1 and G2c. And the regional STR types north of the Caucasus are seen in smaller numbers south of the Caucasus. Because of the rich diversity of these G types, this area of the world is a leading candidate for the starting point for most G migrations especially those to western and central Europe.

The STR marker samples in the YHRD database[31] from these three countries obtained from the Nasidze study,[16] as well as samples in the Haplogroup G Project,[32] contain examples of (1) G2a1, (2) G2a3a, (3) G2a3b1 (DYS388=13 type), and (4) G2a3b1a1. These four G types represent over 90% of G types found in western and central Europe. The three countries also have the north Caucasus type of G2a3b1* seen especially among the Kabardinians (DYS390=23, DYS388=12, DYS385=13,14). However, the presence of the DYS388=13 type of G2a3b1 and G2a3b1b1 have yet to be confirmed by SNP testing.

Both the Babylonian Empire and Persian Empire once included the southern Caucasus and stretched to the Mediterranean, but there is no seeming report of groups from the Caucasus being relocated to the western limits of these empires. But the Assyrians deported Jews to Armenia, and 10,000 Jews moved there on their own. The commonly accepted view is that the Mountain Jews of Azerbaijan were early medieval immigrants from southwestern Persia and this immigration continued throughout the Middle Ages. More recently a high percentage of this community emigrated to Israel. The seemingly related non-Jewish Tats of Azerbaijan and southern Dagestan are descended from Persians who in the time of the dynasty of Sassanids (III-VII C.E.) built cities and founded military garrisons to strengthen their positions in the north. Kurds from Turkey and Iran settled in Georgia and Armenia beginning the late 1800s, but a Nasidze study did not find haplogroup G among them.[33]

One possible source of a G migration from the Caucasus would be men recruited into military units during the Roman Empire. Armenia which encompassed nearly all the southern Caucasus region was occupied for a while in the 100s C.E. None of the Roman legions bore the name of locations in the southern Caucasus. Other possible sources of southern Caucasus G migrations were the Jewish communities. There are significant Jewish subgroups within G2a3a, G2a1 and certain types of G2a3b1. The Jewish Radhanites had significant presences in western and central Europe in the early Medieval period with trade routes that included the eastern shore of the Caucasus together with some known Jewish communities in the Caucasus region.

There was also a significant outflow of persons -- perhaps more than 100,000 persons -- from Armenia during the Byzantine Empire due to various emporers deporting persons to parts of modern-day Turkey or Bulgaria or conscripting soldiers.[34][35][36][37][38][39] In addition, 10,000 Armenians were said to have settled in Cyprus in 578 C.E.[40]

One problem with identifying the southern Caucasus as the source of most of the G in western and central Europe is the minority status there of the DYS388=13 type of G2a3b1 and the inability so far to obtain SNP confirmation of its presence. This DYS388=13 group is dominant in Europe but its rarity elsewhere actually makes no area the most logical location as a source for its migration.


In Iran G1 is found in significant numbers in all tested areas. The Persian Empire once extended to the Mediterranean, and Persian military forces may have carried G persons into the southern Caucasus as well as to the shores of the Mediterranean. In fact, the highest percentages of G1 outside of Iran are in the countries just to the west of southern Iran that were once part of the Persian Empire. If the G movement occurred instead before the Persian Empire, G1 could have originated either in the Middle East or Iran.

While G2a3a occurs in Iran, it is apparently too rare there to account for any source group for G2a3a persons who migrated in numbers elsewhere. The same situation applies to G2a1 there. Seeming G2a3b1 Iranian samples are available, especially G2a3b1a1, (though not confirmed by SNP testing) and several unclassified types of G2a and G2a3 are noted. Neither G2c nor the Caucasus type of G2a3b1 has surfaced among Iranian samples.

There are significant Jewish components to G1 groups. The Jewish communities in the Holy Land contributed substantially to the Persian (now Iranian) population beginning in 727 and 721 B.C.E. with the deportation of Jews to Media and Persia. This was followed by the deportation of over 27,000 Jews to southwestern Persia. In 680 B.C.E. to escape another bad situation there was a Jewish emigration primarily into the Persian city of Isfahan. In the 500s B.C.E. at the end of the Babylonian Captivity, Cyrus allowed additional Jews into Persia. In 135 C.E., there was yet another wave of emigration into Persia by Jews fleeing the Roman persecutions. At the time of the original emigrations, Jews intermarried with the local population.[41] This practice soon stopped, but Jews were not forced into ghettos until Islamic times. The Jewish Radhanites could also have spread G samples to Europe in their travels in the early Middle Ages. They maintained a significant presence on the trade routes that connected Iran to the rest of the world. A major gap in genetic knowledge of Iran involves the Kurdish areas. Jews were once commonly found there, but moved in large numbers to Israel after its founding.

Turkey and the eastern Mediterranean

This region has unusual percentages of G2a3a. Beginning with the Phoenicians, any of the trading and conquering governments of the Mediterranean could have spread G2a3a to the western Mediterranean where it is more common than in Europe to the north. G2a3a also has a Jewish-Italian subgroup, and any of the Jewish diaspora movements may have contributed to the spread. The percentages of G2a1 and the DYS388=13 type of G2a3b1 are so tiny in this area, that it is unlikely Turkey was the starting point for a migration of these G groups.

G1 is common in the northern Middle East, and that area may have been the source of G movements to Iran or vice versa. G2c is found in small numbers in the northern Middle East, and the Middle East could have been the source from which European G2c and Pakistani G2c1 derived.


Within the Old World, due to the rarity of samples it is currently unclear how G2a and G2c spread into Uzbekistan, Kazakhstan, Pakistan, southern India, Sri Lanka, Malaysia and northern China. Haplogroup G was carried to the New World beginning in the 1500s by individual migrations principally from the British Isles, Germany, Iberia and Italy.

Famous members

Joseph Stalin, from a genetic test on his grandson (his son Vasily's son; Alexander Burdonsky) shows his Y-DNA haplogroup to be G2a1a [1]

DYS 393 390 19 391 385A 385B 426 388 439 389I 392 389II 458 459A 459B 455 454 447 437 448 449 464A 464B 464C 464D
Alleles 14 23 15 9 15 16 11 12 11 11 10 28 17 9 9 11 11 25 16 21 28 13 13 14 14

See also


  1. ^ https://genographic.nationalgeographic.com/genographic/lan/en/atlas.html Atlas of the Human Journey: Haplogroup G (M201)
  2. ^ Semino O, Passarino G, Oefner PJ, et al. (November 2000). "The genetic legacy of Paleolithic Homo sapiens sapiens in extant Europeans: a Y chromosome perspective". Science 290 (5494): 1155–9. doi:10.1126/science.290.5494.1155. PMID 11073453. http://www.sciencemag.org/cgi/pmidlookup?view=long&pmid=11073453. 
  3. ^ a b Cinnioğlu C, King R, Kivisild T, et al. (January 2004). "Excavating Y-chromosome haplotype strata in Anatolia". Hum. Genet. 114 (2): 127–48. doi:10.1007/s00439-003-1031-4. PMID 14586639. 
  4. ^ Vanek, D, "et al." (2009). "Kinship and Y-Chromosome Analysis of 7th Century Human Remains: Novel DNA Extraction and Typing Procedure for Ancient Material". Croatian Med J 50 (3): 286–95. doi:10.3325/cmj.2009.50.286. PMID 19480023.& PMC 2702742. http://www.http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2702742/?tool=pubmed. 
  5. ^ Underhill, P., et al. (2001). "The phylogeography of Y chromosome binary haplotypes and the origins of modern human populations". Annals of Human Genetics 65 (Pt1): 43–62. doi:10.1046/j.1469-1809.2001.6510043.x. PMID 11415522. 
  6. ^ Karafat, T., et al. (2008). "New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree". Genome Research 18 (5): 830–38. doi:10.1101/gr.7172008. PMID 18385274.& PMC 2336805. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2336805/?tool=pubmed. 
  7. ^ |url=http://ymap.ftdna.com/cgi-bin/gbrowse_details/hs_chrY?name=L240;class=Sequence;ref=ChrY;start=3191153;end=3191153;feature_id=40369
  8. ^ Sims, L.,et al. (2009). "Improved Resolution Haplogroup G Phylogeny in the Y Chromosome, Revealed by a Set of Newly Characterized SNPs". PLoSOne 4 (6): 1–5. doi:10.1371/journal.pone.0005792. PMID 19495413.& PMC 2686153. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2686153/?tool=pubmed. 
  9. ^ |url=http://www.isogg.org/tree/index.html
  10. ^ Hammer, M. et al. (2009). "Extended Y chromosome haplotypes resolve multiple and unique lineages of the Jewish priesthood.". Human Genetics 126 (5): 707–17. doi:10.1007/s00439-009-0727-5. PMID 19669163.& PMC 2771134. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2771134/?tool=pubmed. 
  11. ^ |url=http://www.members.cox.net/morebanks/MoreG2.html Ray Banks, Haplogroup G, region by region and country by country.
  12. ^ Zei, G. et al. (2003). "From surnames to the history of Y chromosomes: the Sardinian population as a paradign.". Eur J of Human Genetics 11 (10): 802–07. doi:10.1038/sj.ejhg.5201040. PMID 14512971. 
  13. ^ Martinez L. et al. (April 2007). "Paleolithic Y-haplogroup heritage predominates in a Cretan highland plateau". Eur. J. Hum. Genet. 15 (4): 485–93. doi:10.1038/sj.ejhg.5201769. PMID 17264870. 
  14. ^ King R, et al. (March 2008). "Differential Y-chromosome Anatolian influences on the Greek and Cretan Neolithic". Ann. Hum. Genet. 72 (Pt 2): 205–14. doi:10.1111/j.1469-1809.2007.00414.x. PMID 18269686. 
  15. ^ Battaglia V. et al. (2009). "Y-chromosomal evidence of the cultural diffusion of agriculture in southeast Europe". Eur J of Human Genetics 17 (6): 820–30. doi:10.1038/ejhg.2008.249. PMID 19107149. http://www.unipv.eu/on-line/Home/AreaStampa/documento2986.html. 
  16. ^ a b Nasidze, I, et al (2003). "Testing Hypotheses of Language Replacement in the Caucasus: Evidence from the Y Chromosome". Human Genetics 112 (3): 255–61. doi:10.1007/s00439-002-0874-4. PMID 12596050. http://www.familytreedna.com/pdf/Nasidze_2003.pdf. 
  17. ^ Nasidze, I. et al. (2008). "Close Genetic Relationship Between Semitic-speaking and Indo-European-speaking Groups in Iran". Annals of Human Genetics 72 (Pt 2): 241–52. doi:10.1111/j.1469-1809.2007.00413.x. PMID 18205892. 
  18. ^ a b c Firasat, F., et al. (2006). "Y Chromosomal Evidence for a Limited Greek Contribution to the Pathan Population of Pakistan". Eur J of Human Genetics 15 (1): 121–26. doi:10.1038/sj.ejhg.5201726. PMID 17047675.& PMC 2588664. http://www.nature.com/ejhg/journal/v15/n1/full/5201726a.html. 
  19. ^ a b Sengupta S. et al. (February 2006). "Polarity and temporality of high-resolution y-chromosome distributions in India identify both indigenous and exogenous expansions and reveal minor genetic influence of Central Asian pastoralists". Am. J. Hum. Genet. 78 (2): 202–21. doi:10.1086/499411. PMID 16400607.& PMC 1380230. http://linkinghub.elsevier.com/retrieve/pii/S0002-9297(07)62353-2. 
  20. ^ Hammer, M. et al. (2006). "Dual Origins of the Japanese: Common Ground for Hunter-Gatherer and Farmer Y-Chromosomes". Journal of Human Genetics 51 (1): 47–58. doi:10.1007/s10038-005-0322-0. PMID 16328082. 
  21. ^ Flores, C. et al. (2005). [http://wysinger.homestead.com/jordan.pdf "Isolates in a corridor of migrations: a high-resolution analysis of Y-chromosome variation in Jordan"]. Journal of Human Genetics 50 (9): 435–41. doi:10.1007/s10038-005-0274-4. PMID 16142507. http://wysinger.homestead.com/jordan.pdf. 
  22. ^ Cadenas, A. et al. (2008). "Chromosome Diversity Characterizes the Gulf of Oman". Eur Journal of Genetics 16 (3): 374–86. doi:10.1038/sj.ejhg.5201934. PMID 17928816. 
  23. ^ Shlush, L. et al. (2008). "The Druze: A Population Genetic Refugium of the Near East". PLos one 3 (5): e2105. doi:10.1371/journal.pone.0002105. PMID 18461126.& PMC 2324201. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2324201/?tool=pubmed. 
  24. ^ El-Sibai,M. et al.; Platt, DE; Haber, M; Xue, Y; Youhanna, SC; Wells, RS; Izaabel, H; Sanyoura, MF et al. (2009). "Geographical Structure of the Y-Chromosomal Genetic Landscape of the Levant: A Coastal-Inland Contrast". Annals of Human Genetics 73 (Pt6): 561–81. doi:10.1111/j.1469-1809.2009.00538.x. PMID 19686289. 
  25. ^ Luis, J. et al. (2004). "The Levant versus the Horn of Africa: Evidence for Bidirectional Corridors of Human Migrations". Amer J of Human Genetics 74 (3): 532–44. doi:10.1086/382286. PMID 14973781.& PMC 1182266. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1182266/pdf/AJHGv74p532.pdf. 
  26. ^ Alsonso, S. et al. (2005). [http://www.nature.com/ejhg/journal/v13/n12/pdf/5201482a.pdf "The place of the Basques in the European Y-chromosome diversity landscape"]. Eur J of Human Genetics 13 (12): 1293–1302. doi:10.1038/sj.ejhg.5201482. PMID 16094307. http://www.nature.com/ejhg/journal/v13/n12/pdf/5201482a.pdf. 
  27. ^ Cruciani, F. et al. (2002). "A Back Migration from Asia to Sub-Saharan Africa Is Supported by High-Resolution Analysis of Human Y-Chromosome Haplotypes". Eur J of Human Genetics 70 (5): 1197–1214. doi:10.1086/340257. PMID 11910562.& PMC 447595. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC447595/?tool=pubmed. 
  28. ^ Strabo, Geography (XXIII, 11)
  29. ^ Diodorus Siculus, Bibliotheca Historica, Lib. II, 43, p 29. |url=http://penelope.uchicago.edu/Thayer/E/Roman/Texts/Diodorus_Siculus/2B*.html
  30. ^ Pliny, Natural History, VI, ch 7, W. H. Jones, transl., Wm. Heinemann, London, 1949-54)
  31. ^ |url=http://www.yhrd.org/Search
  32. ^ |url=http://www.members.cox.net/morebanks/Diagram.html
  33. ^ Nasidze, I. et al. (2005). "MtDNA and Y-chromosome Variation in Kurdish Groups.". Annals of Human Genetics 69 (Pt 4): 401–12. doi:10.1046/j.1529-8817.2005.00174.x. PMID 15996169. 
  34. ^ Ghevond, Histoire des guerres et des Conquetes des Arabes en Armenie. (trl. from Armenian, Paris, 1856)
  35. ^ Strabo(12,3,6)
  36. ^ C. de Boor, ed., Nicocephoro, Opuscula Historica (Leipzig, 1890), p 518-23.
  37. ^ Sebo, Histoire d'Heraclius. trl. from Armenian. (F. Macler, Paris, 1904), p 30-31.
  38. ^ Theopanes, I: 451-2
  39. ^ Procopius, De bello persico, II, 21, 2.
  40. ^ C. De Boor, trl., Theophylactus Simocatta, Historiae. (Leipzig, 1887, p 143)
  41. ^ Price, Massoume, A Brief History of Iranian Jews. |url=http://www.irol.com/History/jews-history/index.html

External Links

Human Y-chromosome DNA (Y-DNA) haplogroups (by ethnic groups · famous haplotypes)

most recent common Y-ancestor
| |
| |
| |


Up to date as of February 01, 2010

From Familypedia

Haplogroup G (Y-DNA) is most frequent in the Caucasus (found at over 60% in ethnic North Ossetian males).

In human genetics, Haplogroup G (M201) is a Y-chromosome haplogroup. It is a branch of Haplogroup F (M89), and is theorized to have originated, according to the latest thinking, in the Near East or Southern Asia, likely in the region that is now northern India, Pakistan, and Afghanistan. The haplogroup began to spread with the Neolithic Agricultural Revolution, perhaps with the appearance of the early horse nomads of the Eurasian steppe.



Haplogroup G has an overall low frequency in most populations but is widely distributed within the Old World in Europe, Western Asia, northeastern Africa, Central Asia, South Asia, and Southeast Asia (including parts of China and the Malay Archipelago). It is most frequent in the Caucasus (found at over 60% in ethnic North Ossetian males and ~30% in Georgian males). In Europe, haplogroup G is found at ~5% in central and southern sections of the continent. It has relatively high concentrations in northern Sardinia (over 25%) and the Tyrol region of Austria (about 15%). In the British Isles, Scandinavia, and the Baltic countries it is uncommon; Britain and Norway for example at 1–2%.

In Southern Asia, haplogroup G is found at a rate of 10% to 20% among Iranians, Pashtuns (ethnic Afghans), and Kalash, and at a lesser percentage among some other populations in Pakistan, India, and Sri Lanka, including the Tamils. In Central Asia, G is found in small percentages in a belt extending from the Caucasus through the Central Asian steppes out to the Uyghurs of Xinjiang Province in western China.

The initial distribution of haplogroup G in Europe may reflect a migration of agriculture-bringing Anatolian people into the Mediterranean Basin. The haplogroup may also have been brought by invading Sarmatians, Alans and Jasz (all descendant groups of the 'Iranian' Scythians), which is a good fit with the historically attested spread of these peoples across the Central Asian steppe, from Xinjiang in the east to Iberia and Tunisia in the west, with a branch (the Sakas) entering the northwest of the Indian subcontinent at the start of the first millennium. Around 10% of Ashkenazi Jewish males have haplogroup G, and the Jewish diaspora to Europe from the Middle East and the Arab Moor occupation of Spain are two other probable routes into Europe for certain types of G.

Three commonly occurring subgroups of Haplogroup G have been identified so far: G1 (M285), G2a (P15) and G2c (M377). G3 (P287) exists but is rarely found in the G population. The highest reported concentration of G1 is in Iran, with next most frequent concentrations in neighboring countries. G2 represents the majority of haplogroup G Y-chromosomes in all countries, and a recently discovered subcategory (likely to be called G2a3) accounts for a high percentage of G in all sampled countries.

A clade of closely related Ashkenazi Jews represent virtually all G2c persons, with just three other G2c haplotypes having been reported so far: one Turk from Kars in northeast Turkey near Armenia, one Pashtun, and one Burusho in Pakistan. The extreme rarity of G2c in northeast Pakistan could indicate that G2c in this area originates outside the region and was brought there in the historic period, perhaps from further west (this area was part of both the Achaemenid Persian Empire, conquered by Alexander the Great, and then formed a part of the Greco-Bactrian Kingdom). These two reported Pakistani G2c haplotypes are quite divergent from the Ashkenazi Jewish clade, and therefore do not at all indicate a recent common origin. The Turkish G2c is somewhat closer, but not identical. It remains to be seen if testing will reveal G2c haplotypes in other populations — this is some indication that G2c occurs at low levels in the Near East. Early reports that Ashkenazi G men were all G1 are now proven incorrect. There are also Jewish genetic clades within G2 and G1 whose members are not closely related to the G2c men. All G2c men tested so far have a rare null value for the DYS425 marker, (a missing "T" allele of the DYS371 palindromic STR), the result of a RecLOH event, a finding not yet seen among most other G haplotypes. Among Jews worldwide, haplogroup G comprises between 10–20% of the population. Though forming some recognizable clades, Jews today comprise a small percentage of the total number of G men worldwide.


This is a synthesis of the data about the internal phylogeny of haplogroup G from the latest upcoming studies:

(Temporarily embargoed till publication)

G2 Haplogroup distributions in Turkey

A Stanford study surveyed distributions of Haplogroup G in Turkey. Page two of the study shows a map of the regions.

Tabular results for subgroup G2:

See also


  • Haplogroup G SNP project
  • Some Information and Theories on Haplogroup G
  • Cinnioglu, Cengiz, et al., "Excavating Y-Chromosome Haplotype Strata in Anatolia," Human Genetics,2004, vol. 114, pp. 127–48.
  • DiGiacomo, F. et al. "Clinal Patterns of Human Y Chromosomal Diversity in Continental Italy and Greece 2003, vol 23, pp. 387–95. [Lists in table 1 G2 percentages in small samples in various towns]
  • Firasat, Sadaf et al., "Y-chromosomal evidence for a limited Greek contribution to the Pathan population of Pakistan," European Journal of Human Genetics (2007) Vol. 15, p. 121–126. http://www.nature.com/ejhg/journal/v15/n1/full/5201726a.html
  • Nasidze, Ivan et al., "Testing Hypotheses of Language Replacement in the Caucasus: Evidence from the Y Chromosome," Human Genetics, 2003, vol. 112, pp. 255–61.
  • Nasidze, Ivan et al., "Concomitant Replacement of Language and mtDNA in South Caspian Populations of Iran," Current Biology, 2006, vol. 16, pp. 668–73.
  • Nasidze, Ivan et al., "Genetic Evidence concerning the Origins of South and North Ossetians," Annals of Human Genetics, 2004, vol. 68, pp. 588–99.
  • Nasidze, Ivan et al., "Mitochondrial DNA and Y-Chromsome Variation in the Caucasus," Annals of Human Genetics, 2004, vol. 68, pp 204–21.
  • Nasidze, Ivan et al., "MtDNA and Y-Chromosome Variation in Kurdish Groups," Annals of Human Genetics, 2005, vol. 69, pp. 401–12.
  • Qamar, Raheel, "Y Chromosomal DNA Variation in Pakistan," American Journal of Human Genetics, 2002, vol. 70(5), pp. 1107–24.
  • Regueiro, M., et al., "Iran: Tricontinental Nexus for Y-Chromosome Driven Migration," Human Heredity,2006, vol. 61, pp. 132–43.
  • Sahoo, Sanghamitra, "A Prehistory of Indian Y Chromosomes: Evaluating Demic Diffusion Scenarios," Proceedings of the National Academy of Sciences, U.S.A., 2006, vol. 103(4), pp. 843–48.
  • Sengupta, Sanghamitra, "Polarity and Temporality of High-Resolution Y-Chromosome Distributions in India Identify Both Indigenous and Exogenous Expansions and Reveal Minor Genetic Influence of Central Asian Pastoralists," American Journal of Human Genetics, 2006, vol. 78(2), pp. 202–21.
  • Zei, Gianna, et al., "From Surnames to the History of Y Chromosomes: the Sardinian Population as a Paradigm," European Journal of Human Genetics, 2003, vol. 11, pp. 802-07.

External links

Human Y-chromosome DNA (Y-DNA) haplogroups

Y-most recent common ancestor

This page uses content from the English language Wikipedia. The original content was at Haplogroup G (Y-DNA). The list of authors can be seen in the page history. As with this Familypedia wiki, the content of Wikipedia is available under the Creative Commons License.
Facts about Haplogroup G (Y-DNA)RDF feed

This article uses material from the "Haplogroup G (Y-DNA)" article on the Genealogy wiki at Wikia and is licensed under the Creative Commons Attribution-Share Alike License.


Got something to say? Make a comment.
Your name
Your email address