Mitochondrial Genome Variation in Eastern Asia and the Peopling of Japan

doi:10.1101/gr.2286304

QUICK SEARCH:

[advanced]

	Author:		Keyword(s):
Year:		Vol:		Page:

Genome Res. 14:1832-1850, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00

This Article

Abstract

Full Text (PDF)

Supplemental Research Data

Alert me when this article is cited

Alert me if a correction is posted

Citation Map

Services

Email this article to a friend

Similar articles in this journal

Similar articles in PubMed

Alert me to new issues of the journal

Download to citation manager

Cited by other online articles

Google Scholar

Articles by Tanaka, M.

Articles by Shimodaira, H.

Articles citing this Article

Search for Related Content

PubMed

PubMed Citation

Articles by Tanaka, M.

Articles by Shimodaira, H.

Pubmed/NCBI databases

Gene	GEO Profiles
Nucleotide	Protein

Letter

Mitochondrial Genome Variation in Eastern Asia and the Peopling of Japan

Masashi Tanaka¹^,15, Vicente M. Cabrera², Ana M. González², José M. Larruga², Takeshi Takeyasu¹^,3, Noriyuki Fuku¹^,4, Li-Jun Guo¹^,3, Raita Hirose¹, Yasunori Fujita¹, Miyuki Kurata¹, Ken-ichi Shinoda⁵, Kazuo Umetsu⁶, Yoshiji Yamada⁷^,1, Yoshiharu Oshida³, Yuzo Sato³, Nobutaka Hattori⁸, Yoshikuni Mizuno⁸, Yasumichi Arai¹⁰, Nobuyoshi Hirose¹⁰, Shigeo Ohta¹¹, Osamu Ogawa⁹, Yasushi Tanaka⁹, Ryuzo Kawamori⁹, Masayo Shamoto-Nagai¹^,4^,12, Wakako Maruyama¹², Hiroshi Shimokata¹³, Ryota Suzuki¹⁴ and Hidetoshi Shimodaira¹⁴

¹ Department of Gene Therapy, Gifu International Institute of Biotechnology, Kakamigahara, Gifu 504-0838, Japan , ² Department of Genetics, Faculty of Biology, University of La Laguna, Tenerife 38271, Spain , ³ Department of Sports Medicine, Graduate School of Medicine, Nagoya University, Nagoya 464-8601, Japan , ⁴ Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan , ⁵ Department of Anthropology, National Science Museum, Tokyo 169-0073, Japan , ⁶ Department of Forensic Medicine, Yamagata University School of Medicine, Yamagata 990-9585, Japan , ⁷ Department of Human Functional Genomics, Life Science Research Center, Mie University, Tu-shi, Mie 514-8507, Japan , ⁸ Department of Neurology, Metabolism and Endocrinology, Juntendo University School of Medicine, Tokyo 113-8421, Japan , ⁹ Department of Medicine, Metabolism and Endocrinology, Juntendo University School of Medicine, Tokyo 113-8421, Japan , ¹⁰ Department of Geriatric Medicine, Keio University School of Medicine, Tokyo 160-8582, Japan , ¹¹ Department of Biochemistry and Cell Biology, Institute of Gerontology, Nihon Medical School, Kawasaki 211-8533, Japan , ¹² Laboratory of Biochemistry and Metabolism, Department of Basic Gerontology, National Institute for Longevity Sciences, Obu 474-8522, Japan , ¹³ Department of Epidemiology, National Institute for Longevity Sciences, Obu 474-8522, Japan , ¹⁴ Department of Mathematical and Computing Sciences, Tokyo Institute of Technology, Tokyo 152-8552, Japan

ABSTRACT

Top
ABSTRACT
RESULTS
DISCUSSION
METHODS
REFERENCES
WEB SITE REFERENCES

To construct an East Asia mitochondrial DNA (mtDNA) phylogeny,we sequenced the complete mitochondrial genomes of 672 Japaneseindividuals (http://www.giib.or.jp/mtsnp/index_e.html). Thisallowed us to perform a phylogenetic analysis with a pool of942 Asiatic sequences. New clades and subclades emerged fromthe Japanese data. On the basis of this unequivocal phylogeny,we classified 4713 Asian partial mitochondrial sequences, with<10% ambiguity. Applying population and phylogeographic methods,we used these sequences to shed light on the controversial issueof the peopling of Japan. Population-based comparisons confirmedthat present-day Japanese have their closest genetic affinityto northern Asian populations, especially to Koreans, whichfinding is congruent with the proposed Continental gene flowto Japan after the Yayoi period. This phylogeographic approachunraveled a high degree of differentiation in Paleolithic Japanese.Ancient southern and northern migrations were detected basedon the existence of basic M and N lineages in Ryukyuans andAinu. Direct connections with Tibet, parallel to those foundfor the Y-chromosome, were also apparent. Furthermore, the highestdiversity found in Japan for some derived clades suggests thatJapan could be included in an area of migratory expansion toContinental Asia. All the theories that have been proposed upto now to explain the peopling of Japan seem insufficient toaccommodate fully this complex picture.

Recent analysis of global mitochondrial DNA diversity in humansbased on complete mtDNA sequences has provided compelling evidenceof a human mtDNA origin in Africa (Ingman et al. 2000

). Lessthan 100,000 years ago, at least two mtDNA human lineages beganto rapidly spread from Africa to the Old World (Maca-Meyer etal. 2001

). The archaeological records attest that humans reachedJapan, at the eastern edge of Asia, around 30,000 years ago(Glover 1980

). At that time, Japan was connected to the Continentby both northern and southern land bridges, enabling two migratoryroutes. As early as 13,000 years ago, pottery appeared in Japanand Siberia for the first time in the world (Shiraishi 2002

).Subsequent technical improvements gave rise to the JapaneseNeolithic period known as the Jomon period, in which the populationgrowth was considerable. Later, Continental people arrived inJapan from the Korean peninsula, initiating the Yayoi period,with this migration reaching its maximum at the beginning ofthe first millennium.

With this archaeological framework in mind, it was of anthropologicalinterest to us to know whether the modern Japanese are the resultof an admixture between the Paleolithic-Neolithic aboriginesand more recent immigrant populations, whether the indigenouspopulation gradually evolved to give rise to the modern Japanese,with subsequent colonizations having strong cultural influencesbut only minor demographic impact, or even whether the lateNeolithic waves entirely replaced the indigenous residents.Morphometric data obtained from the remains of Japanese Paleolithicpeople are more in accordance with a southern origin for thesefirst immigrants. Subsequent morphological studies on modernindigenous (northern Ainu and southern Ryukyuans) and mainlandJapanese favored an admixture model in which the former wouldbe descendants of the Paleolithic Japanese and the latter derivedfrom the Continental immigrants who gave rise to the Yayoi period(Hanihara 1991). Genetic analysis using classical markers assigneda definitive northern origin to the Upper Paleolithic inhabitantsof Japan; but whereas some authors favored a homogeneous backgroundfor all modern Japanese (Nei 1995), others claimed that althoughUpper Paleolithic and Yayoi period immigrants had probably anorthern Asian origin, they were genetically differentiated(Omoto and Saitou 1997). The application of molecular markersto define maternal and paternal lineages to the peopling ofJapan confirmed the dual admixture model but added some interestingnovelties. For example, the study of Y-chromosome markers ledto the discovery of remarkable Korean and Tibetan influenceson the Japanese population (Hammer and Horai 1995); and mtDNAHVS-I sequences also confirmed the Korean input (Horai et al.1996) and closer affinities of the Japanese to Tibetans thanto southern Asians (Qian et al. 2001). In quantitative estimationsof maternal admixture, it was found that 65% of the mainlandJapanese gene pool was derived from Continental gene flow afterthe Yayoi period. However, the indigenous Ainu from the northernisland of Hokkaido and the Ryukyuans from southern Okinawa showed<20% Continental specificity, pointing to them as the mostprobable descendants of the Jomon people. The fact that theseindigenous groups were, in turn, genetically well differentiatedindicated a notable degree of heterogeneity and/or isolationamong the early Japanese immigrants (Horai et al. 1996). However,two handicaps of these studies are the incomplete representationof Asian populations and the relatively small sample size ofthose analyzed, which weakens the reliance on the relative affinitiesfound by genetic distance methods (Helgason et al. 2001). FormtDNA there are currently enough HVI/HVII data from easternAsia, including Japan, to test the validity of the above-mentionedresults. However, these sequences have been assorted into differentclades following different insufficient criteria or even havenot been classified at all. Furthermore, the phylogenetic confidenceof results based only on sequences from the noncoding region(HVI, HVII) has been recently questioned (Bandelt et al. 2000).This is mainly due to the frequent occurrence of parallel mutationsin independent lineages that confuse the correct classification,a source of error that is increased because the basal motifin the noncoding region for the two macrolineages that expandedthroughout Asia is the same (16223). In addition, as the noncodingregion has not evolved at a constant rate across all human lineages,it is considered inappropriate to use this region for datingevolutionary events (Ingman et al. 2000; Finnilä et al.2001).

To make reliable use of this important source of available dataon the mtDNA noncoding region to contrast the maternal structureand to determine the most probable origin of the modern Japanese,we have undertaken the following approach: First, we used aset of complete mtDNA sequences of 672 Japanese individualsto create a phylogenetic network (Bandelt et al. 1999) thatrelated them to other complete sequences, already published,belonging to the major haplogroups proposed by others (Torroniet al. 1992, 1996; Macaulay et al. 1999; Yao et al. 2002a).Discriminative positions in the noncoding region, defining additionalAsian subhaplogroups, were then used to further classify 766previously published Japanese partial sequences. For this purposewe also included other unambiguously assorted sequence datareported by other research groups (Derbeneva et al. 2002b; Yaoet al. 2002a). These HVI sequences thus pooled were then comparedwith other published Asian sequences. Finally, using all ofthese classified sequences, we tested the relative affinitiesof modern Japanese and Continental Asians using global distancemethods and phylogeographic approaches framed at different agelevels.

RESULTS

Top
ABSTRACT
RESULTS
DISCUSSION
METHODS
REFERENCES
WEB SITE REFERENCES

Eastern Asia Phylogeny Based on Complete mtDNA Sequences
The phylogenetic network constructed with the complete mtDNAsequences fully coincides with those previously published atworldwide (Maca-Meyer et al. 2001; Herrnstadt et al. 2002) orregional scale (Kong et al. 2003). Moreover, their main branchesare well supported by high bootstrap values on a neighbor-joiningtree (Supplemental material, condensed by more than 40% bootstrapvalues).

From the L3 African trunk, two early branches came out of Africaand radiated extensively, originating superhaplogroups M andN, which were defined by the basic mutations depicted in Figures1A and 2, respectively. Representatives of both superhaplogroupsreached Japan. The construction of these phylogenetic treesby using our Japanese complete sequences and other publishedAsian sequences (Table 1) resulted in a better definition ofthe known haplogroups and in the identification of new cladesat different phylogenetic levels. Characteristic HVI motifsand diagnostic RFLPs in the coding region, and coalescence agesfor these haplogroups and subhaplogroups are given in SupplementalTables A and B. To contribute to the unification of the mitochondrialnomenclature, we revised the previously proposed haplogroupsby adding the following new information.

View larger version (79K):
[in this window]
[in a new window]

Figure 1

Phylogenetic tree, based on complete mtDNA sequences, for macrohaplogroup M in general (A) and for subhaplogroup D (B) in particular. Subject origins are given in Table 1. The numbers along the links refer to nucleotide positions, arbitrarily written in ascending order. Open boxes are nodes from which other (not shown) sequences branch. A, C, G, and T indicate transversions; whereas "d" indicates deletions and "i" insertions. Nonrecurrent mutations are underlined.

View larger version (49K):
[in this window]
[in a new window]

Figure 2

Phylogenetic tree, based on complete mtDNA sequences, for macrohaplogroup N. Origins of subjects are explained in Table 1. The numbers along the links refer to nucleotide positions, arbitrarily written in ascending order. Open boxes are nodes from which other (not shown) sequences branch. A, C, G, and T indicate transversions; whereas "d" indicates deletions and "i" insertions. Nonrecurrent mutations are underlined.

View this table:
[in this window]
[in a new window]

Table 1.

List of Individuals Used to Build Up the Networks Shown in Figures 1 and 2

Subdivisions Within Macrohaplogroup M
Haplogroup D
Haplogroup D has been defined by the specific RFLP -5176 AluI(Torroni et al. 1992

). Studies on Native American HVI sequencespermitted further subdivision of D into subgroups D1 by mutation16325 and D2 by mutation 16271 (Forster et al. 1996

). Additionalsubdivisions into subhaplogroups D4 and D5 have been proposedfor Asian lineages (Yao et al. 2002a

). These investigators characterizedD4 by position 3010. Two additional mutations, 8414 and 14668,have been proposed to define D4 (Fig. 1B; Kivisild et al. 2002

).Whereas these two latter mutations seem to be rare events, 3010has also been independently detected in haplogroups H and J.A new branch at the same phylogenetic level as D4 and D5 hasbeen detected in Japan (Fig. 1B). It is characterized by mutations709, 1719, 3714, and 12654 and was named D6. The subdivisionof D4 into subgroups D4a and D4b was proposed on the basis ofthe distinctive mutational motif 152, 3206, 14979, and 16129for the first and 10181 and 16319 for the second (Kivisild etal. 2002

). Both subclades have been detected in our Japanesesample. From our data it can be deduced that mutation 8473 isalso basal for D4a. In relation to D4b it seems that its ancestralbranch is defined by the 8020 substitution (Fig. 1B). Consequently,the D4b subgroup proposed by Yao et al. (2002a

) should be renamedD4b1 harboring 15440 and 15951 as additional basic mutations.A new subgroup characterized by 1382C, 8964, and 9824A mutationsand named D4b2, is represented by lineages GC20 and KA83 inFigure 1B. Furthermore, 12 new branches at the same phylogeneticlevel as subhaplogroups D4a and D4b can be identified in thenetwork. Accordingly, they have been successively named fromD4c to D4n. On the other hand, D5 was defined by mutations 150,10397, and 16189 (Yao et al. 2002a

); however, 16189 is not presentin all D5 lineages. We have named D5a and D5b those lineagesthat share this mutation and 9180 and D5c those lacking them.Consequently, we propose to rename D5a of Yao et al. (2002a

)as D5a1. Additional mutations (1107 and 5301) define D5 (Fig. 1B),as has been recently confirmed (Kong et al. 2003

). Of thefour mutations at the basal branch of this group, 10397 seemsto be a unique event; and the group can be diagnosed by theRFLP polymorphism +10396 BsrI. Recently, the phylogeny of haplogroupD has been revised in the light of complete sequences from Aleuts(Derbeneva et al. 2002b

). By comparing their nomenclature toours, it is possible to equate their D2 lineage to our D4e1and their D3 lineage to our D4b1. As a total, D is the mostabundant haplogroup in people of central and eastern Asia includingmainland Japanese but not in the Ainu and Ryukyuans. However,the geographic distributions of some subhaplogroups are peculiar.For example, D5 is prevalent in southern areas. D4a is abundantin Chukchi of northeast Siberia, but D4a1 has its highest frequencyin the Ryukyuans and clade D4n in the Ainu (Table 2).

View this table:
[in this window]
[in a new window]

Table 2.

Frequency (in Percentage) of Each Haplogroup in Each Group of Populations

Haplogroup M9
It is confirmed that haplogroup M9 is characterized by mutation4491 (Fig. 1A), as recently proposed (Kong et al. 2003

). SubhaplogroupM9a, as redefined by Kong et al. (2003

), was identified by positions153, 3394, 14308, 16234, and 16316 (Yao et al. 2002a

). Nevertheless,not all lineages have 153. Although M9 could be RFLP-diagnosedby +1038 NlaIII and +3391 HaeIII polymorphisms, the latter oneshould be avoided; as 3391 is also present in some D4d1 lineages(Fig. 1B) and thus could produce misclassification. We havegrouped lineages with 11963 as M9a1 and those with 153 as M9a2.M9 has a central and eastern Asian geographic distribution,and it reaches its greatest frequency (11%) and diversity (87%)in Tibet. In Japan, in addition to mainland Japanese it hasbeen detected in the indigenous Ainu and Ryukyuans (Horai etal. 1996

Haplogroup G
This haplogroup was first detected by Ballinger et al. (1992)and later named G by Torroni et al. (1994). It was defined bythe presence of the combined RFLP polymorphism +4830 HaeII/+4831HhaI. In addition, the basal branch has mutations 709, 5108,and 14569 (Fig. 1; Kivisild et al. 2002). Subhaplogroup G1 wasdefined by transition 16017 (Schurr et al. 1999) and G2 by mutations7600 and 16278 (Yao et al. 2002a). Recently, mutations 8200,15323, and 15497 have been used for G1 status (Kong et al. 2003).This is confirmed with our Japanese sequences; consequently,we have defined G1a by 7867 (Fig. 1A). To avoid repetitions,the G1 group of Schurr et al. (1999) has been provisionallyrenamed as G5 (Table 2). At least two mutations (5601 and 13563)characterize G2; and five more, G2a (Fig. 1A; Kong et al. 2003).We have defined subclade G2a1 by the presence of 16189 and thederivative G2a1a by the addition of 16227, whereas 16051 and16150 identify G2a2 lineages. Furthermore, two new subclades,G3 and G4, are also apparent in Japanese (Fig. 1A). SubgroupG5 is dominant in northeastern Siberia, but we have not detectedit in our set of Japanese complete sequences. However, G1a1has its highest frequencies in a cluster embracing Japanese,Ainu, Ryukyuan, and Koreans. On the contrary, G2 is relativelyabundant in northern China and central Asia, reaching notablefrequencies in the Mansi and in Tuvinians at the respectivewest and east ends of South Siberia (Table 2).

Haplogroup E
Haplogroup E was first RFLP-defined as having +16389 HinfI and-7598 HhaI by Ballinger et al. (1992), who named it G, and thenlater it was renamed E by Torroni et al. (1994). As a loss ofrestriction sites can be produced by different nucleotide mutationswithin the recognition sequence, since the beginning, some G2sequences characterized by the 7600 transition were erroneouslyclassified as belonging to haplogroup E. Recently, based onthe complete sequences of coding regions, Herrnstadt et al.(2002) defined three Asiatic lineages as E, although only one(sequence 214) seems to be a genuine representative. It possessestransition 7598, which, similar to 7600, is also detectablewith HhaI as a site loss; and it also harbors mutations 10834and 869, which were found by Ballinger et al. (1992) as -10830HinfI and +868 DdeI in all and some individuals respectivelyclassified as E. However, the inclusion of a Philippine completesequence (Ingman and Gyllensten 2003) in our global tree clearlydemonstrates that the last two mutations might only define abranch of E, as the Philippine sequence lacks both of them.On the contrary, in addition to 7598 and 16390, some of thefour E mutations represented in Figure 1A before the branchingpoint might be basic mutations. In Herrnstadt et al. (2002),sequence 169 belongs to Haplogroup M9 because it has all coding-regionpositions defining this haplogroup; and sequence 287 to M1 becauseit has 6446 and 6680, the coding-region mutations that definethe basic branch of M1 (Fig. 1). It must be mentioned that theambiguous Korean lineage classified as E/G by Schurr et al.(1999), because it had both the -7598 HhaI characteristic Esite and the +4830 HhaI characteristic G site, has been recentlyfound again in a Korean sample (Snäll et al. 2002). Allof them are, in fact, members of subhaplogroup G2. It seemsthat haplogroup E has a southern Asia distribution. Until nowit has been detected in the Malay peninsula populations andin the Sabah of Borneo (Ballinger et al. 1992); and it is alsopresent in coastal Papua New Guinea (Stoneking et al. 1990)as well as in some Pacific islands such as Guam (Herrnstadtet al. 2002) and the Philippines (Ingman and Gyllensten 2003).However, until now, it has not been detected in more northernContinental populations or islands such as the Japanese archipelago.

Haplogroup M8
A monophyletic clade (Fig. 1A) groups M8a, C, and Z lineages.Mutations 4715, 15487T, and 16298 have been proposed as diagnosticfor this clade (Yao et al. 2002a). The transversion 7196A andthe transition 8584 should also be included in its definition(Fig. 1A; Kivisild et al. 2002). However, as the 248d is alsoshared by all Z and C lineages (Fig. 1A), a basal node definedby this deletion and named CZ has been recently proposed (Konget al. 2003). Subhaplogroup C was RFLP-defined by Torroni etal. (1992) by +13262 AluI. Yao et al. (2002a) added 248d, 14318,and 16327 as characteristic of C. In addition, positions 3552A,9545, and 11914 are also diagnostic of this clade (Fig. 1A;Kivisild et al. 2002). The Japanese TC52 has the C1 status andthe Buryat 6970 and the Evenky 6979 have the C4 status proposedby Kong et al. (2003). Subhaplogroup Z was defined by Schurret al. (1999) by the presence of the following noncoding motifs:16185, 16223, 16224, 16260, and 16298. Recently, it was consideredthat only 16185 and 16260 mutations should be counted as basicfor the group (Yao et al. 2002a). However, in full agreementwith the characterization proposed on the basis of completeChinese Z sequences (Kong et al. 2003), three additional mutations(6752, 9090, and 15784) have been placed on the basal branchof Z (Fig. 1A). We detected four Japanese Z clades that, inaddition, shared mutation 152 and another without it. Tentatively,they have been named from Z1 to Z5 (Fig. 1A). Yao et al. (2002a)defined M8a by 14470, 16184, and 16319 transitions. Two moremutations (6179 and 8684) are also characteristic of this subhaplogroup(Kong et al. 2003). In Japanese we have found that 16184 isnot harbored by all M8a members. Consequently, lineages withthis mutation have M8a2 status and those lacking it M8a1 status(Fig. 1A). The largest diversities for C are in Korea (100%),central Asia (86%), and northern China (78%-74%). Therefore,C can be considered a clade with a Northeast Asian radiation.Representatives of subhaplogroup Z extend from the Saami (Finniläet al. 2001) and Russians (Malyarchuk and Derenko 2001) of westEurasia to the people of the eastern peninsula of Kamchatka(Schurr et al. 1999). Its largest diversities are found in Koreans(88%), northern China (73%), and central Asia (67%), compatiblewith a central-East Asian origin of radiation for this group.Finally, M8a has its highest diversity in Koreans (100%), andsouthern (100%) and eastern Chinese, including Taiwanese (73%).Thus, southeastern China was a potential focus of radiationof this group. All these subhaplogroups are present in mainlandJapanese but neither in Ryukyuans nor in Ainu.

Haplogroup M7
This haplogroup was defined by Bamshad et al. (2001) as havingtwo branches, M7a characterized by 16209 and M7b by 16297 transitions.Yao et al. (2002a) assigned mutations 199 and 9824 as basicfor M7. However, our phylogenetic tree points to 6455 and 9824as the basal mutations for this group, whereas 199 is only commonto the M7b and M7c subgroups (Fig. 1A), which coincides withthe phylogeny proposed by Kivisild et al. (2002). M7 can beRFLP-diagnosed by the lack of the 6451 MboII restriction site.The M7a subgroup can be defined by several codingregion positions(Fig. 1A; Kivisild et al. 2002). The M7b classification remainsas proposed in Kivisild et al. (2002); but M7c has, in additionto 146 and 16295, three more coding-region substitutions (4850,5442, and 12091) in its basal branch (Fig. 1A). At this point,it is worthwhile pointing out that the ambiguously assignedsequence 536 in Herrnstadt et al. (2002) belongs to M7c, asit has the five identifying coding-region mutations distinctiveof this subhaplogroup. As for the geographic distribution, M7a1has its highest frequencies (14%) and diversities (86%) in theRyukyuans, and it is also very common in the whole of China,with a mean diversity of 76%. But, curiously, it has not beendetected in Koreans or in Ainu, and is rare in mainland Japanese.In a similar way, M7a has its highest diversity in Ryukyuans(83%). Both groups are rather common in the Philippines. AlthoughM7b has its greatest diversity in northern China (75%-62%),its derivative M7b2, has it again in Ryukyuans (100%), Koreans(53%), and mainland Japanese (45%). On the contrary, M7c isabsent in Ainu and rare in mainland Japanese but very commonin Sabah and the Philippines, although its highest diversityis in the whole of China (76% ± 11%).

Haplogroup M10
This haplogroup has been defined by substitutions 10646 and16311 (Yao et al. 2002a). In addition, Kong et al. (2003) havefound several new mutations in its basal branch that we confirmhere (Fig. 1A). Minor modifications are that a new Japaneselineage shares with M10 only the 8793 mutation, and that a newmutation, 13152, seems to be basal for our M10 Japanese lineages.Although its highest frequency is in Tibetans (8%), the largestdiversities are found in China. It is present in Koreans andmainland Japanese but has not been detected in either Ainu orRyukyuans (Table 2).

Haplogroup M11
This haplogroup has been defined by Kong et al. (2003) by sevencoding-region mutations (1095, 6531, 7642, 8108, 9950, 11969,and 13074) and four mutations in HVS-II (146, 215, 318, and326). We confirm the same characterization for our M11 Japaneselineages. A subclade defined by mutation 14340 was found inChinese (Kong et al. 2003), but it has not been detected inJapanese. In turn, Japanese have a new subclade characterizedby mutation 14790. Finally, our data suggest that mutation 15924is at the root of M11 and the new clade M12.

Haplogroup M12
This haplogroup has been defined in the present study. It harborsa characteristic motif (16145-16188-16189-16223-16381) in itsnoncoding region and several unique mutations in its codingregion (Fig. 1A). Overall, it is a rare haplogroup, being detectedonly in mainland Japanese, Koreans, and Tibetans, the lastmentionedsample showing its highest frequency (8%) and diversity (50%).

Haplogroup M1
Although not present in eastern Asia, this haplogroup has beenincluded in the phylogenetic tree of macrohaplogroup M to ascertainits hierarchical level with respect to other M clades. It wasfirst detected in Ethiopia (Quintana-Murci et al. 1999) anddefined by four transitions in the HVSI region (16129, 16189,16249, and 16311). After this, M1 was also detected in the Mediterraneanbasin including Jordan (Maca-Meyer et al. 2001). Several mutationsin the coding region are distinctive of this haplogroup (Fig. 1A).Its RFLP diagnosis is possible by an MnlI site loss atposition 12401.

Subdivisions Within Macrohaplogroup N
Representatives of two major superhaplogroup N migratory branchesare present in Japan. Two main clades, that directly sproutfrom the basal N trunk (A and N9), have a prevailing northernAsia dispersion, whereas the other two (B and F), having a southernradiation focus, belong to the derivative R clade, characterizedby the loss of 16223 and 12705 mutations. Although not detectedin Japan, to compare their hierarchical levels with those ofthe Asian branches, we have included the rCRS sequence and aN1b sequence (Kivisild et al. 1999) as representatives of thewestern Eurasian R and N clades, respectively.

Haplogroup A
This haplogroup was defined by an HaeIII site gain at 663 (Torroniet al. 1992). It was subdivided on the basis of HVSI motifsin A1 (16223-16290-16319) and A2 (16111-16223-16290-16319) byForster et al. (1996). In our Japanese sample, we have detectedseveral A1 representatives characterized by two substitutions(8563, 11536). Two of these lineages (ON67 and ND218) have beenascribed to the A1a subgroup that is defined by 4655, 11647,and 16187 substitutions. Two additional A1 Japanese clusters(A1b and A1c) have also been phylogenetically defined (Fig. 2).The A2 subgroup is represented in the tree by a Chukchi(6971) and two (KA21 and ON125) Japanese lineages, all sharingthe 16362 mutation. As the Chukchi harbors the 16111 and 16265mutations, it has been labeled as an A2a representative, astentatively proposed by Saillard et al. (2000), having fouradditional mutations (152, 153, 8027, and 12007) in its basalbranch. Owing to their phylogenetic position, three more Japaneselineages (ND28, TC48, and J42) should be classified as representativesof three new A subhaplogroups, respectively named A3, A4, andA5 (Fig. 2). Geographically, whereas A1 has a wide northernand central Asian distribution, subclade A1a is confined toKorea and mainland Japan. The greatest diversity for A1 is incentral Asia (79%). In Japan it is present in both mainlandand indigenous populations. Subhaplogroup A2 is mainly presentin northeast Siberia including the Kamchatka peninsula, althougha lineage has also been detected in Tibet. The main diversity(30%) and frequency (60%) for this subhaplogroup are in theChukchi.

Subhaplogroups Y, N9a, and N9b
Haplogroup N9 characterized by the 5417 substitution (Yao etal. 2002a) phylogenetically comprises three subhaplogroups.Subhaplogroup N9a was mentioned as another N subcluster witha distinctive HVSI motif (16223, 16257A, 16261) by Richardset al. (2000). It appears named as N9a in Yao et al. (2002a),who added as basal substitutions 150 and 5231. Recently, Konget al. (2003) added mutations 12358 and 12372 at the basal branchof N9a, which is according to our Japanese phylogeny (Fig. 2).A Japanese N9a1 lineage (TC2) shares mutations 4386, 12007,16111, and 16129 with the Chinese lineage GD7834 of Kong etal. (2003). Three more N9a Japanese clusters sharing 16172 astheir basal mutation have been considered distinct N9a2 branches(Fig. 2). Subhaplogroup Y was first identified by a set of HVSIpolymorphisms (16126, 16189, 16231, 16266, 16519), an HaeIIIsite loss at 8391 and MboI and DdeI site gains at 7933 and 10394,respectively (Schurr et al. 1999). However, according to theclassification of Kong et al. (2003), all these mutations definethe Y1a1 branch specifically. Our Japanese (Fig. 2) and theChinese (Kong et al. 2003) phylogenies characterize Y by sevenmutations (8392, 10398, 14178, 14693, 16126, and 16231 gainsand a 16223 loss). The branch Y1 would be identified by mutations3834 and 16266, and the Y1a subcluster by 7933 (Fig. 2; Konget al. 2003). In Japan we have found a new subclade (Y1b) characterizedby four mutations (146, 10097, 15221, 15460). Furthermore, anew branch (Y2) with the same phylogenetic consideration asY1, and distinguished by six basal mutations must be aggregatedto the Y phylogeny (Fig. 2). Finally, we have detected a sisterbranch of Y in Japan. This new lineage, named N9b, shares twobasal mutations (5147 and 16519) with Y and is further characterizedby four (10607, 11016, 13183, 14893) additional mutations inits basal branch. All N9b1 representatives seem to have the16189 mutation, and three branches of this trunk (a, b, andc) have been provisionally defined (Fig. 2). The geographicdistribution of subhaplogroup Y is predominantly in NortheastAsia. The highest frequency (22%) is in the Ainu, although onlyone lineage accounts for this frequency. The greatest diversitiesare in northern China (80%), and this group is also very diversein the Nivkhs from northeast Siberia (Torroni et al. 1993a).As for N9a, it has a great diversity in the whole of China (83%)and Korea (79%). In Japan, only mainland Japanese have N9a representatives.Finally, N9b is very scarce, being detected in southern Chinaand Korea. Surprisingly, it is most abundant in the Japaneseincluding the indigenous Ryukyans and Ainu.

Haplogroup F
This haplogroup was first defined as group A by Ballinger etal. (1992), and later renamed as F by Torroni et al. (1994).This group was characterized by the lack of HincII and HpaIsites at 12406. According to the newly proposed nomenclature(Kivisild et al. 2002; Kong et al. 2003), 12406 is now one ofthe six mutations that specifically define subhaplogroup F1.Recently, haplogroup F has been phylogenetically included asa subcluster of haplogroup R9 (Yao et al. 2002a). Besides F1,two new subgroups (F2 and F3) have been defined by Kong et al.(2003). We have found a new subcluster, named F4 (Fig. 2), thatis characterized by three coding-region mutations (5263, 12630,15670). This group has a particularly high incidence in SoutheastAsia (Ballinger et al. 1992), but only subhaplogroup F1b iswell represented in the Japanese, including the indigenous Ainuand Ryukyuan. The highest diversities for this subgroup arein eastern China including Taiwan (100%).

Haplogroup B
Renamed as B after Torroni et al. (1992), this haplogroup wasidentified by the presence of a 9-bp deletion in the COII/tRNA^Lysintergenic region of mtDNA. This polymorphism was first detectedin Asia by RFLP analysis (Cann and Wilson 1983). It was usedto classify Japanese on the basis of the presence/absence ofthis deletion (Horai and Matsunaga 1986). Even in Asia, themonophyletic status of this cluster has been repeatedly questioned(Ballinger et al. 1992; Yao et al. 2000b); but although the9-bp deletion has a high recurrence, it seems that togetherwith transition 16189 it defines fairly well a monophyleticcluster, at least in eastern Asia. Recently, a sister cladeof B, keeping the 16189 mutation but lacking the 9-bp deletion,has been detected in China, being designated as R11 (Kong etal. 2003). Asian subhaplogroups of B have been named as B4,identified by the 16217 mutation and B5, characterized by 10398and 16140 mutations (Yao et al. 2002a). It has been deducedfrom analysis of complete sequences that transitions 709, 8584,and 9950 are also in the basal branch defining B5 (Fig. 2; Konget al. 2003). Lower-level subdivisions have also been proposed.Three subclades (B4a, B4b, and B4c) were defined within B4 (Konget al. 2003). At the same phylogenetic level are our Japanesebranches named B4d, B4e, and B4f; and several new secondaryclusters have also been detected in Japan within B4a, B4b, andB4c (Fig. 2). It is worthwhile to mention that those lineagesharboring 16189, 16217, 16247, and 16261, also known as thePolynesian motif (Soodyall et al. 1995), belong to a branchof B4a, having in addition to 16247, 146, 6719, 12239, 14022,and 15746 as basic mutations. The B5 cluster was also subdividedin B5a and B5b on the basis of the HVSI mutations 16266A and16243, respectively (Yao et al. 2002a), and reinforced withseveral additional positions after the analysis of completeChinese (Kong et al. 2003) and Japanese (Fig. 2) sequences.Within B5b, new subdivisions are necessary to accurately classifythe Japanese sequences (Fig. 2). Finally, on the basis of characteristicHVSI motifs, we had tentatively defined as B4a3 those lineageswith 16189, 16217, 16261, and 16292 transitions. However, thephylogenetic position of a Chinese complete sequence (GD7812)belonging to this HVSI group (Kong et al. 2003) shows that afuture redefinition of B4a might be necessary. The geographicdistribution of haplogroup B is very complex. As expected fromits age, the ancestral motif is widely distributed in Asia excludingKoryacks and other Siberians. The likewise old subhaplogroupB4 has mainly a central-eastern Asian distribution with diversitiesnear 100% from central Asia to Japan. B4a shows a similar distributionas B4, having branches prevalent in Ryukyuans, Lahu of Yunnan,and aborigine Taiwanese (Table 2). In a similar vein, some branchesof B4c are more abundant in southern areas (B4c2), whereas others(B4c1) are mainly detected in Korea and Japan, with derivativesin Taiwan (B4c1b). On the other hand, subhaplogroup B5a hasits greatest diversity in southern-eastern China (89%), includingTaiwan aborigines (67%), but its B5a1 derivative shows the greatestdiversity in northern China (71%), being present in mainlandJapanese. In turn, subhaplogroup B5b has its major diversityin Korea (83%) and also reached the Philippines (50%). Curiously,the B5b1 derivative shows its highest diversity (67%) and frequency(1%) in mainland Japanese.

Lineage Sorting and Population Pooling
A total of 110 clades with different phylogenetic range havebeen proposed on the basis of the pool of the eastern Asiancomplete sequences (Figs. 1A,B and 2). Of these subdivisions(Table 2), 83 have been used to classify all Asian partial sequencesanalyzed in this study. As a test of accuracy in the sortingof partial sequences into haplogroups, we classified our 672Japanese complete sequences by using only their HVSI motifsand found that 34 of them (5%) had an ambiguous status or weremisclassified. The main sources of errors were those sequencesthat differed from CRS in only one or two mutations. For instance,the 16223 mutation was found in M and N backgrounds. The 16189,16223 motif can be D6 or N9b. Within M, sorting into D or Gwas one of the main sources of ambiguity. Some 16223, 16325,16362 lineages were D4 and some G1. The motif 16114A, 16223,16362, classified as D4, was in reality G3. Sometimes furthersubdivision within a haplogroup is rather difficult; for example,there are 16189, 16223, 16362 representatives in D4 and in D5.Because of recurrency and isolation, it can be expected thatthis uncertainty level increases with geographic distance. Forinstance, we have found that several 16129, 16223 Japanese lineagesbelong to D4, but to infer from this that southern Asian sequenceswith the same HVSI motif are also D4 would be inappropriate.From a total of 4713 sequences analyzed, 9.2% had an ambiguousstatus. In spite of this percentage there are enough sequencesleft to carry out population analysis with statistical confidence.

In a first approach, Japanese, Ainu, and Ryukyuan samples werecompared with the rest of Asian samples shown in Table 3 bymeans of F_ST. The closest affinities of mainland Japanese wereto three population groups. The first include Korean and Hanfrom Shandong (mean P-value = 0.29 ± 0.06), the secondHan from Liaoning and Xinjiang, and the Tu ethnic minority (0.20± 0.06), and the third Han from Xi'an and the Sali, abranch of the Yi ethnic group (0.15 ± 0.06). Ryukyuansand Ainu behave as outliers with significant differences withall the samples. Population groups resulting from the F_ST andCLUSTER analysis are defined in Table 3. Although mainland Japanesefrom Aichi were significantly different from other mainlandJapanese because of their high frequency of haplogroup B, theywere merged with them as JPN for comparisons with other areas.Control of the conglomerate number expected in CLUSTER analysisallows for a hierarchical grouping of populations. With twoconglomerates, the first distinguished isolate was the aboriginalSakai from Thailand (Fucharoen et al. 2001). This group wasunique among other Thai people owing to its lack of lineageswith the 9-bp deletion that characterizes haplogroup B, andto the high frequency of the authors' C6 cluster (included inour D4a). The lack of any representative of macrohaplogroupN in a population anthropologically considered one of the oldestgroups in Thailand, if not caused by genetic drift, is compatiblewith the hypothesis that derivatives of macrohaplogroup N had,in southern Asia, a different route from macrohaplogroup M (Maca-Meyeret al. 2001). Also striking is the presence in Sakai of an unequivocalrepresentative (16223-16274-16278-1629416309) of the sub-SaharanAfrican L2a haplogroup (Torroni et al. 2001), which again iscompatible with the physical characteristics of this Negritogroup. Although the suggestion that the first spreading outof Africa of modern humans could have carried some L2 lineagesin addition to the L3 ancestors (Watson et al. 1997) is a temptingexplanation, a recent admixture is more in consonance with thephylogenetic proximity of this lineage to the present Africanones. The next outsiders were the majority of the Siberian isolates,which could not be pooled because of big differences in thefrequency of distinctive haplogroups (Table 2). This considerabledifferentiation was already emphasized (Schurr et al. 1999),with strong genetic drift being its most probable cause. Subsequentisolates belong to some Chinese minorities such as those ofLisu and Nu, Lahu, and Taiwanese aborigines. Unexpectedly, otherChinese minorities (Bai, Sali, and Tu) were left in Han Chinesenorthern clusters. The Bai belong to the Sino-Tibetan Tibeto-Burmanethnic linguistic group and have been strongly influenced byHan. The Sali are a minority within the Yi ethnic group whosemost probable ancestors were the Qiang from northwest China.Finally, the Tu, although belonging to the Mongolian branchof the Altaic Family, show their main genetic affinities tothe Han from Xi'an (P = 0.95), Xinjiang (P = 0.89), and Shanghai(P = 0.79), all of them clustered in the Ch2 group. On the otherhand, Thais, Vietnamese, and Cambodians joined with southernChinese. As already observed (Chunjie et al. 2000; Yao et al.2002a), the Han Chinese do not comprise a homogeneous group.With the exception of cluster Ch4, that includes samples fromHubei and Guandong (Table 3), they appear geographically differentiated.The two central Asian groups detected mainly differ in theirfrequencies for A1b, Z, and G2a. With less than 14 conglomerates,the Japanese, including Ainu and Ryukyuans, were part of a biggroup formed by Korean, Buryat, Tibetans, and northern Chinese.Ainu was the first differentiated Japanese sample. Ryukyuansseparated later, when mainland Japanese and Koreans still compriseda single group. The lack of homogeneity between Ainu and Ryukyuanswas pointed out by Horai et al. (1996), who questioned thatthey shared a recent common ancestor. The main differences betweenthem were attributed to two dominant clusters (C1 and C16, correspondingto our Y and M5/D4a/G1, respectively) present in Ainu but absentin Ryukuyans, and two Ryukyuan dominant clusters (C3 and C13,belonging to our R and M, respectively) absent in Ainu. In addition,applying the present haplogroup nomenclature to the same data,the high frequency of M7a1 and D4a1/D4b in Ryukyuans, but theirabsence in Ainu, stands out. The MDS plot (Fig. 3A), based onF_ST haplogroup frequency distances between final groups (datanot shown), only partially reflects the sequential process describedabove, as only Sakai and Siberians are well differentiated fromthe rest. On the contrary, relationships obtained from haplotypematches (Fig. 3B) show populations highly structured by geographywith the only exceptions being the Ainu and Tuvinian isolates.

View this table:
[in this window]
[in a new window]

Table 3.

Asian Populations Used in This Study

View larger version (20K):
[in this window]
[in a new window]

Figure 3

MDS plots based on (A) F_ST and (B) D match distances. Population groups are as detailed in Table 3.

The Peopling of Japan
To further know the relative affinities of the Japanese betweenthemselves and with the different Asian groups formed, the dataobtained from the global approaches based on haplogroup frequencydistances and on sequence match identities are presented inTable 4. Both values are moderately correlated in the comparisonsinvolving the mainland Japanese (r = -0.479; two-tail probability0.012) but not at all in those involving aborigine Ryukyuans(r = -0.310; two-tail probability 0.115) and Ainu (r = 0.087;two-tail probability 0.667). This result can be explained byassuming that these aboriginal people have suffered importantgenetic drift effects with substantial changes in haplogroupfrequencies and lineage losses or, less probably, that thesepopulations have been isolated long enough to have accumulatednew variation. Results based on haplogroup frequencies by farrelate mainland Japanese to Koreans followed by northern Chinese.Ryukyuans present the smallest distances to Buryats from SouthSiberia, followed in short by southern Chinese. In turn, theAinu have their closest affinities with mainland Japanese, Koreans,and northern Chinese. As regards sequence matches, mainlandJapanese also joins first to Koreans and second to Buryats.Aborigine Ryukyuans are closest to Buryats and then to Koreans.Finally, Ainu show comparatively less shared sequences, theirgreater affinities being toward Chukchi and Koryaks of Kamchatka.This global picture is congruent with an important influenceon mainland Japanese from northern Asian populations throughKorea, that the Ryukyuans had a dual northern and southern Asianbackground previous to the new northern influences acquiredby admixture with mainland Japanese, and that the Ainu representthe most isolated group in Japan in spite of the genetic inputreceived from Kamchatka. Also noticeable is the great distanceand low identity values obtained for the Ainu-Ryukyuan paircompared with those obtained in their respective comparisonto mainland Japanese, which is another hint of its notable maternalisolation.

View this table:
[in this window]
[in a new window]

Table 4.

Frequency-Based F_ST and Sequence Match Identities (In Percentage) Between Japanese Samples and With Other Asian Populations

The distance and identity statistics used above are based onfrequencies of haplogroups and haplotypes, respectively; however,frequencies are more affected by genetic drift than the numberof different haplotypes present in a population. To measurethe relative affinities of Japanese populations between themand to Continental Asia in a frequency-independent way, we chosea haplotype-sharing approach calculating the relative contributionof lineages shared with other areas to the number of differenthaplotypes present in each Japanese population. In these comparisonsall other Asians were merged. Table 5 shows the results of thisanalysis. Note that despite the difference in sample size thehaplotype frequency in mainland Japanese and Ainu is

50%, whereasin Ryukyuans it is 84%; which means that, if there was not abias in the sampling process, in spite of its small size, theAinu sample seems to be representative of that population. However,it would be desirable to enlarge that of the Ryukyuans (Helgasonet al. 2000

). Haplotypes present only in a given populationaccount for 13% in Ainu but

50% in mainland Japanese (60%) andRyukyuans (45%). This finding once more points to the existenceof important drift effects in Ainu. Mainland Japanese exclusivelyshare with Ryukyuans and Ainu only 3% and 2%, respectively,of its lineages, which could reach 6% and 3% if those also sharedwith Continental Asian populations are added. In comparisonthey shared 21% of its lineages with other Asians. On the contrary,Ryukyuans and Ainu share about 50% of their lineages with mainlandJapanese and only 10% and 21%, respectively, with Continentalpopulations, which may reflect other independent Asian influenceson Japan. With respect to those lineages exclusively sharedby Japanese and Continental Asian populations, it is worth mentioningthat, again, Korea is the main contributor, participating in

50% of the haplotype sharing with mainland Japanese (55%), asmuch as with Ryukyuans (50%) and Ainu (50%). However, differencesexist in the provenance of the rest of the shared lineages.Whereas in Ainu (northern China and Siberia) and in Ryukyuans(northern China and central Asia) they are from northern areas,the second region contributing to mainland Japanese is southernChina (17.5%), followed, at the same level (12.5%), by northernChina and central Asia. In addition, there exists a minor percentageof exclusive sharing with Indonesia (2.5%). On the other hand,all the matches with Siberia and Tibet are also shared withother populations. From these results, it can be deduced thatthe ancient Japanese inhabitants came from northern Asia andthat southern areas affected the Japanese by later immigration.Nevertheless, it must be borne in mind that older influencescould be undetectable by lineage sharing. With respect to thehaplogroup affiliation of those lineages that Ainu and Ryukyuansexclusively shared with no Japanese samples, new differencesappear between them. Ainu share derived lineages of haplogroupsA, G, M9, and D5, all of them compatible with a rather recentSiberian influence. In contrast, those shared by Ryukyuans arebasical M lineages, more congruent with an older radiation fromsouthern China. These dual influences are also detected whenthe haplogroup affiliation of the Ainu and Ryukyuan unique lineagesis studied. First, the percentage of lineages belonging to macrohaplogroupN is larger in Ainu (50%) than in Ryukyuans (15%) and from adifferent provenance, as those in Ainu are from haplogroupsN, N9b, and Y, whereas those of Ryukyuans belong to the southernhaplogroups F and B. The remaining 50% of the Ainu lineagesequitably belong to different M haplogroups (M, M7c, G1, andD5a), but in Ryukyuans the remainder are mainly concentratedin M7a (41%) and M7b2 (18%), two groups that have their greatestAsian diversities precisely in Ryukyuans. Although an indigenousfocus of radiation cannot be discarded, it is more conservativeto suppose that the most probable origin of these lineages isagain southern China. Thus, Ainu and Ryukyuans are not onlylargely isolated populations, but they most probably had differentmaternal origins.

View this table:
[in this window]
[in a new window]

Table 5.

Distribution of Unique and Shared Haplotypes in Japanese Populations

Although no matches are involved, the geographic distributionof haplogroup frequency and diversities for some groups presentin Japan and in other distinct Asian areas are also relevantto trace these older connections. For instance, haplogroupsM9, M10, M12, D4b, and F1c have correlated geographic frequencieswith a peak in an area that comprises Tibet (Table 2). Curiously,one of these haplogroups (M12) is today absent in China butpresent in Korea and Japan.

DISCUSSION

Top
ABSTRACT
RESULTS
DISCUSSION
METHODS
REFERENCES
WEB SITE REFERENCES

Although the recent out-of-Africa origin for all modern humans(Cann et al. 1987) is being widely supported (Takahata et al.2001), the most probable time and routes chosen by these earliestmigrants to reach eastern Asia is an open issue. In the followingdiscussion we weigh the different alternatives proposed in lightof the phylogenetic tree obtained from complete mtDNA sequences.One of the first questions raised was whether there was morethan one out-of-Africa dispersion. All the mtDNA lineages detectedin Old World populations belong to one of two M and N macrohaplogroupswith only secondary representatives in Africa. The proposedradiation ages for both, 30,000 to 58,000 years ago and 43,000to 53,000 years ago, respectively (Maca-Meyer et al. 2001),give a temporal frame compatible with only one main dispersionor two successive dispersions, in which case the M precursoris the most probable candidate for the older exit. Even if theone dispersion option is chosen, more than one geographicalroute to eastern Asia is possible. In fact, a northern Continentalroute through the Near East and western-central Asia and a southerncoastal route through the Arabian and Indian peninsulas havebeen proposed (Cavalli-Sforza et al. 1994; Kivisild et al. 1999).The geographical distribution of these two macrohaplogroups,with lack of ancient M representatives and the presence of deepN lineages in western Asia, and the abundance of basal M lineagesin India and southwestern Asia and concomitant lack of equivalent-ageN clades, gave rise to the hypothesis that N represents themain footprint of the northern Continental expansion, whereasM is the equivalent footprint for the southern coastal expansion.The presence of N and M lineages in alternative areas has beenexplained to have been the result of secondary migrations (Maca-Meyeret al. 2001). However, another plausible explanation is thatboth M and N reached southern Asia at the same time, quicklyexpanding to Papua New Guinea (PNG) during maximal glacial ageswhen the permafrost boundary precluded a northern human occupation.During postglacial ages, subsequent migrations northward carriedderivatives of both macrohaplogroups to northern Asia (Forsteret al. 2001). Nevertheless, under this second hypothesis, thepresence of basal N clusters should be expected in India, southernAsia, and PNG; but this is not the case. All N representativesin India belong to R, a clade derived from N by the loss of16223 and 12705 mutations (Fig. 2). In addition, the bulk ofthese Indian lineages belong to western Caucasian haplogroupsthat, most probably, reached India as the result of secondaryimmigrations, as has already been proposed (Kivisild et al.1999; Bamshad et al. 2001). Similarly, the N representativesin southern Asia belong to haplogroups F and B, two sister cladesalso derived from R (Fig. 2). Furthermore, when totally sequencedPNG N lineages (Ingman et al. 2000; Ingman and Gyllensten 2003)are added to the N phylogenetic tree (data not shown), theyform three monophyletic clades that have their roots in thederived R trunk. On the contrary, the geographically northernAsian clades A, N9a, N9b, and Y (Fig. 2) and the western Eurasianclades W, N1b, I, and X all split from the basal N root (Maca-Meyeret al. 2001), although A, N9a, N9b, and Y radiations were delayedcongruent with subsequent northern Asian expansions. Therefore,at present, mtDNA data are compatible with the supposition thatthe northern route, harboring mainly N precursors, met climaticdifficulties and when they finally reached Southeast Asia, theM representatives, brought by the southern route, had alreadycolonized the area. This southern expansion of N derivativeshas, as a lower temporal boundary, the coalescence ages of F,B, and PNG R haplogroups being 46,000 ± 10,000 yearsago. However, when recently published (Ingman et al. 2000; Ingmanand Gyllensten 2003) Australian N lineages are taken into account,it seems evident that the real situation could be far more complexthan the one migration-one lineage hypothesis. Australian Nlineages directly sprout from the basal trunk (data not shown).They most probably differentiated in that continent, supportingthe idea that ancestral N lineages reached Australia but notPNG, although the undemonstrable possibility of lineage extinctionsand subsequent recolonization events in PNG can be an argument.Both hypotheses have difficulties to explain the presence ofancient N lineages in Australia. If the two, M and N lineages,were brought with the southern coastal dispersion, the lackof primitive N in India, southern Asia, and PNG has to be explainedby the subsequent loss of all N lineages carried to Australia;if the northern Continental route of N is favored, the lossof N representatives in all populations formed in route to Australiahas also to be explained. Recently, an N lineage has been detectedin Chenchus, a southern Indian tribal group (Kivisild et al.2003). From the information published, it can be deduced thatthis lineage only shares mutation 1719 with the western EurasianNb1/I and X clades. More extensive studies of populations insouthern India and southern and central Asia would add empiricalsupport to any of these theories.

Concerning macrohaplogroup M, it has already been commentedthat the star radiation of all the main Indian and southeastAsian M clades strongly suggests that this wide geographic colonizationcould have happened in a relatively short time (Maca-Meyer etal. 2001). This star radiation includes the Australian and PNGM complete sequences recently published (Ingman et al. 2000;Ingman and Gyllensten 2003). However, for those clades and subcladeswith later northward expansions, long radiation delays are observed.For instance, whereas M7 and M8 have coalescence ages 35,000to 45,000 years ago, other groups such as G, D4, M7a, or M7chave coalescence ages 15,000 to 30,000 years ago, more in framewith those calculated for A, Y, and N9 derivates, which, althoughbelonging to macrohaplogroup N, share with them a central-northernAsian geographic distribution (see Supplemental material). Itseems that the simultaneous lineage bursts 60,000 to 70,000years ago from Africa (Maca-Meyer et al. 2001), 30,000 to 55,000years ago for macrohaplogroups M and N, and 15,000 to 30,000years ago for clusters with prominent central-northern Asianradiations were related to main climatic changes. The role ofselection in these expansions is an open question (Elson etal. 2004; Ruiz-Pesini et al. 2004).

The application of global pairwise-distance and detailed phylogeographicmethods to the peopling of Japan shows that both approacheshave different grasps but together demonstrate that the actualJapanese population is the result of a complex demographic history,from which the different theories proposed to explain it onlyemphasize partial aspects. Global distances and detailed haplotypecomparisons confirm that Ainu and Ryukyuans are heterogeneouspopulations (Horai et al. 1996) and that both are well differentiatedfrom the mainland Japanese. In spite of this, they have commonpeculiarities such as having the highest frequencies in Asiafor M7a, M7b2, and N9b, shared with mainland Japanese. Furthermore,for both, their closest relatives are northern populations.At first sight, these results are against a supposed southernorigin for the Paleolithic Japanese, favoring the replacementtheory or even that the Paleolithic inhabitants of Japan camefrom northeastern Asia (Nei 1995). Although based on a singlelocus, our results are strikingly coincident with the previouslyproposed northern origin and influences received by the Japanese.In an early study using serum gammaglobulin polymorphisms, itwas concluded that the homeland of all Japanese could have beenin the Lake Baikal area in Siberia (Matsumoto 1988), which agreeswith the close proximity found here between Buryats and Ryukyuansor mainland Japanese. More recently, classical markers (Omotoand Saitou 1997) and mtDNA (Horai et al. 1996) studies demonstratedthat the Japanese are most closely related to the Koreans, whichis also true in our global analysis. It can be added that asubstantial part of this common maternal pool has recent roots,as Korea specifically shares with Ainu, mainland Japanese, andRyukyuans 10%, 7%, and 5%, respectively, of their haplotypes.This particular affinity is increased with the existence ofderived lineages only detected (A1a, B4c1, B4f) or mainly detected(N9b, B4a1, B4b1, G1a, M7b2, M12) in Japanese and Koreans. ThisKorean influence has been attributed to the archeologicallywell-documented Continental immigration to Japan during theYayoi period (Horai et al. 1996). However, specific haplotypematches with other areas increases the geographic range of theserecent influences. Thus, mainland Japanese share part of theirhaplotypes exclusively with South China (2.5%), North China(1.5%), Central Asia (1.5%), and Indonesia (0.3%); and, also,Ryukyuans have specific affinities with North China (2.4%) andCentral Asia (2.4%). The recent Siberian input on the Ainu hasalso been stressed (Schurr et al. 1999). At least, another independentmigratory wave from central Asia also affected mainland Japanese.It was first detected by the peculiar distribution of the Y-chromosomemarker YAP+, and seems to have originated in an area includingTibet (Su et al. 2000). Haplogroup M12 is its mitochondrialcounterpart. As with the Y-chromosome marker, its punctual presencein Tibet and eastern Asia might be explained as the result ofsubsequent migrations in the Continent that erased the routefollowed by the people harboring these markers. In addition,there are clues, at least in Ryukyuans, that a substantial partof their maternal pool had an ancient southern Asian provenance.This fraction is represented by the M, M7a, and M7a1 basic lineages(31%), which the Ryukyuans do not share with northern populations.This southern signal is, in part, congruent with the southernAsian origin for the Paleolithic Japanese proposed by the dualstructure model (Hanihara 1991). Furthermore, the fact thatthe highest diversities for M7a, M7a1, and M7b2 have been foundin Ryukyuans and for N9b and B5b2 in Japan raises the possibilitythat this area was within a focus of migratory radiations tonorthern and southern isles and even to the mainland from Paleolithicto recent times. The significant latitudinal clines detectedin Japan for some genetic markers (Orito et al. 2001; Takeshitaet al. 2001) could also be explained as the result of southernand northern influences on Japanese. Finally, some mtDNA resultsobtained from ancient Jomon remains (Horai et al. 1991; Shinodaand Kanai 1999; K.-I. Shinoda, unpubl.) are congruent with agenetically diverse background for the Paleolithic Japanesepopulation (Horai et al. 1996). A tentative comparison of Jomonwith present-day Japanese populations based on shared lineages(data not shown) significantly relates Jomon first to the indigenousAinu and then to Ryukyuans and last to mainland Japanese. Insummary, Japan could have received several northern and southernAsian maternal inputs since Paleolithic times, with notablenorthern Asian immigrations through Korea in the late Neolithicand more specific gene flows from western Asia, Siberia, andsouthern islands.

METHODS

Top
ABSTRACT
RESULTS
DISCUSSION
METHODS
REFERENCES
WEB SITE REFERENCES

Samples
Complete mtDNA sequences were obtained from a total of 672 unrelatedJapanese including 373 from Tokyo and 299 from the Nagoya area.All subjects gave their written consent to participate in thisstudy, which was approved by the Ethical Committees of the GifuInternational Institute of Biotechnology and collaborative institutions.The sources of 11 additional complete sequences used to buildthe final phylogenetic trees are in Table 1. For the analysisof the peopling of Japan, we used a total of 1438 Japanese and3275 central and eastern Asian HVI sequences, as detailed inTable 3.

Isolation and Amplification of DNA
Total DNA was extracted from the blood with either Dr. Gen TLE(Takara) or MagExtractor System MFX-2000 (Toyobo). The entiremitochondrial genome was amplified as six fragments (3000-3400bp) by the first PCR and 60 overlapping segments (600-1000 bp)by the second PCR. The primer pairs and their nucleotide sequenceswere described previously (Tanaka et al. 1996). The conditionsfor the first and second PCR were the same: an initial denaturationstep for 5 min at 94°C, followed by 40 cycles of denaturationfor 15 sec at 94°C, annealing for 15 sec at 60°C, andextension for 3 min at 72°C, with a final extension for10 min at 72°C. The amplified fragments were analyzed byelectrophoresis on a 1% agarose gel and visualized by stainingwith ethidium bromide. These second PCR products were purifiedby use of the MultiScreen-PCR Plates (Millipore). The qualityof DNA templates was examined by electrophoresis on a 1.2% agarosegel after staining with ethidium bromide by use of a Ready-To-RunSeparation Unit (Amersham Pharmacia Biotech).

Sequence Analysis of Mitochondrial DNA
Sequence reactions were carried out with a BigDye terminatorcycle sequencing FS ready reaction kit (Applied Biosystems).After excess dye terminators had been removed with MultiScreen-HVplates (Millipore) packed with Sephadex G50 superfine (Pharmacia),the purified DNA samples were precipitated with ethanol, dried,and suspended in the template suppression reagent (TSR) or formamidefrom Applied Biosystems. The dissolved DNA samples were heatedfor 2 min at 95°C for denaturation, then immediately cooledon ice. Sequences were analyzed with automated DNA sequencers377 and 310 by use of Sequencing Analysis Program version 4.1(Applied Biosystems). A computer program, Sequencher version4.1 (Gene Codes Co.), was used to indicate possible single nucleotidepolymorphism (SNP) loci. For verification, visual inspectionof each candidate SNP was carried out. At least two overlappingDNA templates amplified with different primer pairs were usedfor identification of each SNP. Mitochondrial SNPs (mtSNPs)were identified by comparison with the revised Cambridge sequence(rCRS) reported by Andrews et al. (1999).

Phylogenetic Analysis of Complete Coding-Region mtDNA Sequences
In this present study, nucleotide positions were numbered asin the Cambridge Reference Sequence (CRS; Anderson et al. 1981),nucleotide substitutions were expressed as differences fromthe revised CRS (Andrews et al. 1999), transitions were denotedonly by their nucleotide positions, and transversions were designatedby their nucleotide positions followed by the changed base.A total of 942 complete coding-region mtDNA sequences, includingour 672 Japanese; one additional Japanese (GenBank accessionno. AB055387 [GenBank] ); 53 worldwide sequences (Ingman et al. 2000);42 worldwide sequences (Maca-Meyer et al. 2001); two Finnishsequences having Asian relatives (Finnilä et al. 2001);17 Asian sequences without concrete geographic assignation (Herrnstadtet al. 2002); 37 sequences from the Bering area (Derbeneva etal. 2002b); 70 Asian, New Guinean, and Australian sequences(Ingman and Gyllensten 2003); and 48 Chinese sequences (Konget al. 2003) were aligned with the rCRS by CLUSTAL V software,and the coding region was used to construct a phylogenetic network(Bandelt et al. 1999) rooted with a chimpanzee sequence (GenBankaccession no. D38113 [GenBank] ) as implemented in the Network 3.1 program(Fluxus Engineering; http://www.fluxus-engineering.com/). Thenoncoding positions were added by hand using molecular weightedparsimony criteria (Bandelt et al. 2000). The phylogenetic relationshipsobtained were also confirmed by means of a neighbor-joiningtree (1000x bootstrapped; Saitou and Nei 1987), built usingMEGA2 (Kumar et al. 2001). From this network (see Supplementalmaterial) we chose 102 Japanese and nine Asiatic sequences thatrepresented the main clusters and subclusters within the twomacrohaplogroups M and N that colonized Asia. To define thesegroups we followed the most generalized cladistic nomenclatureactually used to classify mtDNA lineages (Richards et al. 1998).For the haplogroups previously detected, we maintained the samenotation as their authors proposed (Richards et al. 2000; Bamshadet al. 2001; Kivisild et al. 2002; Yao et al. 2002a; Kong etal. 2003). Those haplogroups introduced here for the first timewere named according to their phylogenetic range deduced fromthe tree of complete sequences.

Haplogroup Assorting of Published Partial mtDNA Sequences
The unambiguously classified complete mtDNA sequences were usedas an initial pool that was hierarchically enlarged by the successiveaddition of those published partial mtDNA sequences with thelargest coding information, ending with those for which informationon only control-region sequences for both mtDNA hypervariablesegments or just one (HVS-I and/or HVS-II) was available, alwaysfollowing sequence matches or, as default, sequence-relatednesscriteria. Some of those partial sequences that could be assignedto more than one haplogroup were tentatively assorted in themost probable one deduced from their geographic origin and therelative haplogroup distribution.

Pooling Small Size Samples and Rare Clades
To avoid small sample sizes and rare alleles in population comparisons,samples with <20 individuals were pooled with others fromthe same geographic and ethnic group. Within populations, individualsbelonging to rare clades were pooled with those classified inthe nearest branch. Pairwise sample distances were calculatedas linearized F_ST distances as implemented in the ARLEQUIN program(Schneider et al. 2000), taking mtDNA as one locus with as manyalleles as the different subhaplogroups considered.

Quantitative Affinities of Japanese Samples
Relative affinities of Japanese samples to the other Asiaticpopulations were assessed by linearized F_ST distances, usingsubhaplogroup frequencies, and haplotype matches' distances(D) estimated simply as D = 1 - {sum} (x_iy_i), x_i and y_i being thefrequency of haplotype i in the two compared populations. Tobe statistically robust, these analyses require large samplesizes, thus further pooling was necessary. Previous studiesin the area prevented us from pooling populations by geographicproximity (Schurr et al. 1999) and/or ethno-linguistic relationship(Comas et al. 1998; Chunjie et al. 2000; Yao et al. 2002a).For this reason, a genetic affinity criterion was chosen. Twoapproaches were used. In the first, all samples with no significantF_ST distances between them and with a similar behavior to therest of the samples studied, were grouped. In the second, poolingwas carried out by means of the CLUSTER algorithm implementedin the SPSS ver 9 package. We followed an iterative method specifyingthe number of conglomerates from 2 to 30. Different groupingswere tested by AMOVA, and that with the least assigned variancewithin areas was chosen. The data were graphically representedby multidimensional scaling (MDS) plots (Kruskal and Wish 1978)using SPSS.

Qualitative Affinities of Japanese Samples
Particular sharing of subhaplogroups and particular haplotypematches of Japanese samples with concrete Continental areaswere phylogeographically analyzed by taking into account therelative genetic diversities of the clades involved in the differentareas, measured as relative haplotypic frequencies, and theirminimum estimates of coalescence ages based on mean divergenceamong lineages for the coding region (Saillard et al. 2000).A constant evolutionary rate of 1.7 x 10^-8 per site per year(Ingman et al. 2000) was used.

Acknowledgements

This work was supported in part by the Support Project for DatabaseDevelopment from the Japan Science and Technology Corporation(to M.T.), Grants-in-Aid for Scientific Research (C2-10832009,A2-15200051) and for Priority Areas from the Ministry of Education,Science, Sports and Culture of Japan (to M.T.), and by grantsBMC2001-3511 and COF2002-015 (to V.M.C.).

Footnotes

¹⁵ Corresponding author.
E-MAIL mtanaka{at}giib.or.jp ; FAX 81-583-71-4412.

[Supplemental material is available online at www.genome.org.]

Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2286304.

REFERENCES

Top
ABSTRACT
RESULTS
DISCUSSION
METHODS
REFERENCES
WEB SITE REFERENCES

Abe, S., Usami, S., Shinkawa, H., Weston, M.D., Overbeck, L.D., Hoover, D.M., Kenyon, J.B., Horai, S., and Kimberling, W.J. 1998. Phylogenetic analysis of mitochondrial DNA in Japanese pedigrees of sensorineural hearing loss associated with the A1555G mutation. Eur. J. Hum. Genet. 6: 563-569.[CrossRef][Medline]

Anderson, S., Bankier, A.T., Barrell, B.G., de Bruijn, M.H., Coulson, A.R., Drouin, J., Eperon, I.C., Nierlich, D.P., Roe, B.A., Sanger, F., et al. 1981. Sequence and organization of the human mitochondrial genome. Nature 290: 457-465.[CrossRef][Medline]

Andrews, R.M., Kubacka, I., Chinnery, P.F., Lightowlers, R.N., Turnbull, D.M., and Howell, N. 1999. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat. Genet. 23: 147.[CrossRef][Medline]

Ballinger, S.W., Schurr, T.G., Torroni, A., Gan, Y.Y., Hodge, J.A., Hassan, K., Chen, K.H., and Wallace, D.C. 1992. Southeast Asian mitochondrial DNA analysis reveals genetic continuity of ancient mongoloid migrations. Genetics 130: 139-152.[Abstract]

Bamshad, M., Kivisild, T., Watkins, W.S., Dixon, M.E., Ricker, C.E., Rao, B.B., Naidu, J.M., Prasad, B.V., Reddy, P.G., Rasanayagam, A., et al. 2001. Genetic evidence on the origins of Indian caste populations. Genome Res. 11: 994-1004.[Abstract/Free Full Text]

Bandelt, H.-J., Forster, P., and Röhl, A. 1999. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 16: 37-48.[Abstract]

Bandelt, H.-J., Macaulay, V., and Richards, M. 2000. Median networks: Speedy construction and greedy reduction, one simulation, and two case studies from human mtDNA. Mol. Phylogenet. Evol. 16: 8-28.[CrossRef][Medline]

Betty, D.J., Chin-Atkins, A.N., Croft, L., Sraml, M., and Easteal, S. 1996. Multiple independent origins of the COII/tRNA(Lys) intergenic 9-bp mtDNA deletion in aboriginal Australians. Am. J. Hum. Genet. 58: 428-433.[Medline]

Cann, R.L. and Wilson, A.C. 1983. Length mutations in human mitochondrial DNA. Genetics 104: 669-711.

Cann, R.L., Stoneking, M., and Wilson, A.C. 1987. Mitochondrial DNA and human evolution. Nature 325: 31-36.[CrossRef][Medline]

Cavalli-Sforza, L.L., Menozzi, P., and Piazza, A. 1994. The history and geography of human genes. Princeton University Press, Princeton, NJ.

Chunjie, X., Cavalli-Sforza, L.L., Minch, E., and Ruofu, D.U. 2000. Principal component analysis of gene frequencies of Chinese populations. Science in China Ser. C 43: 472-481.

Comas, D., Calafell, F., Mateu, E., Perez-Lezaun, A., Bosch, E., Martinez-Arias, R., Clarimon, J., Facchini, F., Fiori, G., Luiselli, D., et al. 1998. Trading genes along the silk road: mtDNA sequences and the origin of central Asian populations. Am. J. Hum. Genet. 63: 1824-1838.[CrossRef][Medline]

Derbeneva, O.A., Starikovskaya, E.B., Wallace, D.C., and Sukernik, R.I. 2002a. Traces of early Eurasians in the Mansi of northwest Siberia revealed by mitochondrial DNA analysis. Am. J. Hum. Genet. 70: 1009-1014.[CrossRef][Medline]

Derbeneva, O.A., Sukernik, R.I., Volodko, N.V., Hosseini, S.H., Lott, M.T., and Wallace, D.C. 2002b. Analysis of mitochondrial DNA diversity in the Aleuts of the commander islands and its implications for the genetic history of Beringia. Am. J. Hum. Genet. 71: 415-421.[CrossRef][Medline]

Derenko, M.V., Malyarchuk, B.A., Dambueva, I.K., Shaikhaev, G.O., Dorzhu, C.M., Nimaev, D.D., and Zakharov, I.A. 2000. Mitochondrial DNA variation in two South Siberian Aboriginal populations: Implications for the genetic history of North Asia. Hum. Biol. 72: 945-973.[Medline]

Elson, J.L., Turnbull, D.M., and Howell, N. 2004. Comparative genomics and the evolution of human mitochondrial DNA: Assessing the effects of selection. Am. J. Hum. Genet. 74: 229-238.[CrossRef][Medline]

Finnilä, S., Lehtonen, M.S., and Majamaa, K. 2001. Phylogenetic network for European mtDNA. Am. J. Hum. Genet. 68: 1475-1484.[CrossRef][Medline]

Forster, P., Harding, R., Torroni, A., and Bandelt, H.J. 1996. Origin and evolution of Native American mtDNA variation: A reappraisal. Am. J. Hum. Genet. 59: 935-945.[Medline]

Forster, P., Torroni, A., Renfrew, C., and Röhl, A. 2001. Phylogenetic star contraction applied to Asian and Papuan mtDNA evolution. Mol. Biol. Evol. 18: 1864-1881.[Abstract/Free Full Text]

Fucharoen, G., Fucharoen, S., and Horai, S. 2001. Mitochondrial DNA polymorphisms in Thailand. J. Hum. Genet. 46: 115-125.[CrossRef][Medline]

Glover, I.C. 1980. Agricultural origins in East Asia. In The Cambridge encyclopedia of archaeology (ed. A. Sherratt), pp. 152-161. Crown, New York.

Hammer, M.F. and Horai, S. 1995. Y chromosomal DNA variation and the peopling of Japan. Am. J. Hum. Genet. 56: 951-962.[Medline]

Hanihara, K. 1991. Dual structure model for the population history of the Japanese. Japan Review 2: 1-33.

Helgason, A., Sigureth Ardottir, S., Gulcher, J.R., Ward, R., and Stefansson, K. 2000. mtDNA and the origin of the Icelanders: Deciphering signals of recent population history. Am. J. Hum. Genet. 66: 999-1016.[CrossRef][Medline]

Helgason, A., Hickey, E., Goodacre, S., Bosnes, V., Stefánsson, K., Ward, R., and Sykes, B. 2001. mtDNA and the islands of the North Atlantic: Estimating the proportions of Norse and Gaelic ancestry. Am. J. Hum. Genet. 68: 723-737.[CrossRef][Medline]

Herrnstadt, C., Elson, J.L., Fahy, E., Preston, G., Turnbull, D.M., Anderson, C., Ghosh, S.S., Olefsky, J.M., Beal, M.F., Davis, R.E., et al. 2002. Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups. Am. J. Hum. Genet. 70: 1152-1171.[CrossRef][Medline]

Horai, S. and Hayasaka, K. 1990. Intraspecific nucleotide sequence differences in the major noncoding region of human mitochondrial DNA. Am. J. Hum. Genet. 46: 828-842.[Medline]

Horai, S. and Matsunaga, E. 1986. Mitochondrial DNA polymorphism in Japanese. II. Analysis with restriction enzymes of four or five base pair recognition. Hum. Genet. 72: 105-117.[Medline]

Horai, S., Kondo, R., Murayama, K., Hayashi, S., Koike, H., and Nakai, N. 1991. Phylogenetic affiliation of ancient and contemporary humans inferred from mitochondrial DNA. Phil. Trans. R Soc. Lond. B 333: 409-417.[CrossRef][Medline]

Horai, S., Murayama, K., Hayasaka, K., Matsubayashi, S., Hattori, Y., Fucharoen, G., Harihara, S., Park, K.S., Omoto, K., and Pan, I.H. 1996. mtDNA polymorphism in East Asian Populations, with special reference to the peopling of Japan. Am. J. Hum. Genet. 59: 579-590.[Medline]

Imaizumi, K., Parsons, T.J., Yoshino, M., and Holland, M.M. 2002. A new database of mitochondrial DNA hypervariable regions I and II sequences from 162 Japanese individuals. Int. J. Legal. Med. 116: 68-73.[CrossRef][Medline]

Ingman, M. and Gyllensten, U. 2003. Mitochondrial genome variation and evolutionary history of Australian and New Guinean Aborigines. Genome Res. 13: 1600-1606.[Abstract/Free Full Text]

Ingman, M., Kaessmann, H., Pääbo, S., and Gyllensten, U. 2000. Mitochondrial genome variation and the origin of modern humans. Nature 408: 708-713.[CrossRef][Medline]

Jorde, L.B., Bamshad, M.J., Watkins, W.S., Zenger, R., Fraley, A.E., Krakowiak, P.A., Carpenter, K.D., Soodyall, H., Jenkins, T., and Rogers, A.R. 1995. Origins and affinities of modern humans: A comparison of mitochondrial and nuclear genetic data. Am. J. Hum. Genet. 57: 523-538.[Medline]

Kivisild, T., Kaldma, K., Metspalu, M., Parik, J., Papiha, S., and Villems, R. 1999. The place of the Indian mitochondrial DNA variants in the global network of maternal lineages and the peopling of the Old World. In Genomic diversity: Applications in human population genetics (eds. S. Papiha et al.), pp. 135-152. Plenum Press, New York.

Kivisild, T., Tolk, H.-V., Parik, J., Wang, Y., Papiha, S.S., Bandelt, H.-J., and Villems, R. 2002. The emerging limbs and twigs of the East Asian mtDNA tree. Mol. Biol. Evol. 19: 1737-1751.[Abstract/Free Full Text]

Kivisild, T., Rootsi, S., Metspalu, M., Mastana, S., Kaldma, K., Parik, J., Metspalu, E., Adojaan, M., Tolk, H.V., Stepanov, V., et al. 2003. The genetic heritage of the earliest settlers persists both in Indian tribal and caste populations. Am. J. Hum. Genet. 72: 313-332.[CrossRef][Medline]

Kolman, C.J., Sambuughin, N., and Bermingham, E. 1996. Mitochondrial DNA analysis of Mongolian populations and implications for the origin of New World founders. Genetics 142: 1321-1334.[Abstract]

Kong, Q.-P., Yao, Y.-G., Sun, C., Bandelt, H.-J., Zhu, C.-L., and Zhang, Y.-P. 2003. Phylogeny of East Asian mitochondrial DNA lineages inferred from complete sequences. Am. J. Hum. Genet. 73: 671-676.[CrossRef][Medline]

Koyama, H., Iwasa, M., Maeno, Y., Tsuchimochi, T., Isobe, I., Seko-Nakamura, Y., Monma-Ohtaki, J., Matsumoto, T., Ogawa, S., Sato, B., et al. 2002. Mitochondrial sequence haplotype in the Japanese population. Forensic Sci. Int. 125: 93-96.[CrossRef][Medline]

Kruskal, J.B. and Wish, M. 1978. Multidimensional scaling. Sage Publications, Beverly Hills, CA.

Kumar, S., Tamura, K., Jakobsen, I.B., and Nei, M. 2001. MEGA2: Molecular Evolutionary Genetics Analysis software. Bioinformatics 17: 1244-1245.[Abstract/Free Full Text]

Lee, S.D., Shin, C.H., Kim, K.B., Lee, Y.S., and Lee, J.B. 1997. Sequence variation of mitochondrial DNA control region in Koreans. Forensic Sci. Int. 87: 99-116.[CrossRef][Medline]

Lee, S.D., Lee, Y.S., and Lee, J.B. 2002. Polymorphism in the mitochondrial cytochrome B gene in Koreans. An additional marker for individual identification. Int. J. Legal Med. 116: 74-78.[CrossRef][Medline]

Maca-Meyer, N., González, A.M., Larruga, J.M., Flores, C., and Cabrera, V.M. 2001. Major genomic mitochondrial lineages delineate early human expansions. BMC Genet. 2: 13-20.[CrossRef][Medline]

Macaulay, V., Richards, M., Hickey, E., Vega, E., Cruciani, F., Guida, V., Scozzari, R., Bonne-Tamir, B., Sykes, B., and Torroni, A. 1999. The emerging tree of West Eurasian mtDNAs: A synthesis of control-region sequences and RFLP. Am. J. Hum. Genet. 64: 232-249.[CrossRef][Medline]

Malyarchuk, B.A. and Derenko, M.V. 2001. Mitochondrial DNA variability in Russians and Ukrainians: Implication to the origin of the Eastern Slavs. Ann. Hum. Genet. 65: 63-78.[CrossRef][Medline]

Matsumoto, H. 1988. Characteristics of Mongoloid and neighboring populations based on the genetic markers of human immunoglobulins. Hum. Genet. 80: 207-218.[CrossRef][Medline]

Melton, T., Clifford, S., Martinson, J., Batzer, M., and Stoneking, M. 1998. Genetic evidence for the proto-Austronesian homeland in Asia: mtDNA and nuclear DNA variation in Taiwanese aboriginal tribes. Am. J. Hum. Genet. 63: 1807-1823.[CrossRef][Medline]

Nei, M. 1995. The origins of human populations: Genetic, linguistic, and archeological data. In The origin and past of modern humans as viewed from DNA (eds. S. Brenner and K. Hanihara), pp. 71-91. World Scientific, Singapore.

Nishimaki, Y., Sato, K., Fang, L., Ma, M., Hasekura, H., and Boettcher, B. 1999. Sequence polymorphism in the mtDNA HV1 region in Japanese and Chinese. Legal Med. 1: 238-249.[CrossRef][Medline]

Omoto, K. and Saitou, N. 1997. Genetic origins of the Japanese: A partial support for the dual structure hypothesis. Am. J. Phys. Anthropol. 102: 437-446.[CrossRef][Medline]

Oota, H., Kitano, T., Jin, F., Yuasa, I., Wang, L., Ueda, S., Saitou, N., and Stoneking, M. 2002. Extreme mtDNA homogeneity in continental Asian populations. Am. J. Phys. Anthropol. 118: 146-153.[CrossRef][Medline]

Orito, E., Ichida, T., Sakugawa, H., Sata, M., Horiike, N., Hino, K., Okita, K., Okanoue, T., Iino, S., Tanaka, E., et al. 2001. Geographic distribution of hepatitis B virus (HBV) genotype in patients with chronic HBV infection in Japan. Hepatology 34: 590-594.[CrossRef][Medline]

Pfeiffer, H., Steighner, R., Fisher, R., Mornstad, H., Yoon, C.L., and Holland, M.M. 1998. Mitochondrial DNA extraction and typing from isolated dentin-experimental evaluation in a Korean population. Int. J. Legal Med. 111: 309-313.[CrossRef][Medline]

Qian, Y.P., Chu, Z.T., Dai, Q., Wei, C.D., Chu, J.Y., Tajima, A., and Horai, S. 2001. Mitochondrial DNA polymorphisms in Yunnan nationalities in China. J. Hum. Genet. 46: 211-220.[CrossRef][Medline]

Quintana-Murci, L., Semino, O., Bandelt, H.-J., Passarino, G., McElreavey, K., and Santachiara-Benereccetti, A.S. 1999. Genetic evidence of an early exit of Homo sapiens sapiens from Africa through eastern Africa. Nat. Genet. 23: 437-441.[CrossRef][Medline]

Redd, A.J. and Stoneking, M. 1999. Peopling of Sahul: mtDNA variation in aboriginal Australian and Papua New Guinean populations. Am. J. Hum. Genet. 65: 808-828.[CrossRef][Medline]

Richards, M., Macaulay, V., Bandelt, H.-J., and Sykes, B. 1998. Phylogeography of mitochondrial DNA in western Europe. Ann. Hum. Genet. 62: 241-260.[CrossRef][Medline]

Richards, M., Macaulay, V., Hickey, E., Vega, E., Sykes, B., Guida, V., Rengo, C., Sellitto, D., Cruciani, F., Kivisild, T., et al. 2000. Tracing European founder lineages in the Near Eastern mtDNA pool. Am. J. Hum. Genet. 67: 1251-1276.[Medline]

Ruiz-Pesini, E., Mishmar, D., Brandon, M., Procaccio, V., and Wallace, D.C. 2004. Effects of purifying and adaptive selection on regional variation in human mtDNA. Science 303: 223-226.[Abstract/Free Full Text]

Saillard, J., Forster, P., Lynnerup, N., Bandelt, H.J., and Norby, S. 2000. mtDNA variation among Greenland Eskimos: The edge of the Beringian expansion. Am. J. Hum. Genet. 67: 718-726.[CrossRef][Medline]

Saitou, N. and Nei, M. 1987. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4: 406-425.[Abstract]

Schneider, S., Roessli, D., and Excoffier, L. 2000. Arlequin ver. 2000: A software for population genetics data analysis. Genetic and Biometry Laboratory, University of Geneva, Switzerland.

Schurr, T.G., Sukernik, R.I., Starikovskaya, Y.B., and Wallace, D.C. 1999. Mitochondrial DNA variation in Koryaks and Itel'men: Population replacement in Okhotsk Sea-Bering Sea region during the Neolithic. Am. J. Phys. Anthropol. 108: 1-39.[Medline]

Seo, Y., Stradmann-Bellinghausen, B., Rittner, C., Takahama, K., and Schneider, P.M. 1998. Sequence polymorphism of mitochondrial DNA control region in Japanese. Forensic Sci. Int. 97: 155-164.[CrossRef][Medline]

Shields, G.F., Schmiechen, A.M., Frazier, B.L., Redd, A., Voevoda, M.I., Reed, J.K., and Ward, R.H. 1993. mtDNA sequences suggest a recent evolutionary divergence for Beringian and northern North American populations. Am. J. Hum. Genet. 53: 549-562.[Medline]

Shinoda, K.-I. and Kanai, S. 1999. Intracemetry genetic analysis at the Nakazuma Jomon site in Japan by mitochondrial DNA sequencing. Anthropol. Sci. 107: 129-140.

Shiraishi, T. 2002. Wakoku tanjou (The formation of ancient Japanese society). In History of Japan 1 (ed. T. Shiraishi et al.), pp. 8-94. Yoshikawakobunkan, Tokyo, Japan (in Japanese).

Snäll, N., Savontaus, M.-L., Kares, S., Lee, M.S., Cho, E.K., Rinne, J.O., and Huoponen, K. 2002. A rare mitochondrial DNA haplotype observed in Koreans. Hum. Biol. 74: 253-262.[Medline]

Soodyall, H., Jenkins, T., and Stoneking, M. 1995. `Polynesian' mtDNA in the Malagasy. Nat. Genet. 10: 377-378.[CrossRef][Medline]

Stoneking, M., Jorde, L.B., Bhatia, K., and Wilson, A.C. 1990. Geographic variation in human mitochondrial DNA from Papua New Guinea. Genetics 124: 717-733.[Abstract]

Su, B., Xiao, C., Deka, R., Seielstad, M.T., Kangwanpong, D., Xiao, J., Lu, D., Underhill, P., Cavalli-Sforza, L., Chakraborty, R., et al. 2000. Y chromosome haplotypes reveal prehistorical migrations to the Himalayas. Hum. Genet. 107: 582-590.[CrossRef][Medline]

Sykes, B., Leiboff, A., Low-Beer, J., Tetzner, S., and Richards, M. 1995. The origins of the Polynesians: An interpretation from mitochondrial lineage analysis. Am. J. Hum. Genet. 57: 1463-1475.[Medline]

Tajima, A., Sun, C.-S., Pan, I.-H., Ishida, T., Saitou, N., and Horai, S. 2003. Mitochondrial DNA polymorphisms in nine aboriginal groups of Taiwan: Implications for the population history of aboriginal Taiwanese. Hum. Genet. 113: 24-33.[CrossRef][Medline]

Takahata, N., Lee, S.-H., and Satta, Y. 2001. Testing multiregionality of modern human origins. Mol. Biol. Evol. 18: 172-183.[Abstract/Free Full Text]

Takeshita, T., Yasuda, Y., Nakashima, K., Mogi, K., Kishi, H., Shiono, K., Sagisaka, I., Yuasa, H., Nishimukai, H., and Kimura, H. 2001. Geographical north-south decline in DNASE^*2 in Japanese populations. Hum. Biol. 73: 129-134.[Medline]

Tanaka, M., Hayakawa, M., and Ozawa, T. 1996. Automated sequencing of mitochondrial DNA. Methods Enzymol. 264: 407-421.[Medline]

Torroni, A., Schurr, T.G., Yang, C.C., Szathmary, E.J., Williams, R.C., Schanfield, M.S., Troup, G.A., Knowler, W.C., Lawrence, D.N., Weiss, K.M., et al. 1992. Native American mitochondrial DNA analysis indicates that the Amerind and the Nadene populations were founded by two independent migrations. Genetics 130: 153-162.[Abstract]

Torroni, A., Sukernik, R.I., Schurr, T.G., Starikorskaya, Y.B., Cabell, M.F., Crawford, M.H., Comuzzie, A.G., and Wallace, D.C. 1993a. mtDNA variation of aboriginal Siberians reveals distinct genetic affinities with Native Americans. Am. J. Hum. Genet. 53: 591-608.[Medline]

Torroni, A., Schurr, T.G., Cabell, M.F., Brown, M.D., Neel, J.V., Larsen, M., Smith, D.G., Vullo, C.M., and Wallace, D.C. 1993b. Asian affinities and continental radiation of the four founding Native American mtDNAs. Am. J. Hum. Genet. 53: 563-590.[Medline]

Torroni, A., Miller, J.A., Moore, L.G., Zamudio, S., Zhuang, J., Droma, T., and Wallace, D.C. 1994. Mitochondrial DNA analysis in Tibet: Implications for the origin of the Tibetan population and its adaptation to high altitude. Am. J. Phys. Anthropol. 93: 189-199.[CrossRef][Medline]

Torroni, A., Huoponen, K., Francalacci, P., Petrozzi, M., Morelli, L., Scozzari, R., Obinu, D., Savontaus, M.-L., and Wallace, D.C. 1996. Classification of European mtDNAs from an analysis of three European populations. Genetics 144: 1835-1850.[Abstract]

Torroni, A., Rengo, C., Guida, V., Cruciani, F., Sellitto, D., Coppa, A., Calderon, F.L., Simionati, B., Valle, G., Richards, M., et al. 2001. Do the four clades of the mtDNA haplogroup L2 evolve at different rates? Am. J. Hum. Genet. 69: 1348-1356.[CrossRef][Medline]

Tsai, L.C., Lin, C.Y., Lee, J.C., Chang, J.G., Linacre, A., and Goodwin, W. 2001. Sequence polymorphism of mitochondrial D-loop DNA in the Taiwanese Han population. Forensic Sci. Int. 119: 239-247.[CrossRef][Medline]

Voevoda, M.I., Avksentyuk, A.V., Ivanova, A.V., Astakhova, T.I., Babenko, V.N., Kurilovich, S.A., Duffy, L.K., Segal, B., and Shields, G.F. 1994. Molecular genetic studies in the population of native inhabitants of Chukchee Peninsula. Analysis of polymorphism of mitochondrial DNA and of genes controlling alcohol metabolizing enzymes. Sibirskii Ekolog. Z. 1: 139-151.

Watson, E., Forster, P., Richards, M., and Bandelt, H.J. 1997. Mitochondrial footprints of human expansions in Africa. Am. J. Hum. Genet. 61: 691-704.[Medline]

Yao, Y.G., Lu, X.M., Luo, H.R., Li, W.H., and Zhang, Y.P. 2000a. Gene admixture in the silk road region of China: Evidence from mtDNA and melanocortin 1 receptor polymorphism. Genes Genet. Syst. 75: 173-178.[CrossRef][Medline]

Yao, Y.G., Watkins, W.S., and Zhang, Y.P. 2000b. Evolutionary history of the mtDNA 9-bp deletion in Chinese populations and its relevance to the peopling of east and southeast Asia. Hum. Genet. 107: 504-512.[CrossRef][Medline]

Yao, Y.G., Kong, Q.P., Bandelt, H.J., Kivisild, T., and Zhang, Y.P. 2002a. Phylogeographic differentiation of mitochondrial DNA in Han Chinese. Am. J. Hum. Genet. 70: 635-651.[CrossRef][Medline]

Yao, Y.-G., Nie, L., Harpending, H., Fu, Y.-X., Yuan, Z.-G., and Zhang, Y.-P. 2002b. Genetic relationship of Chinese ethnic populations revealed by mtDNA sequence diversity. Am. J. Phys. Anthropol. 118: 63-76.[CrossRef][Medline]

WEB SITE REFERENCES

Top
ABSTRACT
RESULTS
DISCUSSION
METHODS
REFERENCES
WEB SITE REFERENCES

http://www.fluxus-engineering.com/; Network 3.1 program, Fluxus Engineering.

http://www.giib.or.jp/mtsnp/index_e.html; authors' data.

Received December 17, 2003; Revision received June 14, 2004.

This article has been cited by other articles: (Search Google Scholar for Other Citing Articles)

R. W. Carter
Mitochondrial diversity within modern human populations
Nucleic Acids Res., May 14, 2007; 35(9): 3039 - 3045.
[Abstract] [Full Text] [PDF]

M. Tanaka, N. Fuku, Y. Nishigaki, H. Matsuo, T. Segawa, S. Watanabe, K. Kato, K. Yoko, M. Ito, Y. Nozawa, and Y. Yamada
Women With Mitochondrial Haplogroup N9a Are Protected Against Metabolic Syndrome
Diabetes, February 1, 2007; 56(2): 518 - 521.
[Abstract] [Full Text] [PDF]

R Hinttala, R Smeets, J S Moilanen, C Ugalde, J Uusimaa, J A M Smeitink, and K Majamaa
Analysis of mitochondrial DNA sequences in patients with isolated or combined oxidative phosphorylation system deficiency
J. Med. Genet., November 1, 2006; 43(11): 881 - 886.
[Abstract] [Full Text] [PDF]

C. H. Cannon, C. S. Kua, E. K. Lobenhofer, and P. Hurban
Capturing genomic signatures of DNA sequence variation using a standard anonymous microarray platform
Nucleic Acids Res., October 6, 2006; 34(18): e121 - e121.
[Abstract] [Full Text] [PDF]

M. J. Pierson, R. Martinez-Arias, B. R. Holland, N. J. Gemmell, M. E. Hurles, and D. Penny
Deciphering Past Human Population Movements in Oceania: Provably Optimal Trees of 127 mtDNA Genomes
Mol. Biol. Evol., October 1, 2006; 23(10): 1966 - 1975.
[Abstract] [Full Text] [PDF]

Q.-P. Kong, H.-J. Bandelt, C. Sun, Y.-G. Yao, A. Salas, A. Achilli, C.-Y. Wang, L. Zhong, C.-L. Zhu, S.-F. Wu, A. Torroni, and Y.-P. Zhang
Updating the East Asian mtDNA phylogeny: a prerequisite for the identification of pathogenic mutations
Hum. Mol. Genet., July 1, 2006; 15(13): 2076 - 2086.
[Abstract] [Full Text] [PDF]

B. J. Willcox, D. C. Willcox, Q. He, J. D. Curb, and M. Suzuki
Siblings of okinawan centenarians share lifelong mortality advantages.
J. Gerontol. A Biol. Sci. Med. Sci., April 1, 2006; 61(4): 345 - 354.
[Abstract] [Full Text] [PDF]

C. Sun, Q.-P. Kong, M. g. Palanichamy, S. Agrawal, H.-J. Bandelt, Y.-G. Yao, F. Khan, C.-L. Zhu, T. K. Chaudhuri, and Y.-P. Zhang
The Dazzling Array of Basal Branches in the mtDNA Macrohaplogroup M from India as Inferred from Complete Genomes
Mol. Biol. Evol., March 1, 2006; 23(3): 683 - 690.
[Abstract] [Full Text] [PDF]

T. Kivisild, P. Shen, D. P. Wall, B. Do, R. Sung, K. Davis, G. Passarino, P. A. Underhill, C. Scharfe, A. Torroni, R. Scozzari, D. Modiano, A. Coppa, P. de Knijff, M. Feldman, L. L. Cavalli-Sforza, and P. J. Oefner
The Role of Selection in the Evolution of Human Mitochondrial Genomes
Genetics, January 1, 2006; 172(1): 373 - 387.
[Abstract] [Full Text] [PDF]

K K Abu-Amero, T M Bosley, S Bohlega, and D McLean
Complex I respiratory defect in LHON plus dystonia with no mitochondrial DNA mutation
Br. J. Ophthalmol., October 1, 2005; 89(10): 1380 - 1381.
[Full Text] [PDF]

V. Macaulay, C. Hill, A. Achilli, C. Rengo, D. Clarke, W. Meehan, J. Blackburn, O. Semino, R. Scozzari, F. Cruciani, A. Taha, N. K. Shaari, J. M. Raja, P. Ismail, Z. Zainuddin, W. Goodwin, D. Bulbeck, H.-J. Bandelt, S. Oppenheimer, A. Torroni, and M. Richards
Single, Rapid Coastal Settlement of Asia Revealed by Analysis of Complete Mitochondrial Genomes
Science, May 13, 2005; 308(5724): 1034 - 1036.
[Abstract] [Full Text] [PDF]

This Article

Abstract

Full Text (PDF)

Supplemental Research Data

Alert me when this article is cited

Alert me if a correction is posted

Citation Map

Services

Email this article to a friend

Similar articles in this journal

Similar articles in PubMed

Alert me to new issues of the journal

Download to citation manager

Google Scholar

Articles by Tanaka, M.

Articles by Shimodaira, H.

Articles citing this Article

Search for Related Content

PubMed

PubMed Citation

Articles by Tanaka, M.

Articles by Shimodaira, H.

Pubmed/NCBI databases

Gene GEO Profiles

Nucleotide Protein