Berber and Arab DNA in Europe from the Moorish occupation

North African Berber and Arab Influences in the Western Mediterranean Revealed by Y-Chromosome DNA Haplotypes

We have analyzed Y-chromosome diversity in the western Mediterranean area, examining p49a,f TaqI haplotype V and subhaplotypes Vb (Berber) and Va (Arab). A total of 2,196 unrelated DNA samples, belonging to 22 populations from North Africa and the southern Mediterranean coast of occidental Europe, have been typed. Subhaplotype Vb, predominant in a Berber population of Morocco (63.5%), was also found at high frequencies in southern Portugal (35.9%) and Andalusia (25.4%). The Arab subhaplotype Va, predominant in Algeria (53.9%) and Tunisia (50.6%), was also found at a relatively high frequency in Sicily (23.1%) and Naples (16.4%); its highest frequency in Iberia was in northern Portugal (22.8%) and Andalusia (15.5%). In Iberia there is a gradient of decreasing frequencies in latitude for both subhaplotypes Va and Vb, related to eight centuries of Muslim domination (8th to 15th centuries) in southern Iberia.

During the 7th century A.D., Muslim people coming from the Arabian peninsula and the Middle East invaded North Africa. The most important population movement relating both sides of the Mediterranean Sea was the conquest of the Iberian peninsula by North African populations (with recruited Berbers), soon after the first Muslim invasion. More than eight centuries (8th to 15th centuries) of Muslim domination in the southern part of Iberia imparted an important cultural legacy (Conrad 1998) and probable gene exchanges between North African and Iberian populations.

Variations in DNA sequences specific to the nonrecombinant part of the Y chromosome, relating to paternal ancestry, are particularly interesting from a human population genetics point of view. The first published and most informative probe used in Southern blots for this objective is p49 (locus DYSl), which is able to identify at last five TaqI male-specific fragments (A, C, D, F, and I) that are polymorphic between individuals (Lucotte and Ngo 1985).

Sixteen main corresponding haplotypes (numbered I-XVI) were identified using the p49 probe on DNA samples of unrelated males living in France (Ngo et al. 1986). Only recently has the molecular basis of the p49 TaqI polymorphisms been established (Jovelin et al. 2003); the polymorphisms correspond to variable TaqI sites located in the four DAZ genes located in the AZF-c region of the Y chromosome.

In fact, the conventional p49 TaqI polymorphisms were the most popular markers used in various populations because of their ability to detect more than 100 different haplotypes [for a compilation on the subject until the end of 1995, see Poloni et al. (1997)]. Haplotype XV (A3,C1,D2,F1,I1) was the most widespread haplotype in our initial study (Ngo et al. 1986).

Haplotype XV was also predominant in the first European study we published (Lucotte and Hazout 1996), with elevated frequencies in French Basques. The geographic distribution of haplotype XV in Europe reveals a gradient of decreasing frequencies from this Basque focus toward eastern peripheral countries (Lucotte and Loirat 1999) but also toward southwestern countries. According to the Y Chromosome Consortium (2002) nomenclature, haplotype XV corresponds to the M173 lineage (Diéterlen and Lucotte 2005).

Haplotype V (E3b1b) (A2,C0,D0,F1,I1) is the most frequent haplotype in North Africa (Lucotte et al. 2000), with a particularly high frequency (55%) in the populations with a relative predominance of Berber origin. Our previous study on the subject examined the relative frequencies of haplotype V in four Iberian populations compared with a Berber population living in North Africa (Lucotte et al. 2001). The highest frequency of haplotype V (68.9%) was observed in Berbers from Morocco, and the geographic distribution of haplotype V revealed a gradient of decreasing frequencies with latitude in Iberia (40.8% in Andalusia, 36.2% in Portugal, 12.1% in Catalonia, and 11.3% in the Basque Country) (Lucotte et al. 2001); such a cline of decreasing haplotype V frequencies from the south to the north in Iberia clearly established a gene flow from North Africa toward Iberia.

According to the Y Chromosome Consortium (2002) nomenclature, haplogroup E is characterized by the mutations SRY4064, M96, and P29 on a background defined by the insertion of an Alu element (YAP + ). The third clade, E3 (defined by the mutation P2), of haplogroup E is further subdivided into two monophyletic forms, the second one (E3b) being characterized by mutations M35 and M125. All of the 110 p49 TaqI haplotype V subjects from Morocco (51 Berbers and 59 Arabs) that we had previously tested correspond to haplogroup E3b.

In the present study we have subdivided haplotype V into its Berber (Vb) and Arab (Va) components in order to distinguish the relative contributions of these two ethnicity-specific markers in the gene pools of the populations living in Iberia and in other populations in the northern part of the western Mediterranean area.

DNA Samples. This study concerns 2,196 unrelated male DNA samples (Table 1). We collected 904 new unrelated males subjects, from three different countries (Portugal, France, and Italy): 79 from North Portugal and 59 from South Portugal; 243 from the Marseilles region of France; 192 from Genoa, 64 from Rome, and 128 from Naples in continental Italy; 39 from Sicily; and 100 from Sardinia.

All these new samples correspond to adult males, whose origin is based on the local birthplace of their fathers and (at least) grandfathers. We have obtained informed consent from each of the French subjects studied.

We add for comparison the following subjects, already tested as bearing haplotype V in previous studies: 11 subjects from Mauritania, 51 Berbers from Morocco, 59 Arabs from Rabat, 80 subjects from Algeria, 39 subjects from Tunisia, and 17 subjects from Libya (Lucotte et al. 2000); 29 Spaniards from Sevilla (Lucotte et al. 2001); 4 Spaniards from Barcelona and 9 French Catalans from Perpignan (Lucotte and Loirat 1999); 11 French Basques, 1 subject from Montpellier, and 7 subjects from Grasse in France and 6 subjects from Milan in Italy (Lucotte and Hazout 1996); and 44 subjects from Corsica (Lucotte et al. 2002).


Table 1 summarizes the frequencies we obtained for haplotype V and sub-haplotypes Vb and Va in the 22 study populations. For the 2,196 males typed, 491 (22.3%) bear haplotype V. The frequency of haplotype V is 35.5% in Portugal, with a more elevated proportion in the south (49.2%) than in the north (25.3%). The frequency of haplotype V in the Marseilles region (11.1%) has a value similar to the mean value in continental France (9%). In Italy the highest frequency is attained in Sicily (28.2%), followed by Naples at 17.2%. As previously shown (Lucotte et al. 2000), haplotype V is found at the highest frequency (68.9%) in Berbers from Marrakech in Morocco; an apparently increasing east-west cline in haplotype V frequencies is shown in North Africa from Libya (44.7%) to Rabat (57.7%), with intermediate values for Tunisia (53.4%) and Algeria (56.7%). In Spain haplotype V is much more frequent (40.9%) in the south of the country [in Andalusia (Sevilla)] than in the north (12.9%) [in Catalonia (Barcelona)].

Subhaplotype Vb is the Berber subhaplotype because its most elevated relative value (63.5%) is obtained for the Berber population of Marrakech. In the non-Berber population of Rabat in Morocco, the frequency of subhaplotype Vb is only 20.6%, whereas the frequency of subhaplotype Va (Arab) is 37.3%. In order of decreasing values, the subhaplotype Vb frequencies are 40% in Mauritania, 35.9% in South Portugal, 25.4% in Andalusia, and 15.8% in Libya.

Low frequencies of subhaplotype Vb are found in Sicily (5.1%), Algeria (2.8%), Tunisia (2.7%), and North Portugal (2.5%); frequencies less than 2% are found in French Basques (1.9%), in Naples (0.8%), and in Corsica (0.6%), Subhaplotype Vb is absent in Catalonia (Barcelona and Perpignan), in the south of France (Montpellier, Grasse, and the region of Marseilles), in continental Italy (Milan, Genoa, and Rome), and in Sardinia.

Table 2 summarizes the frequencies of subhaplotype Vb in North Africa, Iberia, the south of France, and Italy. The maximum value (63.5%) concerns the Berber population, but this frequency is notably lower (9.3%) for other populations from North Africa. In southern Iberia an elevated value (30%) is observed, but the frequency of subhaplotype Vb is only 1.8% in northern Iberia. These frequencies are less than 1% in France and Italy.

Figure 2 shows the isofrequency map of subhaplotype Vb in the western Mediterranean area (coordinates on the map: x = longitude, y = latitude). From the Berber focus in Berbers from southern Morocco, the frequencies of subhaplotype Vb decrease in North Africa to the north of Morocco and to the east in Algeria and Tunisia. For Iberia the most elevated value of subhaplotype Vb frequencies is in southern Portugal; relatively elevated values are observed in Andalusia, moderate values are observed in the southern part of Spain, and low values are seen in Catalonia.

In the present study all haplotype V non-subhaplotype Vb subjects are termed subhaplotype Va (Arab) subjects. Their maximum relative frequencies are 53.9% (Algeria), 50% (Tunisia), and 37.3% (Rabat) in North Africa. Table 3 summarizes the frequencies of subhaplotype Va in North Africa, Iberia, southern France, and Italy. The maximum value (45.8%) is found in North Africa. In northern Iberia a slightly more elevated value is observed (20%) compared to southern Iberia (14.6%). A frequency of 10.3% is seen in France, and in Italy the 14.6% value observed in the south is relatively more elevated than in the north (3.4%).

Figure 3 gives the isofrequency map of subhaplotype Vb. In North Africa frequencies decrease from east to west and southward. For southern Europe the map shows the relatively higher percentages observed in the south of Italy versus the north and (to a lesser degree) in the north of Iberia versus the south.

In our PCR assay the 68 Moroccan subjects with subhaplotype Vb (47 Berbers and 21 Arabs) were tested for the M81 marker: All subjects were positive for the M81 marker, so subhaplotype Vb is homologous with subhaplogroup E3b2. The 38 Moroccan non-Berber subjects were further tested for the M78 marker: Only 31 of them (80.8%) were positive for the M78 marker; we conclude that, in Morocco at least, subhaplotype Va corresponds only partly to subhaplogroup E3b1.

P49a,f TaqI haplotype V, which is homologous with haplogroup E3b according to the Y Chromosome Consortium (2002) nomenclature, is the predominant Y-chromosome haplotype in North Africa (Lucotte et al. 2000), where its geographic distribution shows an east to west cline. In the present study we have extended the research of haplotype V frequencies (Lucotte et al. 2001) in various European populations located in the western Mediterranean basin to include France, Portugal, and Italy. The frequency of haplotype V in the Marseilles region is 11.1%, a value similar to the main value we obtained previously for continental France (Lucotte and Hazout 1996). In continental Italy we observed the highest haplotype V frequency in Naples (17.2%); Sicily, with a frequency of 28.2%, corresponds to the most elevated value we observed for Italy. In South Portugal the frequency of haplotype V is very high (49.2%); we had previously obtained a similar value for Libya and for Mauritania. The frequency of haplotype V for North Portugal (25.3%) is similar to the value we obtained for Sicily in the present study.

Related Results
Y-chromosome DNA haplotypes in North African populations

To better divide haplotype V into its ethnic components, we have subdivided it into subhaplotypes Vb (Berber) and Va (Arab). We have established that subhaplotype Vb is the Berber haplotype, because it is present at very elevated frequencies (63.5%) in our Berber population from Morocco but at relatively low frequencies (20.6%) in our non-Berber population of Rabat. Such a distinction of a Berber component was also realized by Scozzari et al. (2001), because they observed that the haplogroup they named 25.2 was also more frequent in the Berber population from Morocco than in Arabs. Our present results show that subhaplotype Vb frequencies in North Africa decrease from west to east, starting from the Berber focus in Morocco; in the western Mediterranean area subhaplotype Vb is at low frequencies along the south coast of Europe but occurs at relatively elevated frequencies in southern Iberia (peaking at 35.9% in South Portugal). Flores et al. (2004), in their important study of various locations in Iberia, observed that subhaplogroup E3b2 is more frequent in southern Iberia, attaining a maximum value of 11.5% in the region of Málaga.

In the present study all the non-subhaplotype Vb subjects bearing haplo-type V are classified as subhaplotype Va (Arab); they probably correspond to a heterogeneous group representing various ethnicities (our results concerning the incomplete correspondence between subhaplotypes Va and E3b1 in Morocco suggest that). We have shown here that in North Africa the focus of subhaplotype Va frequencies is in Algeria (53.9%) and Tunisia (50.6%); from this focus frequencies of subhaplotype Va decrease in the south and the west of the region.

Subhaplotype Va attains substantial frequencies along the southern coast of Europe; these frequencies reached relatively elevated frequencies in France (Perpignan, 11.8%) and in southern Italy (Naples, 16.4%; Sicily, 23.1%). For Iberia, relatively more elevated values are attained for Andalusia (15.5%) and for North Portugal (22.8%). Brion et al. (2004) also showed relatively higher frequencies of haplogroup E* (xE3a) (up to 18.3%) in their study concerning northern Iberia.

We had previously established (Lucotte et al. 2001) that haplotype V showed a gradient of decreasing frequencies with latitude in Iberia, and we interpreted this pattern as a consequence of the historical Islamic occupation of the peninsula (Conrad 1998). The results reported in the present study concerning subhaplotypes Vb and Va (subhaplotype isofrequencies maps given in Figures 2 and 3) have again shown both of these gradients. From this perspective, the opposite pattern of gradient frequencies observed in Iberia for the western European haplotype XV (Diéterlen and Lucotte 2005) is reconciled with the slow reconquest of the Iberian peninsula from the north by the Christians, which lasted seven centuries and ended in Granada in 1492.

Unfortunately I can’t find the diagrams for this piece. This shows a larger Arab and Berber than the other study I’ve seen into this. I shall have to have another look at it. Notably, for all you Afrocentrists who think Moors were black Africans, there are only Arab and Berber Y chromosomes found. Sub Saharan ancestry only shows up at trace in these studies.