I've been away for some time but one thing that's always stuck with me are the results of the Iberomaurusian (Taforalt) samples from the original Loosdrecht et al. paper [1] that published them and the later results from the Dzudzuana pre-print by Iosif Lazaridis et al. [2]
A two-way admix-ture model, comprising Natufian and a sub-Saharan African population, does not significantly deviate from our data (χ2 p ≥ 0.128) with 63.5% Natufian and 36.5% sub-Saharan African ancestry on average (table S8).
As quoted above, the original paper appeared to posit—though with only a modest fit—that the Iberomaurusians were essentially within the range of about 40:60 for Sub-Saharan:Eurasian ancestry by trying to model the Iberomaurusians as a two-way admixture between something Natufian-like and something Sub-Saharan that very intriguingly had broad SSA affinities. Seeming part West-African, East-African and even various sorts of SSA Hunter-Gatherer in its affinities:
These results can only be explained by Taforalt harboring an ancestry that contains additional affinity with South, East and Central African outgroups.
Whereas, upon finding the Dzudzuana samples from the Caucasus, that appeared to be an early form of Anatolian type Hunter-Gatherers and farmers a new model was put forward by Lazaridis et al.—once again with a rather modest fit (z-score of 2.4)—where the Iberomaurusians are instead modeled as close to 50:50 for Sub-Saharan:Eurasian ancestry:
They also, as you can see, shifted the narrative backwards in that it was most likely an Iberomaurusian-like population that contributed ancestry to Natufians rather than the other way around. Their percentages of Sub-Saharan-related and Eurasian ancestry in Iberomaurusians were also struck by another unrelated paper that mostly offhandedly touched upon the Iberomaurusians [3]:
This paper also landed on a more or less 50:50 Sub-Saharan:Eurasian background for the Iberomaurusians as you can see above, and I'm inclined to agree with it and Dzudzuana preprint in that their results are very consistent with how Iberomaurusians cluster on a global PCA between Sub-Saharan and Eurasian populations:
At any rate, the study did a good job compared to many others before or after because it utilized ALDER, a method that detects how much Eurasian ancestry entered African populations and when, rather than just assuming a simple one-time event. This allowed them to separate different waves of admixture rather than lumping everything together. They also reconstructed the ancestral Eurasian source instead of using modern populations like Sardinians or Europeans, which can distort results.
By doing this, they provided a more accurate estimate of true Eurasian ancestry in African populations rather than forcing them into comparisons with groups that don’t fully represent their ancient genetic makeup.
Finally, they recognized that bad reference choices can make ancestry "disappear." Studies before the Dzudzuana preprint failed to detect anything remotely Sub-Saharan related in populations like Natufians or as much Sub-Saharan related ancestry as there likely really is in Iberomaurusians because they compared them to the wrong groups such as Yorubas, Dinkas, the Mbuti or whomever else who may not be relevant to the type of SSA ancestry these populations actually carried.
Pickrell and his colleagues avoided this sort of pitfall, making their results much more reliable than most and theirs is the only study I've ever seen where the percentages actually almost perfectly match a given population's global PCA position in respect to where they sit between Eurasians and Sub-Saharan Africans:
The results you see above are my own concoction using R and David Wesolowski's Global25 PCA coordinates. I calculated ancestry proportions by measuring the distance of each population to two reference clusters—one Sub-Saharan and one Eurasian—using mean PCA coordinates. The closer a population is to one cluster, the higher its assigned ancestry from that group, creating a proportional estimate of SSA vs. Eurasian ancestry.
The Sub-Saharan cluster in this case was represented by Mota, the Gumuz, and Mbutis, whereas the Eurasian cluster was represented by the previously mentioned WHG and Anatolian HG samples (AHG). I find it quite remarkable how I was able to so closely match Pickrell's results. In my opinion, much of the difference is attributable to the minor Eurasian ancestry in the Gumuz. I would not be shocked to see an almost 1:1 match between my results and Pickrell's if we ever have a "pure" AEA population to include among the SSA cluster.
It’s always encouraging when distinct methodologies yield similar results and this to me suggests Iberomaurusians were indeed roughly 50:50 Sub-Saharan:Eurasian with a slight skew toward the latter, as indicated by the Dzudzuana preprint, the Shum Laka paper, and the PCA positionings.
But moving on from that, you might wonder why I refer to Ancestral North African (ANA) ancestry as "Sub-Saharan" admixture. The Dzudzuana preprint and Shum Laka paper appear to theorize that the ANA lineage—if it existed— may have originated and spent much of its history no farther south than the Sahel, making "SSA" somewhat of a misnomer. While this might be true geographically, genetically, it is very much a Sub-Saharan population:

Above are a public simulated ANA component I encountered being used around the anthro-sphere from an unknown source (possibly made using Genoplot) and my own concoction in R.
Funnily enough, I initially attempted to reconstruct ANA following Loosdrecht et al.'s ~65% Natufian-like and ~35% Sub-Saharan model whilst trying to explain Iberomaurusians' clustering but instead noticed this population—albeit with a poor fit—looked more like it contributed ancestry to both Natufians and Iberomaurusians with AHGs looking more like they fit the bill for the other side of Iberomaurusians' roots:

That led me to try a different approach: instead of assuming Iberomaurusians were a fixed proportion of different ancestries, I used linear algebra to infer the missing population. I considered their PCA clustering position, the clustering position of AHGs, and calculated where another population must have been positioned to explain Iberomaurusians' pull away from AHGs:
ANA = Iberomaurusian + λ(Iberomaurusians − AHG)
Solve for that using something like R or Python and you should be able to get my final simulated ANA population's coordinates (it is the one used in the PCAs above):
My_Simulated_ANA,-0.494675,-0.00507719999999998,-0.0769332,-0.091086,-0.018835,-0.0725678,-0.1372456,0.0376134,0.2645316,-0.0786172,0.0414416,-0.0735542,0.1736056,-0.0870324,0.1799378,-0.067753,-0.027589,-0.1310216,-0.2921226,0.0829396,-0.064087,-0.2577662,0.1478732,-0.0187496,0.0461032
And what's incredibly interesting about this simulated sample I managed to put together is that it plainly clusters like a Sub-Saharan population and—like the 63.5% Natufian and 36.5% ANA based simulation and the public ANA simulation—it shifts Loosdrecht et al.'s model around in that both Iberomaurusians and Natufians look more like varying mixtures between something Anatolian Hunter-Gatherer related and this component rather than Iberomaurusians looking Natufian-like admixed:
Despite the poor fit for Natufians and the overdone fit for Iberomaurusians—we will surely need more appropriate samples such as Dzudzuana, real ANAs and ancient DNA from Egypt—quite interesting results because I effectively came to the same conclusion as the Dzudzuana pre-print before rereading and refreshing my memory on their findings and did so via a totally different methodology.
Again, always encouraging when differing methodologies point in the same direction like this. In both my PCA clustering of it and in the Dzudzuana pre-print it clusters plainly like a Sub-Saharan component, albeit like one that seems to perhaps have elevated affinities to Eurasians in a manner similar to but greater than Mota in displaying something of a pull toward Eurasians mostly along the y-axis of the following PCA:

I strongly suspect that this is because—as the Dzudzuana pre-print's qpGraph essentially depicts—it is a cousin to the AEA cluster often discussed on this blog. A component that makes up most of the ancestry in Mota as well as the modern Dinka and Gumuz and even more so than this component ANA might have remained related to the Proto-Eurasians longer.
I'm sure those of you well-versed in population genetics are well-aware that all the populations outside of Africa including Native Americans, Pacific Islanders and Australasians appear to descend from a single population that expanded out of Africa sometime between 50,000 to 75,000 years ago: the "Proto-Eurasians".
This bottleneck may have formed in Africa itself before they left and I ultimately suspect, whatever ANA is, it probably represents a population that may have remained a sister population to these Proto-Eurasians right up until the last second. In a way Basal Eurasian before Basal Eurasian, but since it doesn't seem to have participated in the Eurasian bottleneck unlike Basal Eurasian it remains a firmly "Sub-Saharan" component in appearance:

And I would say it is possibly the source of most of the SSA ancestry we've always seen across the Middle-East, North-Africa and wider Mediterranean prior to the ancient DNA revolution when we were heavily dependent on global and regional ADMIXTURE runs. You will quickly notice that when ANA or a seemingly ANA admixed population such as Natufians or Iberomaurusians are introduced to a modeling of various MENA and Mediterranean peoples' ancestries their previous AEA and West-African (WA) affinities greatly depress:
I would venture to say that some populations, as you can see above, possibly never had AEA or WA admixture. It was all perhaps just ANA related ancestry being misattributed.
Then on the other side of things we now find populations such as Natufians, Neolithic Levantines and Iron-Age Egyptians who previously couldn't be modeled as part SSA being able to now show such ancestry because we found the unique SSA population we needed in the form of these hypothesized ANAs and they finally explain why groups such as Natufians and those Egyptians have a plain as day SSA pull in global PCAs:

I would strongly argue that—barring the interesting but unrelated things going on with East pulling groups such as Mal'ta boy, Ust-Ishim and the Han—you simply cannot pull away toward SSA populations on the x-axis and away from WHGs and AHGs in the manner above without possessing some sort of SSA ancestry and I suspect that, whatever ANA is, it may have existed along some sort of admixture continuum with AHGs from the southern Levant to what is now Morocco in different forms. Time and ancient DNA results will tell.
If and when this ANA group is found I personally don't doubt it will be shown to have originated and spent much of its time no more south than the Sahel. In fact, the same is likely true for AEA which only appears more south than modern Sudan in an admixed form such as in the case of Mota. But, in the end, both ANA and AEA clearly cluster like populations that did not participate in the Eurasian bottleneck just like any other "SSA" group.
References
1. Loosdrecht MV, Bouzouggar A, Humphrey L, Posth C, Barton N, Aximu-Petri A, et al. Pleistocene North African genomes link Near Eastern and sub-Saharan African human populations. Science. 2018;360(6387):548-552. doi:10.1126/science.aar8380. Available from: https://doi.org/10.1126/science.aar8380
2. Lazaridis I, Belfer-Cohen A, Mallick S, Patterson N, Cheronet O, Rohland N, et al. Paleolithic DNA from the Caucasus reveals core of West Eurasian ancestry [preprint]. Cold Spring Harbor (NY): bioRxiv; 2018. doi:10.1101/423079. Available from: https://doi.org/10.1101/423079
3. Lipson M, Ribot I, Mallick S, Rohland N, Olalde I, Adamski N, et al. Ancient West African foragers in the context of African population history. Nature. 2020;577(7792):665-670. doi:10.1038/s41586-020-1929-1. Available from: https://doi.org/10.1038/s41586-020-1929-1
4. Pickrell JK, Patterson N, Loh P-R, Lipson M, Berger B, Stoneking M, et al. Ancient west Eurasian ancestry in southern and eastern Africa. Proc Natl Acad Sci U S A. 2014;111(7):2632-2637. doi:10.1073/pnas.1313787111. Available from: https://doi.org/10.1073/pnas.1313787111
R Scripts and other files for the PCAs, charts and ANA simulations: https://github.com/Awale-Abdi/Anthromadness_ANA_post
Simplified and shortened for complete laymen:
Based on the findings of a peer-reviewed paper, a groundbreaking yet-to-be-peer-reviewed paper, and my own data analysis that corroborates and attempts to build upon their findings, there appears to have been an ancient population from over 15,000 years ago in North Africa that genetically clustered with present-day "Black" Africans. This group likely contributed ancestry to prehistoric populations such as the Natufians, and further influenced the genetic makeup of modern Middle Easterners, North Africans, and potentially other populations with Middle Eastern ancestral ties such as those in Central Asia, South Asia, and at least Southern Europe. More on this in future posts and with future ancient DNA that rolls in.
Great post. Really appreciate this kind of analysis. It's very much needed.
ReplyDeleteIsn't there an official Dzudzuana G25 coord?
ReplyDeleteNot that I know of. The paper hasn't even been published in a journal. Still a pre-print. Not idea why. Perfectly good paper. Academic politics? I don't know.
DeleteFrom what I remember, the coordinates were eventually released. Like the real ones. I found it, here it is:
DeleteGeorgia_UP_Dzudzuana_(n=1),0.062603,0.099522,-0.009428,-0.002261,0.029852,-0.017012,0.001175,-0.001846,0.03211,0.02606,0.007632,-0.011989,0.001635,0.012799,0.000271,-0.003182,-0.01343,0.004687,-0.007416,0.003377,0.026453,0.007172,-0.01479,-0.029281,-0.00012
Great in-depth post bro. Btw how much do you know about homo sapien idaltu (herto man)?
ReplyDeleteKnow as much about it as what's on its Wikipedia page, I would say. Why?
DeleteNot sure if you saw this entry's supplementary materials: 'A genetic probe into the ancient and medieval history of Southern Europe and West Asia' - 2022
ReplyDeleteSupplementary PDF, Page 9, 'Non-West Eurasian ancestry in the Southern Arc'
"The African-maximized “black” component is
found in Levantine individuals as early as the Natufians and should thus not be interpreted as evidence of recent African influence in West Eurasia. A likely explanation is the partial derivation of the Natufians from Paleolithic Iberomaurusian (48) North African-related ancestors as suggested in (49) Indeed, the average proportion of this component in all Natufian individuals (including those for which it is less than the detection threshold of 10%) is 9.1%, while in Taforalt from Morocco it is 41.4%, thus suggesting ~22% of North African influence, similar to the ~27% inferred using an admixture graph framework in (49)"
Seems I missed this paper. Yet another one to the list, I suppose. Thank you for sharing, akhi.
DeleteI have a few questions:
ReplyDelete• I thought Somalis were 38% to 45% MENA, but above, you mention that Somalis are 38-40% Near Eastern. Does the remaining 5% come from another source?
• Is there any North-South variation among Somalis?
• Lastly, considering your suggestion that Natufians had more African DNA, does that mean Somalis are less MENA than previously thought?
Somalis being ~45% MENA is pretty much accurate, I would say. What this basically means is that our MENA ancestors were themselves part SSA, like the Natufians were. If you're 45% a group that is itself only about 80-85% Eurasian are you going to turn out 45% Eurasian? That's how Somalis can be about 38% Eurasian but about 45% MENA. MENA = mixed.
DeleteAnd no, there's no northern and southern cline among Somalis. Unless the Somali has recent and usually known outside admixture -- from a group like Arabs, Habeshas, Oromos or Somali Bantus -- we're a remarkably homogenous ethnic group with only a ~2% variation across all the tribes and regions of Greater Somalia, the same mtDNA frequencies and overwhelmingly either an E-Z813 or T-L208 Y-DNA subclade. That's become quite clear through the large amount of commercial and academic sampling that's taken place.
A historical population bottleneck probably occurred, making all modern Somalis descendants of a common ancestral population rather than a hodge-podge of Cushites and Ethiosemites joined together under the Somali language and intermixed to some extent like with Oromos. This explains why Somalis consistently form a distinct regional ADMIXTURE component (Ethio-Somali) at the higher K-values in clustering analyses.
So this means Somalis are maximum 40% eurasian?
ReplyDeleteWhat about CHG Iranian is that in our Dna?
Lastly how similar are these Ancestral North African to our 60% African side, do we have any idea what ANA looked like?
ReplyDeleteYes, Somalis are about 40% Eurasian max. Anyone above that is pretty much admixed in some way. Oromo, Habesha, Arab or whatever else admixture.
DeleteAs for CHG, Iron-Age Yemenis would be part Iran-Chalcolithic which carries with it CHG and Iran-Neolithic related ancestry so while in quite small amounts, we do carry such ancestry through them. Feel free to noodle about with the VahaduoJS and G25 coordinates yourself. Run models like "Natufian, CHG, Dinka, Mota" as the source populations for Somalis (target) and you'll notice we always need a bit of CHG or Iran-Neolithic or Chalcolithic on top of the Natufian-like ancestry. That would be the seemingly Iron-Age Yemeni ancestry in us and all other Cushites and Ethiosemites coming through with their Chalcolithic Iranian influences.
Finally, I'll dive deeper into ANA with future posts. Stay tuned.
What’s your thoughts on the theory of the basal eurasians never existing and it being just ANA?
ReplyDeleteI addressed this about a decade ago:
Deletehttps://anthromadness.blogspot.com/2016/06/new-information-on-basal-eurasian.html
Looking at things now, I don't think that's the case. If it was then groups like the Dzudzuana samples and AHGs that appear to lack ANA admixture would not continue to require Basal Eurasian for their models in the presence of ANA in those models. They would also probably look identical to WHGs but they simply do not.
So no, I think Basal Eurasian is something real that was holed up in West Asia—perhaps along the Persian Gulf Oasis—that mixed with a West-Asian subgroup of what were likely a type of "WHG" (Villabruna cluster) local to West-Asia. Plan to do posts on this and many more subjects going forward so stay tuned.
Thanks, bro, for answering all my questions. It's great to have you back!
ReplyDeleteAbout that 40% near the eastern region — is it all Natufian? And did these people come from Arabia? I’ve heard they were ancient Egyptians.
Personally, I grew up around Ethiopians, Eritreans, and Sudanese Arabs. Despite cultural and religious differences, I never felt like I was around total strangers. Even though I could usually tell an Amhara from a Sudanese or a Somali, I’ve always seen us as kind of a shared race. Would it be inaccurate to describe everything from Sudan to Somalia in that way?
Also, how different is a Somali from, say, a Tigrayan
And how about raxanwayn somalis are they genetically the same as regular somalis given their linguistic differences?
Take a look at a global PCA and you'll see that we clearly do form our own cluster with them. Africa PCA will exacerbate what is seen on global as Fulanis, FBAs and other admixed groups won't be on our cline. Biologically, yes, we are our own race. Some call it "Qeyh" - Ge'ez word for "red" used to describe red noba as well as refer to themselves - but that's awful larpy. Cushites is too loaded too. It's all larpy and politically incorrect tbh.
DeleteBro are you in contact with David reich? If so can you please tell him to replace the admixed Somali Kenyan ayodoo sample with a more representative mainland Somali sample?
ReplyDeleteThe only reason people ever thought otherwise pertaining to ANA and their sub-Saharanity is because of Magnats (Maghrebi nationalists). Weird bunch.
ReplyDelete