Faunal remains from archaeological sites allow for the identification of animal species that enables the better understanding of the relationships between humans and animals, not only from their morphological information, but also from the ancient biomolecules (lipids, proteins, and DNA) preserved in these remains for thousands and even millions of years. However, due to the costs and efforts required for ancient biomolecular analysis, there has been considerable research into development of accurate and efficient screening approaches for archaeological remains. FTIR spectroscopy is one such approach that has been considered for screening of proteins, but its widespread use has been hindered by the fact that its predictive accuracy can vary widely depending on the extent of sample preservation and the instrument used. Further, screening methods for ancient DNA (aDNA) analysis are scarce. Here we present a new approach to vastly improve upon FTIR-based screening methods prior to ZooMS (Zooarchaeology by Mass Spectrometry) and aDNA analysis through the use of random forest-based machine learning. To do so, we use ATR-FTIR to examine three sets of archaeological bone assemblages and analyse them by ZooMS (for taxonomic identification). Two of these are from Palaeolithic contexts, dominated by terrestrial fauna and include specimens with a variety of preservational conditions. The third set consists of Holocene faunal remains, with variable levels of preservation and is dominated by cetaceans. Using the Holocene faunal remains, we were able to more consistently evaluate ATR-FTIR-based screening for mtDNA as well as ZooMS success. We report on the potential of machine learning in ATR-FTIR-based screening for ancient mtDNA analysis, and our machine learning models conclusively improve the accuracy prior to usage of ATR-FTIR-based screening for ZooMS by 20–40%. The results also suggest this approach potentially allows for a universal screening system, applicable across multiple sites and largely independent of the spectrometers used.

Machine learning ATR-FTIR spectroscopy data for the screening of collagen for ZooMS analysis and mtDNA in archaeological bone

Fabio Negrino;
2021-01-01

Abstract

Faunal remains from archaeological sites allow for the identification of animal species that enables the better understanding of the relationships between humans and animals, not only from their morphological information, but also from the ancient biomolecules (lipids, proteins, and DNA) preserved in these remains for thousands and even millions of years. However, due to the costs and efforts required for ancient biomolecular analysis, there has been considerable research into development of accurate and efficient screening approaches for archaeological remains. FTIR spectroscopy is one such approach that has been considered for screening of proteins, but its widespread use has been hindered by the fact that its predictive accuracy can vary widely depending on the extent of sample preservation and the instrument used. Further, screening methods for ancient DNA (aDNA) analysis are scarce. Here we present a new approach to vastly improve upon FTIR-based screening methods prior to ZooMS (Zooarchaeology by Mass Spectrometry) and aDNA analysis through the use of random forest-based machine learning. To do so, we use ATR-FTIR to examine three sets of archaeological bone assemblages and analyse them by ZooMS (for taxonomic identification). Two of these are from Palaeolithic contexts, dominated by terrestrial fauna and include specimens with a variety of preservational conditions. The third set consists of Holocene faunal remains, with variable levels of preservation and is dominated by cetaceans. Using the Holocene faunal remains, we were able to more consistently evaluate ATR-FTIR-based screening for mtDNA as well as ZooMS success. We report on the potential of machine learning in ATR-FTIR-based screening for ancient mtDNA analysis, and our machine learning models conclusively improve the accuracy prior to usage of ATR-FTIR-based screening for ZooMS by 20–40%. The results also suggest this approach potentially allows for a universal screening system, applicable across multiple sites and largely independent of the spectrometers used.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11567/1036534
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 8
social impact