Developing a digital archaeology classification system using Natural Language Processing and Machine Learning techniques

Caravale, Alessandra and Duran-Silva, Nicolau and Grimau, Berta and Moscati, Paola and Rondelli, Bernardo (2023) Developing a digital archaeology classification system using Natural Language Processing and Machine Learning techniques. Archeologia e Calcolatori, 34 (2). pp. 9-32. ISSN 1120-6861

[img]
Preview
Text
01_Caravale_et_al (9).pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (657kB) | Preview
Official URL: http://www.archcalc.cnr.it/journal/id.php?id=1259

Abstract

The Authors propose a knowledge map to analyse and access scientific contents related to Digital Archeology by leveraging various Machine Learning (ML) techniques. The case study concerns the articles published in our international journal «Archeologia e Calcolatori» in the decade from 2011 to 2020 and, as a benchmark, the publications in the ‘Computer Applications and Quantitative Methods in Archaeology’ (CAA) conference proceedings and journal. The titles and abstracts of the publications featured in these two data sets were analysed using a supervised classification approach into the subfields of computer science, based on the ACM’s taxonomy, and by applying topic modelling techniques to discover emergent topics, Named Entity Recognition to identify specific archaeologically relevant entities, and geotagging techniques to link articles with the geographical locations they discuss. The results achieved, although preliminary, provide some methodological suggestions: i) the opportunity to build custom analyses by taking advantage of the increasing availability of open data and metadata; ii) the scope of the contribution of archaeology, and in particular of computational archaeology, to the Heritage Science interdisciplinary domain; the heuristic and predictive role of different ML techniques to gain a multi-faceted access to data analysis and interpretation.

Item Type: Article
Additional Information: Licensed under CC BY-NC-ND 4.0
Uncontrolled Keywords: Simulation AI; Theoretical and methodological problems
Subjects: 900 Storia, Geografia e discipline ausiliarie > 930 Storia dei mondo antico fino al 499 ca. > 930.1 Archeologia (Classificare qui la Storia fino al 4000 a.C., l'Archeologia preistorica, le opere interdisciplinari sull'Archeologia) > 930.102 Archeologia - Opere miscellanee > 930.1028 Archeologia – Tecniche, metodologie, apparecchi e strumenti (comprende: Archeometria) > 930.10285 Archeologia – Applicazioni informatiche (comprende: tecniche di datazione)
Depositing User: Dott.ssa Paola Moscati
Date Deposited: 04 Oct 2024 08:17
Last Modified: 04 Oct 2024 08:17
URI: http://eprints.bice.rm.cnr.it/id/eprint/23201

Actions (login required)

View Item View Item