Extracting process information from archival records

By Isto Huvila, 20 May, 2022

Date

Wednesday, August 24, 2022 - 08:00

Until

Friday, August 26, 2022 - 13:00

Body

Presentation together with Ekta Vats, Zanna Friberg, Lisa Börjesson, Jessica Kaiser and Olle Sköld at Final conference of the international network Digitization and the Future of Archives: Digital archives, Big Data and Memory in Copenhagen.

Abstract

Apart from the lack of information on what archival records are about—described using metadata—there is an increasing awareness of that the lack of understanding of the contexts and processes of how records were created and how they have been manipulated (i.e. data about creation, curation and use processes, or paradata). This poses a significant hindrance to their effective management, preservation, findability and use. However, typically the records themselves contain a lot of information that qualifies as paradata. The problem is that it is dispersed throughout the material and can be difficult to find and use. Moreover, paradata can be identified in text, images (incl. photographs and drawings) and tabular data in the records. This presentation reports findings from a pilot project that investigates how AI-based text and image analysis techniques can be used for mining paradata from archival records pertaining to archaeological excavations. The talk describes how the developed approach is promising in extracting meaningful information on how records and their contents have been created and processed. Further, the presentation outlines key lessons learned during the development and implementation analysis workflow. The heterogeneity of records and especially that of the expressions of paradata causes problems for computational analysis but considering that they also slow down manual processing of the data, the approach discussed in the project emerges as successful. The reported work is a part of the research project CApturing Paradata for documenTing data creation and Use for the REsearch of the future (CAPTURE) that has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme grant agreement No 818210 and InterPARES Trust AI funded by a Canadian SSHRC grant. The work has also received funding from the Centre for Digital Humanities Uppsala (CDHU) pilot project scheme.

File attachments

HuvilaEtAl-IRFD2022-handout.pdf (1.99 MB)

Latest Publications

Regimes of Participation: Theorising Participatory Archives from the Outset of Archivists Views on Archival Institutions and User Participation in Scandinavia

Huvila, I. (2024). Regimes of Participation: Theorising Participatory Archives from the Outset of Archivists Views on Archival Institutions and User Participation in Scandinavia. Information Research, 29, 121-146. http://doi.org/10.47989/ir291539 (Original work published mar)

On Infrastructural Speculation

Huvila, I. (2023). On Infrastructural Speculation. Current Swedish Archaeology, 31, 39-42. http://doi.org/10.37718/CSA.2023.02

My Personal Doctor Will Not Be Replaced with Any Robot Service! : Older Adults Experiences with Personal Health Information and eHealth Services

Enwald, H., Eriksson-Backa, K., Hirvonen, N., & Huvila, I. (2024). My Personal Doctor Will Not Be Replaced with Any Robot Service! : Older Adults Experiences with Personal Health Information and eHealth Services. In S. Kurbanoğlu, Špiranec, S., Boustany, J., Ünal, Y., Sencan, I., Kos, D., et al. (Eds.), European Conference on Information Literacy (pp. 145-157). Cham: Springer Nature Switzerland. http://doi.org/10.1007/978-3-031-53001-2_13

Conceptualizing Data Needs within Contexts of Data Discoverability and Reuse: A Study of Environmental and Social Scientists

Liu, Y. -H., Huvila, I., Kaiser, J., Friberg, Z., Sköld, O., Andersson, L., et al. (2023). Conceptualizing Data Needs within Contexts of Data Discoverability and Reuse: A Study of Environmental and Social Scientists. In IST23 Conference: Information Science Perspectives to Documenting Processes and Practices. Uppsala: ASIS&T European Chapter. http://doi.org/10.5281/ZENODO.7937097 (Original work published may)

Users Experiences With Online Access to Electronic Health Records in Mental and Somatic Health Care: Cross-Sectional Study

Wang, B., Kristiansen, E., Fagerlund, A., Zanaboni, P., Hägglund, M., Bärkås, A., et al. (2023). Users Experiences With Online Access to Electronic Health Records in Mental and Somatic Health Care: Cross-Sectional Study. Journal Of Medical Internet Research, 25, e47840. http://doi.org/10.2196/47840 (Original work published dec)

Extracting process information from archival records

Abstract

Forthcoming presentations

Latest Publications

Latest toots

Isto Huvila