Contained in:
Book Chapter

Semantic analysis of web archive historical data: 1983 “Marche pour l’égalité et contre le racisme”

  • Davide Rendina
  • Sophie Gebeil
  • Mathieu Génois
  • Patrice Bellot

Based on a corpus composed by data obtained from the web archive of the French National Audiovisual Institute, including web pages referencing the history of the 1983 March for Equality and Against Racism, we explored how the memory of a historical event is built through the recounting of web media and the possibilities afforded by computational text analysis methods for the study of large corpuses of historical data from the archived web. This chapter presents the methodology and results of Davide Rendina's master's thesis in computer sciences under the supervision of Sophie Gebeil, Mathieu Génois, and Patrice Bellot. The objective is to demonstrate how historians can utilize archived HTML pages to study the media coverage of historical subjects on the web.

  • Keywords:
  • anti-racism,
  • media web archive,
  • memory studies,
  • topic modeling,
  • 1983,
+ Show More

Davide Rendina

Aix-Marseille University, France - ORCID: 0009-0001-3001-8864

Sophie Gebeil

Aix-Marseille University, France - ORCID: 0000-0002-9883-733X

Mathieu Génois

Aix-Marseille University, France - ORCID: 0000-0001-5492-8750

Patrice Bellot

Aix-Marseille University, France - ORCID: 0000-0001-8698-5055

  1. Davide Rendina, Sophie Gebeil, Mathieu Génois, Patrice Bellot. “Semantic analysis of web archive historical data: the 1983 'Marche pour l'égalité et contre le racisme'.“ Master Thesis. Erasmus Mundus Joint Master's Degree in Big Data Management and Analytics (BDMA). Data Analysis, Statistics and Probability [physics.data-an]. 2023. ⟨dumas-04541382⟩
  2. Davide Rendina, Sophie Gebeil, Mathieu Génois, Patrice Bellot. “Master Thesis Report - Semantic Analysis of Web Archive Historical Data the 1983 “Marche Pour L'égalité Et Contre Le Racisme“ ». Zenodo, 10 août 2023. DOI: 10.5281/zenodo.10972646
  3. De Lange, Sarah L. and Mudde Cas. “Political extremism in Europe.“ European Political Science 4(4): 476–88 (2005). http://www.cambridge.org/9780521850810
  4. Ehrmann, Maud, Ahmed Hamdi, Elvys Linhares Pontes, Matteo Romanello, and Antoine Doucet. 2024. “Named Entity Recognition and Classification on Historical Documents: A Survey“. ACM Computing Surveys 56 (2): 1‑47. DOI: 10.1145/3604931
  5. Fortunato, S. “Community detection in graphs.“ Physics Reports, 486 (3–5), 75-174 (2009). DOI: 10.1016/j.physrep.2009.11.002
  6. Gimenez, Elsa, and Voirol Olivier. “Les agitateurs de la toile. L’Internet des droites extrêmes. Présentation du numéro.“ Réseaux, vol. 202-203, no. 2–3, 2017, pp. 9-37. DOI: 10.3917/res.202.0009
  7. Pippa, Noris. “Preaching to the converted?: Pluralism, participation and party websites“. Party Politics 9(1): 21–45 (2003).
  8. Röder, M., Both, A., and Hinneburg, A. “Exploring the space of topic coherence measures.“ In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, WSDM 2015, Shanghai, China, February 2–6, 2015 (2015), X. Cheng, H. Li, E. Gabrilovich, and J. Tang, Eds., ACM, pp. 399–408.
  9. Tedeschi, S., Maiorca, V., Campolungo, N., Cecconi, F., and Navigli, R. “Wikineural: Combined neural and knowledge-based silver data creation for multilingual NER.“ In Findings of the Association for Computational Linguistics: EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 16–20 November, 2021 (2021), M. Moens, X. Huang, L. Specia, and S. W. Yih, Eds. Association for Computational Linguistics, pp. 2521–2533.
PDF
  • Publication Year: 2024
  • Content License: CC BY 4.0
  • © 2024 Author(s)

XML
  • Publication Year: 2024
  • Content License: CC BY 4.0
  • © 2024 Author(s)

Chapter Information

Chapter Title

Semantic analysis of web archive historical data: 1983 “Marche pour l’égalité et contre le racisme”

Authors

Davide Rendina, Sophie Gebeil, Mathieu Génois, Patrice Bellot

Language

English

DOI

10.36253/979-12-215-0413-2.22

Peer Reviewed

Publication Year

2024

Copyright Information

© 2024 Author(s)

Content License

CC BY 4.0

Metadata License

CC0 1.0

Bibliographic Information

Book Title

Exploring the Archived Web during a Highly Transformative Age

Book Subtitle

Proceedings of the 5th international RESAW conference, Marseille, June 2023

Editors

Sophie Gebeil, Jean-Christophe Peyssard

Peer Reviewed

Number of Pages

362

Publication Year

2024

Copyright Information

© 2024 Author(s)

Content License

CC BY 4.0

Metadata License

CC0 1.0

Publisher Name

Firenze University Press

DOI

10.36253/979-12-215-0413-2

ISBN Print

979-12-215-0412-5

eISBN (pdf)

979-12-215-0413-2

eISBN (xml)

979-12-215-0414-9

Series Title

Proceedings e report

Series ISSN

2704-601X

Series E-ISSN

2704-5846

36

Fulltext
downloads

28

Views

Export Citation

1,346

Open Access Books

in the Catalogue

2,262

Book Chapters

3,790,127

Fulltext
downloads

4,420

Authors

from 923 Research Institutions

of 65 Nations

65

scientific boards

from 348 Research Institutions

of 43 Nations

1,248

Referees

from 381 Research Institutions

of 38 Nations