This project was part of the final selection of a competition from Arquivo.pt, which aims to reward innovative work carried out based on historical information preserved by Arquivo.pt.
A Minha Terra is a web platform that enable us visualize in an interactive way how the different municipalities of Portugal are represented by the main generalist newspapers. The idea arises from the curiosity to see if there is some bias in the representation of the different cities by the national media, as well as to understand how the different municipalities are represented by the generality of the media.
The project had 4 major steps:
Firstly, the pages were collected through the Arquivo.pt API, according to the name of the 308 national municipalities, and for the five generalist newspapers with the largest national circulation, namely: Correio da Manhã, Diário de Notícias, Expresso , Jornal de Notícias, and Público.
The collected data was later saved in a SQL database to facilitate processing and so that it could be consumed by the web app in a structured way.
The next step was cleaning the database in order to eliminate possible false positives (e.g. homograph words, Guardar, Almeida, Santana, among others). As well as removing generic pages (e.g. homepage), and eliminating duplicate pages.
Finally, a web platform was developed to enable the visualization of the collected data in an accessible and interactive way.
Built with React, NextJS, and Supabase.