The Adressa Dataset is a news dataset that includes news articles (in Norwegian) in connection with anonymized users. We hope that this dataset will be helpful to achieve a better understanding of the news articles in conjunction with their readers.
This dataset is published with the collaboration of Norwegian University of Science and Technology (NTNU) and Adressavisen (local newspaper in Trondheim, Norway) as a part of RecTech project on recommendation technology. For further details of the project and the dataset please refer to the paper mentioned below for citations.
- Light version 1: 1.4 GB (compressed) - 1 Week of data collection - 923 articles (in Norwegian), 15.514 users, average article length is 518.6 words
- Light version 2: 16 GB (compressed) - 10 Weeks of data collection
- Description of the dataset fields and other details about the dataset
- Licensing information: This dataset is available only to use for non-commercial puropses.
CitationIf you use the SmartMedia Adressa Dataset, please cite the following paper:
Gulla, J. A., Zhang, L., Liu, P., Özgöbek, Ö., & Su, X. (2017, August). The Adressa dataset for news recommendation. In Proceedings of the International Conference on Web Intelligence (pp. 1042-1048). ACM.