unicaen greyc

Research Datasets & Demos



AnnoTag: Concise Content Annotation via LOD Tags derived from Entity-level Analytics


Digital libraries build on classifying contents by capturing their semantics and (optionally) aligning the description with an underlying categorization scheme. This process is usually based on human intervention, either by the content creator or a curator. As such, this procedure is highly time-consuming and - thus - expensive. In order to support the human in data curation, we introduce an annotation tagging system called “AnnoTag”. AnnoTag aims at providing concise content annotations by employing entity-level analytics in order to derive semantic descriptions in the form of tags. In particular, we are generating “Semantic LOD Tags” (linked open data) that allow an interlinking of the derived tags with the LOD cloud. Based on a qualitative evaluation on Web news articles we prove the viability of our approach and the high-quality of the automatically extracted information.


Video and Demo


AnnoTag Presentation Video
AnnoTag Demo


Download and Datasets


AnnoTag Evaluation
AnnoTag API


Publication


A. Kumar and M. Spaniol
AnnoTag: Concise Content Annotation via LOD Tags derived from Entity-level Analytics
Proceedings of the 25th International Conference on Theory and Practice of Digital Libraries (TPDL 2021), virtual conference, September 13-17, 2021, 6 pages (to appear).
BibTeX