Code: EDI-2019-16-VRT_1
Domain: Internet & Media
Summary
Assigning meaningful and detailed tags to VRT’s news stories, based on the content.
Proposed by
VRT NWS is the news service of the VRT, the Flemish public broadcast. VRT NWS is active in the field of television, radio and online.
Description
VRT’s news website VRTNWS produces around one hundred stories every day. Most of these are in Dutch. For various purposes, like recommendations and archiving, metadata are necessary. The stories are currently labelled with categories, but these are too broad, and we need detailed, meaningful tags. The tagging should be done automatically. Note that there is no training set at this moment: none of the stories have meaningful tags.
The challenge is to develop a system capable of assigning meaningful and detailed tags to VRT’s news stories, based on the content.
Data
The challenge has the following sample datasets available for download
Expected outcomes
The main result will be a system capable of assigning meaningful and detailed tags to VRT news stories. Priority goes to the stories written in Dutch, as this is the main portion of VRT’s newsroom’s output.
In later stages, the stories in French, English and German should also be tagged.
The evaluation of the accuracy and completeness of the tags will be done by a team of experts, on a randomly drawn sample of stories and their tags, generated by the system.
The system should be provided as both:
- a demonstrator for expert evaluation
-
the source code
How do we apply?
- Read the Guidelines for Applicants
- Doubts or questions? Read more about EDI on the About Us page, have a look at our FAQ section or drop us an email at opencall@edincubator.eu.