Information Extraction - Covid-19 MLIA @ Eval

Task Description

The goal of the Information Extraction task is to identify medical information in texts. We defined six major types of entities to be identified. Those categories are mainly related to the Covid-19 issue. The main objective is to mine texts in order to access relevant information concerning the Covid-19, and more specifically information that may help the health professional to find outcomes.

During the first round of this task, participants will have access only to unannotated data (namely, the data collected from the two other tasks) in a plain text format. The evaluation will consist in a rover of system outputs. We encourage the participants to try experimental methods and to submit several system outputs in order to exchange different views during the discussion at the virtual meeting.

To participate in the Information Extraction task, the groups need to register at the following link:

Register

Important Dates - Round 2

Round starts: June 14, 2021

Corpora released: June 14, 2021

Runs due from participants (BRAT format): October 15, 2021

Ground-truth released and runs scored: October 22, 2021

Rolling report submission deadline (camera ready): November 19, 2021

Slot for a virtual meeting to discuss the results: November 30-December 2, 2021

Round ends: December 2, 2021

Participation Guidelines

In this information extraction task, participants are expected to identify entities belonging to six categories of entities (the tag to be used in the outputs is shown in a box at the beginning of each line):

drug-trt drug names, treatments, general intervention: this category concerns both commercial and generic names of drugs, as well as general intervention in the health domain; elements from this category usually come from advices from a professional (medical doctor, pharmacist) or from self-medication, e.g., Posaconazole AHCL, Allegra, Fexofenadine HCL, Xarelto, quarantine
sosy-dis signs, symptoms, diseases: this category deals with medical problems and merges together all signs, symptoms, and diseases shortness of breath, extreme fatigue, fever, skin infection, weightloss
findings findings, efficacy of treatments: this category is more complex since it concerns all elements related to positive or negative effets of treatments, including non expected stuff
tests tests: this category concerns all tests performed to diagnose medical problems such as blood sample, physical exam, serological test
behavior behaviors, everyday life actions: this category concerns all actions performed by each of us such as to wash one's hands, to cough into his elbow, to self-confine, use of face masks, physical distancing
legal-reg legal dispositions, regulations: this category concerns all actions decided by local or national authorities (Government, Ministry, etc.), such as to download the employer certificate, list of authorized move, prolonged border closure, closure of educational institutions

Contrary to traditional named entities which generally fit short spans of text, this task may concern both short and long spans of text to be annotated.

Corpora

Description

For round 2, training and testing datasets are composed of files available in four languages (English, French, German, and Spanish). Two types of content are provided:

news from six websites: deutschland.de, Deutsche Welle, EuroNews, EuroParl, Global Voices, and Global Voices (Covid-19)
scientific abstracts published on PubMed (queries: Covid-19 bacteria, Covid-19 pfizer, long Covid-19, long Covid-19 asthma). Since abstracts are only available in English, we used a deep neural translation service on those abstracts in order to produce abstracts in French, German, and Spanish.

Manual annotations have been made on scientific abstracts (DE, EN, ES, FR) and a few news files (EN and FR only), see statistics below. We plan to release annotations for DE and ES on a few news files in the future weeks.

Statistics

On the training dataset from round #2:

Number of files:

English: 14 annotated PMID files + 39 annotated web files + 625 non-annotated web files
French: 14 annotated PMID files + 28 annotated web files + 227 non-annotated web files
German: 14 annotated PMID files + 453 non-annotated web files
Spanish: 14 annotated PMID files + 403 non-annotated web files

Number of annotations:

English: 1419 entities: 36 behavior, 158 drug-trt, 30 findings, 199 legal-reg, 937 sosy-dis, 59 tests
French: 833 entities: 13 behavior, 126 drug-trt, 13 findings, 154 legal-reg, 477 sosy-dis, 50 tests
German: 216 entities (PMID only): 38 drug-trt, 10 findings, 135 sosy-dis, 33 tests
Spanish: 214 entities (PMID only): 36 drug-trt, 8 findings, 137 sosy-dis, 33 tests

Participant Repository:

Participants are provided with a single repository for all the tasks they take part in. The repository contains the runs, resources, code, and report of each participant.

The repository is organised as follows:

submission: this folder contains the runs submitted for the different tasks in the different evaluation rounds.
score: this folder contains the performance scores of the submitted runs.
code: this folder contains the source code of the developed system.
resource: this folder contains (language) resources created during the participation.
report: this folder contains the rolling technical report describing the techniques applied and insights gained during participation, round after round.

Covid-19 MLIA Eval consists of three tasks run in three rounds. Therefore, the submission and score folders are organized into sub-folders for each task and round as follows:

submission/task1/round2: for the runs submitted to the second round of the first taks. Similar structure for the other tasks and rounds. Participants are encouraged to submit several runs with distinct results.
score/task1/round2: for the performance scores of the runs submitted to the second round of the first taks. Similar structure for the other tasks and rounds.

Participants which do not take part in a given task or round can simply delete the corresponding sub-folders.

The goal of Covid-19 MLIA Eval is to speed up the creation of multilingual information acces systems and (language) resources for Covid-19 as well as openly share these systems and resources as much as possible. Therefore, participants are more than encouraged to share their code and any additional (language) resources they have used or created.

All the contents of these repositories are realeased under the Creative Commons Attribution-ShareAlike 4.0 International License.

Task Repository:

Organizers share contents common to all participants through the Information Extraction task repository.

The repository is organised as follows:

topics: this folder contains the topics to be used for task.
ground-truth: this folder contains the ground-truth, i.e. the qrels, for the task.
report: this folder contains the rolling technical report describing the overall outcomes of the task, round after round.

Covid-19 MLIA Eval runs in three rounds. Therefore, the topics and ground-truth folders are organized into sub-folders for each round, i.e. round1, round2, and round3.

All the contents of this repository are realeased under the Creative Commons Attribution-ShareAlike 4.0 International License.

Rolling Technical Report:

The rolling technical report should be formatted according to the Springer LNCS format, using either the LaTeX template or the Word template. LaTeX is the preferred format.

Submission Guidelines

We do not give strict rules to annotate the *txt files except the following one: please try to produce as many spans than expressed ideas (one idea per span; e.g., one drug name, one symptom, one legal regulartion, etc.).

input files: *txt
output files: *ann

All system outputs are expected to be in the BRAT annotation format (i.e., a tabular *.ann file for each *.txt file, composed of three columns: (i) an annotation ID, (ii) category, starting offset, ending offset, and (iii) the corresponding text span). An example is shown below (please see sample.{ann,txt} files in the archives to be downloaded):

T1	drug-trt 34 68	Irbesartan Hydrochlorothiazide BMS
T2	sosy-dis 116 125	dizziness

The six categories to be used in those *.ann files are: drug-trt sosy-dis behavior legal-reg tests findings. A 7th category named other can be used if the participant consider there is a missing class for a useful Covid-19 related information.

Participants are expected to check that their submissions fit all previously described elements: file names, format, entity tags, correct offsets of characters

Participating teams should satisfy the following guidelines:

The runs should be submitted in BRAT format (previously described);
Each group can submit a maximum of three runs per language (i.e., a total number of 3 runs × 4 languages = 12 submissions for round #2);
One run in one language is an archive (see below for convention name) composed of as many id.ann files than exist id.txt files for the given language; the ID between *.txt and *.ann files must be the same;
The code used to produce the runs should be uploaded in a Bitbucket repository provided by the organizers upon the registration to Covid-19 MLIA Eval.

Submission Upload:

Runs should be uploaded in the repository provided by the organizers. Following the repository structure discussed above, for example, a run submitted for the first round of the Information Extraction task should be included in submission/task1/round1.

Runs are composed of a set of several id.ann files (where the id fits the id from the corresponding id.txt file) and should be uploaded in an archive (one archive per run and for each language) named with the following name convention: <teamname>_task1_<round>_<language>_<freefield>.tar.gz where:

teamname is the name of the participating team;
task1 is the fixed identifier of the Information Extraction task;
round is the round of Covid-19 MLIA @ Eval the run is submitted to. It could be round1, round2, or round3
language specifies the ISO 639-1 language code according to the language code used in the archive name (de, en, es, fr) of the language processed in the submitted file.
freefield is a free field that participants can use as they prefer.

Performance scores for the submitted runs will be returned by the organizers in the score folder, which follows the same structure as the submission folder.

The rolling technical report has to be uploaded and kept update in the report folder.

Here, you can find a sample participant repository to get a better idea of its layout.

Evaluation:

For this task, we will use the standard evaluation metrics: recall, precision, and F1-score, as well as macro and micro averages.

Organizers

Thierry Declerck, DFKI, Germany
declerckdfki.de

Cyril Grouin, LISN, France
cyril.grouinlisn.upsaclay.fr

Pierre Zweigenbaum, LISN, France
pzlisn.upsaclay.fr