MULTILINGUALITY AND LANGUAGE TECHNOLOGY
Your Company Address
Deutsches Forschungszentrum für Künstliche Intelligenz GmbH (DFKI) Campus D3 2 Stuhlsatzenhausweg 3 66123 Saarbrücken Germany
– What is the best way to represent mentions? Is it enough to use a concatenation of different embeddings? If yes, which embeddings work best for which type of mentions?
– Can we improve the results by encoding/embedding other (linguistic) features?
– To what extent does context affect the choice of markables?
-How to represent the span with multipable tokens? sum, average, concat?
Data: we can use (part of) the data from the CODI-CRAC Anaphora Resolution Shared Task 2021 and/or the OneCommon dataset (https://github.com/Alab-NII/onecommon/tree/master/aaai2020).