Repository logo
 
Publication

Robust question answering

dc.contributor.authorCarvalho, Gracinda
dc.contributor.authorMatos, David Martins de
dc.contributor.authorRocio, Vitor
dc.date.accessioned2017-01-24T11:25:00Z
dc.date.available2017-01-24T11:25:00Z
dc.date.issued2012
dc.description.abstractA Question Answering (QA) system should provide a short and precise answer to a question in natural language, by searching a large knowledge base consisting of natural language text. The sources of the knowledge base are widely available, for written natural language text is a preferential form of human communication. The information ranges from the more traditional edited texts, for example encyclopaedias or newspaper articles, to text obtained by modern automatic processes, as automatic speech recognizers. The work developed in the present thesis focuses on the Portuguese language and open domain question answering, meaning that neither the questions nor the texts are restricted to a specific area, and it aims to address both types of written text. Since information retrieval is essential for a QA system, a careful analysis of the current state-of-the-art in information retrieval and question answering components was conducted. A complete, efficient and robust question answering system is developed in this thesis, consisting of new modules for information retrieval and question answering, that is competitive with current QA systems. The system was evaluated at the Portuguese monolingual task of QA@CLEF 2008 and achieved the 3rd place in 6 Portuguese participants and 5th place among the 21 participants of 11 languages. The system was also tested in Question Answering over Speech Transcripts (QAST), but outside the official evaluation QAST of QA@CLEF, since Portuguese was not among the available languages for this task. For that reason, an entire test environment consisting of a corpus of transcribed broadcast news and a matching question set was built in the scope of this work, so that experiments could be made. The system proved to be robust in the presence of automatically transcribed data, with results in line with the best reported at QAST.pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.urihttp://hdl.handle.net/10400.2/5968
dc.language.isoengpt_PT
dc.peerreviewedyespt_PT
dc.titleRobust question answeringpt_PT
dc.typeconference object
dspace.entity.typePublication
person.familyNameCarvalho
person.familyNameMARTINS DE MATOS
person.familyNameRocio
person.givenNameGracinda
person.givenNameDAVID MANUEL
person.givenNameVitor
person.identifierC-5546-2008
person.identifierR-000-HKF
person.identifier.ciencia-idCA1C-3C6E-C757
person.identifier.ciencia-id0418-C5A8-59E2
person.identifier.orcid0000-0003-4793-6917
person.identifier.orcid0000-0001-8631-2870
person.identifier.orcid0000-0002-3314-898X
person.identifier.scopus-author-id35184805000
person.identifier.scopus-author-id54959008500
rcaap.rightsopenAccesspt_PT
rcaap.typeconferenceObjectpt_PT
relation.isAuthorOfPublicationd6be5630-7fac-412a-bca3-9b16888ada6f
relation.isAuthorOfPublication34e7fc2b-ec3c-4c8c-ab36-efec3de9bddb
relation.isAuthorOfPublication7cab4248-456c-46bf-a1cf-bbd212928171
relation.isAuthorOfPublication.latestForDiscovery7cab4248-456c-46bf-a1cf-bbd212928171

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2012 PROPOR Robust_question_answering.pdf
Size:
201.92 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.97 KB
Format:
Item-specific license agreed upon to submission
Description: