Exploring diagnostic captioning methods

Karatzas,  Vasilis; Καρατζάς, Βασίλης

Exploring diagnostic captioning methods

dc.contributor.degreegrantinginstitution	Athnes University of Economics and Business, Department of Informatics	el
dc.contributor.opponent	Vassalos, Vasilios	en
dc.contributor.opponent	Koutsopoulos, Iordanis	en
dc.contributor.thesisadvisor	Androutsopoulos, Ion	en
dc.creator	Karatzas, Vasilis	en
dc.creator	Καρατζάς, Βασίλης	el
dc.date.accessioned	2021-11-22	*
dc.date.available	2025-03-26T20:02:40Z
dc.date.issued	2021-11-09	*
dc.date.issuedoriginal	11/09/2021	*
dc.date.submitted	2021-11-22 10:27:06
dc.description.abstract	Image captioning has been researched a lot recently, but not much of that research has been applied to the biomedical domain. Diagnostic Captioning, the process of predicting diagnoses for medical images, can be very helpful for medical experts, since writing a diagnosis can be time-consuming and there is a lot of demand for it. In this master thesis the behavior of three types of models for diagnostic captioning is studied: image unaware unaware, retrieval, and image encoders combined with language models. The thesis also contains important findings on the difference that the preprocessing of the test captions can make in evaluation scores. Finally, this thesis concerns the participation of AUEB's NLP Group in the 2021 ImageCLEFmedical Caption competition, where the main driver was the author. The team earned the 2nd place among 8 teams with a retrieval based model.	en
dc.description.abstract	Το πεδίο της παραγωγής περιγραφών εικόνων (Image Captioning) έχει ερευνηθεί αρκετά τελευταία, αλλά δεν έχει εφαρμοστεί πολλή από αυτήν την έρευνα πάνω στον βϊοιατρικό τομέα. Η παραγωγή διαγνωστικών περιγραφών εικόνων (Diagnostic Captioning), η διαδικασία πρόβλεψης διαγνώσεων για ιατρικές εικόνες, μπορεί να βοηθήσει αρκετά τους γιατρούς που κάνουν διαγνώσεις, καθώς η συγγραφή διαγνώσεων απαιτεί μερικές φορές αρκετή ώρα, και υπάρχει μεγάλη ανάγκη για υποστήριξη των γιατρών. Σε αυτήν την μεταπτυχιακή εργασία παρατηρούμε τη συμπεριφορά τριών τύπων μοντέλων για παραγωγή διαγνωστικών περιγραφών εικόνων: μοντέλα χωρίς γνώση της εικόνας, μοντέλα ανάκτησης, και κωδικοποιητές εικόνας σε συνδυασμό με γλωσσικά μοντέλα. Κάνουμε επίσης σημαντικές παρατηρήσεις σχετικά με τη διαφορά που μπορεί να κάνει η προεπεξεργασία των κειμένων στις βαθμολογίες. Συμμετείχαμε επίσης στον διαγωνισμό ImageCLEFmedical Caption του 2021, όπου πήραμε τη 2η θέση μεταξύ 8 ομάδων με μοντέλο βασισμένο στην ανάκτηση.	el
dc.embargo.expire	2021-11-22 10:27:06
dc.embargo.rule	Open access
dc.format.extent	54p.
dc.identifier	http://www.pyxida.aueb.gr/index.php?op=view_object&object_id=8944
dc.identifier.uri	https://pyxida.aueb.gr/handle/123456789/10431
dc.identifier.uri	https://doi.org/10.26219/heal.aueb.4826
dc.language	en
dc.rights	CC BY: Attribution alone 4.0
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Image captioning	en
dc.subject	Retrieval	en
dc.subject	Encoder-decoder	en
dc.subject	Περιγραφή εικόνων	el
dc.subject	Ανάκτηση	el
dc.subject	Κωδικοποιητής-αποκωδικοποιητής	el
dc.title	Exploring diagnostic captioning methods	en
dc.title.alternative	Ερευνώντας μοντέλα για διαγνωστική περιγραφή εικόνων	el
dc.type	Text

Αρχεία

Πρωτότυπος φάκελος/πακέτο

Τώρα δείχνει 1 - 1 από 1

Ονομα:: Karatzas_2021.pdf
Μέγεθος:: 4.06 MB
Μορφότυπο:: Adobe Portable Document Format

Κατεβάστε

Συλλογές

Τμήμα Πληροφορικής

Μεταπτυχιακές Εργασίες