Incorporating Concreteness in Multi-Modal Language Models With Curriculum Learning

dc.contributor.author Sezerer, Erhan
dc.contributor.author Tekir, Selma
dc.date.accessioned 2021-11-06T09:48:29Z
dc.date.available 2021-11-06T09:48:29Z
dc.date.issued 2021
dc.description.abstract Over the last few years, there has been an increase in the studies that consider experiential (visual) information by building multi-modal language models and representations. It is shown by several studies that language acquisition in humans starts with learning concrete concepts through images and then continues with learning abstract ideas through the text. In this work, the curriculum learning method is used to teach the model concrete/abstract concepts through images and their corresponding captions to accomplish multi-modal language modeling/representation. We use the BERT and Resnet-152 models on each modality and combine them using attentive pooling to perform pre-training on the newly constructed dataset, which is collected from the Wikimedia Commons based on concrete/abstract words. To show the performance of the proposed model, downstream tasks and ablation studies are performed. The contribution of this work is two-fold: A new dataset is constructed from Wikimedia Commons based on concrete/abstract words, and a new multi-modal pre-training approach based on curriculum learning is proposed. The results show that the proposed multi-modal pre-training approach contributes to the success of the model. en_US
dc.identifier.doi 10.3390/app11178241
dc.identifier.issn 2076-3417
dc.identifier.scopus 2-s2.0-85114487960
dc.identifier.uri https://doi.org/10.3390/app11178241
dc.identifier.uri https://hdl.handle.net/11147/11404
dc.language.iso en en_US
dc.publisher MDPI en_US
dc.relation.ispartof Applied Sciences en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Multi-modal dataset en_US
dc.subject Wikimedia Commons en_US
dc.subject Multi-modal language model en_US
dc.subject Concreteness en_US
dc.subject Curriculum learning en_US
dc.title Incorporating Concreteness in Multi-Modal Language Models With Curriculum Learning en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id 0000-0002-0488-9682
gdc.author.id 0000-0002-0488-9682 en_US
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Computer Engineering en_US
gdc.description.issue 17 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q2
gdc.description.volume 11 en_US
gdc.description.wosquality Q2
gdc.identifier.openalex W3197691789
gdc.identifier.wos WOS:000695573500001
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 1.0
gdc.oaire.influence 2.7062748E-9
gdc.oaire.isgreen false
gdc.oaire.keywords Wikimedia Commons
gdc.oaire.keywords concreteness
gdc.oaire.keywords Technology
gdc.oaire.keywords QH301-705.5
gdc.oaire.keywords T
gdc.oaire.keywords Physics
gdc.oaire.keywords QC1-999
gdc.oaire.keywords Engineering (General). Civil engineering (General)
gdc.oaire.keywords multi-modal language model
gdc.oaire.keywords multi-modal dataset
gdc.oaire.keywords Chemistry
gdc.oaire.keywords curriculum learning
gdc.oaire.keywords TA1-2040
gdc.oaire.keywords Biology (General)
gdc.oaire.keywords QD1-999
gdc.oaire.popularity 2.949874E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 02 engineering and technology
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.openalex.collaboration National
gdc.openalex.fwci 0.20443896
gdc.openalex.normalizedpercentile 0.49
gdc.opencitations.count 1
gdc.plumx.crossrefcites 2
gdc.plumx.mendeley 8
gdc.plumx.scopuscites 2
gdc.scopus.citedcount 2
gdc.wos.citedcount 2
relation.isAuthorOfPublication.latestForDiscovery 57639474-3954-4f77-a84c-db8a079648a8
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
applsci-11-08241.pdf
Size:
1002.99 KB
Format:
Adobe Portable Document Format