DSpace@Çankaya

Binary background model with geometric mean for author-independent authorship verification

Basit öğe kaydını göster

dc.contributor.author Canbay, Pelin
dc.contributor.author Sezer, Ebru A.
dc.contributor.author Sever, Hayri
dc.date.accessioned 2023-11-28T12:58:36Z
dc.date.available 2023-11-28T12:58:36Z
dc.date.issued 2023-04
dc.identifier.citation Canbay, Pelin; Sezer, Ebru A.; Sever, Hayri. (2023). "Binary background model with geometric mean for author-independent authorship verification", Journal of Information Science, Vol.49, No.2, pp.448-464. tr_TR
dc.identifier.issn 01655515
dc.identifier.uri http://hdl.handle.net/20.500.12416/6665
dc.description.abstract Authorship verification (AV) is one of the main problems of authorship analysis and digital text forensics. The classical AV problem is to decide whether or not a particular author wrote the document in question. However, if there is one and relatively short document as the author’s known document, the verification problem becomes more difficult than the classical AV and needs a generalised solution. Regarding to decide AV of the given two unlabeled documents (2D-AV), we proposed a system that provides an author-independent solution with the help of a Binary Background Model (BBM). The BBM is a supervised model that provides an informative background to distinguish document pairs written by the same or different authors. To evaluate the document pairs in one representation, we also proposed a new, simple and efficient document combination method based on the geometric mean of the stylometric features. We tested the performance of the proposed system for both author-dependent and author-independent AV cases. In addition, we introduced a new, well-defined, manually labelled Turkish blog corpus to be used in subsequent studies about authorship analysis. Using a publicly available English blog corpus for generating the BBM, the proposed system demonstrated an accuracy of over 90% from both trained and unseen authors’ test sets. Furthermore, the proposed combination method and the system using the BBM with the English blog corpus were also evaluated with other genres, which were used in the international PAN AV competitions, and achieved promising results. tr_TR
dc.language.iso eng tr_TR
dc.relation.isversionof 10.1177/01655515211007710 tr_TR
dc.rights info:eu-repo/semantics/closedAccess tr_TR
dc.subject Authorship Verification tr_TR
dc.subject Binary Background Model tr_TR
dc.subject Document-Pair Verification tr_TR
dc.subject Forensic Authorship tr_TR
dc.subject Geometric Mean tr_TR
dc.subject Turkish Blog Authorship Corpus tr_TR
dc.title Binary background model with geometric mean for author-independent authorship verification tr_TR
dc.type article tr_TR
dc.relation.journal Journal of Information Science tr_TR
dc.contributor.authorID 11916 tr_TR
dc.identifier.volume 49 tr_TR
dc.identifier.issue 2 tr_TR
dc.identifier.startpage 448 tr_TR
dc.identifier.endpage 464 tr_TR
dc.contributor.department Çankaya Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü tr_TR


Bu öğenin dosyaları:

Dosyalar Boyut Biçim Göster

Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster