University of Lincoln
Browse

Two-layered Blogger identification model integrating profile and instance-based methods

Version 2 2024-03-12, 20:11
Version 1 2023-10-19, 19:14
journal contribution
posted on 2024-03-12, 20:11 authored by Haytham Mohtasseb Billah, Amr Ahmed

This paper introduces a two-layered framework that improves the result of authorship identification within larger sample numbers of bloggers as compared with earlier work. Previous studies are mainly divided into two categories: profile-based and instance-based methods. Each of these approaches has its advantages and limitations. The two-layered framework presented here integrates the two previous approaches and presents a new solution to a key problem in authorship identification, namely the drop in accuracy experienced as the number of authors increases. The paper begins by illustrating the regular instance-based core model and the investigated features. It then introduces a new psycholinguistic profile representation of authors, presents similarity grouping extraction over profiles, and applies blogger identification utilizing the two-layered approach. The results confirm the improvement introduced by the proposed two-layered approach against our regular classifier, as well as a selected baseline, for an extended number of users.

History

School affiliated with

  • School of Computer Science (Research Outputs)

Publication Title

Knowledge and Information Systems

Volume

31

Issue

1

Pages/Article Number

1-21

Publisher

Springer London

ISSN

0219-1377

eISSN

0219-3116

Date Submitted

2012-01-30

Date Accepted

2012-01-30

Date of First Publication

2012-01-30

Date of Final Publication

2012-01-30

ePrints ID

4890