دورية أكاديمية

Automatically identifying the function and intent of posts in underground forums.

التفاصيل البيبلوغرافية
العنوان: Automatically identifying the function and intent of posts in underground forums.
المؤلفون: Caines, Andrew, Pastrana, Sergio, Hutchings, Alice, Buttery, Paula J.
المصدر: Crime Science; 11/29/2018, Vol. 7 Issue 1, p1-1, 1p
مصطلحات موضوعية: COMPUTER crimes, SOCIAL networks, NATURAL language processing, INTERNET forums, MACHINE learning
مستخلص: The automatic classification of posts from hacking-related online forums is of potential value for the understanding of user behaviour in social networks relating to cybercrime. We designed annotation schema to label forum posts for three properties: post type, author intent, and addressee. The post type indicates whether the text is a question, a comment, and so on. The author's intent in writing the post could be positive, negative, moderating discussion, showing gratitude to another user, etc. The addressee of a post tends to be a general audience (e.g. other forum users) or individual users who have already contributed to a threaded discussion. We manually annotated a sample of posts and returned substantial agreement for post type and addressee, and fair agreement for author intent. We trained rule-based (logical) and machine learning (statistical) classification models to predict these labels automatically, and found that a hybrid logical-statistical model performs best for post type and author intent, whereas a purely statistical model is best for addressee. We discuss potential applications for this data, including the analysis of thread conversations in forum data and the identification of key actors within social networks. [ABSTRACT FROM AUTHOR]
Copyright of Crime Science is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:21937680
DOI:10.1186/s40163-018-0094-4