tailieunhanh - Using content and non dictionary words for author profiling of Vietnamese forum posts

This paper reports the results of author profiling task for Vietnamese forum posts to identify personal traits, such as gender, age, occupation, and location of the author using content and nondictionary words. Experiments were conducted on different types of features, including stylometric features (such as lexical, syntactic, structural features), content-based features (the most important content words), non-dictionary words (such as slangs, abbreviations) to compare the performance and on datasets we collected from popular forums in Vietnamese. | Using content and non dictionary words for author profiling of Vietnamese forum posts