Skip to main content

Home/ DGL RJC Group/ Group items tagged Vocabulary

Rss Feed Group items tagged

qilabs

Enabling Management Oversight in Corporate Blog Space.pdf - 0 views

  • Analysis of a corpus of tens of thousands of blogs –incorporating close to 300 million words – indicates significant differences in writing style and content between male and female bloggers as well as among authors of different ages. Such differences can be exploited to determine an unknown author’s age and gender on the basis of a blog’s vocabulary.
  • UTOMATED AUTHOR PROFILINGWhile we have found significant differences among bloggers of different ages and genders, the truest test of the significance of these differences is the extent to which they enable us to correctly predict an author’s age and gender
  • Learning AlgorithmWe use the learning algorithm Multi-Class Real Winnow (MCRW) to learn models that classify blogs according to author gender and age, respectively. Since this algorithm is not well known, we describe it briefly
  • ...2 more annotations...
  • 40.0%50.0%60.0%70.0%80.0%90.0%100.0%genderagestyle contentallFigure 1 10-fold cross validation results for the age and gender classifiersClassed as 10's20's30's10's7036102717720's916632684430's17814651351Table 7 Confusion matrix for the age classifier using all featuresFor age, content proves to be slightly more useful than style, but – as in gender – the combination is most useful. The confusion matrix indicates that, using content and style features together, 10s are distinguishable from 30’s with accuracy above 96% and distinguishing 10s from 20s is also achievable with accuracy of 87.3%. Many 30s are mis-classed as 20s, however, yielding overall accuracy is 76.2%CONCLUSIONSWe have assembled a large corpus of blogs labeled for a variety of demographic attributes. This large sample permits us insight into the demographic distribution of bloggers. We have found that teenage bloggers are predominantly female, while older bloggers are predominantly male. Moreover, within each age group, male and female bloggers blog about different thing and use different blogging styles
  • We have assembled a large corpus of blogs labeled for a variety of demographic attributes. This large sample
qilabs

Wiki Co-Founder Creates 'Wikipedia for News' to Fight Bias in the Media | Betabeat - 0 views

  •  
    arry Sanger made Infobitt, a free, open content news resource he's calling "Wikipedia for the news." No, it's not Wikinews; this site grabs facts from news sources, summarizes them and organizes the information to make it a news go-to. Like our beloved online encyclopedia, Infobitt is a collaborative effort.
1 - 20 of 110 Next › Last »
Showing 20 items per page