Skip to main content

Home/ Groups/ Library in Transition
Lisa Spiro

ALA TechSource | Dear Library of Congress... - 0 views

  •  
    While interesting, I think this can be deleted as it doesn't focus on the feasibility of an all-digital library.
Geneva Henry

David Mimno - Publications - 0 views

  •  
    BROWSING VIRTUALLY
  •  
    "Organizing the OCA: Learning faceted subjects from a library of digital books. David Mimno and Andrew McCallum. Joint Conference on Digital Libraries (JCDL) 2007, Vancouver, BC, Canada. PDF The Open Content Alliance is one of several large-scale digitization projects currently producing huge numbers of digital books. Statistical topic models are a natural choice for organizing and describing such large text corpora, but scalability becomes a problem when we are dealing with multi-billion word corpora. This paper presents a new method for topic modeling, DCM-LDA. In this model, we train an independent topic model for every book, using pages as "documents". We then gather the topics discovered, cluster them, and then fit a Dirichlet prior for each topic cluster. Finally, we retrain the individual book topic models using these new shared topics. " via Dan Cohen working on virtual shelves project, using information within texts (OCA) as organizing principle instead of LCSH; former Perseus programmer
« First ‹ Previous 61 - 80 Next › Last »
Showing 20 items per page