Skip to main content

Home/ sensemaking/ Group items tagged archiving

Rss Feed Group items tagged

Jack Park

nutchwax - Home Page - 0 views

  •  
    NutchWAX ("Nutch + Web Archive eXtensions" ) searches web archive collections. The Web Archive eXtensions (WAX) include adaptation of the Nutch fetcher step to go against web archives rather than crawl the open net -- adaptation currently does Internet Archive ARC files only -- and plugins to add extra fields to the index that return an Archive Records' location in the repository, its collection name, etc.
Jack Park

CKAN - Comprehensive Knowledge Archive Network - Home - 0 views

  •  
    CKAN is the Comprehensive Knowledge Archive Network, a registry of open knowledge packages and projects (and a few closed ones). CKAN is the place to search for open knowledge resources as well as register your own.
Jack Park

Europeana - Connecting Cultural Heritage - 0 views

  •  
    Europeana - the European digital library, museum and archive - is a 2-year project that began in July 2007. It will produce a prototype website giving users direct access to some 2 million digital objects, including film material, photos, paintings, sounds, maps, manuscripts, books, newspapers and archival papers. The prototype will be launched in November 2008 by Viviane Reding, European Commissioner for Information Society and Media.
Jack Park

The effect of open access and downloads ('hits') on citation impact: a bibliography of ... - 0 views

  •  
    Despite significant growth in the number of research papers available through open access, principally through author self-archiving in institutional archives, it is estimated that only c. 20% of the number of papers published annually are open access. It is up to the authors of papers to change this. Why might open access be of benefit to authors? One universally important factor for all authors is impact, typically measured by the number of times a paper is cited (some older studies have estimated monetary returns to authors from article publication via the role citations play in determining salaries). Recent studies have begun to show that open access increases impact. More studies and more substantial investigations are needed to confirm the effect, although a simple example demonstrates the effect.
Jack Park

SWAML - Semantic Web Archive of Mailing Lists - 0 views

  •  
    SWAML, pronounced [swæml], is a research project around the semantic web technologies to publish the mailing lists' archives into a RDF format. It has been developed by the CTIC Foundation and the WESO-RG at University of Oviedo (Spain). You can visit the project page at BerliOS for more details. SWAML process description SWAML reads a collection of email messages stored in a mailbox (from a mailing list compatible with RFC 4155) and generates a RDF description. It is written in Python using SIOC as the main ontology to represent in RDF a mailing list.
Jack Park

Heritrix - Home Page - 0 views

  •  
    eritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Jack Park

Yotify - My Scouts - 0 views

  •  
    When it comes to product searches, Yotify is very smart. It doesn't just look at keywords, but also lets you know what the current best price is and then lets you select a checkbox to have the service alert you if the price drops below a certain point. You can also optionally check to be alerted when there are new product reviews available. The shopping section features scouts for common searches like digital cameras and laptops, but the shortcuts section lets you create more specific searches for a keyword, like a product ID or model number. See http://www.readwriteweb.com/archives/stop_searching_the_web_let_yotify_do_it.php
Jack Park

ecai2008_naturalowl.pdf (application/pdf Object) - 0 views

  •  
    See also: http://lists.w3.org/Archives/Public/semantic-web/2008Apr/0005.html NaturalOWL is an open-source natural language generation engine written in Java. It produces descriptions of individuals (e.g., items for sale, museum exhibits) and classes (e.g., types of exhibits) in English and Greek from OWL DL ontologies. The ontologies must have been annotated in RDF with linguistic and user modeling resources. We demonstrate a plug-in for Protege that can be used to produce these resources and to generate texts by invoking NaturalOWL. We also demonstrate how NaturalOWL can be used by robotic avatars in Second Life to describe the exhibits of virtual museums. NaturalOWL demonstrates the benefits of Natural Language Generation (NLG) on the Semantic Web. Organizations that need to publish information about objects, such as exhibits or products, can publish OWL ontologies instead of texts. NLG engines, embedded in browsers or Web servers, can then render the ontologies in multiple natural languages, whereas computer programs may access the ontologies directly.
Jack Park

ORE Specification and User Guide - Table of Contents - 0 views

  •  
    Open Archives Initiative Object Reuse and Exchange (OAI-ORE) defines standards for the description and exchange of aggregations of Web resources. This document provides an introduction and lists the specifications and user guide documents that make up the OAI-ORE standards.
Jack Park

Center for History and New Media » Zotero - 0 views

  •  
    Zotero is an easy-to-use yet powerful research tool that helps you gather, organize, and analyze sources (citations, full texts, web pages, images, and other objects), and lets you share the results of your research in a variety of ways. An extension to the popular open-source web browser Firefox, Zotero includes the best parts of older reference manager software (like EndNote)-the ability to store author, title, and publication fields and to export that information as formatted references-and the best parts of modern software and web applications (like iTunes and del.icio.us), such as the ability to interact, tag, and search in advanced ways. Zotero integrates tightly with online resources; it can sense when users are viewing a book, article, or other object on the web, and-on many major research and library sites-find and automatically save the full reference information for the item in the correct fields. Since it lives in the web browser, it can effortlessly transmit information to, and receive information from, other web services and applications; since it runs on one's personal computer, it can also communicate with software running there (such as Microsoft Word). And it can be used offline as well (e.g., on a plane, in an archive without WiFi).
Jack Park

JSTOR: Home - 0 views

  •  
    JSTOR is a not-for-profit organization dedicated to helping the scholarly community discover, use, and build upon a wide range of intellectual content in a trusted digital archive. Our overarching aims are to preserve a record of scholarship for posterity and to advance research and teaching in cost-effective ways. We operate a research platform that deploys information technology and tools to increase productivity and facilitate new forms of scholarship. We collaborate with organizations that can help us achieve our objectives and maximize the benefits for the scholarly community
Jack Park

Home - MarkMail - 0 views

  •  
    MarkMail is a free service for searching mailing list archives, with huge advantages over traditional search engines. It is powered by MarkLogic Server: Each email is stored internally as an XML document, and accessed using XQuery. All searches, faceted navigation, analytic calculations, and HTML page renderings are performed by a small MarkLogic Server cluster running against millions of messages.
Jack Park

Open Library API (Open Library) - 0 views

  •  
    The Open Library is a project of the Internet Archive. Its goal is to create an online catalog that contains one web page for every book ever published. To do this, it accepts data from a variety of sources: libraries, publishers, book-sellers, and individuals.
Jack Park

Blue Dot is not just another social bookmarking system - 1 views

  •  
    The basic premise of the system is that users can tag items into their online archives and befriend other users to share access to part or all of their items saved. The real differentiation, however, is found in the feature set.
Jack Park

Anecdote: Sensemaking Archives - 0 views

  •  
    I was listening to Melvyn Bragg's radio program, In Our Time , this morning on my iPod. The topic was Albert Camus. In discussing his novel, The Stranger, one of the distinguished panellists felt that Camus was suggesting that meaning is not pre-inscribed in the world around us and we are continuously seeking meaning in an inherently meaningless world. I almost toppled off the step machine. Do we live in an inherently meaningless world? On first thought I think the answer is yes. The onus is on us to make sense of our world. By the way, Melvyn's podcast is a joy. I particularly like its eclectic nature. Today it's Camus, last week The Four Humours, and before that we had The Sassanian Empire, Discovery of Oxygen, Mutation and The Fibonacci Series.
Jack Park

Sputnik Observatory for the Study of Contemporary Culture - 0 views

  •  
    The mission of Sputnik Observatory is to be the world's foremost institute dedicated to the study of contemporary culture. Sputnik Observatory manifests this commitment by documenting, archiving and disseminating the ideas that are shaping the arts, sciences and technology. Central to Sputnik Observatory's mission is the encouragement of an ever deeper understanding and enjoyment of life-long learning that aims to support the advancement of modern thought in society.
Jack Park

Media Cloud - 0 views

  •  
    Media Cloud is a system that lets you see the flow of the media. The Internet is fundamentally altering the way that news is produced and distributed, but there are few comprehensive approaches to understanding the nature of these changes. Media Cloud automatically builds an archive of news stories and blog posts from the web, applies language processing, and gives you ways to analyze and visualize the data.
Jack Park

About in nLab - 0 views

  •  
    The nLab is a collaborative wiki which has grown out of the desire (I, II) of an on-line community communicating via the weblog The n-Category Café of people interested in discussion of expository and research nature about mathematics, physics and philosophy in the light of category theory and higher category theory (the "n" in "nLab") to have a place for development (the "Lab" in "nLab") and indexed archivation of the ideas and concepts that were, are and will be subject of or that developed out of the discussion at the weblog.
Jack Park

Kaleidoscope - 0 views

  •  
    TeLearn is the first international open archive dedicated to research in the field of technology enhanced learning. It accepts research papers and videos, in any language.
Jack Park

Cognition Announces "World's Largest Semantic Map" - ReadWriteWeb - 0 views

  •  
    A Semantic Map is kind of like a dictionary, in that it's a representation of Cognition's ability to define things. Cognition claims that its Semantic Map has over 10 million semantic connections; over 4 million semantic contexts (word meanings that create contexts for specific meanings of other related words); over 536,000 word senses (word and phrase meanings); 75,000 concept classes (or synonym classes of word meanings); 7,500 nodes in the technology's ontology or classification scheme; and 506,000 word stems (roots of words) for the English language.
1 - 20 of 49 Next › Last »
Showing 20 items per page