Group items tagged

Filter: All | Bookmarks | Topics Simple Middle

PLoS Computational Biology: Defrosting the Digital Library: Bibliographic Tools for the... - 0 views

www.ploscompbiol.org/...10.1371%2Fjournal.pcbi.1000204

shared by Amy West on 12 Nov 08 - Cached

Presently, the number of abstracts considerably exceeds the number of full-text papers,
...

Cancel
full papers that are available electronically are likely to be much more widely read and cited
...

Cancel
Since all of these libraries are available on the Web, increasing numbers of tools for managing digital libraries are also Web-based. They rely on Uniform Resource Identifiers (URIs [25] or “links”) to identify, name, and locate resources such as publications and their authors.
...

Cancel
...27 more annotations...
We often take URIs for granted, but these humble strings are fundamental to the way the Web works [58] and how libraries can exploit it, so they are a crucial part of the cyberinfrastructure [59] required for e-science on the Web.
...

Cancel
link to data (the full-text of a given article),
...

Cancel
To begin with, a user selects a paper, which will have come proximately from one of four sources: 1) searching some digital library, “SEARCH” in Figure 4; 2) browsing some digital library (“BROWSE”); 3) a personal recommendation, word-of-mouth from colleague, etc., (“RECOMMEND”); 4) referred to by reading another paper, and thus cited in its reference list (“READ”)
...

Cancel
There is no universal method to retrieve a given paper, because there is no single way of identifying publications across all digital libraries on the Web
...

Cancel
Publication metadata often gets “divorced” from the data it is about, and this forces users to manage each independently, a cumbersome and error-prone process.
...

Cancel
There is no single way of representing metadata, and without adherence to common standards (which largely already exist, but in a plurality) there never will be.
...

Cancel
Where DOIs exist, they are supposed to be the definitive URI. This kind of automated disambiguation, of publications and authors, is a common requirement for building better digital libraries
...

Cancel
Publication metadata are essential for machines and humans in many tasks, not just the disambiguation described above. Despite their importance, metadata can be frustratingly difficult to obtain.
...

Cancel
So, given an arbitrary URI, there are only two guaranteed options for getting any metadata associated with it. Using http [135], it is possible to for a human (or machine) to do the following.
...

Cancel
This technique works, but is not particularly robust or scalable because every time the style of a particular Web site changes, the screen-scraper will probably break as well
...

Cancel
This returns metadata only, not the whole resource. These metadata will not include the author, journal, title, date, etc., of
...

Cancel
As it stands, it is not possible to perform mundane and seemingly simple tasks such as, “get me all publications that fulfill some criteria and for which I have licensed access as PDF” to save locally, or “get me a specific publication and all those it immediately references”.
...

Cancel
Having all these different metadata standards would not be a problem if they could easily be converted to and from each other, a process known as “round-tripping”.
...

Cancel
many of these mappings are non-trivial, e.g., XML to RDF and back again
...

Cancel
more complex metadata such as the inbound and outbound citations, related articles, and “supplementary” information.
...

Cancel
Personalization allows users to say this is my library, the sources I am interested in, my collection of references, as well as literature I have authored or co-authored. Socialization allows users to share their personal collections and see who else is reading the same publications, including added information such as related papers with the same keyword (or “tag”) and what notes other people have written about a given publication.
...

Cancel
CiteULike normalizes bookmarks before adding them to its database, which means it calculates whether each URI bookmarked identifies an identical publication added by another user, with an equivalent URI. This is important for social tagging applications, because part of their value is the ability to see how many people (and who) have bookmarked a given publication. CiteULike also captures another important bibliometric, viz how many users have potentially read a publication, not just cited it.
...

Cancel
Connotea uses MD5 hashes [157] to store URIs that users bookmark, and normalizes them after adding them to its database, rather than before.
...

Cancel
he source code for Connotea [159] is available, and there is an API that allows software engineers to build extra functionality around Connnotea, for example the Entity Describer [160].
...

Cancel
Personalization and socialization of information will increasingly blur the distinction between databases and journals [175], and this is especially true in computational biology where contributions are particularly of a digital nature.
...

Cancel
This is usually because they are either too “small” or too “big” to fit into journals.
...

Cancel
As we move in biology from a focus on hypothesis-driven to data-driven science [1],[181],[182], it is increasingly recognized that databases, software models, and instrumentation are the scientific output, rather than the conventional and more discursive descriptions of experiments and their results.
...

Cancel
In the digital library, these size differences are becoming increasingly meaningless as data, information, and knowledge become more integrated, socialized, personalized, and accessible. Take Postgenomic [183], for example, which aggregates scientific blog posts from a wide variety of sources. These posts can contain commentary on peer-reviewed literature and links into primary database sources. Ultimately, this means that the boundaries between the different types of information and knowledge are continually blurring, and future tools seem likely to continue this trend.
...

Cancel
he identity of people is a twofold problem because applications need to identify people as users in a system and as authors of publications.
...

Cancel
Passing valuable data and metadata onto a third party requires that users trust the organization providing the service. For large publishers such as Nature Publishing Group, responsible for Connotea, this is not necessarily a problem.
...

Cancel
business models may unilaterally change their data model, making the tools for accessing their data backwards incompatible, a common occurrence in bioinformatics.
...

Cancel
Although the practice of sharing raw data immediately, as with Open Notebook Science [190], is gaining ground, many users are understandably cautious about sharing information online before peer-reviewed publication.
...

Cancel

Amy West on 12 Nov 08

Yes, but Alexandria was also a lot smaller; not totally persuaded by analogy here...

<div class="cArrow"> </div><div class="cContentInner">Yes, but Alexandria was also a lot smaller; not totally persuaded by analogy here...</div>

...

Cancel

Chronopolis -- Digital Preservation Program -- Long-Term Mass-Scale Federated Digital P... - 0 views

chronopolis.sdsc.edu/about.html

shared by Lisa Johnston on 30 Dec 09 - Cached

Lisa Johnston on 30 Dec 09

The Chronopolis Digital Preservation Demonstration Project, one of the Library of Congress' latest efforts to collect and preserve at-risk digital information, has been officially launched as a multi-member partnership to meet the archival needs of a wide range of cultural and social domains. Chronopolis is a digital preservation data grid framework being developed by the San Diego Supercomputer Center (SDSC) at UC San Diego , the UC San Diego Libraries (UCSDL) , and their partners at the National Center for Atmospheric Research (NCAR) in Colorado and the University of Maryland's Institute for Advanced Computer Studies (UMIACS) . A key goal of the Chronopolis project is to provide cross-domain collection sharing for long-term preservation. Using existing high-speed educational and research networks and mass-scale storage infrastructure investments, the partnership is designed to leverage the data storage capabilities at SDSC, NCAR, and UMIACS to provide a preservation data grid that emphasizes heterogeneous and highly redundant data storage systems.

<div class="cArrow"> </div><div class="cContentInner">The Chronopolis Digital Preservation Demonstration Project, one of the Library of Congress' latest efforts to collect and preserve at-risk digital information, has been officially launched as a multi-member partnership to meet the archival needs of a wide range of cultural and social domains. Chronopolis is a digital preservation data grid framework being developed by the San Diego Supercomputer Center (SDSC) at UC San Diego , the UC San Diego Libraries (UCSDL) , and their partners at the National Center for Atmospheric Research (NCAR) in Colorado and the University of Maryland's Institute for Advanced Computer Studies (UMIACS) . A key goal of the Chronopolis project is to provide cross-domain collection sharing for long-term preservation. Using existing high-speed educational and research networks and mass-scale storage infrastructure investments, the partnership is designed to leverage the data storage capabilities at SDSC, NCAR, and UMIACS to provide a preservation data grid that emphasizes heterogeneous and highly redundant data storage systems. </div>

...

Cancel

DigitalKoans » Blog Archive » Planets Project Deposits "Digital Genome" Ti... - 0 views

digital-scholarship.org/...ime-capsule-in-swiss-fort-knox

shared by Lisa Johnston on 01 Jun 10 - Cached

Lisa Johnston on 01 Jun 10

Over the last decade the digital age has seen an explosion in the rate of data creation. Estimates from 2009 suggest that over 100 GB of data has already been created for every single individual on the planet ranging from holiday snaps to health records-that's over 1 trillion CDs worth of data, equivalent to 24 tons of books per person!

<div class="cArrow"> </div><div class="cContentInner">Over the last decade the digital age has seen an explosion in the rate of data creation. Estimates from 2009 suggest that over 100 GB of data has already been created for every single individual on the planet ranging from holiday snaps to health records-that's over 1 trillion CDs worth of data, equivalent to 24 tons of books per person!</div>

...

Cancel

Sustainable Digital Preservation and Access - 0 views

www.sdsc.edu/...PR121608_brtf_report.html

data storage best practices

shared by Lisa Johnston on 08 Jan 09 - Cached

Lisa Johnston on 08 Jan 09

While storage and technological issues have been at the forefront of the discussion on digital information, relatively little focus has been on the economic aspect of preserving vast amounts of digital data fundamental to the modern world.

<div class="cArrow"> </div><div class="cContentInner">While storage and technological issues have been at the forefront of the discussion on digital information, relatively little focus has been on the economic aspect of preserving vast amounts of digital data fundamental to the modern world.</div>

...

Cancel

Digital Curation Centre: DCC SCARP Project - 0 views

www.dcc.ac.uk/scarp

report DCC

shared by Lisa Johnston on 25 Jan 10 - Cached

Lisa Johnston on 25 Jan 10

18 January 2010 | Key perspectives | Type: report The Digital Curation Centre is pleased to announce the report "Data Dimensions: Disciplinary Differences in Research Data Sharing, Reuse and Long term Viability" by Key Perspectives, as one of the final outputs of the DCC SCARP project. The project investigated attitudes and approaches to data deposit, sharing and reuse, curation and preservation, over a range of research fields in differing disciplines. The synthesis report (which drew on the SCARP case studies plus a number of others, identified in the Appendix), identifies factors that help understand how curation practices in research groups differ in disciplinary terms. This provides a backdrop to different digital curation approaches.

<div class="cArrow"> </div><div class="cContentInner">18 January 2010 | Key perspectives | Type: report The Digital Curation Centre is pleased to announce the report "Data Dimensions: Disciplinary Differences in Research Data Sharing, Reuse and Long term Viability" by Key Perspectives, as one of the final outputs of the DCC SCARP project. The project investigated attitudes and approaches to data deposit, sharing and reuse, curation and preservation, over a range of research fields in differing disciplines. The synthesis report (which drew on the SCARP case studies plus a number of others, identified in the Appendix), identifies factors that help understand how curation practices in research groups differ in disciplinary terms. This provides a backdrop to different digital curation approaches.</div>

...

Cancel

Digital Scholarship Embraces Tradition and Change, Report Says - Chronicle.com - 0 views

chronicle.com/...7037n.htm

digital preservation

shared by Lisa Johnston on 10 Nov 08 - Cached

Lisa Johnston on 10 Nov 08

Reports on the release of ARL study "Current Models of Digital Scholarly Communication"

<div class="cArrow"> </div><div class="cContentInner">Reports on the release of ARL study "Current Models of Digital Scholarly Communication"</div>

...

Cancel

Digital Representations of Performing Arts - ESIWiki - 0 views

wiki.esi.ac.uk/esentations_of_Performing_Arts

escience arts

shared by umgeoglib on 28 Jan 09 - Cached

umgeoglib on 28 Jan 09

investigated how e-Science could assist in accurate and appropriate digital representations for future scholarship.

<div class="cArrow"> </div><div class="cContentInner"> investigated how e-Science could assist in accurate and appropriate digital representations for future scholarship.</div>

...

Cancel

DigCCurr 2009 - Draft Schedule - 0 views

www.ils.unc.edu/...schedule

digital curation

shared by Lisa Johnston on 12 Jan 09 - Cached

Lisa Johnston on 12 Jan 09

Digital Curation conference in UNC-Chapel Hill April 1-3, 2009

<div class="cArrow"> </div><div class="cContentInner">Digital Curation conference in UNC-Chapel Hill April 1-3, 2009</div>

...

Cancel

Yale Adopts Open Access Policy for Digitized Images - 2 views

digital-scholarship.org/...ss-policy-for-digitized-images

shared by Lisa Johnston on 16 May 11 - No Cached

Lisa Johnston on 16 May 11

Great approach!

<div class="cArrow"> </div><div class="cContentInner">Great approach!</div>

...

Cancel

Digital Preservation Courses & Workshops - Digital Preservation Outreach and Education ... - 0 views

digitalpreservation.gov/...index.html

shared by Lisa Johnston on 20 Oct 11 - No Cached

Lisa Johnston on 20 Oct 11

more online training opportunities...the DPOE program

<div class="cArrow"> </div><div class="cContentInner">more online training opportunities...the DPOE program </div>

...

Cancel

DigitalCommons@UConn - David Lowe and Michael J. Bennett: Digital Project Staff Survey ... - 0 views

digitalcommons.uconn.edu/...16

jpeg2000 digital migration format

shared by Lisa Johnston on 27 Jan 09 - Cached

Lisa Johnston on 27 Jan 09

issues with JPEG 2000 in libraries

<div class="cArrow"> </div><div class="cContentInner">issues with JPEG 2000 in libraries</div>

...

Cancel

Digital Curation Centre: Events: 6th International Digital Curation Conference - 0 views

www.dcc.ac.uk/...index.php

conference workshop

shared by Lisa Johnston on 25 Jan 10 - Cached

Lisa Johnston on 25 Jan 10

The conference is being presented jointly with the Graduate School of Library and Information Science of the University of Illinois at Urbana-Champaign, USA and in partnership with the Coalition for Networked Information (CNI) [External]. The 6 December will offer a programme of workshops. The main conference will take place 7-8 December. the call for papers will be released in March and registration will open in September 2010.

<div class="cArrow"> </div><div class="cContentInner">The conference is being presented jointly with the Graduate School of Library and Information Science of the University of Illinois at Urbana-Champaign, USA and in partnership with the Coalition for Networked Information (CNI) [External]. The 6 December will offer a programme of workshops. The main conference will take place 7-8 December. the call for papers will be released in March and registration will open in September 2010.</div>

...

Cancel

What's New: Digital Data Management: What Faculty Told Us - 1 views

wulibraries.typepad.com/...ment-what-faculty-told-us.html

shared by Lisa Johnston on 16 May 11 - No Cached

Interagency Data Stewardship/Citations/provider guidelines - Federation of Earth Scienc... - 0 views

wiki.esipfed.org/...provider_guidelines

data citations guidelines earth science

shared by Amy West on 13 Sep 11 - No Cached

- Amy West on 13 Sep 11
  
  Little confused by what's meant by "data sets should be cited like books" since they go on to provide really good reasons why data aren't like books, e.g. need subsetting information, access date for dynamic databases.
  
  <div class="cArrow"> </div><div class="cContentInner">Little confused by what's meant by "data sets should be cited like books" since they go on to provide really good reasons why data aren't like books, e.g. need subsetting information, access date for dynamic databases.</div>
  
  ...
  
  Cancel
...

Cancel
The guidelines build from the IPY Guidelines and are compatible with the DataCite Metadata Scheme for the Publication and Citation of Research Data, Version 2.2, July 2011.
...

Cancel
In some cases, the data set authors may have also published a paper describing the data in great detail. These sort of data papers should be encouraged, and both the paper and the data set should be cited when the data are used.
...

Cancel
...27 more annotations...
Ongoing updates to a time series do change the content of the data set, but they do not typically constitute a new version or edition of a data set. New versions typically reflect changes in sampling protocols, algorithms, quality control processes, etc. Both a new version and an update may be reflected in the release date.
...

Cancel
Locator, Identifier, or Distribution Medium
...

Cancel
Then it is necessary to include a persistant reference to the location of the data.
...

Cancel
This may be the most challenging aspect of data citation. It is necessary to enable "micro-citation" or the ability to refer to the specific data used--the exact files, granules, records, etc.
...

Cancel
Data stewards should suggest how to reference subsets of their data. With Earth science data, subsets can often be identified by referring to a temporal and spatial range.
...

Cancel
A particular data set may be part of a compilation, in which case it is appropriate to cite the data set somewhat like a chapter in an edited volume.
...

Cancel
Increasingly, publishers are allowing data supplements to be published along with peer-reviewed research papers. When using the data supplement one need only cite the parent reference. F
...

Cancel
Confusingly, a Digital Object Identifier is a locator. It is a Handle based scheme whereby the steward of the digital object registers a location (typically a URL) for the object. There is no guarantee that the object at the registered location will remain unchanged. Consider a continually updated data time series, for example.
...

Cancel
While it is desirable to uniquely identify the cited object, it has proven extremely challenging to identify whether two data sets or data files are scientifically identical.
...

Cancel
At this point, we must rely on location information combined with other information such as author, title, and version to uniquely identify data used in a study.
...

Cancel
The key to making registered locators, such as DOIs, ARKS, or Handles, work unambiguously to identify and locate data sets is through careful tracking and documentation of versions.
...

Cancel
how to handle different data set versions relative to an assigned locator.
...

Cancel
Track major_version.minor_version.[archive_version].
...

Cancel
Typically, something that affects the whole data set like a reprocessing would be considered a major version.
...

Cancel
Assign unique locators to major versions.
...

Cancel
Old locators for retired versions should be maintained and point to some appropriate web site that explains what happened to the old data if they were not archived.
...

Cancel
A new major version leads to the creation of a new collection-level metadata record that is distributed to appropriate registries. The older metadata record should remain with a pointer to the new version and with explanation of the status of the older version data.
...

Cancel
Major and minor version should be listed in the recommended citation.
...

Cancel
inor versions should be explained in documentation
...

Cancel
Ongoing additions to an existing time series need not constitute a new version. This is one reason for capturing the date accessed when citing the data.
...

Cancel
we believe it is currently impossible to fully satisfy the requirement of scientific reproducibility in all situations
...

Cancel
To aid scientific reproducibility through direct, unambiguous reference to the precise data used in a particular study. (This is the paramount purpose and also the hardest to achieve). To provide fair credit for data creators or authors, data stewards, and other critical people in the data production and curation process. To ensure scientific transparency and reasonable accountability for authors and stewards. To aid in tracking the impact of data set and the associated data center through reference in scientific literature. To help data authors verify how their data are being used. To help future data users identify how others have used the data.
...

Cancel
The ESIP Preservation and Stewardship cluster has examined these and other current approaches and has found that they are generally compatible and useful, but they do not entirely meet all the purposes of Earth science data citation.
...

Cancel
In general, data sets should be cited like books.
...

Cancel
hey need to use the style dictated by their publishers, but by providing an example, data stewards can give users all the important elements that should be included in their citations of data sets
...

Cancel
Access Date and Time--because data can be dynamic and changeable in ways that are not always reflected in release dates and versions, it is important to indicate when on-line data were accessed.
...

Cancel
Additionally, it is important to provide a scheme for users to indicate the precise subset of data that were used. This could be the temporal and spatial range of the data, the types of files used, a specific query id, or other ways of describing how the data were subsetted.
...

Cancel

University of Minnesota Digital Conservancy: Understanding Research Behaviors, Informat... - 0 views

conservancy.umn.edu/5546

reading InfoGatheringGroup

shared by Lisa Johnston on 07 Nov 08 - Cached

Lisa Johnston on 07 Nov 08

Science Assessment of 2006

<div class="cArrow"> </div><div class="cContentInner">Science Assessment of 2006</div>

...

Cancel

PLoS Computational Biology: Defrosting the Digital Library: Bibliographic Tools for the... - 0 views

www.ploscompbiol.org/...journal.pcbi.1000204

socialCitationTools InfoGatheringGroup

shared by Lisa Johnston on 10 Nov 08 - Cached

UC3 Webinars: California Digital Library - 2 views

www.cdlib.org/...uc3webinars.html

shared by Lisa Johnston on 03 Jun 11 - No Cached

Lisa Johnston on 03 Jun 11

free webinars by UC...see the June 30th event on the DMP tool

<div class="cArrow"> </div><div class="cContentInner">free webinars by UC...see the June 30th event on the DMP tool</div>

...

Cancel

STFC Rutherford Appleton Laboratory at the Leading Edge when Preserving Digital Scienti... - 2 views

www.prweb.com/...prweb8736998.htm

shared by Lisa Johnston on 24 Aug 11 - No Cached

Penn State Launches Digital Library Archive Initiative with HP - ITS News - 0 views

news.its.psu.edu/story-1110

Unified Digital Formats Registry (UDFR) - 0 views

www.gdfr.info/udfr.html

metadata repostiory

shared by Lisa Johnston on 30 Apr 09 - Cached

1 - 20 of 25 Next ›

Showing 20▼ items per page

Group items tagged

PLoS Computational Biology: Defrosting the Digital Library: Bibliographic Tools for the... - 0 views

Chronopolis -- Digital Preservation Program -- Long-Term Mass-Scale Federated Digital P... - 0 views

DigitalKoans » Blog Archive » Planets Project Deposits "Digital Genome" Ti... - 0 views

Sustainable Digital Preservation and Access - 0 views

Digital Curation Centre: DCC SCARP Project - 0 views

Digital Scholarship Embraces Tradition and Change, Report Says - Chronicle.com - 0 views

Digital Representations of Performing Arts - ESIWiki - 0 views

DigCCurr 2009 - Draft Schedule - 0 views

Yale Adopts Open Access Policy for Digitized Images - 2 views

Digital Preservation Courses & Workshops - Digital Preservation Outreach and Education ... - 0 views

DigitalCommons@UConn - David Lowe and Michael J. Bennett: Digital Project Staff Survey ... - 0 views

Digital Curation Centre: Events: 6th International Digital Curation Conference - 0 views

What's New: Digital Data Management: What Faculty Told Us - 1 views

Interagency Data Stewardship/Citations/provider guidelines - Federation of Earth Scienc... - 0 views

University of Minnesota Digital Conservancy: Understanding Research Behaviors, Informat... - 0 views

PLoS Computational Biology: Defrosting the Digital Library: Bibliographic Tools for the... - 0 views

UC3 Webinars: California Digital Library - 2 views

STFC Rutherford Appleton Laboratory at the Leading Edge when Preserving Digital Scienti... - 2 views

Penn State Launches Digital Library Archive Initiative with HP - ITS News - 0 views

Unified Digital Formats Registry (UDFR) - 0 views

Related searches