Contents contributed and discussions participated by Amy West

Simple Middle Filter: All | Bookmarks | Topics

2011AGUworkshop - Federation of Earth Science Information Partners - 1 views

wiki.esipfed.org/...2011AGUworkshop

earth science data management training

shared by Amy West on 12 Dec 11 - No Cached

Amy West on 12 Dec 11

All the presentations are good, but I found the Data formats, Creating documentation & metadata, working w/an archive & preservation strategies particularly good. Solid examples of formats, metadata, and real-life preservation. Plus, as mgs of UDC/AgEcon, hopefully more archives over time, I think we should look hard at what they tell researchers to look for in an archive.

<div class="cArrow"> </div><div class="cContentInner">All the presentations are good, but I found the Data formats, Creating documentation & metadata, working w/an archive & preservation strategies particularly good. Solid examples of formats, metadata, and real-life preservation. Plus, as mgs of UDC/AgEcon, hopefully more archives over time, I think we should look hard at what they tell researchers to look for in an archive.</div>

...

Cancel

http://neurocommons.org/report/data-publication.pdf - 2 views

neurocommons.org/...data-publication.pdf

data papers rationale

shared by Amy West on 09 Nov 11 - No Cached

Education | DataONE - 1 views

www.dataone.org/education

ppt data management education

shared by Amy West on 23 Aug 11 - No Cached

Amy West on 19 Sep 11

Like the quizzes embedded at the end of each ppt. Clever.

<div class="cArrow"> </div><div class="cContentInner">Like the quizzes embedded at the end of each ppt. Clever.</div>

...

Cancel

Interagency Data Stewardship/Citations/provider guidelines - Federation of Earth Scienc... - 0 views

wiki.esipfed.org/...provider_guidelines

data citations guidelines earth science

shared by Amy West on 13 Sep 11 - No Cached

- Amy West on 13 Sep 11
  
  Little confused by what's meant by "data sets should be cited like books" since they go on to provide really good reasons why data aren't like books, e.g. need subsetting information, access date for dynamic databases.
  
  <div class="cArrow"> </div><div class="cContentInner">Little confused by what's meant by "data sets should be cited like books" since they go on to provide really good reasons why data aren't like books, e.g. need subsetting information, access date for dynamic databases.</div>
  
  ...
  
  Cancel
...

Cancel
The guidelines build from the IPY Guidelines and are compatible with the DataCite Metadata Scheme for the Publication and Citation of Research Data, Version 2.2, July 2011.
...

Cancel
In some cases, the data set authors may have also published a paper describing the data in great detail. These sort of data papers should be encouraged, and both the paper and the data set should be cited when the data are used.
...

Cancel
...27 more annotations...
Ongoing updates to a time series do change the content of the data set, but they do not typically constitute a new version or edition of a data set. New versions typically reflect changes in sampling protocols, algorithms, quality control processes, etc. Both a new version and an update may be reflected in the release date.
...

Cancel
Locator, Identifier, or Distribution Medium
...

Cancel
Then it is necessary to include a persistant reference to the location of the data.
...

Cancel
This may be the most challenging aspect of data citation. It is necessary to enable "micro-citation" or the ability to refer to the specific data used--the exact files, granules, records, etc.
...

Cancel
Data stewards should suggest how to reference subsets of their data. With Earth science data, subsets can often be identified by referring to a temporal and spatial range.
...

Cancel
A particular data set may be part of a compilation, in which case it is appropriate to cite the data set somewhat like a chapter in an edited volume.
...

Cancel
Increasingly, publishers are allowing data supplements to be published along with peer-reviewed research papers. When using the data supplement one need only cite the parent reference. F
...

Cancel
Confusingly, a Digital Object Identifier is a locator. It is a Handle based scheme whereby the steward of the digital object registers a location (typically a URL) for the object. There is no guarantee that the object at the registered location will remain unchanged. Consider a continually updated data time series, for example.
...

Cancel
While it is desirable to uniquely identify the cited object, it has proven extremely challenging to identify whether two data sets or data files are scientifically identical.
...

Cancel
At this point, we must rely on location information combined with other information such as author, title, and version to uniquely identify data used in a study.
...

Cancel
The key to making registered locators, such as DOIs, ARKS, or Handles, work unambiguously to identify and locate data sets is through careful tracking and documentation of versions.
...

Cancel
how to handle different data set versions relative to an assigned locator.
...

Cancel
Track major_version.minor_version.[archive_version].
...

Cancel
Typically, something that affects the whole data set like a reprocessing would be considered a major version.
...

Cancel
Assign unique locators to major versions.
...

Cancel
Old locators for retired versions should be maintained and point to some appropriate web site that explains what happened to the old data if they were not archived.
...

Cancel
A new major version leads to the creation of a new collection-level metadata record that is distributed to appropriate registries. The older metadata record should remain with a pointer to the new version and with explanation of the status of the older version data.
...

Cancel
Major and minor version should be listed in the recommended citation.
...

Cancel
inor versions should be explained in documentation
...

Cancel
Ongoing additions to an existing time series need not constitute a new version. This is one reason for capturing the date accessed when citing the data.
...

Cancel
we believe it is currently impossible to fully satisfy the requirement of scientific reproducibility in all situations
...

Cancel
To aid scientific reproducibility through direct, unambiguous reference to the precise data used in a particular study. (This is the paramount purpose and also the hardest to achieve). To provide fair credit for data creators or authors, data stewards, and other critical people in the data production and curation process. To ensure scientific transparency and reasonable accountability for authors and stewards. To aid in tracking the impact of data set and the associated data center through reference in scientific literature. To help data authors verify how their data are being used. To help future data users identify how others have used the data.
...

Cancel
The ESIP Preservation and Stewardship cluster has examined these and other current approaches and has found that they are generally compatible and useful, but they do not entirely meet all the purposes of Earth science data citation.
...

Cancel
In general, data sets should be cited like books.
...

Cancel
hey need to use the style dictated by their publishers, but by providing an example, data stewards can give users all the important elements that should be included in their citations of data sets
...

Cancel
Access Date and Time--because data can be dynamic and changeable in ways that are not always reflected in release dates and versions, it is important to indicate when on-line data were accessed.
...

Cancel
Additionally, it is important to provide a scheme for users to indicate the precise subset of data that were used. This could be the temporal and spatial range of the data, the types of files used, a specific query id, or other ways of describing how the data were subsetted.
...

Cancel

The Enduring Value of Social Science Research: The Use and Reuse of Primary Research Da... - 2 views

deepblue.lib.umich.edu/...78307

data sharing icpsr

shared by Amy West on 30 Aug 11 - No Cached

Amy West on 30 Aug 11

Paper on data sharing from social sciences perspective; also some analysis of sharing so far.

<div class="cArrow"> </div><div class="cContentInner">Paper on data sharing from social sciences perspective; also some analysis of sharing so far.</div>

...

Cancel

Open access to research data a lot tougher than you think - 2 views

arstechnica.com/...lot-tougher-than-you-think.ars

open access data publications preservation

shared by Amy West on 29 Aug 11 - No Cached

It means that researchers need to deal with the formatting and deposition of data, an annoying step when they would rather be focusing on their next project. Given the time lag, it's also difficult to associate the correct metadata with the material that's being a
...

Cancel
According to the commentary, scientists view data deposition as a burden due to the extra work it involves. Research data is usually not in the correct format for submission to repositories when the project is completed, and so the scientist must take the time to convert it.
...

Cancel
The authors here propose a new approach to data management, where each research institution should employ data managers to work with scientists and administer local, structured data storage. Local storage and support is the preference of most scientists, who would rather not hand off control of their data to remote strangers.
...

Cancel

Data Citation from the perspective of tracking data reuse - 3 views

www.slideshare.net/...pective-of-tracking-data-reuse

NSF total-impact.org data citation presentation

shared by Amy West on 25 Aug 11 - No Cached

Amy West on 25 Aug 11

heather piowar

<div class="cArrow"> </div><div class="cContentInner">heather piowar</div>

...

Cancel

total-impact.org - 2 views

total-impact.org

NSF total-impact.org

shared by Amy West on 25 Aug 11 - No Cached

Amy West on 25 Aug 11

Welcome to Total-Impact. This site allows you to track the impact of various online research artifacts. It grabs metrics from many different sites and displays them all in one place.

<div class="cArrow"> </div><div class="cContentInner">Welcome to Total-Impact. This site allows you to track the impact of various online research artifacts. It grabs metrics from many different sites and displays them all in one place.</div>

...

Cancel

New Website Offers Easy Access to NIST Disaster and Failure Study Data - 3 views

www.nist.gov/...repository-081611.cfm

NIST data repository

shared by Amy West on 24 Aug 11 - No Cached

DataTrain project - Social Anthropology - 2 views

www.lib.cam.ac.uk/...socanthintro.html

cambridge data management training

shared by Amy West on 18 Aug 11 - No Cached

Mr. Data Converter - 3 views

shancarter.com/data_converter

data converter tools excel xml html json

shared by Amy West on 11 Aug 11 - No Cached

In case you can't read…. | Prof-Like Substance - 1 views

scientopia.org/...in-case-you-cant-read

conference presentations unpublished data

shared by Amy West on 04 Aug 11 - No Cached

When I am putting a talk together it would never occur to me not to include a health dose of unpublished data. The only times in my career that I have talked about mostly published data have been when I first started as a postdoc and in the early days of being a PI, when I didn't have enough new data to even make a coherent story, but that accounts for maybe three professional talks out of man
...

Cancel
s it a fear of being scooped or a penchant for keeping one's ideas close to the chest that promotes the Summary Talk?
...

Cancel
I think it's field dependent. Personally, I can rarely get enough information from a talk to know whether to believe a result or not. This means that unpublished data usually ends up with me thinking "maybe, maybe not".
...

Cancel
...10 more annotations...
(A good talk like this has enough of a citation on the slide that I can jot down where to go if I want to know details on any particular result.)
...

Cancel
I'm in a highly competitive biomed field, and I was taught never to present something unless it was either submitted or ready to be submitted.
...

Cancel
I don't really spend any time worrying about being scooped because I collect my own data.
...

Cancel
Why look at a poster or talk of 100% published work, I've already seen the stuff in a journal to start with
...

Cancel
Final year materials chemist = keeping cards close to my chest. Once bitten, never again.
...

Cancel
In neuro, I'd say that at smaller conferences and less high-profile talks at big conferences (i.e. not keynotes or featured lectures), the bulk of what you're hearing is unpublished. ALL posters are unpublished--in fact, I think (?) it's a rule at SfN that the content of posters can't be published already.
...

Cancel
In my field I'd guess that most talks include data that is in press or at some close to publication sta
...

Cancel
A big name should be more generous, but then again they do have to save guard the career of the student/postdoc who generated the data. Also the star or keynote speaker is expected to address a wider audience, and make their talk relevant to the overall theme of the conference.
...

Cancel
In my (experimental) social science, most conferences explicitly say that you cannot submit to present already published or even accepted work.
...

Cancel
In my field (Astronomy), I'd say 95% of the talks are about unpublished data.
...

Cancel

Amy West on 04 Aug 11

A blog post & comments on what's preferred in conference presentations: published or unpublished data. Interesting.

<div class="cArrow"> </div><div class="cContentInner">A blog post & comments on what's preferred in conference presentations: published or unpublished data. Interesting.</div>

...

Cancel

RDLM Workshop Papers | Research Computing Services - 1 views

rcs.columbia.edu/rdlmpapers

RDLM workshop papers

shared by Amy West on 27 Jul 11 - No Cached

WHAT EXPLAINS THE GERMAN LABOR MARKET MIRACLE IN THE GREAT RECESSION? - 0 views

www.nber.org/w17187.pdf

data management citations social science practices

shared by Amy West on 07 Jul 11 - No Cached

Amy West on 07 Jul 11

This paper uses, among other sources, the US Bureau of Labor Statistics CPS data that covers 1960-2009 to analyze just 2 years of data. The authors do cite the whole CPS, but you have to read the paper to see which bits of that set matter to this paper. The bulk of the paper itself is their explanation of the various statistical methods they used to support their conclusions. The data is neither novel or unique to them. Their analysis however, may be novel and is certainly unique to them. They also provide some technical documentation, e.g. we did x with SPSS. So, ideally, it would be nice to have a citation to the paper, to the 2 year subset of data relevant to it and a citation to the entire BLS CPS data. This is not agricultural economics, but I think that pretty similar patterns will be found there too.

<div class="cArrow"> </div><div class="cContentInner">This paper uses, among other sources, the US Bureau of Labor Statistics CPS data that covers 1960-2009 to analyze just 2 years of data. The authors do cite the whole CPS, but you have to read the paper to see which bits of that set matter to this paper. The bulk of the paper itself is their explanation of the various statistical methods they used to support their conclusions. The data is neither novel or unique to them. Their analysis however, may be novel and is certainly unique to them. They also provide some technical documentation, e.g. we did x with SPSS. So, ideally, it would be nice to have a citation to the paper, to the 2 year subset of data relevant to it and a citation to the entire BLS CPS data. This is not agricultural economics, but I think that pretty similar patterns will be found there too.</div>

...

Cancel

http://www.neh.gov/ODH/LinkClick.aspx?fileticket=9Qc1l5gLcHw%3d&tabid=108 - 4 views

www.neh.gov/...LinkClick.aspx

data neh management plans

shared by Amy West on 22 Jun 11 - No Cached

Democratic Dividends: Stockholding, wealth and politics in New York - 1 views

www.nber.org/w17147.pdf

data methodology

shared by Amy West on 21 Jun 11 - No Cached

Amy West on 21 Jun 11

interesting and frustrating paper. has a "data appendix" which talks about the data and methodology (good), but doesn't include the data files that had to have been created in order to generate the tables.

<div class="cArrow"> </div><div class="cContentInner">interesting and frustrating paper. has a "data appendix" which talks about the data and methodology (good), but doesn't include the data files that had to have been created in order to generate the tables. </div>

...

Cancel

http://www.nsf.gov/pubs/2011/nsf11060/nsf11060.pdf?WT.mc_id=USNSF_179 - 2 views

www.nsf.gov/...nsf11060.pdf

nsf data management plan guidelines

shared by Amy West on 09 Jun 11 - No Cached

Data Preservation - Home - 1 views

datapreservation.usgs.gov

data preservation e-science escience geology

shared by Amy West on 17 May 11 - Cached

Amy West on 17 May 11

USGS is attempting to corral / manage geological & geophysical preservation efforts.

<div class="cArrow"> </div><div class="cContentInner">USGS is attempting to corral / manage geological & geophysical preservation efforts.</div>

...

Cancel

Texas Advanced Computing Center helps the National Archives find solutions to the natio... - 1 views

www.utexas.edu/...tacc_archives

data e-science escience cyberinfrastructure visualization

shared by Amy West on 18 Apr 11 - No Cached

ingentaconnect Citing data sources in the social sciences: do authors do it? - 1 views

www.ingentaconnect.com/...art00004

social science data citation journals behaviors

shared by Amy West on 04 Apr 11 - No Cached

Amy West on 04 Apr 11

Preprint at http://staff.lib.msu.edu/mooneyh/myresearch/HMooney_Citingdatasources_preprint.pdf

<div class="cArrow"> </div><div class="cContentInner">Preprint at <a href="http://staff.lib.msu.edu/mooneyh/myresearch/HMooney_Citingdatasources_preprint.pdf" rel="nofollow" target="_blank">http://staff.lib.msu.edu/mooneyh/myresearch/HMooney_Citingdatasources_preprint.pdf</a></div>

...

Cancel

1 - 20 of 68 Next › Last »

Showing 20▼ items per page

Amy West