Group items tagged

Filter: All | Bookmarks | Topics Simple Middle

Protocol for Implementing Open Access Data - 0 views

sciencecommons.org/...open-access-data-protocol

science commons open access data public domain

shared by Mike Chelen on 07 Sep 08 - Cached

information for the Internet community
...

Cancel
distributing data or databases
...

Cancel
“open” and “open access”
...

Cancel
...69 more annotations...
requirements for gaining and using the Science Commons Open Access Data Mark and metadata
...

Cancel
interoperability of scientific data
...

Cancel
terms and conditions around data make integration difficult to legally perform
...

Cancel
single license
...

Cancel
data with this license can be integrated with any other data under this license
...

Cancel
too many databases under too many terms already
...

Cancel
unlikely that any one license or suite of licenses will have the correct mix of terms
...

Cancel
principles for open access data and a protocol for implementing those principles
...

Cancel
Open Access Data Mark and metadata
...

Cancel
databases and data
...

Cancel
the foundation to legally integrate a database or data product
...

Cancel
another database or data product
...

Cancel
no mechanisms to manage transfer or negotiations of rights unrelated to integration
...

Cancel
submitted to Science Commons for certification as a conforming implementation
...

Cancel
Open Access Data trademarks (icons and phrases) and metadata on databases
...

Cancel
protocol must promote legal predictability and certainty
...

Cancel
easy to use and understand
...

Cancel
lowest possible transaction costs on users
...

Cancel
Science Commons’ experience in distributing a database licensing Frequently Asked Questions (FAQ) file
...

Cancel
hard to apply the distinction between what is copyrightable and what is not copyrightable
...

Cancel
lack of simplicity restricts usage
...

Cancel
reducing or eliminating the need to make the distinction between copyrightable and non-copyrightable elements
...

Cancel
satisfy the norms and expectations of the disciplines providing the database
...

Cancel
norms for citation will differ
...

Cancel
norms must be attached
...

Cancel
Converge on the public domain by waiving all rights based on intellectual property
...

Cancel
reconstruction of the public domain
...

Cancel
scientific norms to express the wishes of the data provider
...

Cancel
public domain
...

Cancel
waiving the relevant rights on data and asserting that the provider makes no claims on the data
...

Cancel
Requesting behavior, such as citation, through norms rather than as a legal requirement based on copyright or contracts, allows for different scientific disciplines to develop different norms for citation.
...

Cancel
waive all rights necessary for data extraction and re-use
...

Cancel
copyright
...

Cancel
sui generis database rights
...

Cancel
claims of unfair competition
...

Cancel
implied contracts
...

Cancel
and other legal rights
...

Cancel
any obligations on the user of the data or database such as “copyleft” or “share alike”, or even the legal requirement to provide attribution
...

Cancel
non-legally binding set of citation norms
...

Cancel
waiving other statutory or intellectual property rights
...

Cancel
there are other rights, in addition to copyright, that may apply
...

Cancel
uncopyrightable databases may be protected in some countries
...

Cancel
sui generis rights apply in the European Union
...

Cancel
waivers of sui generis and other legal grounds for database protection
...

Cancel
no contractual controls
...

Cancel
using contract, rather than intellectual property or statutory rights, to apply terms to databases
...

Cancel
affirmatively declare that contractual constraints do not apply to the database
...

Cancel
interoperation with databases and data not available under the Science Commons Open Access Data Protocol through metadata
...

Cancel
data that is not or cannot be made available under this protocol
...

Cancel
owner provides metadata (as data) under this protocol so that the existence of the non-open access data is discoverable
...

Cancel
digital identifiers and metadata describing non-open access data
...

Cancel
“Licensing” a database typically means that the “copyrightable elements” of a database are made available under a copyright license
...

Cancel
Database FAQ, in its first iteration, recommended this method
...

Cancel
recommendation is now withdrawn
...

Cancel
copyright begins in and ends in many databases
...

Cancel
database divided into copyrightable and non copyrightable elements
...

Cancel
user tends to assume that all is under copyright or none is under copyright
...

Cancel
share-alike license on the copyrightable elements may be falsely assumed to operate on the factual contents of a database
...

Cancel
copyright in situations where it is not necessary
...

Cancel
query across tens of thousands of data records across the web might return a result which itself populates a new database
...

Cancel
selective waiving of intellectual property rights fail to provide a high degree of legal certainty and ease of use
...

Cancel
problem of false expectations
...

Cancel
apply a “copyleft” term to the copyrightable elements of a database, in hopes that those elements result in additional open access database elements coming online
...

Cancel
uncopyrightable factual content
...

Cancel
republish those contents without observing the copyleft or share-alike terms
...

Cancel
cascading attribution if attribution is required as part of a license approach
...

Cancel
Would a scientist need to attribute 40,000 data depositors in the event of a query across 40,000 data sets?
...

Cancel
conflict with accepted norms in some disciplines
...

Cancel
imposes a significant transaction cost
...

Cancel

Science in the open » A breakthrough on data licensing for public science? - 0 views

blog.openwetware.org/...a-licensing-for-public-science

open public science data license

shared by Mike Chelen on 15 May 09 - Cached

Peter Murray-Rust and others at the Unilever Centre for Molecular Informatics at Cambridge
...

Cancel
conversation we had over lunch with Peter, Jim Downing, Nico Adams, Nick Day and Rufus Pollock
...

Cancel
appropriate way to license published scientific data
...

Cancel
...27 more annotations...
value of share-alike or copyleft provisions of GPL and similar licenses
...

Cancel
spreading the message and use of Open Content
...

Cancel
prevent “freeloaders” from being able to use Open material and not contribute back to the open community
...

Cancel
presumption in this view is that a license is a good, or at least acceptable, way of achieving both these goals
...

Cancel
allow people the freedom to address their concerns through copyleft approaches
...

Cancel
Rufus
...

Cancel
concerned more centrally with enabling re-use and re-purposing of data as far as is possible
...

Cancel
make it easy for researchers to deliver on their obligations
...

Cancel
worried by the potential for licensing to make it harder to re-use and re-mix disparate sets of data and content into new digital objects
...

Cancel
“license”, will have scientists running screaming in the opposite direction
...

Cancel
we focused on what we could agree on
...

Cancel
common position statement
...

Cancel
area of best practice for the publication of data that arises from public science
...

Cancel
there is a window of opportunity to influence funder positions
...

Cancel
data sharing policies
...

Cancel
“following best practice”
...

Cancel
don’t tend to be concerned about freeloading
...

Cancel
providing clear guidance and tools
...

Cancel
if it is widely accepted by their research communities
...

Cancel
“best practice is X”
...

Cancel
enable re-use and re-purposing of that data
...

Cancel
share-alike approaches as a community expectation
...

Cancel
Explicit statements of the status of data are required and we need effective technical and legal infrastructure to make this easy for researchers.
...

Cancel
“Where a decision has been taken to publish data deriving from public science research, best practice to enable the re-use and re-purposing of that data, is to place it explicitly in the public domain via {one of a small set of protocols e.g. cc0 or PDDL}.”
...

Cancel
focuses purely on what should be done once a decision to publish has been made
...

Cancel
data generated by public science
...

Cancel
describing this as best practice it also allows deviations that may, for whatever reason, be justified by specific people in specific circumstances
...

Cancel

BioMart - 0 views

www.biomart.org

shared by Mike Chelen on 11 Dec 08 - Cached

Mike Chelen on 11 Dec 08

BioMart is a query-oriented data management system developed jointly by the Ontario Institute for Cancer Research (OiCR) and the European Bioinformatics Institute (EBI). The system can be used with any type of data and is particularly suited for providing 'data mining' like searches of complex descriptive data. BioMart comes with an 'out of the box' website that can be installed, configured and customised according to user requirements. Further access is provided by graphical and text based applications or programmatically using web services or API written in Perl and Java. BioMart has built-in support for query optimisation and data federation and in addition can be configured to work as a DAS 1.5 Annotation server. The process of converting a data source into BioMart format is fully automated by the tools included in the package. Currently supported RDBMS platforms are MySQL, Oracle and Postgres. BioMart is completely Open Source, licensed under the LGPL, and freely available to anyone without restrictions.

<div class="cArrow"> </div><div class="cContentInner">BioMart is a query-oriented data management system developed jointly by the Ontario Institute for Cancer Research (OiCR) and the European Bioinformatics Institute (EBI). The system can be used with any type of data and is particularly suited for providing 'data mining' like searches of complex descriptive data. BioMart comes with an 'out of the box' website that can be installed, configured and customised according to user requirements. Further access is provided by graphical and text based applications or programmatically using web services or API written in Perl and Java. BioMart has built-in support for query optimisation and data federation and in addition can be configured to work as a DAS 1.5 Annotation server. The process of converting a data source into BioMart format is fully automated by the tools included in the package. Currently supported RDBMS platforms are MySQL, Oracle and Postgres. BioMart is completely Open Source, licensed under the LGPL, and freely available to anyone without restrictions. </div>

...

Cancel

Open Knowledge Foundation Blog » Blog Archive » Open Data: Openness and Licen... - 0 views

blog.okfn.org/...en-data-openness-and-licensing

okfn blog open data science license

shared by Mike Chelen on 11 Feb 09 - Cached

Why bother about openness and licensing for data
...

Cancel
It’s crucial because open data is so much easier to break-up and recombine, to use and reuse.
...

Cancel
want people to have incentives to make their data open and for open data to be easily usable and reusable
...

Cancel
...8 more annotations...
good definition of openness acts as a standard that ensures different open datasets are ‘interoperable’
...

Cancel
Licensing is important because it reduces uncertainty. Without a license you don’t know where you, as a user, stand: when are you allowed to use this data? Are you allowed to give to others? To distribute your own changes, etc?
...

Cancel
licensing and definitions are important even though they are only a small part of the overall picture
...

Cancel
If we get them wrong they will keep on getting in the way of everything else.
...

Cancel
Everyone agrees that requiring attribution is OK
- Mike Chelen on 11 Feb 09
  
  My opinion is that there should be no requirements, including attribution, and that standards should be community-based instead of legal.
  
  <div class="cArrow"> </div><div class="cContentInner">My opinion is that there should be no requirements, including attribution, and that standards should be community-based instead of legal.</div>
  
  ...
  
  Cancel
...

Cancel
Even if a basic license is used it can be argued that any ‘requirements’ for attribution or share-alike should not be in a license but in ‘community norms’.
- Mike Chelen on 11 Feb 09
  
  Licenses and community norms are not exclusive. It's recommended to adopt a Public Domain license, and encourage attribution through community standards.
  
  <div class="cArrow"> </div><div class="cContentInner">Licenses and community norms are not exclusive. It's recommended to adopt a Public Domain license, and encourage attribution through community standards.</div>
  
  ...
  
  Cancel
...

Cancel
A license is likely to elicit at least as much, and almost certainly more, conformity with its provisions than community norms.
- Mike Chelen on 11 Feb 09
  
  Ease of access and should be the goal, not conformity.
  
  <div class="cArrow"> </div><div class="cContentInner">Ease of access and should be the goal, not conformity.</div>
  
  ...
  
  Cancel
...

Cancel
(even to a user it is easy to comply with the open license)
- Mike Chelen on 11 Feb 09
  
  It is important to specifically publish using a Public Domain dedication.
  
  <div class="cArrow"> </div><div class="cContentInner">It is important to specifically publish using a Public Domain dedication.</div>
  
  ...
  
  Cancel
...

Cancel

Mike Chelen on 11 Feb 09

Why bother about openness and licensing for data? After all they don't matter in themselves: what we really care about are things like the progress of human knowledge or the freedom to understand and share.

<div class="cArrow"> </div><div class="cContentInner">Why bother about openness and licensing for data? After all they don't matter in themselves: what we really care about are things like the progress of human knowledge or the freedom to understand and share.</div>

...

Cancel

SWAN (Semantic Web Applications in Neuromedicine) Project - 0 views

swan.mindinformatics.org

semantic web neuroscience neuromedicine

shared by Mike Chelen on 07 Jan 09 - Cached

Mike Chelen on 07 Jan 09

SWAN (Semantic Web Applications in Neuromedicine) is a Web-based collaborative program that aims to organize and annotate scientific knowledge about Alzheimer disease (AD) and other neurodegenerative disorders. Its goal is to facilitate the formation, development and testing of hypotheses about the disease. The ultimate goal of this project is to create tools and resources to manage the evolving universe of data and information about AD in such a way that researchers can easily comprehend their larger context ("what hypothesis does this support or contradict?"), compare and contrast hypotheses ("where do these two hypotheses agree and disagree?"), identify unanswered questions and synthesize concepts and data into ever more comprehensive and useful hypotheses and treatment targets for this disease. The SWAN project is designed to allow the community of AD researchers to author, curate and connect a diversity of data and ideas about AD via secure personal and public SWAN workspaces, using the emerging Semantic Web paradigm for deep interconnection of data, information and knowledge. We are initially focusing on developing a fully public Web resource deployed as part of the Alzheimer Research Forum web site (www.alzforum.org). After the public resource has been launched, we will also develop secure personal workspaces (MySWAN) and semi-private lab workspaces (LabSWAN). An essential component of this project is development of an initial, core knowledge base within SWAN, which will provide immediate value to researchers at the time of deployment. This is a critically important part of our strategy to ensure that the SWAN system gains wide adoption and active participation by the AD research community. As part of our development strategy, we are also recruiting a "beta test" community of AD researchers to enter their own hypotheses, add commentaries and citations, and provide feedback on the technology and content. SWAN is being developed by a collaborative team from

<div class="cArrow"> </div><div class="cContentInner">SWAN (Semantic Web Applications in Neuromedicine) is a Web-based collaborative program that aims to organize and annotate scientific knowledge about Alzheimer disease (AD) and other neurodegenerative disorders. Its goal is to facilitate the formation, development and testing of hypotheses about the disease. The ultimate goal of this project is to create tools and resources to manage the evolving universe of data and information about AD in such a way that researchers can easily comprehend their larger context ("what hypothesis does this support or contradict?"), compare and contrast hypotheses ("where do these two hypotheses agree and disagree?"), identify unanswered questions and synthesize concepts and data into ever more comprehensive and useful hypotheses and treatment targets for this disease. The SWAN project is designed to allow the community of AD researchers to author, curate and connect a diversity of data and ideas about AD via secure personal and public SWAN workspaces, using the emerging Semantic Web paradigm for deep interconnection of data, information and knowledge. We are initially focusing on developing a fully public Web resource deployed as part of the Alzheimer Research Forum web site (<a href="http://www.alzforum.org" rel="nofollow" target="_blank">www.alzforum.org</a>). After the public resource has been launched, we will also develop secure personal workspaces (MySWAN) and semi-private lab workspaces (LabSWAN). An essential component of this project is development of an initial, core knowledge base within SWAN, which will provide immediate value to researchers at the time of deployment. This is a critically important part of our strategy to ensure that the SWAN system gains wide adoption and active participation by the AD research community. As part of our development strategy, we are also recruiting a "beta test" community of AD researchers to enter their own hypotheses, add commentaries and citations, and provide feedback on the technology and content. SWAN is being developed by a collaborative team from </div>

...

Cancel

FTP Download - 0 views

www.ensembl.org/...index.html

shared by Mike Chelen on 11 Dec 08 - Cached

Mike Chelen on 11 Dec 08

If required, entire databases can be downloaded from our FTP site in a variety of formats, from flat files to MySQL dumps. Please be aware that these files can run to many gigabytes of data. To facilitate storage and download all databases are GNU Zip (gzip, *.gz) compressed. Please note: Ensembl supports downloading of many correlation tables via the highly customisable BioMart data mining tool. You may find exploring this web-based data mining tool easier than extracting information from our database dumps.

<div class="cArrow"> </div><div class="cContentInner">If required, entire databases can be downloaded from our FTP site in a variety of formats, from flat files to MySQL dumps. Please be aware that these files can run to many gigabytes of data. To facilitate storage and download all databases are GNU Zip (gzip, *.gz) compressed. Please note: Ensembl supports downloading of many correlation tables via the highly customisable BioMart data mining tool. You may find exploring this web-based data mining tool easier than extracting information from our database dumps. </div>

...

Cancel

Open Knowledge Foundation Blog » Blog Archive » Comments on the Science Commo... - 0 views

blog.okfn.org/...-implementing-open-access-data

open access data okfn science commons

shared by Mike Chelen on 01 Mar 09 - Cached

the protocol does not discuss any of the possible attractions of allowing such provisions
...

Cancel
Protocol gives 3 basic reasons for preferring the ‘PD’ approach
...

Cancel
Science Commons Protocol for Implementing Open Access Data
...

Cancel
...7 more annotations...
I am not really convinced by any of these points that attribution or share-alike provisions should not be included in open data licenses
...

Cancel
application of obligations based on copyright in situations where it is not necessary
...

Cancel
non-copyrightable elements extends to the entire database and inadvertently infringe
...

Cancel
If intellectual property rights are involved
...

Cancel
requirements carrying a stiff penalty for failure
...

Cancel
selective waiving of intellectual property rights
...

Cancel
interpretative problems
...

Cancel

The National Center for Biomedical Ontology - 0 views

bioontology.org

biomedical ontology semantic open science

shared by Mike Chelen on 06 Jan 09 - Cached

Mike Chelen on 06 Jan 09

The National Center for Biomedical Ontology is a consortium of leading biologists, clinicians, informaticians, and ontologists who develop innovative technology and methods allowing scientists to create, disseminate, and manage biomedical information and knowledge in machine-processable form. Our visionis that all biomedical knowledge and data are disseminated on the Internet using principled ontologies, such that they are semantically interoperable and useful for improving biomedical science and clinical care. Our resources include the Open Biomedical Ontologies (OBO) library, the Open Biomedical Data (OBD) repositories, and tools for accessing and using this information in research. The Center collaborates with biomedical researchers conducting Driving Biological Projects to enable outside research and stimulate technology development in the Center. The Center undertakes outreach and educational activities (Biomedical Informatics Program) to train future researchers to use biomedical ontologies and related tools with the goal of enhancing scientific discovery.

<div class="cArrow"> </div><div class="cContentInner">The National Center for Biomedical Ontology is a consortium of leading biologists, clinicians, informaticians, and ontologists who develop innovative technology and methods allowing scientists to create, disseminate, and manage biomedical information and knowledge in machine-processable form. Our visionis that all biomedical knowledge and data are disseminated on the Internet using principled ontologies, such that they are semantically interoperable and useful for improving biomedical science and clinical care. Our resources include the Open Biomedical Ontologies (OBO) library, the Open Biomedical Data (OBD) repositories, and tools for accessing and using this information in research. The Center collaborates with biomedical researchers conducting Driving Biological Projects to enable outside research and stimulate technology development in the Center. The Center undertakes outreach and educational activities (Biomedical Informatics Program) to train future researchers to use biomedical ontologies and related tools with the goal of enhancing scientific discovery.</div>

...

Cancel

SourceForge.net: CloudBurst - cloudburst-bio - 0 views

apps.sourceforge.net/...index.php

cloudburstbio mapreduce cloudburst bioinformatics genetics

shared by Mike Chelen on 17 Dec 08 - Cached

Mike Chelen on 17 Dec 08

CloudBurst: Highly Sensitive Short Read Mapping with MapReduce Michael Schatz Center for Bioinformatics and Computational Biology, University of Maryland Next-generation DNA sequencing machines are generating an enormous amount of sequence data, placing unprecedented demands on traditional single-processor read mapping algorithms. CloudBurst is a new parallel read-mapping algorithm optimized for mapping next-generation sequence data to the human genome and other reference genomes, for use in a variety of biological analyses including SNP discovery, genotyping, and personal genomics. It is modeled after the short read mapping program RMAP, and reports either all alignments or the unambiguous best alignment for each read with any number of mismatches or differences. This level of sensitivity could be prohibitively time consuming, but CloudBurst uses the open-source Hadoop implementation of MapReduce to parallelize execution using multiple compute nodes. CloudBurst's running time scales linearly with the number of reads mapped, and with near linear speedup as the number of processors increases. In a 24-processor core configuration, CloudBurst is up to 30 times faster than RMAP executing on a single core, while computing an identical set of alignments. In a large remote compute clouds with 96 cores, CloudBurst reduces the running time from hours to mere minutes for typical jobs involving mapping of millions of short reads to the human genome. CloudBurst is available open-source as a model for parallelizing other bioinformatics algorithms with MapReduce.

<div class="cArrow"> </div><div class="cContentInner">CloudBurst: Highly Sensitive Short Read Mapping with MapReduce Michael Schatz Center for Bioinformatics and Computational Biology, University of Maryland Next-generation DNA sequencing machines are generating an enormous amount of sequence data, placing unprecedented demands on traditional single-processor read mapping algorithms. CloudBurst is a new parallel read-mapping algorithm optimized for mapping next-generation sequence data to the human genome and other reference genomes, for use in a variety of biological analyses including SNP discovery, genotyping, and personal genomics. It is modeled after the short read mapping program RMAP, and reports either all alignments or the unambiguous best alignment for each read with any number of mismatches or differences. This level of sensitivity could be prohibitively time consuming, but CloudBurst uses the open-source Hadoop implementation of MapReduce to parallelize execution using multiple compute nodes. CloudBurst's running time scales linearly with the number of reads mapped, and with near linear speedup as the number of processors increases. In a 24-processor core configuration, CloudBurst is up to 30 times faster than RMAP executing on a single core, while computing an identical set of alignments. In a large remote compute clouds with 96 cores, CloudBurst reduces the running time from hours to mere minutes for typical jobs involving mapping of millions of short reads to the human genome. CloudBurst is available open-source as a model for parallelizing other bioinformatics algorithms with MapReduce. </div>

...

Cancel

SourceForge.net: Running CloudBurst on Amazon EC2 - cloudburst-bio - 0 views

apps.sourceforge.net/...index.php

cloudburstbio bioinformatics ec2

shared by Mike Chelen on 17 Dec 08 - Cached

Mike Chelen on 17 Dec 08

Hadoop comes bundled with launch scripts to simplify initializing an Amazon Elastic Compute Cloud (EC2) cloud for Hadoop. Once initialized, running CloudBurst is identical to running on a local cluster. If you use EC2 regularly with the same datasets (i.e. the human genome as a reference), you will probably want to copy the data once to Amazon Simple Storage Service (S3) so you can quickly copy the data from S3 to your cloud at low cost.

<div class="cArrow"> </div><div class="cContentInner">Hadoop comes bundled with launch scripts to simplify initializing an Amazon Elastic Compute Cloud (EC2) cloud for Hadoop. Once initialized, running CloudBurst is identical to running on a local cluster. If you use EC2 regularly with the same datasets (i.e. the human genome as a reference), you will probably want to copy the data once to Amazon Simple Storage Service (S3) so you can quickly copy the data from S3 to your cloud at low cost. </div>

...

Cancel

NIF - 0 views

www.neuinfo.org

nif neuroscience

shared by Mike Chelen on 15 Dec 08 - Cached

Mike Chelen on 15 Dec 08

The Neuroscience Information Framework (NIF) is a dynamic inventory of web-based neurosciences data, resources, and tools that scientists and students can access via any computer connected to the Internet. An initiative of the NIH Blueprint for Neuroscience Research, the NIF will advance neuroscience research by enabling discovery and access to public research data and tools worldwide through an open source, networked environment.

<div class="cArrow"> </div><div class="cContentInner">The Neuroscience Information Framework (NIF) is a dynamic inventory of web-based neurosciences data, resources, and tools that scientists and students can access via any computer connected to the Internet. An initiative of the NIH Blueprint for Neuroscience Research, the NIF will advance neuroscience research by enabling discovery and access to public research data and tools worldwide through an open source, networked environment.</div>

...

Cancel

A pitfall of wiki solution for biological database...[Brief Bioinform. 2008] - PubMed R... - 0 views

www.ncbi.nlm.nih.gov/...19060305

wiki biology

shared by Mike Chelen on 12 Dec 08 - Cached

Mike Chelen on 12 Dec 08

Not a few biologists tend to consider wiki as a solution to manage and reorganize data by a community. However, in its basic functionality, wiki lacks a measure to check data consistency and is not suitable for a database. To circumvent this pitfall, installation of page dependency through in-line page searches is necessary. We also introduce two existing approaches that support in-line queries.

<div class="cArrow"> </div><div class="cContentInner">Not a few biologists tend to consider wiki as a solution to manage and reorganize data by a community. However, in its basic functionality, wiki lacks a measure to check data consistency and is not suitable for a database. To circumvent this pitfall, installation of page dependency through in-line page searches is necessary. We also introduce two existing approaches that support in-line queries.</div>

...

Cancel

Public MySQL Server - 0 views

www.ensembl.org/...mysql.html

shared by Mike Chelen on 11 Dec 08 - Cached

For large amounts of data and more detailed analysis, we recommend you use our publicly-accessible MySQL server, ensembldb.ensembl.org, which you can access as user 'anonymous'. A second server, martdb.ensembl.org provides public access to the BioMart databases.
...

Cancel

Mike Chelen on 11 Dec 08

For large amounts of data and more detailed analysis, we recommend you use our publicly-accessible MySQL server, ensembldb.ensembl.org, which you can access as user 'anonymous'. A second server, martdb.ensembl.org provides public access to the BioMart databases.

<div class="cArrow"> </div><div class="cContentInner">For large amounts of data and more detailed analysis, we recommend you use our publicly-accessible MySQL server, ensembldb.ensembl.org, which you can access as user 'anonymous'. A second server, martdb.ensembl.org provides public access to the BioMart databases.</div>

...

Cancel

Main Page - GenBioWiki - 0 views

www.gbcb.org.vt.edu/...Main_Page

science wiki online biology genetics informatics compute

shared by Mike Chelen on 22 Nov 08 - Cached

Mike Chelen on 22 Nov 08

GenBioWiki is the student home page for the Genetics, Bioinformatics, and Computational Biology (GBCB) program at Virginia Tech. Bioinformatics and computational biology provide a research platform to acquire, manage, analyze, and display large amounts of data, which in turn catalyze a systems approach to understanding biological organisms, as well as making useful predictions about their behavior in response to environmental and other perturbations. Moreover, bioinformatics is the study of biological systems and large biological data sets using analytical methods borrowed from computer science, mathematics, statistics, and the physical sciences. This transdisciplinary approach to research requires graduates with extensive cross-cultural professional and technical training and provides ample employment opportunities for Ph.D. graduates. [1]

<div class="cArrow"> </div><div class="cContentInner">GenBioWiki is the student home page for the Genetics, Bioinformatics, and Computational Biology (GBCB) program at Virginia Tech. Bioinformatics and computational biology provide a research platform to acquire, manage, analyze, and display large amounts of data, which in turn catalyze a systems approach to understanding biological organisms, as well as making useful predictions about their behavior in response to environmental and other perturbations. Moreover, bioinformatics is the study of biological systems and large biological data sets using analytical methods borrowed from computer science, mathematics, statistics, and the physical sciences. This transdisciplinary approach to research requires graduates with extensive cross-cultural professional and technical training and provides ample employment opportunities for Ph.D. graduates. [1]</div>

...

Cancel

BioLit Project - 0 views

biolit.ucsd.edu/index.html

open access

shared by Mike Chelen on 05 Jan 09 - Cached

Mike Chelen on 05 Jan 09

The establishment of open access literature makes it possible for knowledge to be extracted from scholarly articles and included in other resources. BioLit aims to extract database identifiers and rich meta-data from open access articles in the life sciences and integrate that information with existing biological databases. We have begun prototyping this effort using a clone of the RCSB Protein Data Bank, a database of macromolecular structures.

<div class="cArrow"> </div><div class="cContentInner">The establishment of open access literature makes it possible for knowledge to be extracted from scholarly articles and included in other resources. BioLit aims to extract database identifiers and rich meta-data from open access articles in the life sciences and integrate that information with existing biological databases. We have begun prototyping this effort using a clone of the RCSB Protein Data Bank, a database of macromolecular structures. </div>

...

Cancel

YouTube - Hans Rosling: No more boring data: TEDTalks - 0 views

www.youtube.com/watch

data presentation talk ted video visualize youtube

shared by Mike Chelen on 30 Sep 08 - Cached

Mike Chelen on 30 Sep 08

With the drama and urgency of a sportscaster, statistics guru Hans Rosling uses an amazing new presentation tool, Gapminder, to debunk several myths about world development. Rosling is professor of international health at Sweden's Karolinska Institute, and founder of Gapminder, a nonprofit that brings vital global data to life. (Recorded February 2006 in Monterey, CA.)

<div class="cArrow"> </div><div class="cContentInner">With the drama and urgency of a sportscaster, statistics guru Hans Rosling uses an amazing new presentation tool, Gapminder, to debunk several myths about world development. Rosling is professor of international health at Sweden's Karolinska Institute, and founder of Gapminder, a nonprofit that brings vital global data to life. (Recorded February 2006 in Monterey, CA.) </div>

...

Cancel

Open Data Commons - Legal solutions for data - 0 views

www.opendatacommons.org

data legal open

shared by Mike Chelen on 25 Sep 08 - Cached

de.bezier.mysql - 0 views

www.bezier.de/mysql

my sql processing java data base

shared by Mike Chelen on 22 Nov 08 - Cached

Mike Chelen on 22 Nov 08

Processing (BETA) library to communicate with MySQL (or any other SQL) databases. note that due to java security restrictions this will not work with applets "out of the box" and that many remote mysql-servers will only allow local access ("localhost") or connections from trusted hosts. (see notes. ) also note that you should have some experience with SQL to put, change and retrieve data from the database.

<div class="cArrow"> </div><div class="cContentInner">Processing (BETA) library to communicate with MySQL (or any other SQL) databases. note that due to java security restrictions this will not work with applets "out of the box" and that many remote mysql-servers will only allow local access ("localhost") or connections from trusted hosts. (see notes. ) also note that you should have some experience with SQL to put, change and retrieve data from the database. </div>

...

Cancel

Datawocky: More data usually beats better algorithms, Part 2 - 0 views

anand.typepad.com/...data-versus-alg.html

analysis data

shared by Mike Chelen on 17 Sep 08 - Cached

Datawocky: More data usually beats better algorithms - 0 views

anand.typepad.com/...more-data-usual.html

analysis data

shared by Mike Chelen on 17 Sep 08 - Cached

1 - 20 of 54 Next › Last »

Showing 20▼ items per page

Group items tagged

Protocol for Implementing Open Access Data - 0 views

Science in the open » A breakthrough on data licensing for public science? - 0 views

BioMart - 0 views

Open Knowledge Foundation Blog » Blog Archive » Open Data: Openness and Licen... - 0 views

SWAN (Semantic Web Applications in Neuromedicine) Project - 0 views

FTP Download - 0 views

Open Knowledge Foundation Blog » Blog Archive » Comments on the Science Commo... - 0 views

The National Center for Biomedical Ontology - 0 views

SourceForge.net: CloudBurst - cloudburst-bio - 0 views

SourceForge.net: Running CloudBurst on Amazon EC2 - cloudburst-bio - 0 views

NIF - 0 views

A pitfall of wiki solution for biological database...[Brief Bioinform. 2008] - PubMed R... - 0 views

Public MySQL Server - 0 views

Main Page - GenBioWiki - 0 views

BioLit Project - 0 views

YouTube - Hans Rosling: No more boring data: TEDTalks - 0 views

Open Data Commons - Legal solutions for data - 0 views

de.bezier.mysql - 0 views

Datawocky: More data usually beats better algorithms, Part 2 - 0 views

Datawocky: More data usually beats better algorithms - 0 views

Related searches