Skip to main content

Home/ OpenSciInfo/ Group items tagged base

Rss Feed Group items tagged

Mike Chelen

PS3Cluster Guide: By The Cluster Workshop - 0 views

  •  
    Our community guide allows you to set up your own MPI (Message Passing Interface) based supercomputer cluster with the Playstation 3. This guide was co-written by Gaurav Khanna, based on his previous work on the Gravity Grid and is a current run-time environment for the research of co-author (Chris Poulin), based on his current work in distributed pattern recognition. As such, we currently utilize the Fedora Core for this infrastructure and illustrate a "how-to" below. NOTE: We focus on the Fedora 8 distribution, due to prevalence of Fedora and its Cell SDK (3.0) compatibility. Finally, this content should be considered open source, and here is the license.
Mike Chelen

Bioclipse - Home - 0 views

  •  
    The Bioclipse project is aimed at creating a Java-based, open source, visual platform for chemo- and bioinformatics based on the Eclipse Rich Client Platform (RCP). Bioclipse, as any RCP application, is based on a plugin architecture that inherits basic functionality and visual interfaces from Eclipse, such as help system, software updates, preferences, cross-platform deployment etc.
Mike Chelen

Neuroscience Information Framework (Main.WebHome) - Neuroscience Information Framework ... - 0 views

  •  
    The advent of the World Wide Web has led to an explosion in the number of diverse resources available to neuroscientists. Despite the availability of powerful search engines, locating these diverse resources has become increasingly difficult and time consuming. The NIF project utilizes both advanced machine-based search technologies and old-fashioned human legwork to provide access to neuroscience-relevant resources on the Web. Resources include research materials, Web pages, software tools, data sets, literature and general information. The NIF has developed technologies that allow a user to search across these different types of resources, all from a single interface. A unique feature of the NIF is the ability to issue direct queries against multiple databases simultaneously, retrieving content that is largely hidden from traditional search engines. A second unique feature is an extensive vocabulary covering major neuroscience domains for describing and searching these resources. The NIF takes advantage of advances in knowledge engineering to broaden and refine searches based on related concepts. The NIF beta test site was developed to gain feedback on the NIF search interface and content. Users will be asked to search the NIF, explore the vocabularies, and answer a questionnaire about their experience.
Mike Chelen

ChemSpider Blog » Blog Archive » Adding Publications to ChemSpider via Digita... - 0 views

  •  
    We are focused on providing tools to our users to ensure that they can add information of interest to structure-based records in ChemSpider. We have introduced DOI-based associations recently allowing users to connect publications of interest to chemical compounds on our database. The process is simple. Find the structure record of interest, use the Add DOI function and Publish. The process is outlined graphically below.
Mike Chelen

USENIX IMC '05 Technical Paper - 0 views

  •  
    Existing studies on BitTorrent systems are single-torrent based, while more than 85% of all peers participate in multiple torrents according to our trace analysis. In addition, these studies are not sufficiently insightful and accurate even for single-torrent models, due to some unrealistic assumptions. Our analysis of representative BitTorrent traffic provides several new findings regarding the limitations of BitTorrent systems: (1) Due to the exponentially decreasing peer arrival rate in reality, service availability in such systems becomes poor quickly, after which it is difficult for the file to be located and downloaded. (2) Client performance in the BitTorrent-like systems is unstable, and fluctuates widely with the peer population. (3) Existing systems could provide unfair services to peers, where peers with high downloading speed tend to download more and upload less. In this paper, we study these limitations on torrent evolution in realistic environments. Motivated by the analysis and modeling results, we further build a graph based multi-torrent model to study inter-torrent collaboration. Our model quantitatively provides strong motivation for inter-torrent collaboration instead of directly stimulating seeds to stay longer. We also discuss a system design to show the feasibility of multi-torrent collaboration.
Mike Chelen

SWAN (Semantic Web Applications in Neuromedicine) Project - 0 views

  •  
    SWAN (Semantic Web Applications in Neuromedicine) is a Web-based collaborative program that aims to organize and annotate scientific knowledge about Alzheimer disease (AD) and other neurodegenerative disorders. Its goal is to facilitate the formation, development and testing of hypotheses about the disease. The ultimate goal of this project is to create tools and resources to manage the evolving universe of data and information about AD in such a way that researchers can easily comprehend their larger context ("what hypothesis does this support or contradict?"), compare and contrast hypotheses ("where do these two hypotheses agree and disagree?"), identify unanswered questions and synthesize concepts and data into ever more comprehensive and useful hypotheses and treatment targets for this disease. The SWAN project is designed to allow the community of AD researchers to author, curate and connect a diversity of data and ideas about AD via secure personal and public SWAN workspaces, using the emerging Semantic Web paradigm for deep interconnection of data, information and knowledge. We are initially focusing on developing a fully public Web resource deployed as part of the Alzheimer Research Forum web site (www.alzforum.org). After the public resource has been launched, we will also develop secure personal workspaces (MySWAN) and semi-private lab workspaces (LabSWAN). An essential component of this project is development of an initial, core knowledge base within SWAN, which will provide immediate value to researchers at the time of deployment. This is a critically important part of our strategy to ensure that the SWAN system gains wide adoption and active participation by the AD research community. As part of our development strategy, we are also recruiting a "beta test" community of AD researchers to enter their own hypotheses, add commentaries and citations, and provide feedback on the technology and content. SWAN is being developed by a collaborative team from
Mike Chelen

processingjs - 0 views

  •  
    Generates an HTML page including a Processing.js visualization based on entered or remote source code.
Mike Chelen

mobibot - 0 views

  •  
    mobibot is the #mobitopia IRC channel bot. It is built on Paul Mutton's PircBot Java-based Framework. mobibot is making extensive use of various open source libraries, including: * Commons CLI * Commons HTTPClient * Commons Logging * Commons Net * delicious-java * Google Tag Library * JWeather * MathEvaluator * Rome * Apache XML-RPC * Twitter4J mobibot was written by Erik C. Thauvin as a replacement for the channel's original ChumpBot. Features mobibot's main functionality is to capture URLs posted on the channel. The URLs are automatically gathered into a publishable RSS feed. Other features include: * Performing calculations * Converting between currencies * Rolling dice * Performing Google searches * Displaying the latest entries on Mobitopia * Performing DNS lookups * Recapping public channel messages * Performing Google spelling queries * Retrieving stock quotes * Displaying the time in various time zones * Listing the users on the channel * Displaying weather information * Posting to Jaiku's #mobitopia channel * Posting to Twitter Some of the internal features include RSS feed backlogs, rolling logs, debugging toggle and much more. If you have any feature suggestions, please post them to the mobibot wiki.
Mike Chelen

SourceForge.net: webcamstudio » home - 0 views

  •  
    What is WebcamStudio? Your own TV studio in your computer for broadcasting thru a virtual webcam. Mix multiple video source as webcams, movies, images, your desktop and log in your prefered video streaming site like Stikam or BlogTV and start your own show or video blog with: - Cool Special Effects - Text overlay - Video Source transition/rotation/movement... - Animations and Faces! - A webcam at your own image See the features! Why for GNU/Linux? Because there are no solution for Linux and Windows users already have acces to this kind of software. And mostly because I use a lot of Linux technologies that are not available under Microsoft Windows. (Sorry guys!) Based on... * Gstreamer libraries and plugins * gstreamer-java libraries * Java 6 (from Sun)
Mike Chelen

NIF - 0 views

  •  
    The Neuroscience Information Framework (NIF) is a dynamic inventory of web-based neurosciences data, resources, and tools that scientists and students can access via any computer connected to the Internet. An initiative of the NIH Blueprint for Neuroscience Research, the NIF will advance neuroscience research by enabling discovery and access to public research data and tools worldwide through an open source, networked environment.
Mike Chelen

Neuroscience Information Framework - 0 views

shared by Mike Chelen on 15 Dec 08 - Cached
  •  
    Through its resource registry and concept based query system, NIF enhances neuroscience research by enabling discovery and access to research data and tools worldwide.
Mike Chelen

FTP Download - 0 views

  •  
    If required, entire databases can be downloaded from our FTP site in a variety of formats, from flat files to MySQL dumps. Please be aware that these files can run to many gigabytes of data. To facilitate storage and download all databases are GNU Zip (gzip, *.gz) compressed. Please note: Ensembl supports downloading of many correlation tables via the highly customisable BioMart data mining tool. You may find exploring this web-based data mining tool easier than extracting information from our database dumps.
Mike Chelen

Qualifying Online Information Resources for Chemists | SciVee - 0 views

shared by Mike Chelen on 11 Dec 08 - Cached
  •  
    his meeting was about "Making the Web Work for Science and the Impact of e-Science and the Cyberinfrastructure." I provided an overview of how access to information has changed over the past 20 years for me. I talked about the challenges for publishers serving the chemistry community and how their business models are being challenged and how I empathize with the struggle to figure out how to deal with it. I talked about quality and how care must be taken when using information online. We are ALL challenged with errors - whether you consider PubChem, ChemSpider, Wikipedia or any of the other online databases they all have errors - how do you find them? Some of them are obvious and I pointed to obvious examples in the talk. I hoped to educate the attendees in regards to the value of InChI which, while not a perfect fit yet, is a great start to structure-based communication of chemistry. I publicly blessed the efforts of publishers such as the RSC and Nature Publishing group for the efforts they are making to support InChI and improve the quality of document presentation online. I blessed CAS as a treasure trove of information and the gold standard of curated chemistry. We need them all to be successful for the sake of our science. The challenge is how to fit into the ongoing proliferation of free access to information without modifying the business models.
Mike Chelen

Main Page - OpenResearch.org - 0 views

  •  
    This semantic wiki at OpenResearch.org aims at making the world of science more visible and accessible. Everybody can add his favorite events (e.g. conferences and workshops), co-workers, tools / datasets, community fora or journals. Pooled together these pieces of information constitute a vast knowledge base about who and what moves science forward.
Mike Chelen

WikiGenes - A wiki for the life sciences where authorship matters. - 0 views

  •  
    WikiGenes is the first wiki system to combine the collaborative and largely altruistic possibilities of wikis with explicit authorship. In view of the extraordinary success of Wikipedia there remains no doubt about the potential of collaborative publishing, yet its adoption in science has been limited. Here I discuss a dynamic collaborative knowledge base for the life sciences that provides authors with due credit and that can evolve via continual revision and traditional peer review into a rigorous scientific
Mike Chelen

Ubuntu -- Details of package avfs in intrepid - 0 views

  •  
    This FUSE-base VFS (Virtual FileSystem) enables all programs to look inside archived or compressed files, or access remote files without recompiling the programs or changing the kernel. At the moment it supports floppies, tar and gzip files, zip, bzip2, ar and rar files, ftp sessions, http, webdav, rsh/rcp, ssh/scp. Quite a few other handlers are implemented with the Midnight Commander's external FS.
Mike Chelen

BioMart - 0 views

shared by Mike Chelen on 11 Dec 08 - Cached
  •  
    BioMart is a query-oriented data management system developed jointly by the Ontario Institute for Cancer Research (OiCR) and the European Bioinformatics Institute (EBI). The system can be used with any type of data and is particularly suited for providing 'data mining' like searches of complex descriptive data. BioMart comes with an 'out of the box' website that can be installed, configured and customised according to user requirements. Further access is provided by graphical and text based applications or programmatically using web services or API written in Perl and Java. BioMart has built-in support for query optimisation and data federation and in addition can be configured to work as a DAS 1.5 Annotation server. The process of converting a data source into BioMart format is fully automated by the tools included in the package. Currently supported RDBMS platforms are MySQL, Oracle and Postgres. BioMart is completely Open Source, licensed under the LGPL, and freely available to anyone without restrictions.
Mike Chelen

Protocol for Implementing Open Access Data - 0 views

  • information for the Internet community
  • distributing data or databases
  • “open” and “open access”
  • ...69 more annotations...
  • requirements for gaining and using the Science Commons Open Access Data Mark and metadata
  • interoperability of scientific data
  • terms and conditions around data make integration difficult to legally perform
  • single license
  • data with this license can be integrated with any other data under this license
  • too many databases under too many terms already
  • unlikely that any one license or suite of licenses will have the correct mix of terms
  • principles for open access data and a protocol for implementing those principles
  • Open Access Data Mark and metadata
  • databases and data
  • the foundation to legally integrate a database or data product
  • another database or data product
  • no mechanisms to manage transfer or negotiations of rights unrelated to integration
  • submitted to Science Commons for certification as a conforming implementation
  • Open Access Data trademarks (icons and phrases) and metadata on databases
  • protocol must promote legal predictability and certainty
  • easy to use and understand
  • lowest possible transaction costs on users
  • Science Commons’ experience in distributing a database licensing Frequently Asked Questions (FAQ) file
  • hard to apply the distinction between what is copyrightable and what is not copyrightable
  • lack of simplicity restricts usage
  • reducing or eliminating the need to make the distinction between copyrightable and non-copyrightable elements
  • satisfy the norms and expectations of the disciplines providing the database
  • norms for citation will differ
  • norms must be attached
  • Converge on the public domain by waiving all rights based on intellectual property
  • reconstruction of the public domain
  • scientific norms to express the wishes of the data provider
  • public domain
  • waiving the relevant rights on data and asserting that the provider makes no claims on the data
  • Requesting behavior, such as citation, through norms rather than as a legal requirement based on copyright or contracts, allows for different scientific disciplines to develop different norms for citation.
  • waive all rights necessary for data extraction and re-use
  • copyright
  • sui generis database rights
  • claims of unfair competition
  • implied contracts
  • and other legal rights
  • any obligations on the user of the data or database such as “copyleft” or “share alike”, or even the legal requirement to provide attribution
  • non-legally binding set of citation norms
  • waiving other statutory or intellectual property rights
  • there are other rights, in addition to copyright, that may apply
  • uncopyrightable databases may be protected in some countries
  • sui generis rights apply in the European Union
  • waivers of sui generis and other legal grounds for database protection
  • no contractual controls
  • using contract, rather than intellectual property or statutory rights, to apply terms to databases
  • affirmatively declare that contractual constraints do not apply to the database
  • interoperation with databases and data not available under the Science Commons Open Access Data Protocol through metadata
  • data that is not or cannot be made available under this protocol
  • owner provides metadata (as data) under this protocol so that the existence of the non-open access data is discoverable
  • digital identifiers and metadata describing non-open access data
  • “Licensing” a database typically means that the “copyrightable elements” of a database are made available under a copyright license
  • Database FAQ, in its first iteration, recommended this method
  • recommendation is now withdrawn
  • copyright begins in and ends in many databases
  • database divided into copyrightable and non copyrightable elements
  • user tends to assume that all is under copyright or none is under copyright
  • share-alike license on the copyrightable elements may be falsely assumed to operate on the factual contents of a database
  • copyright in situations where it is not necessary
  • query across tens of thousands of data records across the web might return a result which itself populates a new database
  • selective waiving of intellectual property rights fail to provide a high degree of legal certainty and ease of use
  • problem of false expectations
  • apply a “copyleft” term to the copyrightable elements of a database, in hopes that those elements result in additional open access database elements coming online
  • uncopyrightable factual content
  • republish those contents without observing the copyleft or share-alike terms
  • cascading attribution if attribution is required as part of a license approach
  • Would a scientist need to attribute 40,000 data depositors in the event of a query across 40,000 data sets?
  • conflict with accepted norms in some disciplines
  • imposes a significant transaction cost
Mike Chelen

ChemSpider - Database of Chemical Structures and Property Predictions - 0 views

  •  
    ChemSpider is a free access service providing a structure centric community for chemists. Providing access to millions of chemical structures and integration to a multitude of other online services ChemSpider is the richest single source of structure-based chemistry information.
Mike Chelen

Open Knowledge Foundation Blog » Blog Archive » Comments on the Science Commo... - 0 views

  • the protocol does not discuss any of the possible attractions of allowing such provisions
  • Protocol gives 3 basic reasons for preferring the ‘PD’ approach
  • Science Commons Protocol for Implementing Open Access Data
  • ...7 more annotations...
  • I am not really convinced by any of these points that attribution or share-alike provisions should not be included in open data licenses
  • application of obligations based on copyright in situations where it is not necessary
  • non-copyrightable elements extends to the entire database and inadvertently infringe
  • If intellectual property rights are involved
  • requirements carrying a stiff penalty for failure
  • selective waiving of intellectual property rights
  • interpretative problems
1 - 20 of 27 Next ›
Showing 20 items per page