Group items tagged database - sensemaking

An Architecture and Object Model for Distributed Object-Oriented Real-Time Databases - ... - 0 views

citeseerx.ist.psu.edu/...summary

citeseerx collaboration architecture beehive

shared by Jack Park on 05 Jan 09 - Cached

Jack Park on 05 Jan 09

The confluence of computers, communications, and databases is quickly creating a distributed database where many applications require real-time access to both temporally accurate and multimedia data. This is particularly true in military and intelligence applications, but these required features are needed in many commercial applications as well. We are developing a distributed database, called BeeHive, which could offer features along different types of requirements: real-time, fault-tolerance, security, and quality-of service for audio and video. Support of these features and potential trade-offs between them could provide a significant improvement in performance and functionality over current distributed database and object management systems. In this paper, we present a high level design for BeeHive architecture and sketch the design of the BeeHive Object Model (BOM) which extends object-oriented data models by incorporating time and other features into objects, resulting in a highly reflective architecture.

<div class="cArrow"> </div><div class="cContentInner">The confluence of computers, communications, and databases is quickly creating a distributed database where many applications require real-time access to both temporally accurate and multimedia data. This is particularly true in military and intelligence applications, but these required features are needed in many commercial applications as well. We are developing a distributed database, called BeeHive, which could offer features along different types of requirements: real-time, fault-tolerance, security, and quality-of service for audio and video. Support of these features and potential trade-offs between them could provide a significant improvement in performance and functionality over current distributed database and object management systems. In this paper, we present a high level design for BeeHive architecture and sketch the design of the BeeHive Object Model (BOM) which extends object-oriented data models by incorporating time and other features into objects, resulting in a highly reflective architecture.</div>

...

Cancel

Open Data Commons » Open Database Licence (ODbL) - 0 views

www.opendatacommons.org/...odbl

odbl data database license

shared by Jack Park on 29 Jun 09 - Cached

Jack Park on 29 Jun 09

This {DATA(BASE)-NAME} is made available under the Open Database License: http://opendatacommons.org/licenses/odbl/1.0/. Any rights in individual contents of the database are licensed under the Database Contents License: http://opendatacommons.org/licenses/dbcl/1.0/

<div class="cArrow"> </div><div class="cContentInner">This {DATA(BASE)-NAME} is made available under the Open Database License: <a href="http://opendatacommons.org/licenses/odbl/1.0/" rel="nofollow" target="_blank">http://opendatacommons.org/licenses/odbl/1.0/</a>. Any rights in individual contents of the database are licensed under the Database Contents License: <a href="http://opendatacommons.org/licenses/dbcl/1.0/" rel="nofollow" target="_blank">http://opendatacommons.org/licenses/dbcl/1.0/</a></div>

...

Cancel

Carrollogos: Copyright in Databases - 0 views

carrollogos.blogspot.com/...copyright-in-databases.html

carrollogos copyright databases blog tutorial law

shared by Jack Park on 21 Feb 09 - Cached

Jack Park on 21 Feb 09

I'm going to have more to say about data, databases, and intellectual property rights in the coming months. This longish post provides a basic primer on how U.S. copyright law applies to databases.

<div class="cArrow"> </div><div class="cContentInner">I'm going to have more to say about data, databases, and intellectual property rights in the coming months. This longish post provides a basic primer on how U.S. copyright law applies to databases.</div>

...

Cancel

HCLSIG BioRDF Subgroup/aTags - ESW Wiki - 1 views

esw.w3.org/...aTags

atags biordf bio-ontologies bioinformatics tags tagging

shared by Jack Park on 06 Jan 09 - Cached

Jack Park on 06 Jan 09

# The primary intention of creating aTags is not the categorization of the document, but the representation of the key facts inside the document. Key facts in the biomedical domain might be, for example, "Protein A interacts with protein B" or "Overexpression of protein A in tissue B is the cause of disease C". # An aTag is comprised of a set of associated entities. The size of the set is arbitrary, but will typically lie between 2 and 5 entities. For example, the fact "Protein A binds to protein B" can be represented with an aTag comprising of the three entities "Protein A", "Molecular interaction" and "Protein B". Similarly, the fact "Overexpression of protein A in tissue B is the cause of disease C" can be represented with an aTag comprising of the four entities "Overexpression", "Protein A", "Tissue B" and "Disease C". # Each document or database entry can be described with an arbitrary number of such aTags. Each aTag can be associated with the relevant portions of text or data in a fine granularity. # The entities in an aTag are not simple strings, but resources that are part of ontologies and RDF/OWL-enabled databases. For example, "Protein A" and "Protein B" are resources that are defined in the UniProt database, whereas "Molecular Interaction" is a class in the branch of biological processes of the Gene Ontology. They are identified with their URIs.

<div class="cArrow"> </div><div class="cContentInner"># The primary intention of creating aTags is not the categorization of the document, but the representation of the key facts inside the document. Key facts in the biomedical domain might be, for example, "Protein A interacts with protein B" or "Overexpression of protein A in tissue B is the cause of disease C". # An aTag is comprised of a set of associated entities. The size of the set is arbitrary, but will typically lie between 2 and 5 entities. For example, the fact "Protein A binds to protein B" can be represented with an aTag comprising of the three entities "Protein A", "Molecular interaction" and "Protein B". Similarly, the fact "Overexpression of protein A in tissue B is the cause of disease C" can be represented with an aTag comprising of the four entities "Overexpression", "Protein A", "Tissue B" and "Disease C". # Each document or database entry can be described with an arbitrary number of such aTags. Each aTag can be associated with the relevant portions of text or data in a fine granularity. # The entities in an aTag are not simple strings, but resources that are part of ontologies and RDF/OWL-enabled databases. For example, "Protein A" and "Protein B" are resources that are defined in the UniProt database, whereas "Molecular Interaction" is a class in the branch of biological processes of the Gene Ontology. They are identified with their URIs. </div>

...

Cancel

triplify.org : About - 0 views

triplify.org/About

database json rdf relational triplify

shared by Jack Park on 01 Sep 08 - Cached

Jack Park on 01 Sep 08

Triplify is based on the definition of relational database queries for a specific Web application in order to retrieve valuable information and to convert the results of these queries into RDF, JSON and Linked Data. Experiences showed that for most web-applications a relatively small number of queries (mostly between 3-7) is sufficient to extract the important information. After generating such database views the Triplify software can be used to convert the view into an RDF, JSON or Linked Data representation, which can be shared and accessed on the (Semantic) Web.

<div class="cArrow"> </div><div class="cContentInner">Triplify is based on the definition of relational database queries for a specific Web application in order to retrieve valuable information and to convert the results of these queries into RDF, JSON and Linked Data. Experiences showed that for most web-applications a relatively small number of queries (mostly between 3-7) is sufficient to extract the important information. After generating such database views the Triplify software can be used to convert the view into an RDF, JSON or Linked Data representation, which can be shared and accessed on the (Semantic) Web.</div>

...

Cancel

PATIKA Project Web site - 0 views

www.patika.org

bio-ontologies biofuel databases pathways patika

shared by Jack Park on 19 Oct 08 - Cached

Jack Park on 19 Oct 08

This is the homepage for an ongoing research and development project named PATIKA - Pathway Analysis Tools for Integration and Knowledge Acquisition. Within this project so far, among others, an ontology has been defined; a pathway database (which integrates and interfaces with several public pathway databases) has been constructed; and some software tools have been developed for effective integration, querying, analysis, and manipulation of pathway data.

<div class="cArrow"> </div><div class="cContentInner">This is the homepage for an ongoing research and development project named PATIKA - Pathway Analysis Tools for Integration and Knowledge Acquisition. Within this project so far, among others, an ontology has been defined; a pathway database (which integrates and interfaces with several public pathway databases) has been constructed; and some software tools have been developed for effective integration, querying, analysis, and manipulation of pathway data.</div>

...

Cancel

Official Google Research Blog: Google Fusion Tables - 0 views

googleresearch.blogspot.com/...google-fusion-tables.html

fusion database collaboration cloudcomputing google

shared by Jack Park on 13 Jun 09 - Cached

Jack Park on 13 Jun 09

Database systems are notorious for being hard to use. It is even more difficult to integrate data from multiple sources and collaborate on large data sets with people outside your organization. Without an easy way to offer all the collaborators access to the same server, data sets get copied, emailed and ftp'd--resulting in multiple versions that get out of sync very quickly. Today we're introducing Google Fusion Tables on Labs, an experimental system for data management in the cloud. It draws on the expertise of folks within Google Research who have been studying collaboration, data integration, and user requirements from a variety of domains. Fusion Tables is not a traditional database system focusing on complicated SQL queries and transaction processing. Instead, the focus is on fusing data management and collaboration: merging multiple data sources, discussion of the data, querying, visualization, and Web publishing. We plan to iteratively add new features to the systems as we get feedback from users.

<div class="cArrow"> </div><div class="cContentInner">Database systems are notorious for being hard to use. It is even more difficult to integrate data from multiple sources and collaborate on large data sets with people outside your organization. Without an easy way to offer all the collaborators access to the same server, data sets get copied, emailed and ftp'd--resulting in multiple versions that get out of sync very quickly. Today we're introducing Google Fusion Tables on Labs, an experimental system for data management in the cloud. It draws on the expertise of folks within Google Research who have been studying collaboration, data integration, and user requirements from a variety of domains. Fusion Tables is not a traditional database system focusing on complicated SQL queries and transaction processing. Instead, the focus is on fusing data management and collaboration: merging multiple data sources, discussion of the data, querying, visualization, and Web publishing. We plan to iteratively add new features to the systems as we get feedback from users.</div>

...

Cancel

CrunchBase, The Free Tech Company Database - 0 views

www.crunchbase.com

web2.0 directory startup companies entrepreneurship techcrunch list database

shared by Jack Park on 11 Dec 08 - Cached

Jack Park on 11 Dec 08

CrunchBase is the free database of technology companies, people, and investors that anyone can edit.

<div class="cArrow"> </div><div class="cContentInner">CrunchBase is the free database of technology companies, people, and investors that anyone can edit.</div>

...

Cancel

List of bioinformatics databases - Biohack - 0 views

heybryan.org/...st_of_bioinformatics_databases

bioinformatics databases

shared by Jack Park on 10 Jul 08 - Cached

Jack Park on 10 Jul 08

List of bioinformatics databases

<div class="cArrow"> </div><div class="cContentInner">List of bioinformatics databases</div>

...

Cancel

Welcome to FlyTED - 0 views

www.fly-ted.org

flyted drosophila genome data

shared by Jack Park on 05 Nov 08 - Cached

Jack Park on 05 Nov 08

FlyTED, the Drosophila Testis Gene Expression Database, is a public database currently containing 1,947 mRNA in situ images and ancillary data revealing the extent of expression of 623 individual genes involved in spermatogenesis in the testis of the fruitfly, Drosophila melanogaster, both in normal wild type flies and in seven meiotic arrest mutant strains.

<div class="cArrow"> </div><div class="cContentInner">FlyTED, the Drosophila Testis Gene Expression Database, is a public database currently containing 1,947 mRNA in situ images and ancillary data revealing the extent of expression of 623 individual genes involved in spermatogenesis in the testis of the fruitfly, Drosophila melanogaster, both in normal wild type flies and in seven meiotic arrest mutant strains.</div>

...

Cancel

The Trade & Environment Database - 0 views

gurukul.ucc.american.edu/...ted.htm

data database environment sensemaking

shared by Jack Park on 18 Apr 09 - Cached

Jack Park on 18 Apr 09

The Trade & Environment Database (TED) is a collection of categorical case studies that began with a focus on solely environmental issues, but did not include the economic consequences of other social policy choices, such as culture, rights, or other issues.

<div class="cArrow"> </div><div class="cContentInner">The Trade & Environment Database (TED) is a collection of categorical case studies that began with a focus on solely environmental issues, but did not include the economic consequences of other social policy choices, such as culture, rights, or other issues.</div>

...

Cancel

A Lightweight SQL Database for Cloud and Web in Launchpad - 0 views

launchpad.net/drizzle

database drizzle mysql SQL web scalability db rdbms cloudcomputing

shared by Jack Park on 15 May 09 - No Cached

Jack Park on 15 May 09

The Drizzle project is building a database optimized for Cloud and Net applications. It is being designed for massive concurrency on modern multi-cpu/core architecture. The code is originally derived from MySQL.

<div class="cArrow"> </div><div class="cContentInner">The Drizzle project is building a database optimized for Cloud and Net applications. It is being designed for massive concurrency on modern multi-cpu/core architecture. The code is originally derived from MySQL. </div>

...

Cancel

Technology Review: A Web Spider for Everyone - 1 views

www.technologyreview.com/printer_friendly_article.aspx

spider discovery

shared by Jack Park on 29 Sep 09 - Cached

Jack Park on 29 Sep 09

A user can start a Web crawl through 80legs's Web-based interface. The form on the company's site lets them set parameters for the project and upload custom code needed to control how the crawler does its job. For example, a user might want the crawler to find images and check them against a database of copyrighted ones. Deysarkar says his company's crawlers are capable of processing up to two billion pages a day. The company charges $2 for every million pages crawled, plus a fee of three cents per hour of processing used.

<div class="cArrow"> </div><div class="cContentInner">A user can start a Web crawl through 80legs's Web-based interface. The form on the company's site lets them set parameters for the project and upload custom code needed to control how the crawler does its job. For example, a user might want the crawler to find images and check them against a database of copyrighted ones. Deysarkar says his company's crawlers are capable of processing up to two billion pages a day. The company charges $2 for every million pages crawled, plus a fee of three cents per hour of processing used. </div>

...

Cancel
Jack Park on 29 Sep 09

A user can start a Web crawl through 80legs's Web-based interface. The form on the company's site lets them set parameters for the project and upload custom code needed to control how the crawler does its job. For example, a user might want the crawler to find images and check them against a database of copyrighted ones. Deysarkar says his company's crawlers are capable of processing up to two billion pages a day. The company charges $2 for every million pages crawled, plus a fee of three cents per hour of processing used.

<div class="cArrow"> </div><div class="cContentInner">A user can start a Web crawl through 80legs's Web-based interface. The form on the company's site lets them set parameters for the project and upload custom code needed to control how the crawler does its job. For example, a user might want the crawler to find images and check them against a database of copyrighted ones. Deysarkar says his company's crawlers are capable of processing up to two billion pages a day. The company charges $2 for every million pages crawled, plus a fee of three cents per hour of processing used.</div>

...

Cancel

mysqlicious - Google Code - 0 views

code.google.com/mysqlicious

mysqlicious database mysql delicious opensource php

shared by Jack Park on 23 Jan 09 - Cached

Jack Park on 23 Jan 09

MySQLicious provides automated mirroring/backups of Delicious bookmarks into a MySQL database.

<div class="cArrow"> </div><div class="cContentInner">MySQLicious provides automated mirroring/backups of Delicious bookmarks into a MySQL database. </div>

...

Cancel

Sphinx - Free open-source SQL full-text search engine - 0 views

www.sphinxsearch.com

search mysql database sphinx sql opensource php indexing searchengine gpl

shared by Jack Park on 21 Dec 08 - Cached

Jack Park on 21 Dec 08

Sphinx is a full-text search engine, distributed under GPL version 2. Commercial license is also available for embedded use. Generally, it's a standalone search engine, meant to provide fast, size-efficient and relevant fulltext search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages. Currently built-in data sources support fetching data either via direct connection to MySQL or PostgreSQL, or using XML pipe mechanism (a pipe to indexer in special XML-based format which Sphinx recognizes).

<div class="cArrow"> </div><div class="cContentInner">Sphinx is a full-text search engine, distributed under GPL version 2. Commercial license is also available for embedded use. Generally, it's a standalone search engine, meant to provide fast, size-efficient and relevant fulltext search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages. Currently built-in data sources support fetching data either via direct connection to MySQL or PostgreSQL, or using XML pipe mechanism (a pipe to indexer in special XML-based format which Sphinx recognizes). </div>

...

Cancel

Syncable tools for the offline web - 0 views

syncwith.us

collaboration database p2p perl

shared by Jack Park on 13 Oct 08 - Cached

Jack Park on 13 Oct 08

A grounded, semirelational, peer to peer replicated, disconnected, versioned, property database with self-healing conflict resolution.

<div class="cArrow"> </div><div class="cContentInner">A grounded, semirelational, peer to peer replicated, disconnected, versioned, property database with self-healing conflict resolution.</div>

...

Cancel

DeepPeep: discover the hidden web - 0 views

www.deeppeep.org

deeppeep search searchengine forms labels databases

shared by Jack Park on 23 Feb 09 - Cached

Jack Park on 23 Feb 09

DeepPeep is a search engine specialized in Web forms. The current beta version tracks 13,000 forms across 7 domains. DeepPeep helps you discover the entry points to content in Deep Web (aka Hidden Web) sites, including online databases and Web services.

<div class="cArrow"> </div><div class="cContentInner">DeepPeep is a search engine specialized in Web forms. The current beta version tracks 13,000 forms across 7 domains. DeepPeep helps you discover the entry points to content in Deep Web (aka Hidden Web) sites, including online databases and Web services.</div>

...

Cancel

Apache CouchDB: The CouchDB Project - 0 views

couchdb.apache.org/index.html

couchdb database RESTful api erlang opensource apache

shared by Jack Park on 07 Jun 09 - Cached

Jack Park on 07 Jun 09

Apache CouchDB is a distributed, fault-tolerant and schema-free document-oriented database accessible via a RESTful HTTP/JSON API. Among other features, it provides robust, incremental replication with bi-directional conflict detection and resolution, and is queryable and indexable using a table-oriented view engine with JavaScript acting as the default view definition language.

<div class="cArrow"> </div><div class="cContentInner">Apache CouchDB is a distributed, fault-tolerant and schema-free document-oriented database accessible via a RESTful HTTP/JSON API. Among other features, it provides robust, incremental replication with bi-directional conflict detection and resolution, and is queryable and indexable using a table-oriented view engine with JavaScript acting as the default view definition language.</div>

...

Cancel

Snowden: "Narrative Research" (PDF, 2010) - 3 views

cognitive-edge.com/...e-Research_Snowden%20FINAL.pdf

Snowden Cognitive Edge cognitiveedge research complexity

shared by Stian Danenbarger on 23 Nov 10 - No Cached

Stian Danenbarger on 23 Nov 10

Narrative techniques both provide a complementary form of what we will call pre-hypothesis research, but further that the use of narrative research techniques produces, through a single intervention, quantitative conclusions supported by narrative context, fragmented knowledge databases, and a mechanism for measuring impact and more complex issues such as mapping ideation cultures.

<div class="cArrow"> </div><div class="cContentInner">Narrative techniques both provide a complementary form of what we will call pre-hypothesis research, but further that the use of narrative research techniques produces, through a single intervention, quantitative conclusions supported by narrative context, fragmented knowledge databases, and a mechanism for measuring impact and more complex issues such as mapping ideation cultures.</div>

...

Cancel
Stian Danenbarger on 23 Nov 10

Snowden again... Looks like a fairly interesting book is on its way, as well...?

<div class="cArrow"> </div><div class="cContentInner">Snowden again... Looks like a fairly interesting book is on its way, as well...?</div>

...

Cancel

iGlue beta - 0 views

iglue.com/beta

iglue cms topicmap-like

shared by Jack Park on 23 Mar 09 - Cached

Jack Park on 23 Mar 09

What is iGlue? The right meaning to the right word. iGlue is an online content management application that reorganizes data fragmented on the net. It arranges pictures, videos, people, notions and geographical locations into a unified and manageable structure. Through this collaboratively edited database iGlue shows the content network of any webpage.

<div class="cArrow"> </div><div class="cContentInner">What is iGlue? The right meaning to the right word. iGlue is an online content management application that reorganizes data fragmented on the net. It arranges pictures, videos, people, notions and geographical locations into a unified and manageable structure. Through this collaboratively edited database iGlue shows the content network of any webpage. </div>

...

Cancel

Group items tagged