Skip to main content

Home/ sensemaking/ Group items tagged database

Rss Feed Group items tagged

Jack Park

An Architecture and Object Model for Distributed Object-Oriented Real-Time Databases - ... - 0 views

  •  
    The confluence of computers, communications, and databases is quickly creating a distributed database where many applications require real-time access to both temporally accurate and multimedia data. This is particularly true in military and intelligence applications, but these required features are needed in many commercial applications as well. We are developing a distributed database, called BeeHive, which could offer features along different types of requirements: real-time, fault-tolerance, security, and quality-of service for audio and video. Support of these features and potential trade-offs between them could provide a significant improvement in performance and functionality over current distributed database and object management systems. In this paper, we present a high level design for BeeHive architecture and sketch the design of the BeeHive Object Model (BOM) which extends object-oriented data models by incorporating time and other features into objects, resulting in a highly reflective architecture.
Jack Park

Open Data Commons » Open Database Licence (ODbL) - 0 views

  •  
    This {DATA(BASE)-NAME} is made available under the Open Database License: http://opendatacommons.org/licenses/odbl/1.0/. Any rights in individual contents of the database are licensed under the Database Contents License: http://opendatacommons.org/licenses/dbcl/1.0/
Jack Park

Carrollogos: Copyright in Databases - 0 views

  •  
    I'm going to have more to say about data, databases, and intellectual property rights in the coming months. This longish post provides a basic primer on how U.S. copyright law applies to databases.
Jack Park

HCLSIG BioRDF Subgroup/aTags - ESW Wiki - 1 views

  •  
    # The primary intention of creating aTags is not the categorization of the document, but the representation of the key facts inside the document. Key facts in the biomedical domain might be, for example, "Protein A interacts with protein B" or "Overexpression of protein A in tissue B is the cause of disease C". # An aTag is comprised of a set of associated entities. The size of the set is arbitrary, but will typically lie between 2 and 5 entities. For example, the fact "Protein A binds to protein B" can be represented with an aTag comprising of the three entities "Protein A", "Molecular interaction" and "Protein B". Similarly, the fact "Overexpression of protein A in tissue B is the cause of disease C" can be represented with an aTag comprising of the four entities "Overexpression", "Protein A", "Tissue B" and "Disease C". # Each document or database entry can be described with an arbitrary number of such aTags. Each aTag can be associated with the relevant portions of text or data in a fine granularity. # The entities in an aTag are not simple strings, but resources that are part of ontologies and RDF/OWL-enabled databases. For example, "Protein A" and "Protein B" are resources that are defined in the UniProt database, whereas "Molecular Interaction" is a class in the branch of biological processes of the Gene Ontology. They are identified with their URIs.
Jack Park

triplify.org : About - 0 views

  •  
    Triplify is based on the definition of relational database queries for a specific Web application in order to retrieve valuable information and to convert the results of these queries into RDF, JSON and Linked Data. Experiences showed that for most web-applications a relatively small number of queries (mostly between 3-7) is sufficient to extract the important information. After generating such database views the Triplify software can be used to convert the view into an RDF, JSON or Linked Data representation, which can be shared and accessed on the (Semantic) Web.
Jack Park

PATIKA Project Web site - 0 views

  •  
    This is the homepage for an ongoing research and development project named PATIKA - Pathway Analysis Tools for Integration and Knowledge Acquisition. Within this project so far, among others, an ontology has been defined; a pathway database (which integrates and interfaces with several public pathway databases) has been constructed; and some software tools have been developed for effective integration, querying, analysis, and manipulation of pathway data.
Jack Park

Official Google Research Blog: Google Fusion Tables - 0 views

  •  
    Database systems are notorious for being hard to use. It is even more difficult to integrate data from multiple sources and collaborate on large data sets with people outside your organization. Without an easy way to offer all the collaborators access to the same server, data sets get copied, emailed and ftp'd--resulting in multiple versions that get out of sync very quickly. Today we're introducing Google Fusion Tables on Labs, an experimental system for data management in the cloud. It draws on the expertise of folks within Google Research who have been studying collaboration, data integration, and user requirements from a variety of domains. Fusion Tables is not a traditional database system focusing on complicated SQL queries and transaction processing. Instead, the focus is on fusing data management and collaboration: merging multiple data sources, discussion of the data, querying, visualization, and Web publishing. We plan to iteratively add new features to the systems as we get feedback from users.
Jack Park

CrunchBase, The Free Tech Company Database - 0 views

  •  
    CrunchBase is the free database of technology companies, people, and investors that anyone can edit.
Jack Park

List of bioinformatics databases - Biohack - 0 views

  •  
    List of bioinformatics databases
Jack Park

Welcome to FlyTED - 0 views

  •  
    FlyTED, the Drosophila Testis Gene Expression Database, is a public database currently containing 1,947 mRNA in situ images and ancillary data revealing the extent of expression of 623 individual genes involved in spermatogenesis in the testis of the fruitfly, Drosophila melanogaster, both in normal wild type flies and in seven meiotic arrest mutant strains.
Jack Park

The Trade & Environment Database - 0 views

  •  
    The Trade & Environment Database (TED) is a collection of categorical case studies that began with a focus on solely environmental issues, but did not include the economic consequences of other social policy choices, such as culture, rights, or other issues.
Jack Park

A Lightweight SQL Database for Cloud and Web in Launchpad - 0 views

  •  
    The Drizzle project is building a database optimized for Cloud and Net applications. It is being designed for massive concurrency on modern multi-cpu/core architecture. The code is originally derived from MySQL.
Jack Park

Technology Review: A Web Spider for Everyone - 1 views

  •  
    A user can start a Web crawl through 80legs's Web-based interface. The form on the company's site lets them set parameters for the project and upload custom code needed to control how the crawler does its job. For example, a user might want the crawler to find images and check them against a database of copyrighted ones. Deysarkar says his company's crawlers are capable of processing up to two billion pages a day. The company charges $2 for every million pages crawled, plus a fee of three cents per hour of processing used.
  •  
    A user can start a Web crawl through 80legs's Web-based interface. The form on the company's site lets them set parameters for the project and upload custom code needed to control how the crawler does its job. For example, a user might want the crawler to find images and check them against a database of copyrighted ones. Deysarkar says his company's crawlers are capable of processing up to two billion pages a day. The company charges $2 for every million pages crawled, plus a fee of three cents per hour of processing used.
Jack Park

mysqlicious - Google Code - 0 views

  •  
    MySQLicious provides automated mirroring/backups of Delicious bookmarks into a MySQL database.
Jack Park

Sphinx - Free open-source SQL full-text search engine - 0 views

  •  
    Sphinx is a full-text search engine, distributed under GPL version 2. Commercial license is also available for embedded use. Generally, it's a standalone search engine, meant to provide fast, size-efficient and relevant fulltext search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages. Currently built-in data sources support fetching data either via direct connection to MySQL or PostgreSQL, or using XML pipe mechanism (a pipe to indexer in special XML-based format which Sphinx recognizes).
Jack Park

Syncable tools for the offline web - 0 views

  •  
    A grounded, semirelational, peer to peer replicated, disconnected, versioned, property database with self-healing conflict resolution.
Jack Park

DeepPeep: discover the hidden web - 0 views

  •  
    DeepPeep is a search engine specialized in Web forms. The current beta version tracks 13,000 forms across 7 domains. DeepPeep helps you discover the entry points to content in Deep Web (aka Hidden Web) sites, including online databases and Web services.
Jack Park

Apache CouchDB: The CouchDB Project - 0 views

  •  
    Apache CouchDB is a distributed, fault-tolerant and schema-free document-oriented database accessible via a RESTful HTTP/JSON API. Among other features, it provides robust, incremental replication with bi-directional conflict detection and resolution, and is queryable and indexable using a table-oriented view engine with JavaScript acting as the default view definition language.
Stian Danenbarger

Snowden: "Narrative Research" (PDF, 2010) - 3 views

  •  
    Narrative techniques both provide a complementary form of what we will call pre-hypothesis research, but further that the use of narrative research techniques produces, through a single intervention, quantitative conclusions supported by narrative context, fragmented knowledge databases, and a mechanism for measuring impact and more complex issues such as mapping ideation cultures.
  •  
    Snowden again... Looks like a fairly interesting book is on its way, as well...?
Jack Park

iGlue beta - 0 views

  •  
    What is iGlue? The right meaning to the right word. iGlue is an online content management application that reorganizes data fragmented on the net. It arranges pictures, videos, people, notions and geographical locations into a unified and manageable structure. Through this collaboratively edited database iGlue shows the content network of any webpage.
1 - 20 of 45 Next › Last »
Showing 20 items per page