Group items tagged entities - sensemaking

1More

HCLSIG BioRDF Subgroup/aTags - ESW Wiki - 1 views

shared by Jack Park on 06 Jan 09 - Cached

Jack Park on 06 Jan 09

# The primary intention of creating aTags is not the categorization of the document, but the representation of the key facts inside the document. Key facts in the biomedical domain might be, for example, "Protein A interacts with protein B" or "Overexpression of protein A in tissue B is the cause of disease C". # An aTag is comprised of a set of associated entities. The size of the set is arbitrary, but will typically lie between 2 and 5 entities. For example, the fact "Protein A binds to protein B" can be represented with an aTag comprising of the three entities "Protein A", "Molecular interaction" and "Protein B". Similarly, the fact "Overexpression of protein A in tissue B is the cause of disease C" can be represented with an aTag comprising of the four entities "Overexpression", "Protein A", "Tissue B" and "Disease C". # Each document or database entry can be described with an arbitrary number of such aTags. Each aTag can be associated with the relevant portions of text or data in a fine granularity. # The entities in an aTag are not simple strings, but resources that are part of ontologies and RDF/OWL-enabled databases. For example, "Protein A" and "Protein B" are resources that are defined in the UniProt database, whereas "Molecular Interaction" is a class in the branch of biological processes of the Gene Ontology. They are identified with their URIs.

<div class="cArrow"> </div><div class="cContentInner"># The primary intention of creating aTags is not the categorization of the document, but the representation of the key facts inside the document. Key facts in the biomedical domain might be, for example, "Protein A interacts with protein B" or "Overexpression of protein A in tissue B is the cause of disease C". # An aTag is comprised of a set of associated entities. The size of the set is arbitrary, but will typically lie between 2 and 5 entities. For example, the fact "Protein A binds to protein B" can be represented with an aTag comprising of the three entities "Protein A", "Molecular interaction" and "Protein B". Similarly, the fact "Overexpression of protein A in tissue B is the cause of disease C" can be represented with an aTag comprising of the four entities "Overexpression", "Protein A", "Tissue B" and "Disease C". # Each document or database entry can be described with an arbitrary number of such aTags. Each aTag can be associated with the relevant portions of text or data in a fine granularity. # The entities in an aTag are not simple strings, but resources that are part of ontologies and RDF/OWL-enabled databases. For example, "Protein A" and "Protein B" are resources that are defined in the UniProt database, whereas "Molecular Interaction" is a class in the branch of biological processes of the Gene Ontology. They are identified with their URIs. </div>

...

Cancel

1More

Jigsaw Page - 0 views

www.cc.gatech.edu/jigsaw jigsaw java visual sensemaking visualisation visualization software

shared by Jack Park on 12 Jan 09 - Cached

Jack Park on 12 Jan 09

Jigsaw provides a collection of visualizations that each portray different aspects of the documents. We particularly focus on presenting the identifiable important entities (people, places, organizations, etc.) and their direct or indirect connections. Textual processing extracts the important entities from the documents and then the visualizations help an analyst to explore the relationships and connections among the entities. The system includes graph, calendar, scatterplot and and tabular connections-based views, as well as views of individual document's text and the report collections as a whole. Jigsaw essentially acts as a visual index onto the document collection, helping analysts identify particular documents to read and examine next.

<div class="cArrow"> </div><div class="cContentInner">Jigsaw provides a collection of visualizations that each portray different aspects of the documents. We particularly focus on presenting the identifiable important entities (people, places, organizations, etc.) and their direct or indirect connections. Textual processing extracts the important entities from the documents and then the visualizations help an analyst to explore the relationships and connections among the entities. The system includes graph, calendar, scatterplot and and tabular connections-based views, as well as views of individual document's text and the report collections as a whole. Jigsaw essentially acts as a visual index onto the document collection, helping analysts identify particular documents to read and examine next. </div>

...

Cancel

1More

OYSTER: A configurable, open-source entity resolution engine in Java - 1 views

identityresolutiondaily.com/...yster-a-configurable-er-engine Talburt identity entity resolution Java opensource

shared by Stian Danenbarger on 09 Feb 11 - No Cached

1More

Welcome to the web site of the OKKAM Large-Scale Integrating Project (GA#215032) - The ... - 0 views

www.okkam.org collections data entities information integration of okkam web

shared by Jack Park on 31 Aug 08 - Cached

1More

Apache UIMA - Apache UIMA - 0 views

incubator.apache.org/uima uima nlp unstructured textmining TextMining harvesting discovery opensource apache

shared by Jack Park on 18 Nov 08 - Cached

Jack Park on 18 Nov 08

Unstructured Information Management applications are software systems that analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user. UIMA is a framework and SDK for developing such applications. An example UIM application might ingest plain text and identify entities, such as persons, places, organizations; or relations, such as works-for or located-at. UIMA enables such an application to be decomposed into components, for example "language identification" -> "language specific segmentation" -> "sentence boundary detection" -> "entity detection (person/place names etc.)". Each component must implement interfaces defined by the framework and must provide self-describing metadata via XML descriptor files. The framework manages these components and the data flow between them. Components are written in Java or C++; the data that flows between components is designed for efficient mapping between these languages. UIMA additionally provides capabilities to wrap components as network services, and can scale to very large volumes by replicating processing pipelines over a cluster of networked nodes.

<div class="cArrow"> </div><div class="cContentInner">Unstructured Information Management applications are software systems that analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user. UIMA is a framework and SDK for developing such applications. An example UIM application might ingest plain text and identify entities, such as persons, places, organizations; or relations, such as works-for or located-at. UIMA enables such an application to be decomposed into components, for example "language identification" -> "language specific segmentation" -> "sentence boundary detection" -> "entity detection (person/place names etc.)". Each component must implement interfaces defined by the framework and must provide self-describing metadata via XML descriptor files. The framework manages these components and the data flow between them. Components are written in Java or C++; the data that flows between components is designed for efficient mapping between these languages. UIMA additionally provides capabilities to wrap components as network services, and can scale to very large volumes by replicating processing pipelines over a cluster of networked nodes. </div>

...

Cancel

1More

| KNOWLEDGE VILLAGE - HOME | - 0 views

www.kv.ae/default.asp dubai knowledge village

shared by Jack Park on 01 Aug 08 - Cached

1More

Yago - A Core of Semantic Knowledge - 0 views

www.mpi-inf.mpg.de/...yago yago wordnet wikipedia ontology semantic semanticweb

shared by Jack Park on 30 Dec 08 - Cached

1More

KIM Platform - 0 views

ontotext.com/kim kim gate annotation indexing co-occurrence timelie

shared by Jack Park on 30 Nov 08 - Cached

1More

DallasWorkshop - NCBO Wiki - 0 views

bioontology.org/...DallasWorkshop clinicalresearch ncbo ontology slides workshop

shared by Jack Park on 05 Sep 08 - Cached

Jack Park on 05 Sep 08

The aims of clinical and translational research are to achieve a better understanding of the pathogenesis of human disease in order to develop effective diagnostic, therapeutic and prevention strategies. Biomedical informatics can play an important role in supporting this research by facilitating the management, integration, analysis and exchange of data derived from and related to the research problems being studied. A key aspect of this support is to bring clarity, rigor and formalism to the representation of 1. disease initiation, progression, pathogenesis, signs, symptoms, assessments, clinical and laboratory findings, disease diagnosis, treatment, treatment response and outcome, and 2. the interrelations between these distinct entities both in patient management and in clinical research, thus allowing the data to be more readily retrievable and shareable, and more able to serve in the support of algorithmic reasoning.

<div class="cArrow"> </div><div class="cContentInner">The aims of clinical and translational research are to achieve a better understanding of the pathogenesis of human disease in order to develop effective diagnostic, therapeutic and prevention strategies. Biomedical informatics can play an important role in supporting this research by facilitating the management, integration, analysis and exchange of data derived from and related to the research problems being studied. A key aspect of this support is to bring clarity, rigor and formalism to the representation of 1. disease initiation, progression, pathogenesis, signs, symptoms, assessments, clinical and laboratory findings, disease diagnosis, treatment, treatment response and outcome, and 2. the interrelations between these distinct entities both in patient management and in clinical research, thus allowing the data to be more readily retrievable and shareable, and more able to serve in the support of algorithmic reasoning. </div>

...

Cancel

1More

Semantic Search: The Myth and Reality - ReadWriteWeb - 0 views

www.readwriteweb.com/...earch_the_myth_and_reality.php readwriteweb search semantic semanticweb

shared by Jack Park on 27 Sep 08 - Cached

1More

Alchemy - Open Source AI - 0 views

alchemy.cs.washington.edu alchemy markovlogic software opensource c++ knowledge discovery TextMining

shared by Jack Park on 15 Jan 09 - Cached

1More

collection sensemaking [interface ecology lab | research] - 0 views

ecologylab.cs.tamu.edu/...collectionSensemaking.html collections games sensemaking

shared by Jack Park on 24 Aug 08 - Cached

Jack Park on 24 Aug 08

Sensemaking is the process through which humans put together understanding of related information. Sensemaking has been said to involve changes in cognitive representations during a human information processing task. Collection sensemaking involves understanding a collection of media entities, as a whole. One example of a sensemaking task is to compare the damage from Hurricane Katrina to homes, personal effects, and community buildings in different areas of New Orleans. Connected visual and semantic representations provide perspective to support users involved in collection sensemaking tasks. A zoomable map organizes images based on location at varying scales. Multiscale clusters based on zoom level organize images associated with events. The clusters afford contextualized thumbnail browsing and also maintain uniform information density on the map. Metadata enhances context and memory in the process of collection sensemaking.

<div class="cArrow"> </div><div class="cContentInner">Sensemaking is the process through which humans put together understanding of related information. Sensemaking has been said to involve changes in cognitive representations during a human information processing task. Collection sensemaking involves understanding a collection of media entities, as a whole. One example of a sensemaking task is to compare the damage from Hurricane Katrina to homes, personal effects, and community buildings in different areas of New Orleans. Connected visual and semantic representations provide perspective to support users involved in collection sensemaking tasks. A zoomable map organizes images based on location at varying scales. Multiscale clusters based on zoom level organize images associated with events. The clusters afford contextualized thumbnail browsing and also maintain uniform information density on the map. Metadata enhances context and memory in the process of collection sensemaking.</div>

...

Cancel

2More

Black: "Creating a Common Ground for URI Meaning Using Socially Constructed Web sites" ... - 2 views

www.ibiblio.org/...jblack.pdf Black uri semantic semantic web identity social PSI

shared by Stian Danenbarger on 10 Apr 10 - No Cached

Stian Danenbarger on 10 Apr 10

"The semantic web proposes to inject machine meaningful data into the existing human language oriented web. As part of this effort, on the semantic web, URIs are used to identify entities. But there is currently no standard way to specify what it is that any given URI is to identify, or to whom, or when. Recent work in linguistics offers ideas for a solution to this lack. It focuses on the pragmatics of actual language use among ensembles of people. Also, the World Wide Web provides a set of technologies, in the form of socially constructed web sites, that could be employed to provide a solution. In this paper, I suggest how such socially constructed web sites could be used to address the problem of establishing common ground among a community of machines of the referent of a URI used on the semantic web. The result is a proposal to automate social meaning by creating societies of machines that share knowledge representations identified by URIs."

<div class="cArrow"> </div><div class="cContentInner">"The semantic web proposes to inject machine meaningful data into the existing human language oriented web. As part of this effort, on the semantic web, URIs are used to identify entities. But there is currently no standard way to specify what it is that any given URI is to identify, or to whom, or when. Recent work in linguistics offers ideas for a solution to this lack. It focuses on the pragmatics of actual language use among ensembles of people. Also, the World Wide Web provides a set of technologies, in the form of socially constructed web sites, that could be employed to provide a solution. In this paper, I suggest how such socially constructed web sites could be used to address the problem of establishing common ground among a community of machines of the referent of a URI used on the semantic web. The result is a proposal to automate social meaning by creating societies of machines that share knowledge representations identified by URIs."</div>

...

Cancel
Stian Danenbarger on 10 Apr 10

What tagging does point to convincingly is the social aspect of naming. In a given natural language, many sorts of identifiers, such as common words, are socially centralized. Other sorts of identifiers, such as proper names, are socially decentralized, varying from local context to local context. Black has noticed a correspondence between this socially grounded identification process and the use of socially constructed Web sites.

<div class="cArrow"> </div><div class="cContentInner">What tagging does point to convincingly is the social aspect of naming. In a given natural language, many sorts of identifiers, such as common words, are socially centralized. Other sorts of identifiers, such as proper names, are socially decentralized, varying from local context to local context. Black has noticed a correspondence between this socially grounded identification process and the use of socially constructed Web sites.</div>

...

Cancel

1More

YAGO-NAGA - D5: Databases and Information Systems (Max-Planck-Institut für In... - 0 views

www.mpi-inf.mpg.de/yago-naga yago naga search semantic opensource knowledge discovery wordnet wikipedia

shared by Jack Park on 27 Apr 09 - Cached

Jack Park on 27 Apr 09

The YAGO-NAGA project started in 2006 with the goal of building a conveniently searchable, large-scale, highly accurate knowledge base of common facts in a machine-processible representation. We have already harvested knowledge about millions of entities and facts about their relationships, from Wikipedia and WordNet with careful integration of these two sources. The resulting knowledge base, coined YAGO, has very high precision and is freely available. The facts are represented as RDF triples, and we have developed methods and prototype systems for querying, ranking, and exploring knowledge. Our search engine NAGA provides ranked answers to queries based on statistical models.

<div class="cArrow"> </div><div class="cContentInner">The YAGO-NAGA project started in 2006 with the goal of building a conveniently searchable, large-scale, highly accurate knowledge base of common facts in a machine-processible representation. We have already harvested knowledge about millions of entities and facts about their relationships, from Wikipedia and WordNet with careful integration of these two sources. The resulting knowledge base, coined YAGO, has very high precision and is freely available. The facts are represented as RDF triples, and we have developed methods and prototype systems for querying, ranking, and exploring knowledge. Our search engine NAGA provides ranked answers to queries based on statistical models.</div>

...

Cancel

1More

GoodRelations Ontology - 0 views

www.heppnetz.de/...v1 ontology owl business GoodRelations opensource

shared by Jack Park on 08 Apr 09 - No Cached

Jack Park on 08 Apr 09

The GoodRelations ontology provides the vocabulary for annotating e-commerce offerings (1) to sell, lease, repair, dispose, and maintain commodity products and (2) to provide commodity services. GoodRelations allows describing the relationship between (1) Web resources, (2) offerings made by those Web resources, (3) legal entities, (4) prices, (5) terms and conditions, and the aforementioned ontologies for products and services (6). For more information, see http://purl.org/goodrelations/ Note: The base URI of GoodRelations has changed to http://purl.org/goodrelations/v1. Please make sure you are only using element identifiers in this namespace, e.g. http://purl.org/goodrelations/v1#BusinessEntity. T

<div class="cArrow"> </div><div class="cContentInner">The GoodRelations ontology provides the vocabulary for annotating e-commerce offerings (1) to sell, lease, repair, dispose, and maintain commodity products and (2) to provide commodity services. GoodRelations allows describing the relationship between (1) Web resources, (2) offerings made by those Web resources, (3) legal entities, (4) prices, (5) terms and conditions, and the aforementioned ontologies for products and services (6). For more information, see <a href="http://purl.org/goodrelations/" rel="nofollow" target="_blank">http://purl.org/goodrelations/</a> Note: The base URI of GoodRelations has changed to <a href="http://purl.org/goodrelations/v1" rel="nofollow" target="_blank">http://purl.org/goodrelations/v1</a>. Please make sure you are only using element identifiers in this namespace, e.g. <a href="http://purl.org/goodrelations/v1#BusinessEntity" rel="nofollow" target="_blank">http://purl.org/goodrelations/v1#BusinessEntity</a>. T</div>

...

Cancel

2More

Booker: "Identity Resolution in Criminal Justice Data: An Application of NORA and SUDA"... - 0 views

www.mirlabs.org/booker.pdf Booker identity mining entity disambiguation conceptual spaces Jonas

shared by Stian Danenbarger on 04 Dec 09 - No Cached

dianne cipollo liked it

Stian Danenbarger on 04 Dec 09

Identifying aliases is an important component of the criminal justice system. Accurately identifying a person of interest or someone who has been arrested can significantly reduce the costs within the entire criminal justice system. This paper examines the problem domain of matching and relating identities, examines traditional approaches to the problem, and applies the identity resolution approach described by Jeff Jonas and relationship awareness to the specific case of client identification for the indigent defense office. The combination of identify resolution and relationship awareness offered improved accuracy in matching identities

<div class="cArrow"> </div><div class="cContentInner">Identifying aliases is an important component of the criminal justice system. Accurately identifying a person of interest or someone who has been arrested can significantly reduce the costs within the entire criminal justice system. This paper examines the problem domain of matching and relating identities, examines traditional approaches to the problem, and applies the identity resolution approach described by Jeff Jonas and relationship awareness to the specific case of client identification for the indigent defense office. The combination of identify resolution and relationship awareness offered improved accuracy in matching identities</div>

...

Cancel
Stian Danenbarger on 04 Dec 09

Further work building on Jeff Jonas' "data finds data", and his his article in IEEE Security and Privacy entitled "Threat and Fraud Intelligence, Las Vegas Style"

<div class="cArrow"> </div><div class="cContentInner">Further work building on Jeff Jonas' "data finds data", and his his article in IEEE Security and Privacy entitled "Threat and Fraud Intelligence, Las Vegas Style"</div>

...

Cancel

2More

Jeff Jonas: "Threat and Fraud Intelligence, Las Vegas Style" (IEEE, PDF, 2006) - 0 views

jeffjonas.typepad.com/IEEE.Identity.Resolution.pdf Jonas identity privacy entity resolution semantic reconciliation information information retrieval information systems IEEE

shared by Stian Danenbarger on 14 Dec 09 - No Cached

dianne cipollo liked it

Group items tagged

HCLSIG BioRDF Subgroup/aTags - ESW Wiki - 1 views

Jigsaw Page - 0 views

OYSTER: A configurable, open-source entity resolution engine in Java - 1 views

Welcome to the web site of the OKKAM Large-Scale Integrating Project (GA#215032) - The ... - 0 views

Apache UIMA - Apache UIMA - 0 views

| KNOWLEDGE VILLAGE - HOME | - 0 views

Yago - A Core of Semantic Knowledge - 0 views

KIM Platform - 0 views

DallasWorkshop - NCBO Wiki - 0 views

Semantic Search: The Myth and Reality - ReadWriteWeb - 0 views

Alchemy - Open Source AI - 0 views

collection sensemaking [interface ecology lab | research] - 0 views

Black: &quot;Creating a Common Ground for URI Meaning Using Socially Constructed Web sites&quot; ... - 2 views

YAGO-NAGA - D5: Databases and Information Systems (Max-Planck-Institut für In... - 0 views

GoodRelations Ontology - 0 views

Booker: &quot;Identity Resolution in Criminal Justice Data: An Application of NORA and SUDA&quot;... - 0 views

Jeff Jonas: &quot;Threat and Fraud Intelligence, Las Vegas Style&quot; (IEEE, PDF, 2006) - 0 views

Related searches

Black: "Creating a Common Ground for URI Meaning Using Socially Constructed Web sites" ... - 2 views

Booker: "Identity Resolution in Criminal Justice Data: An Application of NORA and SUDA"... - 0 views

Jeff Jonas: "Threat and Fraud Intelligence, Las Vegas Style" (IEEE, PDF, 2006) - 0 views