Group items tagged uima - sensemaking

Apache UIMA - Apache UIMA - 0 views

incubator.apache.org/uima

uima nlp unstructured textmining TextMining harvesting discovery opensource apache

shared by Jack Park on 18 Nov 08 - Cached

Jack Park on 18 Nov 08

Unstructured Information Management applications are software systems that analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user. UIMA is a framework and SDK for developing such applications. An example UIM application might ingest plain text and identify entities, such as persons, places, organizations; or relations, such as works-for or located-at. UIMA enables such an application to be decomposed into components, for example "language identification" -> "language specific segmentation" -> "sentence boundary detection" -> "entity detection (person/place names etc.)". Each component must implement interfaces defined by the framework and must provide self-describing metadata via XML descriptor files. The framework manages these components and the data flow between them. Components are written in Java or C++; the data that flows between components is designed for efficient mapping between these languages. UIMA additionally provides capabilities to wrap components as network services, and can scale to very large volumes by replicating processing pipelines over a cluster of networked nodes.

<div class="cArrow"> </div><div class="cContentInner">Unstructured Information Management applications are software systems that analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user. UIMA is a framework and SDK for developing such applications. An example UIM application might ingest plain text and identify entities, such as persons, places, organizations; or relations, such as works-for or located-at. UIMA enables such an application to be decomposed into components, for example "language identification" -> "language specific segmentation" -> "sentence boundary detection" -> "entity detection (person/place names etc.)". Each component must implement interfaces defined by the framework and must provide self-describing metadata via XML descriptor files. The framework manages these components and the data flow between them. Components are written in Java or C++; the data that flows between components is designed for efficient mapping between these languages. UIMA additionally provides capabilities to wrap components as network services, and can scale to very large volumes by replicating processing pipelines over a cluster of networked nodes. </div>

...

Cancel

UIMA COMPONENT REPOSITORY - 0 views

uima.lti.cs.cmu.edu/...Welcome.do

uima libraries TextMining knowledge discovery

shared by Jack Park on 27 Apr 09 - Cached

Jack Park on 27 Apr 09

Our goal in creating this site is to provide the basis for a thriving community of UIMA developers who can announce, discuss, design, share, and critique UIMA-compliant components, resources and solutions. The Unstructured Information Management Architecture (UIMA) is a software framework that supports rapid development and deployment of multimodal analytics - applications which provide value by processing human-readable text, audio and/or video in order to extract information, answer questions, summarize documents, etc.

<div class="cArrow"> </div><div class="cContentInner">Our goal in creating this site is to provide the basis for a thriving community of UIMA developers who can announce, discuss, design, share, and critique UIMA-compliant components, resources and solutions. The Unstructured Information Management Architecture (UIMA) is a software framework that supports rapid development and deployment of multimodal analytics - applications which provide value by processing human-readable text, audio and/or video in order to extract information, answer questions, summarize documents, etc.</div>

...

Cancel

alphaWorks : Text Analytics Tools and Runtime for IBM LanguageWare : Overview - 0 views

www.alphaworks.ibm.com/lrw

languageware ibm TextMining uima harvesting discovery

shared by Jack Park on 18 Nov 08 - Cached

Jack Park on 18 Nov 08

IBM® LanguageWare® is a set of run-time libraries and an easy-to-use Eclipse-based development environment for building custom text analyzers in various languages. Deployable in Apache UIMA, these analyzers can expose the information buried in text to any application. The Eclipse-based tools makes creating analyzers simple and fast, even for non-technical users. The tools make it easy to build dictionaries, ontologies, and rules for identifying key information, relationships and meaning.

<div class="cArrow"> </div><div class="cContentInner">IBM® LanguageWare® is a set of run-time libraries and an easy-to-use Eclipse-based development environment for building custom text analyzers in various languages. Deployable in Apache UIMA, these analyzers can expose the information buried in text to any application. The Eclipse-based tools makes creating analyzers simple and fast, even for non-technical users. The tools make it easy to build dictionaries, ontologies, and rules for identifying key information, relationships and meaning. </div>

...

Cancel

GATE, A General Architecture for Text Engineering - 0 views

gate.ac.uk

java nlp opensource ai gate software information_extraction language annotation

shared by Jack Park on 30 Nov 08 - Cached

Jack Park on 30 Nov 08

GATE is... * the Eclipse of Natural Language Engineering, the Lucene of Information Extraction, a leading toolkit for Text Mining * used worldwide by thousands of scientists, companies, teachers and students * comprised of an architecture, a free open source framework (or SDK) and graphical development environment * used for all sorts of language processing tasks, including Information Extraction in many languages * funded by the EPSRC, BBSRC, AHRC, the EU and commercial users * 100% Java reference implementation of ISO TC37/SC4 and used with XCES in the ANC * 10 years old in 2005, used in many research projects and compatible with IBM's UIMA * based on MVC, mobile code, continuous integration, and test-driven development, with code hosted on SourceForge

<div class="cArrow"> </div><div class="cContentInner">GATE is... * the Eclipse of Natural Language Engineering, the Lucene of Information Extraction, a leading toolkit for Text Mining * used worldwide by thousands of scientists, companies, teachers and students * comprised of an architecture, a free open source framework (or SDK) and graphical development environment * used for all sorts of language processing tasks, including Information Extraction in many languages * funded by the EPSRC, BBSRC, AHRC, the EU and commercial users * 100% Java reference implementation of ISO TC37/SC4 and used with XCES in the ANC * 10 years old in 2005, used in many research projects and compatible with IBM's UIMA * based on MVC, mobile code, continuous integration, and test-driven development, with code hosted on SourceForge</div>

...

Cancel

Group items tagged

Apache UIMA - Apache UIMA - 0 views

UIMA COMPONENT REPOSITORY - 0 views

alphaWorks : Text Analytics Tools and Runtime for IBM LanguageWare : Overview - 0 views

GATE, A General Architecture for Text Engineering - 0 views

Related searches