Skip to main content

Home/ HealthcareMetadata/ Group items tagged hadoop

Rss Feed Group items tagged

Malcolm McRoberts

Building a Hadoop Data Warehouse: Hadoop 101 for Enterprise Data Warehouse Professionals - 0 views

  • Dr. Kimball explains how Hadoop can be both: A destination data warehouse, and also An efficient staging and ETL source for an existing data warehouse
  • Building a Hadoop Data Warehouse: Hadoop 101 for EDW Professionals Dr. Ralph Kimball explains how Hadoop can be both a destination data warehouse, and also an efficient staging and ETL source for an existing data warehouse. Learn how enterprise conformed dimensions can be used as the basis for integrating Hadoop and conventional data warehouses.
    • Malcolm McRoberts
       
      Can't view this using IE from inside Harris. Use FF or try from home.
Malcolm McRoberts

Integrating R with Cloudera Impala for Real-Time Queries on Hadoop | BigHadoop - 0 views

  • R is one of the most popular open source statistical computing and graphical software. It can work with various data sources from comma separated files to web contents referred by URLs to relational databases to NoSQL (e.g. MongoDB or Cassandra) and Hadoop.
Malcolm McRoberts

Introducing Parquet: Efficient Columnar Storage for Apache Hadoop | Cloudera Developer ... - 0 views

  • Parquet is designed to bring efficient columnar storage to Hadoop. Compared to, and learning from, the initial work done toward this goal in Trevni, Parquet includes the following enhancements:
Malcolm McRoberts

Big Analytics For Hadoop and EDWs | Revolution Analytics - 0 views

  • Revolution R Enterprise transparently runs R analytics inside Hadoop and Teradata EDWs, providing your team with:
Malcolm McRoberts

Sentry - 0 views

  • Sentry Open Source, Fine-Grained Access Control for your Enterprise Data Hub Apache Sentry (incubating) is the next step in enterprise-grade big data security and delivers fine-grained authorization to data stored in Apache Hadoop
  • Improved Regulatory Compliance – Business teams can leverage the power of Hadoop while aligning with regulatory mandates like HIPAA, SOX, and PCI.
  • Role-Based Administration – Database administrators can unlock key role-based access control (RBAC) requirements and define what users and applications can do with data within a server, database, table, or view.
Malcolm McRoberts

Incubation Status Template - Apache Incubator - 0 views

  • Sentry is a highly modular system for providing fine grained role based authorization to both data and metadata stored on an Apache Hadoop cluster.
Malcolm McRoberts

How-to: Do Statistical Analysis with Impala and R | Cloudera Developer Blog - 0 views

  • To meet that goal, we have created a new R package, RImpala, which connects Impala to R. RImpala enables querying the data residing in HDFS and Apache HBase from R, which can be further processed as an R object using R functions. RImpala is now available for download from the Comprehensive R Archive Network (CRAN) under GNU General Public License (GPL3).
1 - 13 of 13
Showing 20 items per page