R is one of the most popular open source statistical computing and graphical software. It can work with various data sources from comma separated files to web contents referred by URLs to relational databases to NoSQL (e.g. MongoDB or Cassandra) and Hadoop.
Integrating R with Cloudera Impala for Real-Time Queries on Hadoop | BigHadoop - 0 views
How-to: Do Statistical Analysis with Impala and R | Cloudera Developer Blog - 0 views
-
To meet that goal, we have created a new R package, RImpala, which connects Impala to R. RImpala enables querying the data residing in HDFS and Apache HBase from R, which can be further processed as an R object using R functions. RImpala is now available for download from the Comprehensive R Archive Network (CRAN) under GNU General Public License (GPL3).
Introducing Parquet: Efficient Columnar Storage for Apache Hadoop | Cloudera Developer ... - 0 views
-
Parquet is designed to bring efficient columnar storage to Hadoop. Compared to, and learning from, the initial work done toward this goal in Trevni, Parquet includes the following enhancements:
Sentry - 0 views
-
Sentry Open Source, Fine-Grained Access Control for your Enterprise Data Hub Apache Sentry (incubating) is the next step in enterprise-grade big data security and delivers fine-grained authorization to data stored in Apache Hadoop
-
Improved Regulatory Compliance – Business teams can leverage the power of Hadoop while aligning with regulatory mandates like HIPAA, SOX, and PCI.
-
Role-Based Administration – Database administrators can unlock key role-based access control (RBAC) requirements and define what users and applications can do with data within a server, database, table, or view.
Building a Hadoop Data Warehouse: Hadoop 101 for Enterprise Data Warehouse Professionals - 0 views
-
Dr. Kimball explains how Hadoop can be both: A destination data warehouse, and also An efficient staging and ETL source for an existing data warehouse
-
Building a Hadoop Data Warehouse: Hadoop 101 for EDW Professionals Dr. Ralph Kimball explains how Hadoop can be both a destination data warehouse, and also an efficient staging and ETL source for an existing data warehouse. Learn how enterprise conformed dimensions can be used as the basis for integrating Hadoop and conventional data warehouses.
-
Big Analytics For Hadoop and EDWs | Revolution Analytics - 0 views
-
Revolution R Enterprise transparently runs R analytics inside Hadoop and Teradata EDWs, providing your team with:
1 - 7 of 7
Showing 20▼ items per page