Skip to main content

Home/ HealthcareMetadata/ Contents contributed and discussions participated by Malcolm McRoberts

Contents contributed and discussions participated by Malcolm McRoberts

Malcolm McRoberts

Building a Hadoop Data Warehouse: Hadoop 101 for Enterprise Data Warehouse Professionals - 0 views

  • Dr. Kimball explains how Hadoop can be both: A destination data warehouse, and also An efficient staging and ETL source for an existing data warehouse
  • Building a Hadoop Data Warehouse: Hadoop 101 for EDW Professionals Dr. Ralph Kimball explains how Hadoop can be both a destination data warehouse, and also an efficient staging and ETL source for an existing data warehouse. Learn how enterprise conformed dimensions can be used as the basis for integrating Hadoop and conventional data warehouses.
    • Malcolm McRoberts
       
      Can't view this using IE from inside Harris. Use FF or try from home.
Malcolm McRoberts

Fact Tables - Kimball Group - 0 views

  • Fact tables are the foundation of the data warehouse. They contain the fundamental measurements of the enterprise, and they are the ultimate target of most data warehouse queries.
  • The grain is the business definition of what a single fact table record represents.
  • the grain is the description of the measurement event in the physical world that gives rise to a measurement.
Malcolm McRoberts

Big Data Analytics: Descriptive Vs. Predictive Vs. Prescriptive - InformationWeek - 0 views

  • In any big data setup, the first step is to capture lots of digital information, "which there's no shortage of
  • The purpose of descriptive analytics is to summarize what happened. Wu estimated that more than 80% of business analytics -- most notably social analytics -- are descriptive.
  • In the most general cases of predictive analytics, "you basically take data that you have to predict data you don't have,"
  • ...2 more annotations...
  • "Prescriptive analytics is a type of predictive analytics," Wu said. "It's basically when we need to prescribe an action, so the business decision-maker can take this information and act."
  • In addition, prescriptive analytics requires a predictive model with two additional components: actionable data and a feedback system that tracks the outcome produced by the action taken.
Malcolm McRoberts

Data mapping - Wikipedia, the free encyclopedia - 0 views

  • In computing and data management, data mapping is the process of creating data element mappings between two distinct data models. Data mapping is used as a first step for a wide variety of data integration
Malcolm McRoberts

Schema crosswalk - Wikipedia, the free encyclopedia - 0 views

  • A Schema crosswalk is a table that shows equivalent elements (or "fields") in more than one database schema. It maps the elements in one schema to the equivalent elements in another schema.
  • This type of "translating" from one format to another is often called "metadata mapping" or "field mapping," and is related to "data mapping," and "semantic mapping."
Malcolm McRoberts

RStudio - Home - 0 views

shared by Malcolm McRoberts on 29 Apr 14 - No Cached
  • Powerful IDE for R RStudio IDE is a powerful and productive user interface for R. It’s free and open source, and works great on Windows, Mac, and Linux.
  • Web framework for R Shiny is an elegant and powerful web framework for building interactive reports and visualizations using R — with or without web development skills.
  • Open source R packages Our developers and expert trainers are the authors of several popular R packages, including ggplot2, plyr, lubridate, and others.
Malcolm McRoberts

How-to: Do Statistical Analysis with Impala and R | Cloudera Developer Blog - 0 views

  • To meet that goal, we have created a new R package, RImpala, which connects Impala to R. RImpala enables querying the data residing in HDFS and Apache HBase from R, which can be further processed as an R object using R functions. RImpala is now available for download from the Comprehensive R Archive Network (CRAN) under GNU General Public License (GPL3).
Malcolm McRoberts

Big Analytics For Hadoop and EDWs | Revolution Analytics - 0 views

  • Revolution R Enterprise transparently runs R analytics inside Hadoop and Teradata EDWs, providing your team with:
Malcolm McRoberts

Integrating R with Cloudera Impala for Real-Time Queries on Hadoop | BigHadoop - 0 views

  • R is one of the most popular open source statistical computing and graphical software. It can work with various data sources from comma separated files to web contents referred by URLs to relational databases to NoSQL (e.g. MongoDB or Cassandra) and Hadoop.
Malcolm McRoberts

Sentry - 0 views

  • Sentry Open Source, Fine-Grained Access Control for your Enterprise Data Hub Apache Sentry (incubating) is the next step in enterprise-grade big data security and delivers fine-grained authorization to data stored in Apache Hadoop
  • Improved Regulatory Compliance – Business teams can leverage the power of Hadoop while aligning with regulatory mandates like HIPAA, SOX, and PCI.
  • Role-Based Administration – Database administrators can unlock key role-based access control (RBAC) requirements and define what users and applications can do with data within a server, database, table, or view.
« First ‹ Previous 81 - 100 of 245 Next › Last »
Showing 20 items per page