Skip to main content

Home/ Arquitectura?/ Group items tagged hive

Rss Feed Group items tagged

Pablo Lalloni

shark - 0 views

  •  
    "Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can execute Hive QL queries up to 100 times faster than Hive without any modification to the existing data or queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions, providing seamless integration with existing Hive deployments and a familiar, more powerful option for new ones."
Pablo Lalloni

Shark - Lightning Fast Data Warehouse System - 0 views

  •  
    "Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 100 times faster than Hive without modification to the existing data nor queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions."
Pablo Lalloni

Rosetta Code · twitter/scalding Wiki - 0 views

  •  
    A collection of MapReduce tasks translated (from Pig, Hive, MapReduce streaming, etc.) into Scalding for comparison.
Pablo Lalloni

kevinweil/elephant-bird - 0 views

  •  
    "Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code."
Pablo Lalloni

elasticsearch/elasticsearch-hadoop - 0 views

  •  
    "Read and write data to/from Elasticsearch within Hadoop/MapReduce libraries. Automatically converts data to/from JSON. Supports MapReduce, Cascading, Hive and Pig."
Pablo Lalloni

nathanmarz/cascalog · GitHub - 0 views

  •  
    "Cascalog is a fully-featured data processing and querying library for Clojure or Java. The main use cases for Cascalog are processing "Big Data" on top of Hadoop or doing analysis on your local computer. Cascalog is a replacement for tools like Pig, Hive, and Cascading and operates at a significantly higher level of abstraction than those tools."
1 - 10 of 10
Showing 20 items per page