Group items matching "spark" in title, tags, annotations or url - Arquitectura?

andypetrella/spark-notebook - 0 views

github.com/...spark-notebook

development data-science bigdata spark tools

shared by Pablo Lalloni on 05 Oct 15 - No Cached

Pablo Lalloni on 05 Oct 15

"The main intent of this tool is to create reproducible analysis using Scala, Apache Spark and more. This is achieved through an interactive web-based editor that can combine Scala code, SQL queries, Markup or even JavaScript in a collaborative manner. The usage of Spark comes out of the box, and is simply enabled by the implicit variable named SparkContext. You should also check the website, http://Spark-notebook.io."

<div class="cArrow"> </div><div class="cContentInner">"The main intent of this tool is to create reproducible analysis using Scala, Apache Spark and more. This is achieved through an interactive web-based editor that can combine Scala code, SQL queries, Markup or even JavaScript in a collaborative manner. The usage of Spark comes out of the box, and is simply enabled by the implicit variable named sparkContext. You should also check the website, <a href="http://spark-notebook.io" rel="nofollow" target="_blank">http://spark-notebook.io</a>."</div>

...

Cancel

dnafrance/vagrant-hadoop-spark-cluster - 0 views

github.com/...vagrant-hadoop-spark-cluster

development spark vagrant tools

shared by Pablo Lalloni on 07 Jan 15 - No Cached

Pablo Lalloni on 07 Jan 15

"Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1"

<div class="cArrow"> </div><div class="cContentInner">"Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1"</div>

...

Cancel

Apache Spark: 100 terabytes (TB) of data sorted in 23 minutes | Opensource.com - 1 views

opensource.com/...apache-spark-new-world-record

spark data

shared by Sebastián Zaffarano on 20 Jan 15 - No Cached

Pablo Lalloni liked it

Sebastián Zaffarano on 20 Jan 15

"In October 2014, Databricks participated in the Sort Benchmark and set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-byte records. The team used Apache Spark on 207 EC2 virtual machines and sorted 100 TB of data in 23 minutes."

<div class="cArrow"> </div><div class="cContentInner">"In October 2014, Databricks participated in the Sort Benchmark and set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-byte records. The team used Apache Spark on 207 EC2 virtual machines and sorted 100 TB of data in 23 minutes."</div>

...

Cancel

Akka, Spark or Kafka? Selecting The Right Streaming Engine For the Job - 1 views

info.lightbend.com/gine-for-the-job-register.html

architecture akka spark kafka data-streaming streaming fast-data fastdata stream-processing

shared by Pablo Lalloni on 29 Mar 18 - No Cached

Spark Release 0.5.0 - 1 views

www.spark-project.org/release-0.5.0.html

spark release cloud-computing grid-computing parallel clustering development

shared by Pablo Lalloni on 13 Jun 12 - No Cached

shark - 0 views

github.com/wiki

development programming bigdata big-data distributed-computing cloud-computing spark hive

shared by Pablo Lalloni on 05 Aug 13 - No Cached

Pablo Lalloni on 05 Aug 13

"Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can execute Hive QL queries up to 100 times faster than Hive without any modification to the existing data or queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions, providing seamless integration with existing Hive deployments and a familiar, more powerful option for new ones."

<div class="cArrow"> </div><div class="cContentInner">"Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can execute Hive QL queries up to 100 times faster than Hive without any modification to the existing data or queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions, providing seamless integration with existing Hive deployments and a familiar, more powerful option for new ones."</div>

...

Cancel

Shark - Lightning Fast Data Warehouse System - 0 views

shark.cs.berkeley.edu

hive spark bigdata hadoop warehouse data development tools cloud-computing distributed-computing infrastructure

shared by Pablo Lalloni on 04 Jun 13 - No Cached

Pablo Lalloni on 04 Jun 13

"Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 100 times faster than Hive without modification to the existing data nor queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions."

<div class="cArrow"> </div><div class="cContentInner">"Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 100 times faster than Hive without modification to the existing data nor queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions."</div>

...

Cancel

Apache Spark and the Typesafe Reactive Platform: A Match Made in Heaven - 1 views

typesafe.com/...latform-a-match-made-in-heaven

apache spark scala development programming bigdata mapreduce cloud-computing java python distributed-computing

shared by Pablo Lalloni on 11 Aug 14 - No Cached

Cloud native Data with Spark 2.3 and Kubernetes - Tim Park - Medium - 0 views

medium.com/...-3-and-kubernetes-938b04d0da57

spark kubernetes distributed-computing containerization architecture development

shared by Pablo Lalloni on 12 Apr 18 - No Cached

Spark Cluster Computing - 1 views

www.spark-project.org

cloud-computing scala distributed-computing jvm

shared by Pablo Lalloni on 16 Jul 11 - No Cached

Ferry | Big Data Development Environment Using Docker - 0 views

ferry.opencore.io/...index.html

development programming hadoop tools docker

shared by Pablo Lalloni on 05 Nov 14 - No Cached

Pablo Lalloni on 05 Nov 14

"Ferry helps you create big data clusters on your local machine. Define your big data stack using YAML and share your application with Dockerfiles. Ferry supports Hadoop, Cassandra, Spark, GlusterFS, and Open MPI."

<div class="cArrow"> </div><div class="cContentInner">"Ferry helps you create big data clusters on your local machine. Define your big data stack using YAML and share your application with Dockerfiles. Ferry supports Hadoop, Cassandra, Spark, GlusterFS, and Open MPI."</div>

...

Cancel

Ferry | Big Data Development Environment Using Docker - 0 views

ferry.opencore.io/latest

development cloud-computing infrastructure devops big-data

shared by Pablo Lalloni on 05 Sep 14 - No Cached

Pablo Lalloni on 05 Sep 14

"Ferry helps you create big data clusters on your local machine. Define your big data stack using YAML and share your application with Dockerfiles. Ferry supports Hadoop, Cassandra, Spark, GlusterFS, and Open MPI."

<div class="cArrow"> </div><div class="cContentInner">"Ferry helps you create big data clusters on your local machine. Define your big data stack using YAML and share your application with Dockerfiles. Ferry supports Hadoop, Cassandra, Spark, GlusterFS, and Open MPI."</div>

...

Cancel

Spark Just Passed Hadoop in Popularity on the Web--Here's Why - Prismatic - 0 views

prsm.tc/dxZZEO

hadoop spark big-data distributed-computing development infrastructure programming scala java

shared by Pablo Lalloni on 30 Nov 14 - No Cached

Big Data Exploration, Visualization, Analytics - 0 views

www.zoomdata.com

data-visualization big-data real-time data-analysis tools spark hadoop elasticsearch mongodb oracle

shared by Pablo Lalloni on 11 Apr 15 - No Cached

Group items matching
in title, tags, annotations or url

andypetrella/spark-notebook - 0 views

dnafrance/vagrant-hadoop-spark-cluster - 0 views

Apache Spark: 100 terabytes (TB) of data sorted in 23 minutes | Opensource.com - 1 views

Akka, Spark or Kafka? Selecting The Right Streaming Engine For the Job - 1 views

Spark Release 0.5.0 - 1 views

shark - 0 views

Shark - Lightning Fast Data Warehouse System - 0 views

Apache Spark and the Typesafe Reactive Platform: A Match Made in Heaven - 1 views

Cloud native Data with Spark 2.3 and Kubernetes - Tim Park - Medium - 0 views

Spark Cluster Computing - 1 views

Ferry | Big Data Development Environment Using Docker - 0 views

Ferry | Big Data Development Environment Using Docker - 0 views

Spark Just Passed Hadoop in Popularity on the Web--Here's Why - Prismatic - 0 views

Big Data Exploration, Visualization, Analytics - 0 views

Related searches

Group items matching in title, tags, annotations or url

andypetrella/spark-notebook - 0 views

dnafrance/vagrant-hadoop-spark-cluster - 0 views

Apache Spark: 100 terabytes (TB) of data sorted in 23 minutes | Opensource.com - 1 views

Akka, Spark or Kafka? Selecting The Right Streaming Engine For the Job - 1 views

Spark Release 0.5.0 - 1 views

shark - 0 views

Shark - Lightning Fast Data Warehouse System - 0 views

Apache Spark and the Typesafe Reactive Platform: A Match Made in Heaven - 1 views

Cloud native Data with Spark 2.3 and Kubernetes - Tim Park - Medium - 0 views

Spark Cluster Computing - 1 views

Ferry | Big Data Development Environment Using Docker - 0 views

Ferry | Big Data Development Environment Using Docker - 0 views

Spark Just Passed Hadoop in Popularity on the Web--Here's Why - Prismatic - 0 views

Big Data Exploration, Visualization, Analytics - 0 views

Related searches

Group items matching
in title, tags, annotations or url