Skip to main content

/ Future of the Web/ Group items tagged Terabytes spark

Group items tagged

Filter: All | Bookmarks | Topics Simple Middle

Apache Spark: 100 terabytes (TB) of data sorted in 23 minutes | Opensource.com - 0 views

opensource.com/...apache-spark-new-world-record

apache spark Terabytes data sort like tthunder open source

shared by Gonzalo San Gil, PhD. on 16 Jan 15 - No Cached

Gonzalo San Gil, PhD. on 16 Jan 15

"In October 2014, Databricks participated in the Sort Benchmark and set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-byte records. The team used Apache Spark on 207 EC2 virtual machines and sorted 100 TB of data in 23 minutes."

<div class="cArrow"> </div><div class="cContentInner">"In October 2014, Databricks participated in the Sort Benchmark and set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-byte records. The team used Apache Spark on 207 EC2 virtual machines and sorted 100 TB of data in 23 minutes."</div>

...

Cancel
Gonzalo San Gil, PhD. on 16 Jan 15

"In October 2014, Databricks participated in the Sort Benchmark and set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-byte records. The team used Apache Spark on 207 EC2 virtual machines and sorted 100 TB of data in 23 minutes."

<div class="cArrow"> </div><div class="cContentInner">"In October 2014, Databricks participated in the Sort Benchmark and set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-byte records. The team used Apache Spark on 207 EC2 virtual machines and sorted 100 TB of data in 23 minutes."</div>

...

Cancel

Startup Crunches 100 Terabytes of Data in a Record 23 Minutes | WIRED - 0 views

www.wired.com/...rabytes-data-record-23-minutes

startup data record transfer Terabytes minutes Big Data

shared by Gonzalo San Gil, PhD. on 14 Oct 14 - No Cached

Gonzalo San Gil, PhD. on 14 Oct 14

"There's a new record holder in the world of "big data." On Friday, Databricks-a startup spun out of the University California, Berkeley-announced that it has sorted 100 terabytes of data in a record 23 minutes using a number-crunching tool called Spark, eclipsing the previous record held by Yahoo and the popular big-data tool Hadoop."

<div class="cArrow"> </div><div class="cContentInner">"There's a new record holder in the world of "big data." On Friday, Databricks-a startup spun out of the University California, Berkeley-announced that it has sorted 100 terabytes of data in a record 23 minutes using a number-crunching tool called Spark, eclipsing the previous record held by Yahoo and the popular big-data tool Hadoop."</div>

...

Cancel

1 - 2 of 2

Showing 20▼ items per page

Related searches