Skip to content

EC Dataplatform v1.0.0#

Features#

Big data environment for development and testing proof-of-concepts big data projects#

  • Compute cluster that runs Hadoop and can run other applications from the Hadoop ecosystem.
  • Compute cluster runs on top of AWS EMR
  • The following applications are included:
  • Hadoop (>v2.7.2)
  • Ganglia(>v3.7.2)
  • Hive (>v2.1.0)
  • Hue (>v3.10.0)
  • JupyterHub (>v0.8.1)
  • Mahout (>v0.12.2)
  • Oozie (>v4.2.0)
  • Spark (>v2.0.0)
  • TensorFlow (>v1.9.0)
  • Zeppelin (>v0.6.1)
  • RStudio Server (>v1.2.1335)
  • Shiny Server (>v1.5.9.923)
  • MXNet (>v0.12.0)
  • Pig (>v0.16.0)
  • Livy (>v0.4.0)
  • The following distributions of programming languages can be used on the cluster:
  • Python (v2.7, v3.6)
  • R (v3.6.0)
  • Scala (v2.11.12)
  • Java (>v8.0.0)
  • The compute cluster is always launched in a private network for security purposes.
  • The compute cluster has access to AWS S3, where data can be stored and results can be persisted. Users can upload data to S3 from their local machines and ingest this data directly into HDFS.

Fixes#

No new fixes were added in this release.

Improvements#

No new improvements were added in this release.

Projects#

No new projects were added in this release

Operations#

No new operations were added in this release.