• 设为首页
  • 点击收藏
  • 手机版
    手机扫一扫访问
    迪恩网络手机版
  • 关注官方公众号
    微信扫一扫关注
    公众号

Toree: Apache Toree 提供了与远程 Apache Spark 进行交互和访问的机制 ...

原作者: [db:作者] 来自: 网络 收藏 邀请

开源软件名称:

Toree

开源软件地址:

https://gitee.com/apache/toree

开源软件介绍:

Build StatusLicenseJoin the chat at https://gitter.im/apache/toree

Apache Toree

Apache Toree is a Juypter Notebook kernel. The main goal of Toree is to provide the foundation forinteractive applications that connect to and use Apache Spark using Scala language.

Overview

Toree provides an interface that allows clients to interact with a Spark Cluster. Clients can send libraries andsnippets of code that are interpreted and executed using a preconfigured Spark context.These snippets can do a variety of things:

  1. Define and run spark jobs of all kinds
  2. Collect results from spark and push them to the client
  3. Load necessary dependencies for the running code
  4. Start and monitor a stream
  5. ...

Apache Toree supports the Scala programming language. It implements the latest Jupyter message protocol (5.0),so it can easily plug into the latest releases of Jupyter/IPython (3.2.x+ and up) for quick, interactive data exploration.

Develop

This project uses make as the entry point for build, test, and packaging. To perform a local build, you need toinstall sbt, jupyter/ipython, and other development requirements locally on your machine.

To build and interact with Toree using Jupyter, run

make dev

This will start a Jupyter notebook server. Depending on your mode, it will be accessible at http://localhost:8888 or http://192.168.44.44:8888. From here you can create notebooks that use Toree configured for Spark local mode.

Tests can be run by doing make test.

NOTE: Do not use sbt directly.

Build & Package

To build and package up Toree, run

make release

This results in 2 packages.

  • ./dist/toree-<VERSION>-binary-release.tar.gz is a simple package that contains JAR and executable
  • ./dist/toree-<VERSION>.tar.gz is a pip installable package that adds Toree as a Jupyter kernel.

NOTE: make release uses docker. Please refer to docker installation instructions for your system.

Run Examples

To play with the example notebooks, run

make jupyter

A notebook server will be launched in a Docker container with Toree and some other dependencies installed.Refer to your Docker setup for the ip address. The notebook will be at http://<ip>:8888/.

Install

This requires you to have a distribution of Apache Spark downloaded to the system where Apache Toree will run. The following commands will install Apache Toree.

pip install --upgrade toreejupyter toree install --spark_home=<YOUR_SPARK_PATH>

Dev snapshots of Toree are located at https://dist.apache.org/repos/dist/dev/incubator/toree. To install using oneof those packages, you can use the following:

pip install <PIP_RELEASE_URL>jupyter toree install --spark_home=<YOUR_SPARK_PATH>

where PIP_RELEASE_URL is one of the pip packages. For example:

pip install https://dist.apache.org/repos/dist/dev/incubator/toree/0.2.0/snapshots/dev1/toree-pip/toree-0.2.0.dev1.tar.gzjupyter toree install --spark_home=<YOUR_SPARK_PATH>

Reporting Issues

Refer to and open issue here

Communication

You can reach us through gitter or our mailing list

Version

We are working on publishing binary releases of Toree soon. As part of our move into Apache Incubator, Toree will start a new version sequence starting at 0.1.

Our goal is to keep master up to date with the latest version of Spark. When new versions of Spark require specific code changes to Toree, we will branch out older Spark version support.

As it stands, we maintain several branches for legacy versions of Spark. The table below shows what is available now.

BranchApache Spark Version
master3.x.x
[0.4.x][0.4.x]2.x.x
0.1.x1.6+

Please note that for the most part, new features will mainly be added to the master branch.

Resources

We are currently enhancing our documentation, which is available in our website.


鲜花

握手

雷人

路过

鸡蛋
该文章已有0人参与评论

请发表评论

全部评论

专题导读
热门推荐
热门话题
阅读排行榜

扫描微信二维码

查看手机版网站

随时了解更新最新资讯

139-2527-9053

在线客服(服务时间 9:00~18:00)

在线QQ客服
地址:深圳市南山区西丽大学城创智工业园
电邮:jeky_zhao#qq.com
移动电话:139-2527-9053

Powered by 互联科技 X3.4© 2001-2213 极客世界.|Sitemap