Disco

来自开放百科 - 灰狐
2010年9月21日 (二) 09:23Allen (讨论 | 贡献)的版本

跳转到: 导航, 搜索

Disco is an open-source implementation of the MapReduce framework for distributed computing. As the original framework, Disco supports parallel computations over large data sets on unreliable cluster of computers.

The Disco core is written in Erlang, a functional language that is designed for building robust fault-tolerant distributed applications. Users of Disco typically write jobs in Python, which makes it possible to express even complex algorithms or data processing tasks often only in tens of lines of code. This means that you can quickly write scripts to process massive amounts of data.

Links

分享您的观点
个人工具
名字空间

变换
操作
导航
工具箱