Kettle

来自开放百科 - 灰狐
(版本间的差异)
跳转到: 导航, 搜索
 
(Eclipse)
 
(未显示1个用户的31个中间版本)
第1行: 第1行:
 
Kettle is a powerful, open source, metadata-driven ETL tool
 
Kettle is a powerful, open source, metadata-driven ETL tool
  
License: GNU General Public License (GPL)
+
==简介==
 +
Pentaho Data Integration (ETL, EII - jobs & transformations), License: Apache v2
  
==Used for==
+
==组成==
 +
Pentaho Data Integration (Kettle) 包含以下工具:
 +
*Spoon:一个创建转换和任务的图形化IDE
 +
*Kitchen:一个运行任务的命令行工具
 +
*Pan:一个运行转换的命令行工具
 +
*Carte:一个在远程主机运行任务和转换的轻量级服务器
 +
 
 +
==用处==
 
* Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more.
 
* Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more.
 
* Export of database(s) to text-file(s) or other databases
 
* Export of database(s) to text-file(s) or other databases
第13行: 第21行:
 
* Application integration
 
* Application integration
  
==Links==
+
==插件==
*http://www.cs.waikato.ac.nz/~ml/
+
Kettle的插件体系
*http://kettle.pentaho.org/
+
*[[OSGi]]
*http://sourceforge.net/projects/weka/
+
*[[Apache Karaf]]
 +
 
 +
==仓库==
 +
MySQL
 +
create database kettle;
 +
然后通过资源库的创建向导完成数据库的创建,会新建41个表 (3.0.1版本),然后每次启动登录资源库
 +
admin/admin
 +
可把整个资源库导出: repositories.xml, 也可把资源库从xml导入
 +
 
 +
[[Kettle Repository DDL]]
 +
 
 +
==集群==
 +
./carte.sh 127.0.0.1 8080
 +
http://127.0.0.1:8080
 +
cluster/cluster
 +
 
 +
==Eclipse==
 +
基于[[Eclipse]]体系和框架进行重构
 +
 
 +
==大数据==
 +
*[https://github.com/pentaho/big-data-plugin Big Data Plugin]
 +
 
 +
==图集==
 +
<gallery widths=100px heights=100px perrow=6>
 +
Image:kettle-repository.png|Kettle Repository
 +
Image:kettle-01.png
 +
Image:kettle-02.png
 +
Image:kettle-03.png
 +
Image:spoon-transformation.png|设计转换
 +
Image:spoon-job.png|设计任务
 +
Image:spoon-logview.png|日志
 +
Image:spoon-welcome.png|欢迎页
 +
Image:spoon-verify.png|验证转换
 +
Image:spoon-debug.png|调试转换
 +
image:pdi-kettle-database-plugin.png|数据库插件
 +
image:pentaho-kettle-spoon.png|Spoon
 +
</gallery>
 +
 
 +
==链接==
 +
*[http://community.pentaho.com/projects/data-integration/ Kettle官网]
 +
*[https://github.com/pentaho/pentaho-kettle Kettle @ GitHub]
 +
*http://docs.huihoo.com/kettle/
 +
*[https://sourceforge.net/projects/pentaho/files/Data%20Integration/ Kettle下载]
 +
 
 +
[[category:ETL]]
 +
[[category:MDM]]
 +
[[category:pentaho]]
 +
[[category:metadata]]
 +
[[category:OSGi]]
 +
[[category:huihoo]]

2021年2月24日 (三) 10:48的最后版本

Kettle is a powerful, open source, metadata-driven ETL tool

目录

[编辑] 简介

Pentaho Data Integration (ETL, EII - jobs & transformations), License: Apache v2

[编辑] 组成

Pentaho Data Integration (Kettle) 包含以下工具:

  • Spoon:一个创建转换和任务的图形化IDE
  • Kitchen:一个运行任务的命令行工具
  • Pan:一个运行转换的命令行工具
  • Carte:一个在远程主机运行任务和转换的轻量级服务器

[编辑] 用处

  • Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more.
  • Export of database(s) to text-file(s) or other databases
  • Import of data into databases, ranging from text-files to excel sheets
  • Data migration between database applications
  • Exploration of data in existing databases. (tables, views, synonyms, )
  • Information enrichment by looking up data in various information stores (databases, text-files, excel sheets, )
  • Data cleaning by applying complex conditions in data transformations
  • Application integration

[编辑] 插件

Kettle的插件体系

[编辑] 仓库

MySQL

create database kettle;

然后通过资源库的创建向导完成数据库的创建,会新建41个表 (3.0.1版本),然后每次启动登录资源库

admin/admin 

可把整个资源库导出: repositories.xml, 也可把资源库从xml导入

Kettle Repository DDL

[编辑] 集群

./carte.sh 127.0.0.1 8080
http://127.0.0.1:8080
cluster/cluster

[编辑] Eclipse

基于Eclipse体系和框架进行重构

[编辑] 大数据

[编辑] 图集

[编辑] 链接

分享您的观点
个人工具
名字空间

变换
操作
导航
工具箱