Kettle

来自开放百科 - 灰狐
(版本间的差异)
跳转到: 导航, 搜索
(Eclipse)
 
(未显示1个用户的9个中间版本)
第1行: 第1行:
 
Kettle is a powerful, open source, metadata-driven ETL tool
 
Kettle is a powerful, open source, metadata-driven ETL tool
  
Pentaho Data Integration (ETL, EII - jobs & transformations)
+
==简介==
 
+
Pentaho Data Integration (ETL, EII - jobs & transformations), License: Apache v2
License: GNU General Public License (GPL)
+
  
 
==组成==
 
==组成==
第21行: 第20行:
 
* Data cleaning by applying complex conditions in data transformations
 
* Data cleaning by applying complex conditions in data transformations
 
* Application integration
 
* Application integration
 +
 +
==插件==
 +
Kettle的插件体系
 +
*[[OSGi]]
 +
*[[Apache Karaf]]
  
 
==仓库==
 
==仓库==
第35行: 第39行:
 
  http://127.0.0.1:8080
 
  http://127.0.0.1:8080
 
  cluster/cluster
 
  cluster/cluster
 +
 +
==Eclipse==
 +
基于[[Eclipse]]体系和框架进行重构
 +
 +
==大数据==
 +
*[https://github.com/pentaho/big-data-plugin Big Data Plugin]
  
 
==图集==
 
==图集==
第48行: 第58行:
 
Image:spoon-verify.png|验证转换
 
Image:spoon-verify.png|验证转换
 
Image:spoon-debug.png|调试转换
 
Image:spoon-debug.png|调试转换
 +
image:pdi-kettle-database-plugin.png|数据库插件
 +
image:pentaho-kettle-spoon.png|Spoon
 
</gallery>
 
</gallery>
  
 
==链接==
 
==链接==
*http://kettle.pentaho.org/
+
*[http://community.pentaho.com/projects/data-integration/ Kettle官网]
 +
*[https://github.com/pentaho/pentaho-kettle Kettle @ GitHub]
 
*http://docs.huihoo.com/kettle/
 
*http://docs.huihoo.com/kettle/
*http://download.huihoo.com/kettle/
+
*[https://sourceforge.net/projects/pentaho/files/Data%20Integration/ Kettle下载]
 
+
{{comment}}
+
  
 
[[category:ETL]]
 
[[category:ETL]]
第61行: 第72行:
 
[[category:pentaho]]
 
[[category:pentaho]]
 
[[category:metadata]]
 
[[category:metadata]]
 +
[[category:OSGi]]
 +
[[category:huihoo]]

2021年2月24日 (三) 10:48的最后版本

Kettle is a powerful, open source, metadata-driven ETL tool

目录

[编辑] 简介

Pentaho Data Integration (ETL, EII - jobs & transformations), License: Apache v2

[编辑] 组成

Pentaho Data Integration (Kettle) 包含以下工具:

  • Spoon:一个创建转换和任务的图形化IDE
  • Kitchen:一个运行任务的命令行工具
  • Pan:一个运行转换的命令行工具
  • Carte:一个在远程主机运行任务和转换的轻量级服务器

[编辑] 用处

  • Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more.
  • Export of database(s) to text-file(s) or other databases
  • Import of data into databases, ranging from text-files to excel sheets
  • Data migration between database applications
  • Exploration of data in existing databases. (tables, views, synonyms, )
  • Information enrichment by looking up data in various information stores (databases, text-files, excel sheets, )
  • Data cleaning by applying complex conditions in data transformations
  • Application integration

[编辑] 插件

Kettle的插件体系

[编辑] 仓库

MySQL

create database kettle;

然后通过资源库的创建向导完成数据库的创建,会新建41个表 (3.0.1版本),然后每次启动登录资源库

admin/admin 

可把整个资源库导出: repositories.xml, 也可把资源库从xml导入

Kettle Repository DDL

[编辑] 集群

./carte.sh 127.0.0.1 8080
http://127.0.0.1:8080
cluster/cluster

[编辑] Eclipse

基于Eclipse体系和框架进行重构

[编辑] 大数据

[编辑] 图集

[编辑] 链接

分享您的观点
个人工具
名字空间

变换
操作
导航
工具箱