欢迎大家赞助一杯啤酒🍺 我们准备了下酒菜:Formal mathematics/Isabelle/ML, Formal verification/Coq/ACL2, C++/F#/Lisp
Kettle
来自开放百科 - 灰狐
(版本间的差异)
小 (→Eclipse) |
|||
(未显示1个用户的23个中间版本) | |||
第1行: | 第1行: | ||
Kettle is a powerful, open source, metadata-driven ETL tool | Kettle is a powerful, open source, metadata-driven ETL tool | ||
− | License: | + | ==简介== |
+ | Pentaho Data Integration (ETL, EII - jobs & transformations), License: Apache v2 | ||
− | + | ==组成== | |
− | == | + | Pentaho Data Integration (Kettle) 包含以下工具: |
+ | *Spoon:一个创建转换和任务的图形化IDE | ||
+ | *Kitchen:一个运行任务的命令行工具 | ||
+ | *Pan:一个运行转换的命令行工具 | ||
+ | *Carte:一个在远程主机运行任务和转换的轻量级服务器 | ||
+ | |||
+ | ==用处== | ||
* Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more. | * Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more. | ||
* Export of database(s) to text-file(s) or other databases | * Export of database(s) to text-file(s) or other databases | ||
第13行: | 第20行: | ||
* Data cleaning by applying complex conditions in data transformations | * Data cleaning by applying complex conditions in data transformations | ||
* Application integration | * Application integration | ||
− | == | + | |
− | [[ | + | ==插件== |
+ | Kettle的插件体系 | ||
+ | *[[OSGi]] | ||
+ | *[[Apache Karaf]] | ||
+ | |||
+ | ==仓库== | ||
MySQL | MySQL | ||
create database kettle; | create database kettle; | ||
第20行: | 第32行: | ||
admin/admin | admin/admin | ||
可把整个资源库导出: repositories.xml, 也可把资源库从xml导入 | 可把整个资源库导出: repositories.xml, 也可把资源库从xml导入 | ||
− | == | + | |
+ | [[Kettle Repository DDL]] | ||
+ | |||
+ | ==集群== | ||
./carte.sh 127.0.0.1 8080 | ./carte.sh 127.0.0.1 8080 | ||
http://127.0.0.1:8080 | http://127.0.0.1:8080 | ||
cluster/cluster | cluster/cluster | ||
− | == | + | |
− | * | + | ==Eclipse== |
+ | 基于[[Eclipse]]体系和框架进行重构 | ||
+ | |||
+ | ==大数据== | ||
+ | *[https://github.com/pentaho/big-data-plugin Big Data Plugin] | ||
+ | |||
+ | ==图集== | ||
+ | <gallery widths=100px heights=100px perrow=6> | ||
+ | Image:kettle-repository.png|Kettle Repository | ||
+ | Image:kettle-01.png | ||
+ | Image:kettle-02.png | ||
+ | Image:kettle-03.png | ||
+ | Image:spoon-transformation.png|设计转换 | ||
+ | Image:spoon-job.png|设计任务 | ||
+ | Image:spoon-logview.png|日志 | ||
+ | Image:spoon-welcome.png|欢迎页 | ||
+ | Image:spoon-verify.png|验证转换 | ||
+ | Image:spoon-debug.png|调试转换 | ||
+ | image:pdi-kettle-database-plugin.png|数据库插件 | ||
+ | image:pentaho-kettle-spoon.png|Spoon | ||
+ | </gallery> | ||
+ | |||
+ | ==链接== | ||
+ | *[http://community.pentaho.com/projects/data-integration/ Kettle官网] | ||
+ | *[https://github.com/pentaho/pentaho-kettle Kettle @ GitHub] | ||
*http://docs.huihoo.com/kettle/ | *http://docs.huihoo.com/kettle/ | ||
− | * | + | *[https://sourceforge.net/projects/pentaho/files/Data%20Integration/ Kettle下载] |
− | + | [[category:ETL]] | |
− | [[ | + | [[category:MDM]] |
+ | [[category:pentaho]] | ||
+ | [[category:metadata]] | ||
+ | [[category:OSGi]] | ||
+ | [[category:huihoo]] |
2021年2月24日 (三) 10:48的最后版本
Kettle is a powerful, open source, metadata-driven ETL tool
目录 |
[编辑] 简介
Pentaho Data Integration (ETL, EII - jobs & transformations), License: Apache v2
[编辑] 组成
Pentaho Data Integration (Kettle) 包含以下工具:
- Spoon:一个创建转换和任务的图形化IDE
- Kitchen:一个运行任务的命令行工具
- Pan:一个运行转换的命令行工具
- Carte:一个在远程主机运行任务和转换的轻量级服务器
[编辑] 用处
- Data warehouse population with built-in support for slowly changing dimensions, junk dimensions and much, much more.
- Export of database(s) to text-file(s) or other databases
- Import of data into databases, ranging from text-files to excel sheets
- Data migration between database applications
- Exploration of data in existing databases. (tables, views, synonyms, )
- Information enrichment by looking up data in various information stores (databases, text-files, excel sheets, )
- Data cleaning by applying complex conditions in data transformations
- Application integration
[编辑] 插件
Kettle的插件体系
[编辑] 仓库
MySQL
create database kettle;
然后通过资源库的创建向导完成数据库的创建,会新建41个表 (3.0.1版本),然后每次启动登录资源库
admin/admin
可把整个资源库导出: repositories.xml, 也可把资源库从xml导入
[编辑] 集群
./carte.sh 127.0.0.1 8080 http://127.0.0.1:8080 cluster/cluster
[编辑] Eclipse
基于Eclipse体系和框架进行重构
[编辑] 大数据
[编辑] 图集
[编辑] 链接
分享您的观点