PDI(Kettle)是一种开源的 ETL 解决方案,书中介绍了如何使用PDI来实现数据的剖析、清洗、校验、抽取、转换、加载等各种常见的ETL类工做。 除了ODS/DW类比较大型的应用外,Kettle实际还能够为中小企业提供灵活的数据抽取和数据处理的功能。 Kettle除了支持各类关系型数据库、HBase、MongoDB这样的NoSQL数据源外,它还支持Excel、Access这类小型的数据源。 而且经过插件扩展,Kettle 能够支持各种数据源。本书详细介绍了Kettle能够处理的数据源, 并且详细介绍了如何使用Kettle抽取增量数据。Kettle的数据处理功能也很强大,除了选择、过滤、分组、链接、排序这些经常使用的功能外, Kettle 里的Java表达式、正则表达式、Java脚本、Java类等功能都很是灵活而强大,都很是适合于各类数据处理功能
How to get PDI up and running Linux Ubuntu 12.04 and later: The libwebkitgtk package needs to be installed. This can be done by running apt-get install libwebkitgtk-1.0.0 Unzip the downloaded file. Run spoon.sh file, it should be under /data-integration. On some installations of Ubuntu 14.04, Unity doesn't display the menu bar. In order to fix that, spoon.sh has a setting to disable this integration, export UBUNTU_MENUPROXY=0 You can try to remove that setting if you wish to see if it works propery on your machine CentOS 6 Desktop: The libwebkitgtk package needs to be installed. This can be done by running yum install libwebkitgtk Unzip the downloaded file and run spoon.sh, it should be under /data-integration.
####################################################################### WARNING: no libwebkitgtk-1.0 detected, some features will be unavailable Consider installing the package with apt-get or yum. e.g. 'sudo apt-get install libwebkitgtk-1.0-0' ####################################################################### Java HotSpot(TM) 64-Bit Server VM warning: ignoring option MaxPermSize=256m; support was removed in 8.0 org.eclipse.swt.SWTError: No more handles [gtk_init_check() failed] at org.eclipse.swt.SWT.error(Unknown Source) at org.eclipse.swt.widgets.Display.createDisplay(Unknown Source) at org.eclipse.swt.widgets.Display.create(Unknown Source) at org.eclipse.swt.graphics.Device.<init>(Unknown Source) at org.eclipse.swt.widgets.Display.<init>(Unknown Source) at org.eclipse.swt.widgets.Display.<init>(Unknown Source) at org.pentaho.di.ui.spoon.Spoon.main(Spoon.java:649) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.pentaho.commons.launcher.Launcher.main(Launcher.java:92
解决方法java
[Desktop Entry] Version=7.1 Name=kettle Exec=path to start script xxx/spoon.sh Icon=path to ico /spoon.ico Terminal=false Type=Application Categories=Application;
# A fatal error has been detected by the Java Runtime Environment: # # SIGSEGV (0xb) at pc=0x00007f4ab4f35164, pid=4011, tid=0x00007f4b09bd7700 # # JRE version: OpenJDK Runtime Environment (8.0_151-b12) (build 1.8.0_151-b12) # Java VM: OpenJDK 64-Bit Server VM (25.151-b12 mixed mode linux-amd64 compressed oops) # Problematic frame: # C [libglib-2.0.so.0+0x5e164] g_match_info_unref+0x4 # # Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
As I already mentioned on #1245468 I could not verify that changing GTK_MODULES, UBUNTU_MENUPROXY, or GTK_IM_MODULE helps in any way. However, I could verify that the problem GOES AWAYS IN KUBUNTU/KDE when doing: System Settings -> Application Themes -> GTK -> Choose GTK2 Theme Choose 'Radiance' instead of 'oxygen-gtk'
在kettle的启动文件spoon.sh中jvm的启动参数中,添加参数 -Dfile.encoding=utf8(指定本身须要的字符集)