Apache Sqoop:修订间差异
外观
删除的内容 添加的内容
小 加粗(MOS:B) |
小 维护清理 |
||
(未显示另一用户的1个中间版本) | |||
第7行: | 第7行: | ||
| released = {{Start date and age|2009|06|01|df=yes}} <!-- https://blog.cloudera.com/blog/2009/06/introducing-sqoop/ --> |
| released = {{Start date and age|2009|06|01|df=yes}} <!-- https://blog.cloudera.com/blog/2009/06/introducing-sqoop/ --> |
||
| discontinued = yes |
| discontinued = yes |
||
| latest release version = 1.4.7 |
|||
| latest release date = {{Start date and age|2017|12|06}} |
|||
| latest preview version = |
|||
| latest preview date = |
|||
| programming language = [[Java]] |
| programming language = [[Java]] |
||
| operating system = [[跨平台]] |
| operating system = [[跨平台]] |
||
第18行: | 第14行: | ||
| website = {{URL|https://sqoop.apache.org}} |
| website = {{URL|https://sqoop.apache.org}} |
||
}} |
}} |
||
'''Apache Sqoop'''是用于在[[关系型数据库]]和[[Apache Hadoop|Hadoop]]之间传输数据的开源工具。<ref>{{Cite web |title=Sqoop - |url=https://sqoop.apache.org/ |website=sqoop.apache.org |access-date=2022-06-24}}</ref> 该项目始于2009年,在2021年6月结束,并被移至[[Apache Attic]]。<ref>{{Cite web |title=moving Sqoop to the Attic |url=http://mail-archives.apache.org/mod_mbox/sqoop-user/202106.mbox/browser |url-status= |
'''Apache Sqoop'''是用于在[[关系型数据库]]和[[Apache Hadoop|Hadoop]]之间传输数据的开源工具。<ref>{{Cite web |title=Sqoop - |url=https://sqoop.apache.org/ |website=sqoop.apache.org |access-date=2022-06-24 |archive-date=2022-07-07 |archive-url=https://web.archive.org/web/20220707224939/https://sqoop.apache.org/ |dead-url=no }}</ref> 该项目始于2009年,在2021年6月结束,并被移至[[Apache Attic]]。<ref>{{Cite web |title=moving Sqoop to the Attic |url=http://mail-archives.apache.org/mod_mbox/sqoop-user/202106.mbox/browser |url-status=no |website=mail-archives.apache.org |access-date=2021-06-27 |archive-date=2021-06-27 |archive-url=https://web.archive.org/web/20210627201658/http://mail-archives.apache.org/mod_mbox/sqoop-user/202106.mbox/browser }}</ref> |
||
== 概要 == |
== 概要 == |
||
Sqoop支持增量更新,将新记录添加到最近一次的导出的数据源上,或者指定上次修改的时间戳。导入也可以填充[[Apache Hive|Hive]]或[[Apache HBase|HBase]]中的表。<ref>{{Cite web |title=Apache Sqoop - Overview : Apache Sqoop |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_overview |website=blogs.apache.org |access-date=2022-06-24}}</ref> 导出则支持将Hadoop的数据放入关系数据库中。Sqoop得名于“SQL-to-Hadoop”。Sqoop于2012年3月成为顶级Apache项目。<ref>{{Cite web |title=Apache Sqoop Graduates from Incubator : Apache Sqoop |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator |website=blogs.apache.org |access-date=2022-06-24}}</ref> |
Sqoop支持增量更新,将新记录添加到最近一次的导出的数据源上,或者指定上次修改的时间戳。导入也可以填充[[Apache Hive|Hive]]或[[Apache HBase|HBase]]中的表。<ref>{{Cite web |title=Apache Sqoop - Overview : Apache Sqoop |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_overview |website=blogs.apache.org |access-date=2022-06-24 |archive-date=2022-06-24 |archive-url=https://web.archive.org/web/20220624141140/https://blogs.apache.org/sqoop/entry/apache_sqoop_overview |dead-url=no }}</ref> 导出则支持将Hadoop的数据放入关系数据库中。Sqoop得名于“SQL-to-Hadoop”。Sqoop于2012年3月成为顶级Apache项目。<ref>{{Cite web |title=Apache Sqoop Graduates from Incubator : Apache Sqoop |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator |website=blogs.apache.org |access-date=2022-06-24 |archive-date=2022-06-24 |archive-url=https://web.archive.org/web/20220624141045/https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator |dead-url=no }}</ref> |
||
Informatica从10.1版开始提供基于Sqoop的连接器。Pentaho自4.5版开始在其[[ETL]]套件Pentaho Data Integration中提供基于开源Sqoop的连接器,Sqoop导入<ref name="2015-12-10_PSI" />和导出<ref name="2015-12-10_PSE" />。<ref name="2012-07-27_dbta" />[[微软]]使用基于Sqoop的连接器将数据从[[Microsoft SQL Server]]传输到Hadoop。<ref>{{cite web |title=Microsoft SQL Server Connector for Apache Hadoop |url=https://www.microsoft.com/en-us/download/details.aspx?id=27584 |access-date=Sep 8, 2012}}</ref>Couchbase还通过Sqoop提供Couchbase Server-Hadoop连接器。<ref>{{cite web |title=Couchbase Hadoop Connector |url=http://www.couchbase.com/develop/connectors/hadoop |url-status=dead |archive-url=https://web.archive.org/web/20120825184036/http://www.couchbase.com/develop/connectors/hadoop |archive-date=2012-08-25 |access-date=Sep 8, 2012}}</ref> |
Informatica从10.1版开始提供基于Sqoop的连接器。Pentaho自4.5版开始在其[[ETL]]套件Pentaho Data Integration中提供基于开源Sqoop的连接器,Sqoop导入<ref name="2015-12-10_PSI" />和导出<ref name="2015-12-10_PSE" />。<ref name="2012-07-27_dbta" />[[微软]]使用基于Sqoop的连接器将数据从[[Microsoft SQL Server]]传输到Hadoop。<ref>{{cite web |title=Microsoft SQL Server Connector for Apache Hadoop |url=https://www.microsoft.com/en-us/download/details.aspx?id=27584 |access-date=Sep 8, 2012 |archive-date=2016-04-13 |archive-url=https://web.archive.org/web/20160413023204/http://www.microsoft.com/en-us/download/details.aspx?id=27584 |dead-url=no }}</ref>Couchbase还通过Sqoop提供Couchbase Server-Hadoop连接器。<ref>{{cite web |title=Couchbase Hadoop Connector |url=http://www.couchbase.com/develop/connectors/hadoop |url-status=dead |archive-url=https://web.archive.org/web/20120825184036/http://www.couchbase.com/develop/connectors/hadoop |archive-date=2012-08-25 |access-date=Sep 8, 2012}}</ref> |
||
== 参考书目 == |
== 参考书目 == |
2023年8月1日 (二) 13:13的最新版本
開發者 | Apache Software Foundation |
---|---|
首次发布 | 2009年6月1日 |
最终版本 |
|
源代码库 | Sqoop Repository |
编程语言 | Java |
操作系统 | 跨平台 |
类型 | 数据管理 |
许可协议 | Apache License 2.0 |
网站 | sqoop |
Apache Sqoop是用于在关系型数据库和Hadoop之间传输数据的开源工具。[1] 该项目始于2009年,在2021年6月结束,并被移至Apache Attic。[2]
概要
[编辑]Sqoop支持增量更新,将新记录添加到最近一次的导出的数据源上,或者指定上次修改的时间戳。导入也可以填充Hive或HBase中的表。[3] 导出则支持将Hadoop的数据放入关系数据库中。Sqoop得名于“SQL-to-Hadoop”。Sqoop于2012年3月成为顶级Apache项目。[4]
Informatica从10.1版开始提供基于Sqoop的连接器。Pentaho自4.5版开始在其ETL套件Pentaho Data Integration中提供基于开源Sqoop的连接器,Sqoop导入[5]和导出[6]。[7]微软使用基于Sqoop的连接器将数据从Microsoft SQL Server传输到Hadoop。[8]Couchbase还通过Sqoop提供Couchbase Server-Hadoop连接器。[9]
参考书目
[编辑]- White, Tom. Chapter 15: Sqoop. Hadoop: The Definitive Guide 2nd. O'Reilly Media. : 477–495. ISBN 978-1-449-38973-4.
参考资料
[编辑]- ^ Sqoop -. sqoop.apache.org. [2022-06-24]. (原始内容存档于2022-07-07).
- ^ moving Sqoop to the Attic. mail-archives.apache.org. [2021-06-27]. (原始内容存档于2021-06-27).
- ^ Apache Sqoop - Overview : Apache Sqoop. blogs.apache.org. [2022-06-24]. (原始内容存档于2022-06-24).
- ^ Apache Sqoop Graduates from Incubator : Apache Sqoop. blogs.apache.org. [2022-06-24]. (原始内容存档于2022-06-24).
- ^ Sqoop Import. Pentaho. 2015-12-10 [2015-12-10]. (原始内容存档于2015-12-10).
The Sqoop Import job allows you to import data from a relational database into the Hadoop Distributed File System (HDFS) using Apache Sqoop.
- ^ Sqoop Export. Pentaho. 2015-12-10 [2015-12-10]. (原始内容存档于2015-12-10).
The Sqoop Export job allows you to export data from Hadoop into an RDBMS using Apache Sqoop.
- ^ Big Data Analytics Vendor Pentaho Announces Tighter Integration with Cloudera; Extends Visual Interface to Include Hadoop Sqoop and Oozie. Database Trends and Applications (dbta.com). 2012-07-27 [2015-12-08]. (原始内容存档于2015-12-08).
Pentaho’s Business Analytics 4.5 is now certified on Cloudera’s latest releases, Cloudera Enterprise 4.0 and CDH4. Pentaho also announced that its visual design studio capabilities have been extended to the Sqoop and Oozie components of Hadoop.
- ^ Microsoft SQL Server Connector for Apache Hadoop. [Sep 8, 2012]. (原始内容存档于2016-04-13).
- ^ Couchbase Hadoop Connector. [Sep 8, 2012]. (原始内容存档于2012-08-25).