跳转到内容

Apache Sqoop:修订间差异

维基百科,自由的百科全书
删除的内容 添加的内容
加粗(MOS:B
维护清理
 
(未显示另一用户的1个中间版本)
第7行: 第7行:
| released = {{Start date and age|2009|06|01|df=yes}} <!-- https://blog.cloudera.com/blog/2009/06/introducing-sqoop/ -->
| released = {{Start date and age|2009|06|01|df=yes}} <!-- https://blog.cloudera.com/blog/2009/06/introducing-sqoop/ -->
| discontinued = yes
| discontinued = yes
| latest release version = 1.4.7
| latest release date = {{Start date and age|2017|12|06}}
| latest preview version =
| latest preview date =
| programming language = [[Java]]
| programming language = [[Java]]
| operating system = [[跨平台]]
| operating system = [[跨平台]]
第18行: 第14行:
| website = {{URL|https://sqoop.apache.org}}
| website = {{URL|https://sqoop.apache.org}}
}}
}}
'''Apache Sqoop'''是用于在[[关系型数据库]]和[[Apache Hadoop|Hadoop]]之间传输数据的开源工具。<ref>{{Cite web |title=Sqoop - |url=https://sqoop.apache.org/ |website=sqoop.apache.org |access-date=2022-06-24}}</ref> 该项目始于2009年,在2021年6月结束,并被移至[[Apache Attic]]。<ref>{{Cite web |title=moving Sqoop to the Attic |url=http://mail-archives.apache.org/mod_mbox/sqoop-user/202106.mbox/browser |url-status=live |website=mail-archives.apache.org |access-date=2021-06-27}}</ref>
'''Apache Sqoop'''是用于在[[关系型数据库]]和[[Apache Hadoop|Hadoop]]之间传输数据的开源工具。<ref>{{Cite web |title=Sqoop - |url=https://sqoop.apache.org/ |website=sqoop.apache.org |access-date=2022-06-24 |archive-date=2022-07-07 |archive-url=https://web.archive.org/web/20220707224939/https://sqoop.apache.org/ |dead-url=no }}</ref> 该项目始于2009年,在2021年6月结束,并被移至[[Apache Attic]]。<ref>{{Cite web |title=moving Sqoop to the Attic |url=http://mail-archives.apache.org/mod_mbox/sqoop-user/202106.mbox/browser |url-status=no |website=mail-archives.apache.org |access-date=2021-06-27 |archive-date=2021-06-27 |archive-url=https://web.archive.org/web/20210627201658/http://mail-archives.apache.org/mod_mbox/sqoop-user/202106.mbox/browser }}</ref>


== 概要 ==
== 概要 ==
Sqoop支持增量更新,将新记录添加到最近一次的导出的数据源上,或者指定上次修改的时间戳。导入也可以填充[[Apache Hive|Hive]]或[[Apache HBase|HBase]]中的表。<ref>{{Cite web |title=Apache Sqoop - Overview : Apache Sqoop |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_overview |website=blogs.apache.org |access-date=2022-06-24}}</ref> 导出则支持将Hadoop的数据放入关系数据库中。Sqoop得名于“SQL-to-Hadoop”。Sqoop于2012年3月成为顶级Apache项目。<ref>{{Cite web |title=Apache Sqoop Graduates from Incubator : Apache Sqoop |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator |website=blogs.apache.org |access-date=2022-06-24}}</ref>
Sqoop支持增量更新,将新记录添加到最近一次的导出的数据源上,或者指定上次修改的时间戳。导入也可以填充[[Apache Hive|Hive]]或[[Apache HBase|HBase]]中的表。<ref>{{Cite web |title=Apache Sqoop - Overview : Apache Sqoop |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_overview |website=blogs.apache.org |access-date=2022-06-24 |archive-date=2022-06-24 |archive-url=https://web.archive.org/web/20220624141140/https://blogs.apache.org/sqoop/entry/apache_sqoop_overview |dead-url=no }}</ref> 导出则支持将Hadoop的数据放入关系数据库中。Sqoop得名于“SQL-to-Hadoop”。Sqoop于2012年3月成为顶级Apache项目。<ref>{{Cite web |title=Apache Sqoop Graduates from Incubator : Apache Sqoop |url=https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator |website=blogs.apache.org |access-date=2022-06-24 |archive-date=2022-06-24 |archive-url=https://web.archive.org/web/20220624141045/https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator |dead-url=no }}</ref>


Informatica从10.1版开始提供基于Sqoop的连接器。Pentaho自4.5版开始在其[[ETL]]套件Pentaho Data Integration中提供基于开源Sqoop的连接器,Sqoop导入<ref name="2015-12-10_PSI" />和导出<ref name="2015-12-10_PSE" />。<ref name="2012-07-27_dbta" />[[微软]]使用基于Sqoop的连接器将数据从[[Microsoft SQL Server]]传输到Hadoop。<ref>{{cite web |title=Microsoft SQL Server Connector for Apache Hadoop |url=https://www.microsoft.com/en-us/download/details.aspx?id=27584 |access-date=Sep 8, 2012}}</ref>Couchbase还通过Sqoop提供Couchbase Server-Hadoop连接器。<ref>{{cite web |title=Couchbase Hadoop Connector |url=http://www.couchbase.com/develop/connectors/hadoop |url-status=dead |archive-url=https://web.archive.org/web/20120825184036/http://www.couchbase.com/develop/connectors/hadoop |archive-date=2012-08-25 |access-date=Sep 8, 2012}}</ref>
Informatica从10.1版开始提供基于Sqoop的连接器。Pentaho自4.5版开始在其[[ETL]]套件Pentaho Data Integration中提供基于开源Sqoop的连接器,Sqoop导入<ref name="2015-12-10_PSI" />和导出<ref name="2015-12-10_PSE" />。<ref name="2012-07-27_dbta" />[[微软]]使用基于Sqoop的连接器将数据从[[Microsoft SQL Server]]传输到Hadoop。<ref>{{cite web |title=Microsoft SQL Server Connector for Apache Hadoop |url=https://www.microsoft.com/en-us/download/details.aspx?id=27584 |access-date=Sep 8, 2012 |archive-date=2016-04-13 |archive-url=https://web.archive.org/web/20160413023204/http://www.microsoft.com/en-us/download/details.aspx?id=27584 |dead-url=no }}</ref>Couchbase还通过Sqoop提供Couchbase Server-Hadoop连接器。<ref>{{cite web |title=Couchbase Hadoop Connector |url=http://www.couchbase.com/develop/connectors/hadoop |url-status=dead |archive-url=https://web.archive.org/web/20120825184036/http://www.couchbase.com/develop/connectors/hadoop |archive-date=2012-08-25 |access-date=Sep 8, 2012}}</ref>


== 参考书目 ==
== 参考书目 ==

2023年8月1日 (二) 13:13的最新版本

Apache Sqoop
開發者Apache Software Foundation
首次发布2009年6月1日,​15年前​(2009-06-01
最终版本
  • 1.4.6(2015年5月11日)
編輯維基數據鏈接
源代码库Sqoop Repository
编程语言Java
操作系统跨平台
类型数据管理
许可协议Apache License 2.0
网站sqoop.apache.org

Apache Sqoop是用于在关系型数据库Hadoop之间传输数据的开源工具。[1] 该项目始于2009年,在2021年6月结束,并被移至Apache Attic[2]

概要

[编辑]

Sqoop支持增量更新,将新记录添加到最近一次的导出的数据源上,或者指定上次修改的时间戳。导入也可以填充HiveHBase中的表。[3] 导出则支持将Hadoop的数据放入关系数据库中。Sqoop得名于“SQL-to-Hadoop”。Sqoop于2012年3月成为顶级Apache项目。[4]

Informatica从10.1版开始提供基于Sqoop的连接器。Pentaho自4.5版开始在其ETL套件Pentaho Data Integration中提供基于开源Sqoop的连接器,Sqoop导入[5]和导出[6][7]微软使用基于Sqoop的连接器将数据从Microsoft SQL Server传输到Hadoop。[8]Couchbase还通过Sqoop提供Couchbase Server-Hadoop连接器。[9]

参考书目

[编辑]

参考资料

[编辑]
  1. ^ Sqoop -. sqoop.apache.org. [2022-06-24]. (原始内容存档于2022-07-07). 
  2. ^ moving Sqoop to the Attic. mail-archives.apache.org. [2021-06-27]. (原始内容存档于2021-06-27). 
  3. ^ Apache Sqoop - Overview : Apache Sqoop. blogs.apache.org. [2022-06-24]. (原始内容存档于2022-06-24). 
  4. ^ Apache Sqoop Graduates from Incubator : Apache Sqoop. blogs.apache.org. [2022-06-24]. (原始内容存档于2022-06-24). 
  5. ^ Sqoop Import. Pentaho. 2015-12-10 [2015-12-10]. (原始内容存档于2015-12-10). The Sqoop Import job allows you to import data from a relational database into the Hadoop Distributed File System (HDFS) using Apache Sqoop. 
  6. ^ Sqoop Export. Pentaho. 2015-12-10 [2015-12-10]. (原始内容存档于2015-12-10). The Sqoop Export job allows you to export data from Hadoop into an RDBMS using Apache Sqoop. 
  7. ^ Big Data Analytics Vendor Pentaho Announces Tighter Integration with Cloudera; Extends Visual Interface to Include Hadoop Sqoop and Oozie. Database Trends and Applications (dbta.com). 2012-07-27 [2015-12-08]. (原始内容存档于2015-12-08). Pentaho’s Business Analytics 4.5 is now certified on Cloudera’s latest releases, Cloudera Enterprise 4.0 and CDH4. Pentaho also announced that its visual design studio capabilities have been extended to the Sqoop and Oozie components of Hadoop. 
  8. ^ Microsoft SQL Server Connector for Apache Hadoop. [Sep 8, 2012]. (原始内容存档于2016-04-13). 
  9. ^ Couchbase Hadoop Connector. [Sep 8, 2012]. (原始内容存档于2012-08-25).