Jump to content

Ab Initio Software: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Revert.
Line 16: Line 16:
==The Co>Operating System==
==The Co>Operating System==
Runs across a variety of Operating Systems and Hardware Platforms including OS/390 on [[Mainframe computer|Mainframe]], [[Unix]], [[Linux]], and [[Microsoft Windows|Windows]]. Supports distributed and parallel execution. Can provide scalability proportional to the [[hardware]] resources provided. Supports platform independent data transport using the Ab Initio data manipulation language. The Co>Operating System is the underlying system of all parts of the product suite allowing communication and integration of all parts into the platform.
Runs across a variety of Operating Systems and Hardware Platforms including OS/390 on [[Mainframe computer|Mainframe]], [[Unix]], [[Linux]], and [[Microsoft Windows|Windows]]. Supports distributed and parallel execution. Can provide scalability proportional to the [[hardware]] resources provided. Supports platform independent data transport using the Ab Initio data manipulation language. The Co>Operating System is the underlying system of all parts of the product suite allowing communication and integration of all parts into the platform.

The product supports three types of parallelisms - data, component, pipeline to improve the performance of huge data processing.


==The Component Library==
==The Component Library==

Revision as of 05:11, 27 April 2007

If you are looking for the Latin term, see ab initio.

Ab Initio Software Corporation was founded in the mid 1990's by the former CEO, Sheryl Handler, and several other former employees of Thinking Machines Corporation, after the bankruptcy of that company.

The Ab Initio software is a fourth generation data analysis, batch processing, data manipulation graphical user interface (GUI)-based parallel processing tool that is commonly used to extract, transform and load data.

The Ab Initio software is a suite of products that together provides a platform for data processing applications. The Core Ab Initio Products are:

  • Co>Operating System
  • The Component Library
  • Graphical Development Environment
  • Enterprise Meta>Environment
  • Data Profiler

The Co>Operating System

Runs across a variety of Operating Systems and Hardware Platforms including OS/390 on Mainframe, Unix, Linux, and Windows. Supports distributed and parallel execution. Can provide scalability proportional to the hardware resources provided. Supports platform independent data transport using the Ab Initio data manipulation language. The Co>Operating System is the underlying system of all parts of the product suite allowing communication and integration of all parts into the platform.

The product supports three types of parallelisms - data, component, pipeline to improve the performance of huge data processing.

The Component Library

Reusable software Modules for Sorting, Joining, Data Transformation, database Loading Etc. The components adapt at runtime to the record formats and business rules controlling their behavior. Components also include various system "connectors" giving access to various storage engines.

The Graphical Development Environment

Provides graphical interface for editing and executing Ab Initio computer programs. This development environment utilizes the available components from the library to enable various ETL activities to occur. The Co>Operating System can execute these programs directly. Allows for monitoring of running applications to quantify data volumes and execution times for performance estimation.

An Ab Initio computer program is called a graph as it behaves similar to its math counterpart. A graph contains one ore more components or vertices, each joined by a flow or edge through which data flows. Data flows only in one direction, which allows the graph to run in a parallel processing environment. Each graph is compiled by the GDE into a Korn Shell script which can be run by the Co>Operating System. It is also possible to write Ab Initio programs without the GDE using a common text editor.

Enterprise Meta>Environment

Along with the Development environment and Co>Operating system is a datastore that tracks changes in development of graphs, as well as metadata pertaining to the development, how data is used, and potential of other means of data classification. The storage of graph related Metadata allows for data impact analysis to occur, giving the user a visual sense of how the data is changing in a graph, and the impacts that those changes have on another graph. Additionally the EME doubles up for config/change management allowing the latest version of a graph to reside post subsequent code changes thereby ensuring the latest code.

Data Profiler

The Data Profiler is a graphical data analysis tool that runs on top of the Co>Operating system. It can be used to characterize data range, scope, distribution, variance, and quality.