Jump to content

Star schema: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
1984 (talk | contribs)
m +es:
Line 85: Line 85:


[[de:Sternschema]]
[[de:Sternschema]]
[[es:Esquema en estrella]]
[[pt:Esquema estrela]]
[[pt:Esquema estrela]]
[[ru:Схема звезды]]
[[ru:Схема звезды]]

Revision as of 20:44, 21 May 2007

The star schema (sometimes referenced as star join schema) is the simplest data warehouse schema, consisting of a single "fact table" with a compound primary key, with one segment for each "dimension" and with additional columns of additive, numeric facts. The name star schema is derived from the fact that the schema diagram is shaped like a star.

The star schema makes multi-dimensional database (MDDB) functionality possible using a traditional relational database. Because relational databases are the most common data management system in organizations today, implementing multi-dimensional views of data using a relational database is very appealing. Even if a specific MDDB solution is used, its sources likely are relational databases.

Another reason for using star schema is its ease of understanding. Fact tables in star schema are mostly in third normal form (3NF), but dimensional tables are in de-normalized second normal form (2NF). If you want to normalize dimensional tables, they look like snowflakes (see snowflake schema) and the same problems of 3NF databases arise - you need complex queries and business users cannot easily understand the meaning of data. Although query performance may be improved by advanced DBMS technology and hardware, highly normalized tables make reporting difficult and applications complex.

Example SQL

SELECT
 sum (f_sales.units_sold)
FROM
 f_sales, d_customer, d_time, d_store, d_product
WHERE
 f_sales.customer_id   = d_customer.customer_id AND
 f_sales.date_id       = d_time.date_id AND
 f_sales.store_id      = d_store.store_id AND
 f_sales.product_id    = d_product.product_id AND
 d_time.year_id        = 1997 AND
 d_product.category_id = "tv"
GROUP BY
 d_product.brand, d_store.country_iso_id

Equivalent ANSI SQL-92 Examples

SELECT
      sum (f_sales.units_sold)
FROM
      f_sales
INNER JOIN d_customer ON d_customer.customer_id = f_sales.customer_id
INNER JOIN d_time     ON d_time.date_id         = f_sales.date_id
INNER JOIN d_store    ON d_store.store_id       = f_sales.store_id
INNER JOIN d_product  ON d_product.product_id   = f_sales.product_id
WHERE
      d_time.year_id        = 1997
  AND d_product.category_id = "tv"
GROUP BY
      d_product.brand, 
      d_store.country_iso_id
SELECT
      sum (f_sales.units_sold)
FROM
      f_sales
INNER JOIN d_customer USING (customer_id)
INNER JOIN d_time     USING (date_id)
INNER JOIN d_store    USING (store_id)
INNER JOIN d_product  USING (product_id)
WHERE
      d_time.year_id        = 1997
  AND d_product.category_id = "tv"
GROUP BY
      d_product.brand, 
      d_store.country_iso_id

Alternate ANSI SQL-92 Example

SELECT
 sum (f_sales.units_sold)
FROM
 f_sales
NATURAL JOIN 
 d_customer
NATURAL JOIN
 d_time
NATURAL JOIN
 d_store
NATURAL JOIN
 d_product
WHERE
 d_time.year_id = 1997 AND
 d_product.category_id = "tv"
GROUP BY
 d_product.brand, d_store.country_iso_id

See also