You are here:  Data Warehouse  > Glossary  > 
Share this Printer Friendly Version PDF Version Email



 

Glossary -- S


A B C D E F G I L M N O P Q R S T U V Print Version


Sample Population

A statistically significant subset of data.

Sampling

The technique of randomly acquiring a small percentage of data from a source. The technique is based on the theory that analyzing a statistically significant sample of a data set will reveal the same or close to the same information as analyzing the complete data set would. This is often used in lieu of extracting reviewing every row of data element in a file where the data volume is very large.

Scheduling

Setting the execution sequence and timing for movement of data files from the source system environment to the DW target environment.

Scrubbing

The automated correction of data anomalies during data warehouse processing. See cleansing.

Secondary Level

A data warehouse level that is populated from another Data Warehouse level, usually the atomic level.

Sizing

Determining needed disk, CPU and communications configurations. 

Factors that impact data warehouse sizing include data volumes, data transport volumes and frequency, data load volumes and frequency, typical data access and user report volumes.

Snapshot

A view of the data at a particular instant in time.

Source System

The system of record or the system that contains the operational data that is to be extracted and loaded into the DW.

Source/Target

see source systems / target systems

Star-Schema

A data model of a particular topology that is typical of the DW subject oriented data relationships. A large "fact" table that has "one to many" typifies the star-schema topology relationships with a number of smaller "dimension" tables.

Stress Test

A test to determine how many resources will be used by a system.

Subject Area

A major subset of corporate data such as customer, transaction, product, part, vendor.

Subject Area Model

A data model of a particular subject area in the DW.

Subject-Oriented

To focus on the subject versus the business process or organization. The data architecture of a DW is subject oriented versus process oriented.

Summarized

see lightly summarized and highly summarized

Symmetric Multi-Processor (SMP)

Computer hardware architecture allowing multi-threaded of multiple processes through shared CPUs and memory space. 

Systems Of Record

The source system that has been selected as the best and most accurate source of a particular subject area or subset of data for the data warehouse.





County Home   |   Info A-Z   |   Departments   |   Jobs   |   Online Services