Wednesday, July 7, 2010

ReE#1

Re-engineering #1

Many companies put their computers to providing automated facility for their employees without deliberating effort in the first place to gaining optimal benefits out of their investment.

A typical example is, providing an order entry screen, completely removing the paper order form step that the salesmen wrote in before the computer data entry clerks keyed the orders into the computer.

But, why not providing the computerized order entry facility directly to the customers, enabling them to place their orders in their convenient space and time, and save our efforts?

Monday, April 19, 2010

The Need of a Data Warehouse

The Need of a Data Warehouse
By: Djoni Darmawikarta

You have a need of data for analysis and reporting.

Whatever the specific reasons are, fundamentally, you, whether you are a technical developer or a business user, need a data warehouse because accessing the data directly from their operational systems (transactional systems) has not been satisfying.

Your dissatisfaction is either because you have difficulty to understand or access, or you get a slow response time, or both. Technically, the problems happen because the operational/transaction systems are not developed for your data access purpose.

A data warehouse is the middleware between the source (the operational/transactional systems) and you. A data warehouse resolves the problems by dedicating its design, implementation and delivery, to serve your analytical data and reporting requirement.

Your data warehouse collects data you need from various sources, integrate, structure, store, and make them available such that you can access them in easy manner, at the time you need, and with your expected response time.

Wednesday, March 31, 2010

Data Warehousing issue

Data Warehousing issue: Untimely reference data

By Djoni Darmawikarta

 

 

Coming ahead of time

You can have a large new reference data coming early, no transaction uses them at the time they land in your data warehouse as yet. If you load them into you data warehouse, they can slow down user queries unnecessarily. For example, your company has just acquired a large volume survey data that the marketing folks plan to use for email marketing. The data primarily contains list of names and their email. You don’t have any transaction attaches to them until you really send them email. You can defer its loading into the data warehouse.

 

You have the following options
1) Let them stay in the staging area, don’t load them into the data warehouse

2) Option 1 and load only the reference data at the time needed by incoming transaction

3) Load all of them whenever any of them, even just one, needed by any incoming transaction

4) Load all of them regardless of any need

 

Trade off between disk space and speed, option 4 is fastest in loading, takes most disk space, but can slow down user query.


Thursday, March 11, 2010

star schema rule 0 - a star

A star schema contains one fact table and one or more dimension tables.
The dimensions relate to the fact directly and on their surrogate keys.

star schema rule 4 ... no fact to fact

In a multi-stars schema, a fact cannot relate to any other fact

star schema rule 3 ... no dimension to dimension

In a star schema or multi-stars schema, a dimension cannot relate to any other dimension